Machine Learning: Automated Learning (Part 1)
6 December 2016 by Eduardo García Martín
The technological wave we know as Big Data has transformed the business landscape. Demand is surging for AI systems capable of processing data at a scale and speed no human team can match.
This is happening across virtually every sector. Few business or public-sector activities cannot benefit from intelligent, automated data analysis.
The Value of Information: The New Gold Rush
The volume of data generated today far exceeds any human capacity to process it.
Once organizations have both the data and the systems to handle it, the real work begins: understanding that data, extracting knowledge from it, and turning it into value.
At small scale, this is something humans have always done — we access data, interpret it with our brains, and make decisions. But when we're dealing with gigabytes, terabytes, or petabytes of information, and decisions need to happen in milliseconds, human processing simply isn't in the picture.
Today, countless products, services, and the marketing strategies surrounding them depend on machines performing tasks like these automatically:
- Reading web pages with near-human comprehension.
- Recognizing faces in images posted on social media.
- Detecting emotion in the tone of a phone conversation.
- Answering customer questions in a chat interface.
- Understanding why and how people move geographically.
- Predicting energy consumption in buildings or industrial facilities.
- Inferring which films or songs a person is most likely to enjoy.
- Recommending diet and exercise plans tailored to an individual's health profile and genotype.
Machines are no longer optional — and we need those machines to interpret data, understand it, and draw conclusions intelligently.
In short, we need artificial cognitive systems: brains built from hardware and software, capable of making decisions on our behalf and handling millions of tasks that once required human judgment.
Machines Must Learn from Data the Way We Learn from Experience
Machine Learning replicates the human learning process through experience with data.
We need machines that can program themselves — machines that learn from their own experience with data.
Machine Learning is the discipline that addresses exactly this challenge. As data volumes exploded and cloud infrastructure matured, every major technology company moved aggressively into this space, offering cloud-based services for building applications that learn directly from the data they ingest.
Supervised vs. Unsupervised Machine Learning
Machine learning divides into two main areas: supervised learning and unsupervised learning. The distinction isn't simply about whether a human is watching — it's really about what you want to do with your data.
The easiest way to understand these learning algorithms is to think about how children learn. Reinforcement-style learning — where rewarded behaviors become more frequent and penalized behaviors fade — underpins a broad family of machine learning techniques used in artificial systems today.
This family is called supervised learning, and it requires human input to define what's correct and what isn't. In many cognitive computing applications, humans also supply some of the semantic context the algorithms need to function.
Consider a system trained to classify the different document types an office receives. Initially, humans must label a representative sample of examples so the machine can learn the patterns — after that, it generalizes on its own.
Supervised vs. unsupervised learning in Machine Learning.
Unsupervised learning, by contrast, works with unlabeled historical data. The goal is to explore that data and uncover structure — patterns the data itself reveals without any predefined categories.
A classic application is customer segmentation: grouping customers by similar characteristics or behaviors so that highly targeted marketing campaigns can be directed at each group.
Effective machine learning requires accounting for the full context surrounding the data: social dynamics, economic conditions, environment, location, and more. That broader awareness is what separates a model that generalizes well from one that simply memorizes.
Central to all of this is the concept of knowledge — a genuine understanding of a subject deep enough to form an informed opinion, answer questions, and reason about new situations.
More on this in the next installment. Stay tuned.