What is Machine Learning?

Neil D. Lawrence

2017-07-17

Introduction

General introduction to machine learning.
Highlight technical challenges and current solutions.
What is machine learning? And why is it important?

Rise of Machine Learning

Driven by data and computation
Fundamentally dependent on models

\[ \text{data} + \text{model} + \text{compute} \rightarrow \text{prediction} \]

Data Revolution

Large amounts of data and high interconnection bandwidth mean that we receive much of our information about the world around us through computers.

Efficiency

Economies driven by ‘production’.
Greater production comes with better efficiency.
- E.g. moving from gathering food to settled agriculture.
In the modern era one approach to becoming more efficient is automation of processes.
- E.g. manufacturing production lines

Physical Processes

Manufacturing processes consist of production lines and robotic automation.
Logistics can also be decomposed into the supply chain processes.
Efficiency can be improved by automation.

Goods and Information

For modern society: management of flow of goods and information.
Flow of information is highly automated.
Processing of data is decomposed into stages in computer code.

Intervention

For all cases: manufacturing, logistics, data management
Pipeline requires human intervention from an operator.
Interventions create bottlenecks, slow the process.
Machine learning is a key technology in automating these manual stages.

Long Grass

Easy to replicate interventions have already been dealt with.
Components that still require human intervention are the knottier problems.
Difficult decompose into stages which could then be further automated.
These components are ‘process-atoms’.
These are the “long grass” regions of technology.

Nature of Challenge

In manufacturing or logistics settings atoms are flexible manual skills.
- Requires emulation of a human’s motor skills.
In information processing: our flexible cognitive skills.
- Our ability to mentally process an image or some text.

Worked Example: Delivery Drones

Data Driven

Machine Learning: Replicate Processes through direct use of data.
Aim to emulate cognitive processes through the use of data.
Use data to provide new approaches in control and optimization that should allow for emulation of human motor skills.

Process Emulation

Key idea: emulate the process as a mathematical function.
Each function has a set of parameters which control its behavior.
Learning is the process of changing these parameters to change the shape of the function
Choice of which class of mathematical functions we use is a vital component of our model.

Polynomial Fit

Example of prediction: The Olympic gold medalist in the marathon’s pace is predicted using a regression fit. In this case the mathematical function is directly predicting the pace of the winner as a function of the year of the Olympics.

Polynomial Fit

Artificial Intelligence

Principal technology underlying the recent advances in artificial intelligence techniques.
Different approach to that developed in classical artificial intelligence (sometimes referred to as “good old fashioned AI” or GOFAI).
GOFAI relied on symbolic logic as its mathematical engine.

Artificial Intelligence

Early AI used expert systems: a set of logical rules implemented to reconstruct expertise. For example, rules to decide whether or not someone has cancer.
Such rules prove hard to specify for very complex processes.¹

Data Science

Can split applications of machine learning broadly into data science and artificial intelligence.
Data science: making sense of ‘new data’, the large volumes of data from sensors and increased interconnectivity (big data, IoT)
Classical statistics: the question is formed first, and data is later.

Data Science

Data Science: data is first, questions come later.
Overlap through exploratory data analysis

Artificial Intelligence

Artificial intelligence originates in cybernetics
Challenge to recreate “intelligent” behaviour.
Either general intelligence or emulate human capabilities.
Machine learning is important because of success of data-driven artificial intelligence.
Data-driven artificial intelligence: instead of solving from first principles, collect data.

Machine Learning

observe a system in practice
emulate its behavior with mathematics.

Design challenge: where to put mathematical function.
Where it’s placed leads to different ML domains.