Introduction to
Machine Learning
DataTalks.Club
Machine Learning Zoomcamp
Session #1.1
DataTalks.Club — mlzoomcamp.com — @Al_Grigor
Imagine we have a car classifieds website
Pictures taken from olx.ua
DataTalks.Club — mlzoomcamp.com — @Al_Grigor
John Torcasio/unsplash (source)
I want to
sell my
car
DataTalks.Club — mlzoomcamp.com — @Al_Grigor
📱
DataTalks.Club — mlzoomcamp.com — @Al_Grigor
📱
DataTalks.Club — mlzoomcamp.com — @Al_Grigor
🤔
📱
DataTalks.Club — mlzoomcamp.com — @Al_Grigor
How can we help our user select the best price?
📱
🤔
DataTalks.Club — mlzoomcamp.com — @Al_Grigor
$1.1k
Price
$0.6k
$23k
What do we know about cars?
DataTalks.Club — mlzoomcamp.com — @Al_Grigor
1995
Year
1980
2016
$1.1k
Price
$0.6k
$23k
What do we know about cars?
DataTalks.Club — mlzoomcamp.com — @Al_Grigor
1995
Year
1980
2016
GAZ
Make
VAZ
BWM
$1.1k
Price
$0.6k
$23k
What do we know about cars?
DataTalks.Club — mlzoomcamp.com — @Al_Grigor
1995
Year
1980
2016
GAZ
Make
VAZ
BWM
200.000
Mileage
100.000
5.000
$1.1k
Price
$0.6k
$23k
What do we know about cars?
DataTalks.Club — mlzoomcamp.com — @Al_Grigor
1995
Year
1980
2016
GAZ
Make
VAZ
BWM
200.000
Mileage
100.000
5.000
$1.1k
Price
$0.6k
$23k
...
...
...
...
What do we know about cars?
DataTalks.Club — mlzoomcamp.com — @Al_Grigor
1995
Year
1980
2016
GAZ
Make
VAZ
BWM
200.000
Mileage
100.000
5.000
$1.1k
Price
$0.6k
$23k
...
...
...
...
👷
Using this information, an expert can determine the price
DataTalks.Club — mlzoomcamp.com — @Al_Grigor
DATA PATTERNS
👷
DataTalks.Club — mlzoomcamp.com — @Al_Grigor
DATA ML PATTERNS
DATA PATTERNS
If an expert can, so can a model!
👷
DataTalks.Club — mlzoomcamp.com — @Al_Grigor
1995
Year
1980
2016
GAZ
Make
VAZ
BWM
200.000
Mileage
100.000
5.000
$1.1k
Price
$0.6k
$23k
...
...
...
...
“Features”
what we know about cars
“Target”
what we want to predict
DataTalks.Club — mlzoomcamp.com — @Al_Grigor
1995
Year
1980
2016
GAZ
Make
VAZ
BWM
200.000
Mileage
100.000
5.000
$1.1k
Price
$0.6k
$23k
...
...
...
...
... ... ... ...
...
🚗
DataTalks.Club — mlzoomcamp.com — @Al_Grigor
1995
Year
1980
2016
GAZ
Make
VAZ
BWM
200.000
Mileage
100.000
5.000
$1.1k
Price
$0.6k
$23k
...
...
...
...
... ... ... ...
...
🚗
model
“Features” “Target”
train
Machine Learning
DataTalks.Club — mlzoomcamp.com — @Al_Grigor
1995
Year
1980
2016
GAZ
Make
VAZ
BWM
200.000
Mileage
100.000
5.000
$1.1k
Price
$0.6k
$23k
...
...
...
...
... ... ... ...
...
🚗
“Features” “Target”
predict
$1.5k
Price
$0.4k
$20k
...
“Predictions”
Using a model
model
DataTalks.Club — mlzoomcamp.com — @Al_Grigor
�
�
DataTalks.Club — mlzoomcamp.com — @Al_Grigor
📱 model
1995
GAZ
200.000
...
Year
Make
Mileage
...
DataTalks.Club — mlzoomcamp.com — @Al_Grigor
model
1995
GAZ
200.000
...
Year
Make
Mileage
...
📱
DataTalks.Club — mlzoomcamp.com — @Al_Grigor
📱
🥳
DataTalks.Club — mlzoomcamp.com — @Al_Grigor
Features
Target
Model training
ML Model
Summary
DataTalks.Club — mlzoomcamp.com — @Al_Grigor
1995
Year
1980
2016
GAZ
Make
VAZ
BWM
200.000
Mileage
100.000
5.000
...
...
...
...
... ... ... ...
$1.1k
Price
$0.6k
$23k
...
Model training
Summary
ML Model
DataTalks.Club — mlzoomcamp.com — @Al_Grigor
Features Predictions
Model
Predictions
Summary
DataTalks.Club — mlzoomcamp.com — @Al_Grigor
Model
1996
Year
1991
2018
Volvo
Make
GAZ
Audi
100.000
Mileage
50.000
2.000
...
...
...
...
... ... ... ...
$1.1k
Price
$0.6k
$23k
...
Predictions
Summary
DataTalks.Club — mlzoomcamp.com — @Al_Grigor
Next
Machine Learning vs Rule-Based System
● Spam detection example

ML Zoomcamp 1.1 - Introduction to Machine Learning