Ridho Rahmadi ML Models Learning May 10

Models and Learning
Ridho Rahmadi
Center of Data Science UII
May 10, 2020
Ridho Rahmadi (Center of Data Science UII) Models and Learning May 10, 2020 1 / 30

Human Intelligence

Artificial Intelligence (AI)
Note that most of processes are automatic procedure.

ML, DL, CM
Machine Learning (ML), Deep Learning (DL), and Causal Modeling (CM) are parts of
AI.

Redefining Data
49K photos in Instagram
3.9M Google searches
4.3M Youtube watch
473K Twitter tweets
12.9M text sent
750K Spotify stream
156M emails sent
154K Skype calls

Data To Expect
Estimated there are
> 2.500.000.000.000.000.000 bytes
generated per day

Perspectives in Data Science
Problem Activity Questions Examples
Association
P(y|x)
Seeing
What is?
How would seeing X
change my belief in Y ?
What does a symptom tell
me about a disease?
Intervention
P(y|do(x), z)
Doing
What if?
What if I do X = x?
What if I take aspirin, will
my headache be cured?
Counterfactual
P(yx|x0
, y0
)
Imagining
Why?
Was it X that caused Y ?
What if I had acted differ-
ently?
What if I had not been
smoking the past 2 years?

Machine Learning
Data {x, y}
House area
Calories intake
Supervised
Machine
Learning f
Linear regression
Polynomial regression
etc.
Linear regression model
E.g., House price,
Weight gain

Machine Learning
Data {x, y}
Twitter tweets
Clinical assessment
Students’ study hour
Supervised
Machine
Learning f
Logistic regression
Naiv̈e Bayes Classifier
Random forest
Support vector machine
etc.
Classification model
E.g., Hoax or not
Diabetes or not
Pass exam or not

Example when x and y are continuous
1 2 3 4
1
2
3
4
5
6
eat 1 cookie
eat 2 cookies
cookies
Kg
What if I eat 3 cookies?

Extend the problem
1,000 2,000 3,000 4,000
200
400
600
Area in m2
Price
What is the price of a house if the area is 558 M2?

A good model?
1,000 2,000 3,000 4,000
200
400
600
Area in m2
Price
Draw a line by connecting all the points like this?

A good model?
1,000 2,000 3,000 4,000
200
400
600
Area in m2
Price
Draw a line by connecting all the points like this? Our objective is a model
generalization; the model above will not fit well other data.

A better model?
1,000 2,000 3,000 4,000 5,000
0
200
400
600
800
Area in m2
Price
Draw a line like this?

Which one?
1,000 2,000 3,000 4,000 5,000
0
200
400
600
800
Area in m2
Price
But which line?

Linear Regression
1 Pick an initial line/model
h(θ) by randomly choosing
parameter θ
2 Compute the corresponding
cost function J
3 Update the line/model h(θ)
by updating θ that makes
J(θ) smaller, using, e.g.
gradient descent
4 Repeat steps 2 and 3 until
converges

When x continuous and y discrete
Cholesterol x1 Exercise x2 Status y
100 200 healthy
200 50 unhealthy
90 300 healthy
95 250 healthy
250 30 unhealthy
.
.
.
.
.
.
x2
x1
Given a training set consisting of two classes “healthy” or “unhealthy”,
what is the class of a new sample with x1 = 300, x2 = 20?

A good classifier?
x2
x1
Note that this is a slightly different data set with the previous one.

Which one?
x2
x1
Which classifier?

Support Vector Machine (SVM)
y
x
m
a
r
g
i
n

Unsupervised Learning
Cholesterol x1 Exercise x2
100 200
200 50
90 300
95 250
250 30
.
.
.
.
.
.
x2
x1
In unsupervised learning, our training set has no target variable y, that is,
{x(1), . . . , x(m)}, and thus regression and classification is no longer of
interest.

In unsupervised learning, we want to
find an interesting
patterns/structures in the data.
x2
x1

For example, clusters or smaller
groups in a data set
The idea: partitioning data into
distinct groups
observations within each
group are similar
observations in different
groups are different
x2
x1

An algorithm fo clustering: K-Means
1 Initialize cluster centroids
2 Repeat until convergence (no
change)
1 Assign each ith observation to
the closest cluster centroid
2 For each cluster, move the
centroid to the mean of
observations belong to the
cluster

K-Means
change)
cluster

Happy Learning!
Matur Nuwun
ridho.rahmadi@uii.ac.id

Ridho Rahmadi ML Models Learning May 10

Recommended

Recommended

More Related Content

Similar to Ridho Rahmadi ML Models Learning May 10

Similar to Ridho Rahmadi ML Models Learning May 10 (20)

Recently uploaded

Recently uploaded (20)

Ridho Rahmadi ML Models Learning May 10