phd-defense

From Adversarial Learning to
Robust and Scalable Learning
Ph.D. Presentation
1
Han Xiao (I20)
xiaoh@in.tum.de
Advisor: Prof. Dr. Claudia Eckert

Introduction Adversarial Learning Robust Learning Scalable Learning
Motivation
2
Machine learning algorithms in real-world applications are
vulnerable to adversaries.
Machine
learning
algorithms
Spam ﬁltering
Recommendation
system
Spammer may disguise the spam by
adding image and “good words” to
cheat the ﬁlter.
Spam users may give false ratings on
tail items, leading to a biased
recommendation system.
Explorative attack
Causative attack
Application Threat

Explorative attack vs. causative attack
3

Why shall we care?
4
“Know your enemies and yourself,
you will not be imperiled in a
hundred battles.”
Robust anti-virus
software
High quality
recommendation
system
Spam-free social
network service
Cost-effective crowd-
sourcing system
Traditional machine learning and
data mining rarely focus on
adversarial settings.

outlier
detection
Related work
Multi-labeler
Semi-supervised
learning
Active learning
Outlier detection
multi-
labeler
learning
active
learning
semi-
supervised
learning
1 1
2
3
4
2
3
4
Research Idea
Data are labeled by
multiple labelers
Data are partially
labeled
An oracle provides
labels
Noisy data points do
not ﬁt distribution
5
Some labelers are
adversaries
Even those limited
labels can not be fully
trusted
The oracle can provide
wrong label
Noise does not follow
any predeﬁned
distribution
Adversarial setting

Roadmap of my dissertation
Contribution
Adversarial
learning
Robust learning
6
Robust and
scalable learning
How can adversaries exploit the
vulnerabilities of learning algorithms?
How to learn from unfaithful training
data?
Are current algorithms fast enough for
online learning?
How to learn from noisy data stream
for real-time applications?
ProblemTopic
Showed that convex-
inducing classiﬁers
are vulnerable to
explorative attack
Showed that SVMs
are vulnerable to
causative label-ﬂip
attack
Developed a hierarchical Gaussian process
model and a graph-based model for multi-
labeler learning
Developed an
approximate
Gaussian process
for online regression
Developed online
algorithm learning
from partially labeled
data in client-server
setting

Exploratory attack notations
7

Exploratory attack: an optimization formulation
8

Illustrative example: is convex and loss
function is
9

Exploratory attack algorithm
10

Theoretical results
11
A polynomial
time algorithm!

Causative label ﬂip attack
12
Adversary

A bilevel formulation of label ﬂip attack
13

A bilevel formulation of label ﬂip attack
14
Classiﬁer (defender)
Adversary (attacker)

A relax formulation
15

Decision boundaries of SVMs under different ﬂip
strategies
16

Error rate of SVMs vs. the number of label ﬂips
17

Learning from multiple yet unreliable labelers
18
• Each instance is labeled
by several labelers
• Labeler can be genuine
or adversary
• Groundtruth label is
unknown

Latent space model for connecting the input space and
label space
19

Gaussian process for modeling joint probability
20
Latent space GP model Labeler GP model
Maximum a posterior

Synthetic examples: recover from the
responses of four observers
21

22
Synthetic examples: recover from the
responses of four observers

A graph-based approach for multi-labeler problem
23
• Not all instances are labeled
• A labeler only label a set of instances
• Some labelers are adversaries
Problem setting
Goal
• Compute the label and uncertainty of each
instance
• Compute the conﬁdence of each labeler
Idea: joint smoothness on graph
• Instances that are similar in item feature space
should have similar label
• Labeler that are similar in labeler feature space
should have similar conﬁdence
i
µi

Joint smoothness on labeler-graph and instance-graph
24
Labeler similarity
graph
Item similarity
graph
Instances that are close together
should have similar predicted labels,
unless their uncertainties are large.
Predicted labeled should be close to
their assigned labels, unless the
instance is uncertain or the
corresponding labelers are not
conﬁdence
Labelers that are close together should
have similar conﬁdence.
The uncertainty of an instance/labeler
should not be too large or too close to
zero.
joint smoothness on two graphs

Signiﬁcant improvement over simple average method
(#users )
25
0.2 0.3 0.4 0.5 0.6 0.7 0.8
0.2
0.3
0.4
0.5
0.6
0.7
0.8
Majority vote accuracy
Modelaccuracy
australian.scale lb_num win: 139, lose: 61
10.00
20.00
40.00
80.00
0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9
0.1
0.2
0.3
0.4
0.5
0.6
0.7
0.8
0.9
Modelaccuracy
breast.scale lb_num win: 121, lose: 79
10.00
20.00
40.00
80.00
0.2 0.25 0.3 0.35 0.4 0.45 0.5 0.55 0.6 0.65 0.7
0.2
0.25
0.3
0.35
0.4
0.45
0.5
0.55
0.6
0.65
0.7
Modelaccuracy
diabetes.scale lb_num win: 187, lose: 13
10.00
20.00
40.00
80.00
0.1 0.2 0.3 0.4 0.5 0.6 0.7
0.1
0.2
0.3
0.4
0.5
0.6
0.7
Modelaccuracy
fourclass.scale lb_num win: 171, lose: 28
10.00
20.00
40.00
80.00
0.25 0.3 0.35 0.4 0.45 0.5 0.55 0.6 0.65
0.25
0.3
0.35
0.4
0.45
0.5
0.55
0.6
0.65
Modelaccuracy
german.scale lb_num win: 197, lose: 3
10.00
20.00
40.00
80.00
0.25 0.3 0.35 0.4 0.45 0.5 0.55 0.6
0.25
0.3
0.35
0.4
0.45
0.5
0.55
0.6
Modelaccuracy
splice.scale lb_num win: 189, lose: 11
10.00
20.00
40.00
80.00
0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8
0.1
0.2
0.3
0.4
0.5
0.6
0.7
0.8
Modelaccuracy
svmguide1 lb_num win: 194, lose: 6
10.00
20.00
40.00
80.00
0.1 0.2 0.3 0.4 0.5 0.6 0.7
0.1
0.2
0.3
0.4
0.5
0.6
0.7
Modelaccuracy
svmguide2 lb_num win: 165, lose: 35
10.00
20.00
40.00
80.00
Better
W
orse
10 20 40 80

From robust learning to scalable learning
26

Divide and conquer: lazy Gaussian process
committee
27
Prediction

Which GP member should receive new point for
training?
28

Active selection for lazy Gaussian process
committee
29

Proposed method achieves better performance
in less time
30
Accuracy
(root mean
square error)
Efﬁciency
(training and
prediction
time)

Scalable robust learning in client-server settings
31
Which instance should I query?
Homogenous clients
Heterogenous clients
Which instance should I query?
Who should I ask for labeling?
Learn a good model under
limited bandwidth
Client Server Unlabeled data Goal
Problem

Subset selection under given budget
(Homogenous) Client uploads only crucial data
according to the selection policy
Unlabeled
data
Keysteps
Candidate
pool
Selection
policy
Upload
selections
Two-learner
model
Update
selection
policy
Client Server
PurposeMethod
• Select a small set of data from the candidate pool for uploading
Requirement
• Uploaded data should improve the classiﬁcation performance on the server
• Selection procedure should be light-weight for the client
• Selection policy should be light-weight for the network
• Select by optimizing a function consists of
two criterions
• Utility of instance (w.r.t. SCW)
• Redundancy w.r.t. the candidate pool
32

Server employs a two-learner model to learn
unlabeled data from client
33
Unlabeled
data
Candidate
pool
Selection
policy
Upload
selections
Two-learner
model
Update
selection
policy
Client Server
PurposeMethod
• Incrementally learn a binary classifier from unlabeled data
Requirement
• Leverage neighbor information for exploiting unlabeled data
• Learn in online fashion
• Be efficient enough to handle large-volume of data
• Be easily parameterized as a selection policy
• Two-learner structure
• Harmonic solution (HS)
• Soft confidence-weighted (SCW)
Keysteps

Proposed selection strategy reduces
communication cost and gives high accuracy
34
}FrameworkClient Communication Server
Selection policy on
client
Labeling rate (a mount
of human effort)
Sampling rate (a mount
of communication cost)
Accuracy averaged on
10 data sets
Full 100% 20% 92.16%
All 2% 100% 86.32%
Rand 2% 20% 86.38%
Proposed 2% 20% 87.08%
Unlabeled
data
Candidate
pool
Selection
policy
Upload
selections
Two-learner
model
Update
selection
policy
Client Server
Keysteps

Heterogenous clients: ask the most conﬁdent
client for labeling most uncertain instance
35

From adversarial learning to robust and scalable
learning
36
Contribution
Adversarial
learning
Robust learning
Robust and
scalable learning
How can adversaries exploit the
vulnerabilities of learning algorithms?
How to learn from unfaithful training
data?
Are current algorithms fast enough for
online learning?
How to learn from noisy data stream
for real-time applications?
ProblemTopic
Showed that convex-
inducing classiﬁers
are vulnerable to
explorative attack
Showed that SVMs
are vulnerable to
causative label-ﬂip
attack
Developed a hierarchical Gaussian process
model and a graph-based model for multi-
labeler learning
Developed an
approximate
Gaussian process
for online regression
Developed online
algorithm learning
from partially labeled
data in client-server
setting

Conclusion
37
• Traditional machine learning algorithms are
vulnerable to the attack.
• Through labelers may contain adversaries, robust
learning can still be achieved.
• Multi-labeler learning (crowdsourcing) could have
more and more applications in the next couple of
years.

Thanks for your attention
38

phd-defense

Recommended

Recommended

More Related Content

Similar to phd-defense

Similar to phd-defense (20)

phd-defense