SlideShare a Scribd company logo
1 of 26
Alleviating Privacy Attacks via
Causal Learning
Shruti Tople, Amit Sharma, Aditya V. Nori
Microsoft Research
https://arxiv.org/abs/1909.12732
https://github.com/microsoft/robustdg
Motivation: ML models leak information
about data points in the training set
Neural
Network
TrainingHealth Records
(HIV/AIDS
patients)
ML-as-a-service
Member of
Train Dataset
Non-member
Membership Inference Attacks
[SPโ€™17][CSFโ€™18][NDSSโ€™19][SPโ€™19]
The likely reason is overfitting
Output
85%
Output
95%
Overfitting to
dataset
โ€ข Neural networks or associational models
overfit to the training dataset
โ€ข Membership inference adversary exploits
differences in prediction score for training and
test data [CSFโ€™18]
Overfitting to
distribution
The likely reason is overfitting
โ€ข Neural networks or associational models
overfit to the training dataset
โ€ข Membership inference attacks exploit
differences in prediction score for training and
test data [CSFโ€™18]
โ€ข Privacy risk can increase when model is
deployed to different distributions
โ€ข E.g., Hospital in one region shares the model to
other regions
Output
85%
Output
95%
Overfitting to
dataset
Output
75%
Poor generalization across distributions exacerbates
membership inference risk.
Can causal ML
models help?
Can causal ML models help?
Contributions
1. Causal models provide stronger (differential) privacy guarantees than
associational models.
โ€ข Due to their better generalizability on new distributions.
2. And hence are more robust to membership inference attacks.
โ€ข As the training dataset size โ†’ โˆž, membership inference attackโ€™s accuracy drops to a
random guess.
3. We empirically demonstrate privacy benefits of causal models across 5 datasets.
โ€ข Associational models exhibit up to 80% attack accuracy whereas causal models exhibit
attack accuracy close to 50%.
Causal
Learning
Privacy
Disease
Severity
Background: Causal Learning
๐’€
Blood
Pressure
Heart
Rate
๐‘ฟ ๐’‘๐’‚๐’“๐’†๐’๐’• ๐‘ฟ ๐’‘๐’‚๐’“๐’†๐’๐’•
๐‘ฟ ๐Ÿ ๐‘ฟ ๐Ÿ
Weight Age
Use a structural causal model (SCM) that defines what
conditional probabilities are invariant across different
distributions [Pearlโ€™09].
Background: Causal Learning
Use a structural causal model (SCM) that defines what
conditional probabilities are invariant across different
distributions [Pearlโ€™09].
Causal Predictive Model: A prediction model based only
on the parents of the outcome Y.
What if SCM is not known? Learn an invariant feature
representation across distributions [ABGDโ€™19, MTSโ€™20].
For ML models, causal learning can be useful for
fairness [KLRSโ€™17]
explainability [DSZโ€™16, MTSโ€™19]
privacy [this work]
Disease
Severity
๐’€
Blood
Pressure
Heart
Rate
๐‘ฟ ๐’‘๐’‚๐’“๐’†๐’๐’• ๐‘ฟ ๐’‘๐’‚๐’“๐’†๐’๐’•
๐‘ฟ ๐Ÿ ๐‘ฟ ๐Ÿ
Weight Age
๐’€
๐‘‹๐‘†0 ๐‘‹ ๐‘ƒ๐ด
๐‘‹๐‘†2
๐‘‹๐‘†1
๐‘‹ ๐ถ๐ป
๐‘‹๐‘๐‘
Intervention
Why is a model based on causal parents
invariant across data distributions?
Why is a model based on causal parents
invariant across data distributions?
๐’€
๐‘‹๐‘†0 ๐‘‹ ๐‘ƒ๐ด
๐‘‹๐‘†2
๐‘‹๐‘†1
๐‘‹ ๐ถ๐ป
๐‘‹๐‘๐‘
Intervention
๐’€
๐‘‹๐‘†0 ๐‘‹ ๐‘ƒ๐ด
๐‘‹๐‘†2
๐‘‹๐‘†1
๐‘‹ ๐ถ๐ป
๐‘‹๐‘๐‘
๐‘ƒ(๐‘Œ|๐‘‹ ๐‘ƒ๐ด) is invariant across different distributions, unless there is a
change in true data-generating process for Y.
Result 1: Worst-case out-of-distribution error of a
causal model is lower than an associational model.
For any model โ„Ž, and ๐‘ƒโˆ— such that ๐‘ƒโˆ— ๐‘Œ ๐‘‹ ๐‘ƒ๐ด = ๐‘ƒ(๐‘Œ|๐‘‹ ๐‘ƒ๐ด),
In-Distribution Error (IDE)= ๐ˆ๐ƒ๐„ ๐ ๐’‰, ๐’š = ๐‹ ๐‘ท ๐’‰, ๐’š โˆ’ ๐‹ ๐‘บโˆผP(๐’‰, ๐’š)
Expected loss on the same distribution as the train data
Out-of-Distribution Error (ODE)=๐Ž๐ƒ๐„ ๐,๐โˆ— ๐’‰, ๐’š = ๐‹ ๐‘ทโˆ— ๐’‰, ๐’š โˆ’ ๐‹ ๐‘บโˆผP ๐’‰, ๐’š
Expected loss on a different distribution ๐‘ƒโˆ—
than the train data
Result 1: Worst-case out-of-distribution error of a
causal model is lower than an associational model.
For any model โ„Ž, and ๐‘ƒโˆ— such that ๐‘ƒโˆ— ๐‘Œ ๐‘‹ ๐‘ƒ๐ด = ๐‘ƒ(๐‘Œ|๐‘‹ ๐‘ƒ๐ด),
In-Distribution Error (IDE)= ๐ˆ๐ƒ๐„ ๐ ๐’‰, ๐’š = ๐‹ ๐‘ท ๐’‰, ๐’š โˆ’ ๐‹ ๐‘บโˆผP(๐’‰, ๐’š)
Expected loss on the same distribution as the train data
Out-of-Distribution Error (ODE)=๐Ž๐ƒ๐„ ๐,๐โˆ— ๐’‰, ๐’š = ๐‹ ๐‘ทโˆ— ๐’‰, ๐’š โˆ’ ๐‹ ๐‘บโˆผP ๐’‰, ๐’š
Expected loss on a different distribution ๐‘ƒโˆ—
than the train data
Proof Idea. Simple case: Assume ๐‘ฆ = ๐‘“(๐’™) is deterministic.
๐Ž๐ƒ๐„ ๐,๐โˆ— ๐’‰ ๐œ, ๐’š โ‰ค ๐ˆ๐ƒ๐„ ๐(๐’‰ ๐’„, ๐’š) + ๐’…๐’Š๐’”๐’„ ๐‹ ๐‘ท, ๐‘ทโˆ—
Discrepancy
b/w ๐‘ท and ๐‘ทโˆ—
distributions
Causal Model
Result 1: Worst-case out-of-distribution error of a
causal model is lower than an associational model.
For any model โ„Ž, and ๐‘ƒโˆ— such that ๐‘ƒโˆ— ๐‘Œ ๐‘‹ ๐‘ƒ๐ด = ๐‘ƒ(๐‘Œ|๐‘‹ ๐‘ƒ๐ด),
In-Distribution Error (IDE)= ๐ˆ๐ƒ๐„ ๐ ๐’‰, ๐’š = ๐‹ ๐‘ท ๐’‰, ๐’š โˆ’ ๐‹ ๐‘บโˆผP(๐’‰, ๐’š)
Expected loss on the same distribution as the train data
Out-of-Distribution Error (ODE)=๐Ž๐ƒ๐„ ๐,๐โˆ— ๐’‰, ๐’š = ๐‹ ๐‘ทโˆ— ๐’‰, ๐’š โˆ’ ๐‹ ๐‘บโˆผP ๐’‰, ๐’š
Expected loss on a different distribution ๐‘ƒโˆ—
than the train data
Proof Idea. Simple case: Assume ๐‘ฆ = ๐‘“(๐’™) is deterministic.
๐Ž๐ƒ๐„ ๐,๐โˆ— ๐’‰ ๐œ, ๐’š โ‰ค ๐ˆ๐ƒ๐„ ๐(๐’‰ ๐’„, ๐’š) + ๐’…๐’Š๐’”๐’„ ๐‹ ๐‘ท, ๐‘ทโˆ—
๐Ž๐ƒ๐„ ๐,๐โˆ— ๐’‰ ๐’‚, ๐’š โ‰ค ๐ˆ๐ƒ๐„ ๐ ๐’‰ ๐’‚, ๐’š + ๐’…๐’Š๐’”๐’„ ๐‹ ๐‘ท, ๐‘ทโˆ—
+ ๐‹ ๐‘ทโˆ—(๐’‰ ๐’‚,๐‘ท
๐‘ถ๐‘ท๐‘ป
, ๐’š)
โ‡’ max
๐โˆ—
๐Ž๐ƒ๐„๐๐จ๐ฎ๐ง๐ ๐,๐โˆ— ๐’‰ ๐œ, ๐’š โ‰ค max
๐โˆ—
๐Ž๐ƒ๐„๐๐จ๐ฎ๐ง๐ ๐,๐โˆ— ๐’‰ ๐’‚, ๐’š
Discrepancy
b/w ๐‘ท and ๐‘ทโˆ—
distributions
Optimal ๐’‰ ๐’‚ on P is
not optimal on ๐‘ทโˆ—
Causal Model
Assoc. Model
Result 1: Worst-case out-of-distribution error of a
causal model is lower than an associational model.
And better generalization results in lower
sensitivity for a causal model
Sensitivity: If a single data point ๐’™, ๐‘ฆ โˆผ ๐‘ƒโˆ— is added to the train
dataset ๐‘† to create ๐‘†โ€ฒ, how much does the learnt model h ๐‘†
min
change?
Since the optimal causal model is the same across all ๐‘ƒโˆ—
, adding
any ๐’™, ๐‘ฆ โˆผ ๐‘ƒโˆ— has less impact on a trained causal model.
Sensitivity for a causal
model
Sensitivity for an
associational model
Main Result: A causal model has stronger
Differential Privacy guarantees
Let M be a mechanism that returns a ML model trained over dataset ๐‘†, M(๐‘†) = โ„Ž.
Differential Privacy [DRโ€™14]: A learning mechanism M satisfies ๐œ–-differential
privacy if for any two datasets, ๐‘†, ๐‘†โ€ฒ that differ in one data point,
Pr(M ๐‘† โˆˆ๐ป)
Pr(M ๐‘†โ€ฒ โˆˆ๐ป)
โ‰ค ๐‘’ ๐œ–.
(Smaller ๐œ– values provide better privacy guarantees)
Since lower sensitivity โ‡’ lower ๐œ–,
Theorem: When equivalent Laplace noise is added and models are trained on same
dataset, causal mechanism MC provides ๐œ– ๐ถ-DP and associational mechanism MA
provides ๐œ– ๐ด-DP guarantees such that:
๐ ๐’„ โ‰ค ๐ ๐‘จ
Therefore, causal models are more robust to
membership inference (MI) attacks
Advantage of an MI adversary:
(True Positive Rate โ€“ False Positive Rate)
in detecting whether ๐‘ฅ is from training dataset or not.
[From Yeom et al. CSFโ€™18] Membership advantage of an adversary is bounded by
๐‘’ ๐œ–
โˆ’ 1.
Since the optimal causal models are the same for ๐‘ƒ and ๐‘ƒโˆ—,
As ๐‘› โ†’ โˆž, membership advantage of causal model โ†’ 0.
Theorem: When trained on the same dataset of size ๐‘›, membership
advantage of a causal model is lower than the membership advantage for an
associational model.
Empirical
Evaluation
Goal: Compare MI attack accuracy between
causal and associational models
[BN] When true causal structure is known
Datasets generated from Bayesian networks: Child, Sachs, Water, Alarm
Causal model: MLE estimation based on Yโ€™s parents
Associational model: Neural networks with 3 linear layers
๐‘ƒโˆ—: Noise added to conditional probabilities (uniform or additive)
[MNIST] When true causal structure is unknown
Colored MNIST dataset (Digits are correlated with color)
Causal Model: Invariant Risk Minimization that utilizes ๐‘ƒ ๐‘Œ ๐‘‹ ๐‘ƒ๐ด is same across distributions [ABGDโ€™19]
Associational Model: Empirical Risk Minimization using the same NN architecture
๐‘ƒโˆ—: Different correlations between color and digit than the train dataset
Attacker Model: Predict whether an input belongs to train dataset or not
[BN] With uniform noise, MI attack accuracy
for a causal model is near a random guess
80%
50%
For associational models, the attacker can guess membership in training set with 80% accuracy.
[BN-Child] With uniform noise, MI attack accuracy
for a causal model is near a random guess
80%
50%
For associational models, the attacker can guess membership in training set with 80% accuracy.
Privacy without loss in utility: Causal & DNN models achieve same prediction accuracy.
[BN-Child] MI Attack accuracy increases with
amount of noise for associational models, but
stays constant at 50% for causal models
[BN] Consistent results across all four datasets
High attack accuracy for associational
models when ๐‘ƒโˆ—
(Test2) has uniform noise.
Same classification accuracy between
causal and associational models.
[MNIST] MI attack accuracy is lower for invariant
risk minimizer compared to associational model
IRM model motivated by causal reasoning has 53% attack accuracy, close to random.
Associational model also fails to generalize: 16% accuracy on test set.
Model
Train
Accuracy
(%)
Test
Accuracy
(%)
Attack
Accuracy
(%)
Causal Model
(IRM)
70 69 53
Associational
Model (ERM)
87 16 66
Conclusion
โ€ข Established theoretical connection between causality and differential privacy.
โ€ข Demonstrated the benefits of causal ML models for alleviating privacy attacks,
both theoretically and empirically.
โ€ข Code available at https://github.com/microsoft/robustdg
Future work: Investigate robustness of causal models with other kinds of
adversarial attacks.
Causal
Learning
Privacy
thank you!
Amit Sharma
Microsoft Research
References
โ€ข [ABGDโ€™19] Martin Arjovsky, Lรฉon Bottou, Ishaan Gulrajani, and David Lopez-Paz. Invariant risk minimization. arXiv
preprint arXiv:1907.02893, 2019.
โ€ข [CSFโ€™18] Yeom, S., Giacomelli, I., Fredrikson, M., and Jha, S. Privacy risk in machine learning: Analyzing the connection
to overfitting. CSF 2018.
โ€ข [DRโ€™14] Cynthia Dwork, Aaron Roth, et al. The algorithmic foundations of differential privacy. Foundations and
Trends in Theoretical Computer Science, 9(3โ€“4):211โ€“407, 2014.
โ€ข [DSZโ€™16] Anupam Datta, Shayak Sen, and Yair Zick. Algorithmic transparency via quantitative input influence: Theory
and experiments with learning systems. In Security and Privacy (SP), 2016 IEEE Symposium on, pp. 598โ€“617. IEEE,
2016
โ€ข [KLRSโ€™17] Matt J Kusner, Joshua Loftus, Chris Russell, and Ricardo Silva. Counterfactual fairness. In Advances in
Neural Information Processing Systems, pp. 4066โ€“4076, 2017.
โ€ข [MTSโ€™19] Mahajan, Divyat, Chenhao Tan, and Amit Sharma. "Preserving Causal Constraints in Counterfactual
Explanations for Machine Learning Classifiers." arXiv preprint arXiv:1912.03277 (2019).
โ€ข [MTSโ€™20] Mahajan, Divyat, Shruti Tople and Amit Sharma. โ€œDomain Generalization using Causal Matchingโ€. arXiv
preprint arXiv:2006.07500, 2020.
โ€ข [NDSSโ€™19] Salem, A., Zhang, Y., Humbert, M., Fritz, M., and Backes, M. Ml-leaks: Model and data independent
membership inference attacks and defenses on machine learning models. NDSS 2019.
โ€ข [SPโ€™17] Shokri, R., Stronati, M., Song, C., and Shmatikov, V. Membership inference attacks against machine learning
models. Security and Privacy (SP), 2017.
โ€ข [SPโ€™19] Nasr, M., Shokri, R., and Houmansadr, A. Comprehensive privacy analysis of deep learning: Stand-alone and
federated learning under passive and active white-box inference attacks. Security and Privacy (SP), 2019.

More Related Content

What's hot

Module 6: Ensemble Algorithms
Module 6:  Ensemble AlgorithmsModule 6:  Ensemble Algorithms
Module 6: Ensemble AlgorithmsSara Hooker
ย 
Repurposing Classification & Regression Trees for Causal Research with High-D...
Repurposing Classification & Regression Trees for Causal Research with High-D...Repurposing Classification & Regression Trees for Causal Research with High-D...
Repurposing Classification & Regression Trees for Causal Research with High-D...Galit Shmueli
ย 
Causal data mining: Identifying causal effects at scale
Causal data mining: Identifying causal effects at scaleCausal data mining: Identifying causal effects at scale
Causal data mining: Identifying causal effects at scaleAmit Sharma
ย 
To Explain, To Predict, or To Describe?
To Explain, To Predict, or To Describe?To Explain, To Predict, or To Describe?
To Explain, To Predict, or To Describe?Galit Shmueli
ย 
Machine learning session6(decision trees random forrest)
Machine learning   session6(decision trees random forrest)Machine learning   session6(decision trees random forrest)
Machine learning session6(decision trees random forrest)Abhimanyu Dwivedi
ย 
PyGotham 2016
PyGotham 2016PyGotham 2016
PyGotham 2016Manojit Nandi
ย 
Module 3: Linear Regression
Module 3:  Linear RegressionModule 3:  Linear Regression
Module 3: Linear RegressionSara Hooker
ย 
Repurposing predictive tools for causal research
Repurposing predictive tools for causal researchRepurposing predictive tools for causal research
Repurposing predictive tools for causal researchGalit Shmueli
ย 
sigir2018tutorial
sigir2018tutorialsigir2018tutorial
sigir2018tutorialTetsuya Sakai
ย 
Module 2: Machine Learning Deep Dive
Module 2:  Machine Learning Deep DiveModule 2:  Machine Learning Deep Dive
Module 2: Machine Learning Deep DiveSara Hooker
ย 
Module 5: Decision Trees
Module 5: Decision TreesModule 5: Decision Trees
Module 5: Decision TreesSara Hooker
ย 
To explain or to predict
To explain or to predictTo explain or to predict
To explain or to predictGalit Shmueli
ย 
Module 1.2 data preparation
Module 1.2  data preparationModule 1.2  data preparation
Module 1.2 data preparationSara Hooker
ย 
Statistical Modeling in 3D: Describing, Explaining and Predicting
Statistical Modeling in 3D: Describing, Explaining and PredictingStatistical Modeling in 3D: Describing, Explaining and Predicting
Statistical Modeling in 3D: Describing, Explaining and PredictingGalit Shmueli
ย 
Statistical Approaches to Missing Data
Statistical Approaches to Missing DataStatistical Approaches to Missing Data
Statistical Approaches to Missing DataDataCards
ย 
Big Data - To Explain or To Predict? Talk at U Toronto's Rotman School of Ma...
Big Data - To Explain or To Predict?  Talk at U Toronto's Rotman School of Ma...Big Data - To Explain or To Predict?  Talk at U Toronto's Rotman School of Ma...
Big Data - To Explain or To Predict? Talk at U Toronto's Rotman School of Ma...Galit Shmueli
ย 

What's hot (18)

Module 6: Ensemble Algorithms
Module 6:  Ensemble AlgorithmsModule 6:  Ensemble Algorithms
Module 6: Ensemble Algorithms
ย 
Repurposing Classification & Regression Trees for Causal Research with High-D...
Repurposing Classification & Regression Trees for Causal Research with High-D...Repurposing Classification & Regression Trees for Causal Research with High-D...
Repurposing Classification & Regression Trees for Causal Research with High-D...
ย 
Causal data mining: Identifying causal effects at scale
Causal data mining: Identifying causal effects at scaleCausal data mining: Identifying causal effects at scale
Causal data mining: Identifying causal effects at scale
ย 
To Explain, To Predict, or To Describe?
To Explain, To Predict, or To Describe?To Explain, To Predict, or To Describe?
To Explain, To Predict, or To Describe?
ย 
Machine learning session6(decision trees random forrest)
Machine learning   session6(decision trees random forrest)Machine learning   session6(decision trees random forrest)
Machine learning session6(decision trees random forrest)
ย 
PyGotham 2016
PyGotham 2016PyGotham 2016
PyGotham 2016
ย 
Module 3: Linear Regression
Module 3:  Linear RegressionModule 3:  Linear Regression
Module 3: Linear Regression
ย 
Repurposing predictive tools for causal research
Repurposing predictive tools for causal researchRepurposing predictive tools for causal research
Repurposing predictive tools for causal research
ย 
sigir2018tutorial
sigir2018tutorialsigir2018tutorial
sigir2018tutorial
ย 
Module 2: Machine Learning Deep Dive
Module 2:  Machine Learning Deep DiveModule 2:  Machine Learning Deep Dive
Module 2: Machine Learning Deep Dive
ย 
sigir2020
sigir2020sigir2020
sigir2020
ย 
Hypothesis testing
Hypothesis testingHypothesis testing
Hypothesis testing
ย 
Module 5: Decision Trees
Module 5: Decision TreesModule 5: Decision Trees
Module 5: Decision Trees
ย 
To explain or to predict
To explain or to predictTo explain or to predict
To explain or to predict
ย 
Module 1.2 data preparation
Module 1.2  data preparationModule 1.2  data preparation
Module 1.2 data preparation
ย 
Statistical Modeling in 3D: Describing, Explaining and Predicting
Statistical Modeling in 3D: Describing, Explaining and PredictingStatistical Modeling in 3D: Describing, Explaining and Predicting
Statistical Modeling in 3D: Describing, Explaining and Predicting
ย 
Statistical Approaches to Missing Data
Statistical Approaches to Missing DataStatistical Approaches to Missing Data
Statistical Approaches to Missing Data
ย 
Big Data - To Explain or To Predict? Talk at U Toronto's Rotman School of Ma...
Big Data - To Explain or To Predict?  Talk at U Toronto's Rotman School of Ma...Big Data - To Explain or To Predict?  Talk at U Toronto's Rotman School of Ma...
Big Data - To Explain or To Predict? Talk at U Toronto's Rotman School of Ma...
ย 

Similar to Causal Learning Boosts Privacy for ML Models

Adversarial examples in deep learning (Gregory Chatel)
Adversarial examples in deep learning (Gregory Chatel)Adversarial examples in deep learning (Gregory Chatel)
Adversarial examples in deep learning (Gregory Chatel)MeetupDataScienceRoma
ย 
Probability density estimation using Product of Conditional Experts
Probability density estimation using Product of Conditional ExpertsProbability density estimation using Product of Conditional Experts
Probability density estimation using Product of Conditional ExpertsChirag Gupta
ย 
Summary.ppt
Summary.pptSummary.ppt
Summary.pptbutest
ย 
DATA-LEVEL HYBRID STRATEGY SELECTION FOR DISK FAULT PREDICTION MODEL BASED ON...
DATA-LEVEL HYBRID STRATEGY SELECTION FOR DISK FAULT PREDICTION MODEL BASED ON...DATA-LEVEL HYBRID STRATEGY SELECTION FOR DISK FAULT PREDICTION MODEL BASED ON...
DATA-LEVEL HYBRID STRATEGY SELECTION FOR DISK FAULT PREDICTION MODEL BASED ON...IJCI JOURNAL
ย 
Intro to Model Selection
Intro to Model SelectionIntro to Model Selection
Intro to Model Selectionchenhm
ย 
Modeling strategies for definitive screening designs using jmp and r
Modeling strategies for definitive  screening designs using jmp and rModeling strategies for definitive  screening designs using jmp and r
Modeling strategies for definitive screening designs using jmp and rPhilip Ramsey
ย 
Dealing with imbalanced data sets.pdf
Dealing with imbalanced data sets.pdfDealing with imbalanced data sets.pdf
Dealing with imbalanced data sets.pdfNagaVarthini
ย 
Morse-Smale Regression for Risk Modeling
Morse-Smale Regression for Risk ModelingMorse-Smale Regression for Risk Modeling
Morse-Smale Regression for Risk ModelingColleen Farrelly
ย 
Top 20 Data Science Interview Questions and Answers in 2023.pdf
Top 20 Data Science Interview Questions and Answers in 2023.pdfTop 20 Data Science Interview Questions and Answers in 2023.pdf
Top 20 Data Science Interview Questions and Answers in 2023.pdfAnanthReddy38
ย 
Machine learning module 2
Machine learning module 2Machine learning module 2
Machine learning module 2Gokulks007
ย 
Adversarial examples reading comprehension system
Adversarial examples reading comprehension systemAdversarial examples reading comprehension system
Adversarial examples reading comprehension systemMasa Kato
ย 
SPSS statistics - get help using SPSS
SPSS statistics - get help using SPSSSPSS statistics - get help using SPSS
SPSS statistics - get help using SPSScsula its training
ย 
Download It
Download ItDownload It
Download Itbutest
ย 
Predire il futuro con Machine Learning & Big Data
Predire il futuro con Machine Learning & Big DataPredire il futuro con Machine Learning & Big Data
Predire il futuro con Machine Learning & Big DataData Driven Innovation
ย 
PyData Miami 2019, Quantum Generalized Linear Models
PyData Miami 2019, Quantum Generalized Linear ModelsPyData Miami 2019, Quantum Generalized Linear Models
PyData Miami 2019, Quantum Generalized Linear ModelsColleen Farrelly
ย 
AIAA-SDM-PEMF-2013
AIAA-SDM-PEMF-2013AIAA-SDM-PEMF-2013
AIAA-SDM-PEMF-2013OptiModel
ย 
Model validation strategies ftc 2018
Model validation strategies ftc 2018Model validation strategies ftc 2018
Model validation strategies ftc 2018Philip Ramsey
ย 
Predictive data analytics models and their applications
Predictive data analytics models and their applicationsPredictive data analytics models and their applications
Predictive data analytics models and their applicationsBharathi Raja Asoka Chakravarthi
ย 
Multi-Cluster Based Approach for skewed Data in Data Mining
Multi-Cluster Based Approach for skewed Data in Data MiningMulti-Cluster Based Approach for skewed Data in Data Mining
Multi-Cluster Based Approach for skewed Data in Data MiningIOSR Journals
ย 

Similar to Causal Learning Boosts Privacy for ML Models (20)

Adversarial examples in deep learning (Gregory Chatel)
Adversarial examples in deep learning (Gregory Chatel)Adversarial examples in deep learning (Gregory Chatel)
Adversarial examples in deep learning (Gregory Chatel)
ย 
Probability density estimation using Product of Conditional Experts
Probability density estimation using Product of Conditional ExpertsProbability density estimation using Product of Conditional Experts
Probability density estimation using Product of Conditional Experts
ย 
Summary.ppt
Summary.pptSummary.ppt
Summary.ppt
ย 
DATA-LEVEL HYBRID STRATEGY SELECTION FOR DISK FAULT PREDICTION MODEL BASED ON...
DATA-LEVEL HYBRID STRATEGY SELECTION FOR DISK FAULT PREDICTION MODEL BASED ON...DATA-LEVEL HYBRID STRATEGY SELECTION FOR DISK FAULT PREDICTION MODEL BASED ON...
DATA-LEVEL HYBRID STRATEGY SELECTION FOR DISK FAULT PREDICTION MODEL BASED ON...
ย 
Intro to Model Selection
Intro to Model SelectionIntro to Model Selection
Intro to Model Selection
ย 
Modeling strategies for definitive screening designs using jmp and r
Modeling strategies for definitive  screening designs using jmp and rModeling strategies for definitive  screening designs using jmp and r
Modeling strategies for definitive screening designs using jmp and r
ย 
Dealing with imbalanced data sets.pdf
Dealing with imbalanced data sets.pdfDealing with imbalanced data sets.pdf
Dealing with imbalanced data sets.pdf
ย 
Morse-Smale Regression for Risk Modeling
Morse-Smale Regression for Risk ModelingMorse-Smale Regression for Risk Modeling
Morse-Smale Regression for Risk Modeling
ย 
Top 20 Data Science Interview Questions and Answers in 2023.pdf
Top 20 Data Science Interview Questions and Answers in 2023.pdfTop 20 Data Science Interview Questions and Answers in 2023.pdf
Top 20 Data Science Interview Questions and Answers in 2023.pdf
ย 
Machine learning module 2
Machine learning module 2Machine learning module 2
Machine learning module 2
ย 
Adversarial examples reading comprehension system
Adversarial examples reading comprehension systemAdversarial examples reading comprehension system
Adversarial examples reading comprehension system
ย 
SPSS statistics - get help using SPSS
SPSS statistics - get help using SPSSSPSS statistics - get help using SPSS
SPSS statistics - get help using SPSS
ย 
Download It
Download ItDownload It
Download It
ย 
Predire il futuro con Machine Learning & Big Data
Predire il futuro con Machine Learning & Big DataPredire il futuro con Machine Learning & Big Data
Predire il futuro con Machine Learning & Big Data
ย 
PyData Miami 2019, Quantum Generalized Linear Models
PyData Miami 2019, Quantum Generalized Linear ModelsPyData Miami 2019, Quantum Generalized Linear Models
PyData Miami 2019, Quantum Generalized Linear Models
ย 
Bank loan purchase modeling
Bank loan purchase modelingBank loan purchase modeling
Bank loan purchase modeling
ย 
AIAA-SDM-PEMF-2013
AIAA-SDM-PEMF-2013AIAA-SDM-PEMF-2013
AIAA-SDM-PEMF-2013
ย 
Model validation strategies ftc 2018
Model validation strategies ftc 2018Model validation strategies ftc 2018
Model validation strategies ftc 2018
ย 
Predictive data analytics models and their applications
Predictive data analytics models and their applicationsPredictive data analytics models and their applications
Predictive data analytics models and their applications
ย 
Multi-Cluster Based Approach for skewed Data in Data Mining
Multi-Cluster Based Approach for skewed Data in Data MiningMulti-Cluster Based Approach for skewed Data in Data Mining
Multi-Cluster Based Approach for skewed Data in Data Mining
ย 

More from Amit Sharma

DoWhy Python library for causal inference: An End-to-End tool
DoWhy Python library for causal inference: An End-to-End toolDoWhy Python library for causal inference: An End-to-End tool
DoWhy Python library for causal inference: An End-to-End toolAmit Sharma
ย 
The Impact of Computing Systems | Causal inference in practice
The Impact of Computing Systems | Causal inference in practiceThe Impact of Computing Systems | Causal inference in practice
The Impact of Computing Systems | Causal inference in practiceAmit Sharma
ย 
Artificial Intelligence for Societal Impact
Artificial Intelligence for Societal ImpactArtificial Intelligence for Societal Impact
Artificial Intelligence for Societal ImpactAmit Sharma
ย 
Measuring effectiveness of machine learning systems
Measuring effectiveness of machine learning systemsMeasuring effectiveness of machine learning systems
Measuring effectiveness of machine learning systemsAmit Sharma
ย 
Auditing search engines for differential satisfaction across demographics
Auditing search engines for differential satisfaction across demographicsAuditing search engines for differential satisfaction across demographics
Auditing search engines for differential satisfaction across demographicsAmit Sharma
ย 
Causal inference in data science
Causal inference in data scienceCausal inference in data science
Causal inference in data scienceAmit Sharma
ย 
Causal inference in online systems: Methods, pitfalls and best practices
Causal inference in online systems: Methods, pitfalls and best practicesCausal inference in online systems: Methods, pitfalls and best practices
Causal inference in online systems: Methods, pitfalls and best practicesAmit Sharma
ย 
Equivalence causal frameworks: SEMs, Graphical models and Potential Outcomes
Equivalence causal frameworks: SEMs, Graphical models and Potential OutcomesEquivalence causal frameworks: SEMs, Graphical models and Potential Outcomes
Equivalence causal frameworks: SEMs, Graphical models and Potential OutcomesAmit Sharma
ย 
Estimating the causal impact of recommender systems
Estimating the causal impact of recommender systemsEstimating the causal impact of recommender systems
Estimating the causal impact of recommender systemsAmit Sharma
ย 
Data mining for causal inference: Effect of recommendations on Amazon.com
Data mining for causal inference: Effect of recommendations on Amazon.comData mining for causal inference: Effect of recommendations on Amazon.com
Data mining for causal inference: Effect of recommendations on Amazon.comAmit Sharma
ย 
Estimating influence of online activity feeds on people's actions
Estimating influence of online activity feeds on people's actionsEstimating influence of online activity feeds on people's actions
Estimating influence of online activity feeds on people's actionsAmit Sharma
ย 
From prediction to causation: Causal inference in online systems
From prediction to causation: Causal inference in online systemsFrom prediction to causation: Causal inference in online systems
From prediction to causation: Causal inference in online systemsAmit Sharma
ย 
The interplay of personal preference and social influence in sharing networks...
The interplay of personal preference and social influence in sharing networks...The interplay of personal preference and social influence in sharing networks...
The interplay of personal preference and social influence in sharing networks...Amit Sharma
ย 
The role of social connections in shaping our preferences
The role of social connections in shaping our preferencesThe role of social connections in shaping our preferences
The role of social connections in shaping our preferencesAmit Sharma
ย 
[RecSys '13]Pairwise Learning: Experiments with Community Recommendation on L...
[RecSys '13]Pairwise Learning: Experiments with Community Recommendation on L...[RecSys '13]Pairwise Learning: Experiments with Community Recommendation on L...
[RecSys '13]Pairwise Learning: Experiments with Community Recommendation on L...Amit Sharma
ย 
RSWEB 2013: A research platform for social recommendation
RSWEB 2013: A research platform for social recommendationRSWEB 2013: A research platform for social recommendation
RSWEB 2013: A research platform for social recommendationAmit Sharma
ย 

More from Amit Sharma (16)

DoWhy Python library for causal inference: An End-to-End tool
DoWhy Python library for causal inference: An End-to-End toolDoWhy Python library for causal inference: An End-to-End tool
DoWhy Python library for causal inference: An End-to-End tool
ย 
The Impact of Computing Systems | Causal inference in practice
The Impact of Computing Systems | Causal inference in practiceThe Impact of Computing Systems | Causal inference in practice
The Impact of Computing Systems | Causal inference in practice
ย 
Artificial Intelligence for Societal Impact
Artificial Intelligence for Societal ImpactArtificial Intelligence for Societal Impact
Artificial Intelligence for Societal Impact
ย 
Measuring effectiveness of machine learning systems
Measuring effectiveness of machine learning systemsMeasuring effectiveness of machine learning systems
Measuring effectiveness of machine learning systems
ย 
Auditing search engines for differential satisfaction across demographics
Auditing search engines for differential satisfaction across demographicsAuditing search engines for differential satisfaction across demographics
Auditing search engines for differential satisfaction across demographics
ย 
Causal inference in data science
Causal inference in data scienceCausal inference in data science
Causal inference in data science
ย 
Causal inference in online systems: Methods, pitfalls and best practices
Causal inference in online systems: Methods, pitfalls and best practicesCausal inference in online systems: Methods, pitfalls and best practices
Causal inference in online systems: Methods, pitfalls and best practices
ย 
Equivalence causal frameworks: SEMs, Graphical models and Potential Outcomes
Equivalence causal frameworks: SEMs, Graphical models and Potential OutcomesEquivalence causal frameworks: SEMs, Graphical models and Potential Outcomes
Equivalence causal frameworks: SEMs, Graphical models and Potential Outcomes
ย 
Estimating the causal impact of recommender systems
Estimating the causal impact of recommender systemsEstimating the causal impact of recommender systems
Estimating the causal impact of recommender systems
ย 
Data mining for causal inference: Effect of recommendations on Amazon.com
Data mining for causal inference: Effect of recommendations on Amazon.comData mining for causal inference: Effect of recommendations on Amazon.com
Data mining for causal inference: Effect of recommendations on Amazon.com
ย 
Estimating influence of online activity feeds on people's actions
Estimating influence of online activity feeds on people's actionsEstimating influence of online activity feeds on people's actions
Estimating influence of online activity feeds on people's actions
ย 
From prediction to causation: Causal inference in online systems
From prediction to causation: Causal inference in online systemsFrom prediction to causation: Causal inference in online systems
From prediction to causation: Causal inference in online systems
ย 
The interplay of personal preference and social influence in sharing networks...
The interplay of personal preference and social influence in sharing networks...The interplay of personal preference and social influence in sharing networks...
The interplay of personal preference and social influence in sharing networks...
ย 
The role of social connections in shaping our preferences
The role of social connections in shaping our preferencesThe role of social connections in shaping our preferences
The role of social connections in shaping our preferences
ย 
[RecSys '13]Pairwise Learning: Experiments with Community Recommendation on L...
[RecSys '13]Pairwise Learning: Experiments with Community Recommendation on L...[RecSys '13]Pairwise Learning: Experiments with Community Recommendation on L...
[RecSys '13]Pairwise Learning: Experiments with Community Recommendation on L...
ย 
RSWEB 2013: A research platform for social recommendation
RSWEB 2013: A research platform for social recommendationRSWEB 2013: A research platform for social recommendation
RSWEB 2013: A research platform for social recommendation
ย 

Recently uploaded

GFP in rDNA Technology (Biotechnology).pptx
GFP in rDNA Technology (Biotechnology).pptxGFP in rDNA Technology (Biotechnology).pptx
GFP in rDNA Technology (Biotechnology).pptxAleenaTreesaSaji
ย 
Call Girls in Munirka Delhi ๐Ÿ’ฏCall Us ๐Ÿ”9953322196๐Ÿ” ๐Ÿ’ฏEscort.
Call Girls in Munirka Delhi ๐Ÿ’ฏCall Us ๐Ÿ”9953322196๐Ÿ” ๐Ÿ’ฏEscort.Call Girls in Munirka Delhi ๐Ÿ’ฏCall Us ๐Ÿ”9953322196๐Ÿ” ๐Ÿ’ฏEscort.
Call Girls in Munirka Delhi ๐Ÿ’ฏCall Us ๐Ÿ”9953322196๐Ÿ” ๐Ÿ’ฏEscort.aasikanpl
ย 
Cultivation of KODO MILLET . made by Ghanshyam pptx
Cultivation of KODO MILLET . made by Ghanshyam pptxCultivation of KODO MILLET . made by Ghanshyam pptx
Cultivation of KODO MILLET . made by Ghanshyam pptxpradhanghanshyam7136
ย 
Traditional Agroforestry System in India- Shifting Cultivation, Taungya, Home...
Traditional Agroforestry System in India- Shifting Cultivation, Taungya, Home...Traditional Agroforestry System in India- Shifting Cultivation, Taungya, Home...
Traditional Agroforestry System in India- Shifting Cultivation, Taungya, Home...jana861314
ย 
Call Girls in Munirka Delhi ๐Ÿ’ฏCall Us ๐Ÿ”8264348440๐Ÿ”
Call Girls in Munirka Delhi ๐Ÿ’ฏCall Us ๐Ÿ”8264348440๐Ÿ”Call Girls in Munirka Delhi ๐Ÿ’ฏCall Us ๐Ÿ”8264348440๐Ÿ”
Call Girls in Munirka Delhi ๐Ÿ’ฏCall Us ๐Ÿ”8264348440๐Ÿ”soniya singh
ย 
Spermiogenesis or Spermateleosis or metamorphosis of spermatid
Spermiogenesis or Spermateleosis or metamorphosis of spermatidSpermiogenesis or Spermateleosis or metamorphosis of spermatid
Spermiogenesis or Spermateleosis or metamorphosis of spermatidSarthak Sekhar Mondal
ย 
Boyles law module in the grade 10 science
Boyles law module in the grade 10 scienceBoyles law module in the grade 10 science
Boyles law module in the grade 10 sciencefloriejanemacaya1
ย 
Disentangling the origin of chemical differences using GHOST
Disentangling the origin of chemical differences using GHOSTDisentangling the origin of chemical differences using GHOST
Disentangling the origin of chemical differences using GHOSTSรฉrgio Sacani
ย 
Nightside clouds and disequilibrium chemistry on the hot Jupiter WASP-43b
Nightside clouds and disequilibrium chemistry on the hot Jupiter WASP-43bNightside clouds and disequilibrium chemistry on the hot Jupiter WASP-43b
Nightside clouds and disequilibrium chemistry on the hot Jupiter WASP-43bSรฉrgio Sacani
ย 
Lucknow ๐Ÿ’‹ Russian Call Girls Lucknow Finest Escorts Service 8923113531 Availa...
Lucknow ๐Ÿ’‹ Russian Call Girls Lucknow Finest Escorts Service 8923113531 Availa...Lucknow ๐Ÿ’‹ Russian Call Girls Lucknow Finest Escorts Service 8923113531 Availa...
Lucknow ๐Ÿ’‹ Russian Call Girls Lucknow Finest Escorts Service 8923113531 Availa...anilsa9823
ย 
NAVSEA PEO USC - Unmanned & Small Combatants 26Oct23.pdf
NAVSEA PEO USC - Unmanned & Small Combatants 26Oct23.pdfNAVSEA PEO USC - Unmanned & Small Combatants 26Oct23.pdf
NAVSEA PEO USC - Unmanned & Small Combatants 26Oct23.pdfWadeK3
ย 
Animal Communication- Auditory and Visual.pptx
Animal Communication- Auditory and Visual.pptxAnimal Communication- Auditory and Visual.pptx
Animal Communication- Auditory and Visual.pptxUmerFayaz5
ย 
STERILITY TESTING OF PHARMACEUTICALS ppt by DR.C.P.PRINCE
STERILITY TESTING OF PHARMACEUTICALS ppt by DR.C.P.PRINCESTERILITY TESTING OF PHARMACEUTICALS ppt by DR.C.P.PRINCE
STERILITY TESTING OF PHARMACEUTICALS ppt by DR.C.P.PRINCEPRINCE C P
ย 
Isotopic evidence of long-lived volcanism on Io
Isotopic evidence of long-lived volcanism on IoIsotopic evidence of long-lived volcanism on Io
Isotopic evidence of long-lived volcanism on IoSรฉrgio Sacani
ย 
G9 Science Q4- Week 1-2 Projectile Motion.ppt
G9 Science Q4- Week 1-2 Projectile Motion.pptG9 Science Q4- Week 1-2 Projectile Motion.ppt
G9 Science Q4- Week 1-2 Projectile Motion.pptMAESTRELLAMesa2
ย 
Grafana in space: Monitoring Japan's SLIM moon lander in real time
Grafana in space: Monitoring Japan's SLIM moon lander  in real timeGrafana in space: Monitoring Japan's SLIM moon lander  in real time
Grafana in space: Monitoring Japan's SLIM moon lander in real timeSatoshi NAKAHIRA
ย 
Artificial Intelligence In Microbiology by Dr. Prince C P
Artificial Intelligence In Microbiology by Dr. Prince C PArtificial Intelligence In Microbiology by Dr. Prince C P
Artificial Intelligence In Microbiology by Dr. Prince C PPRINCE C P
ย 
Work, Energy and Power for class 10 ICSE Physics
Work, Energy and Power for class 10 ICSE PhysicsWork, Energy and Power for class 10 ICSE Physics
Work, Energy and Power for class 10 ICSE Physicsvishikhakeshava1
ย 

Recently uploaded (20)

GFP in rDNA Technology (Biotechnology).pptx
GFP in rDNA Technology (Biotechnology).pptxGFP in rDNA Technology (Biotechnology).pptx
GFP in rDNA Technology (Biotechnology).pptx
ย 
Call Girls in Munirka Delhi ๐Ÿ’ฏCall Us ๐Ÿ”9953322196๐Ÿ” ๐Ÿ’ฏEscort.
Call Girls in Munirka Delhi ๐Ÿ’ฏCall Us ๐Ÿ”9953322196๐Ÿ” ๐Ÿ’ฏEscort.Call Girls in Munirka Delhi ๐Ÿ’ฏCall Us ๐Ÿ”9953322196๐Ÿ” ๐Ÿ’ฏEscort.
Call Girls in Munirka Delhi ๐Ÿ’ฏCall Us ๐Ÿ”9953322196๐Ÿ” ๐Ÿ’ฏEscort.
ย 
Cultivation of KODO MILLET . made by Ghanshyam pptx
Cultivation of KODO MILLET . made by Ghanshyam pptxCultivation of KODO MILLET . made by Ghanshyam pptx
Cultivation of KODO MILLET . made by Ghanshyam pptx
ย 
Traditional Agroforestry System in India- Shifting Cultivation, Taungya, Home...
Traditional Agroforestry System in India- Shifting Cultivation, Taungya, Home...Traditional Agroforestry System in India- Shifting Cultivation, Taungya, Home...
Traditional Agroforestry System in India- Shifting Cultivation, Taungya, Home...
ย 
The Philosophy of Science
The Philosophy of ScienceThe Philosophy of Science
The Philosophy of Science
ย 
Call Girls in Munirka Delhi ๐Ÿ’ฏCall Us ๐Ÿ”8264348440๐Ÿ”
Call Girls in Munirka Delhi ๐Ÿ’ฏCall Us ๐Ÿ”8264348440๐Ÿ”Call Girls in Munirka Delhi ๐Ÿ’ฏCall Us ๐Ÿ”8264348440๐Ÿ”
Call Girls in Munirka Delhi ๐Ÿ’ฏCall Us ๐Ÿ”8264348440๐Ÿ”
ย 
Spermiogenesis or Spermateleosis or metamorphosis of spermatid
Spermiogenesis or Spermateleosis or metamorphosis of spermatidSpermiogenesis or Spermateleosis or metamorphosis of spermatid
Spermiogenesis or Spermateleosis or metamorphosis of spermatid
ย 
Engler and Prantl system of classification in plant taxonomy
Engler and Prantl system of classification in plant taxonomyEngler and Prantl system of classification in plant taxonomy
Engler and Prantl system of classification in plant taxonomy
ย 
Boyles law module in the grade 10 science
Boyles law module in the grade 10 scienceBoyles law module in the grade 10 science
Boyles law module in the grade 10 science
ย 
Disentangling the origin of chemical differences using GHOST
Disentangling the origin of chemical differences using GHOSTDisentangling the origin of chemical differences using GHOST
Disentangling the origin of chemical differences using GHOST
ย 
Nightside clouds and disequilibrium chemistry on the hot Jupiter WASP-43b
Nightside clouds and disequilibrium chemistry on the hot Jupiter WASP-43bNightside clouds and disequilibrium chemistry on the hot Jupiter WASP-43b
Nightside clouds and disequilibrium chemistry on the hot Jupiter WASP-43b
ย 
Lucknow ๐Ÿ’‹ Russian Call Girls Lucknow Finest Escorts Service 8923113531 Availa...
Lucknow ๐Ÿ’‹ Russian Call Girls Lucknow Finest Escorts Service 8923113531 Availa...Lucknow ๐Ÿ’‹ Russian Call Girls Lucknow Finest Escorts Service 8923113531 Availa...
Lucknow ๐Ÿ’‹ Russian Call Girls Lucknow Finest Escorts Service 8923113531 Availa...
ย 
NAVSEA PEO USC - Unmanned & Small Combatants 26Oct23.pdf
NAVSEA PEO USC - Unmanned & Small Combatants 26Oct23.pdfNAVSEA PEO USC - Unmanned & Small Combatants 26Oct23.pdf
NAVSEA PEO USC - Unmanned & Small Combatants 26Oct23.pdf
ย 
Animal Communication- Auditory and Visual.pptx
Animal Communication- Auditory and Visual.pptxAnimal Communication- Auditory and Visual.pptx
Animal Communication- Auditory and Visual.pptx
ย 
STERILITY TESTING OF PHARMACEUTICALS ppt by DR.C.P.PRINCE
STERILITY TESTING OF PHARMACEUTICALS ppt by DR.C.P.PRINCESTERILITY TESTING OF PHARMACEUTICALS ppt by DR.C.P.PRINCE
STERILITY TESTING OF PHARMACEUTICALS ppt by DR.C.P.PRINCE
ย 
Isotopic evidence of long-lived volcanism on Io
Isotopic evidence of long-lived volcanism on IoIsotopic evidence of long-lived volcanism on Io
Isotopic evidence of long-lived volcanism on Io
ย 
G9 Science Q4- Week 1-2 Projectile Motion.ppt
G9 Science Q4- Week 1-2 Projectile Motion.pptG9 Science Q4- Week 1-2 Projectile Motion.ppt
G9 Science Q4- Week 1-2 Projectile Motion.ppt
ย 
Grafana in space: Monitoring Japan's SLIM moon lander in real time
Grafana in space: Monitoring Japan's SLIM moon lander  in real timeGrafana in space: Monitoring Japan's SLIM moon lander  in real time
Grafana in space: Monitoring Japan's SLIM moon lander in real time
ย 
Artificial Intelligence In Microbiology by Dr. Prince C P
Artificial Intelligence In Microbiology by Dr. Prince C PArtificial Intelligence In Microbiology by Dr. Prince C P
Artificial Intelligence In Microbiology by Dr. Prince C P
ย 
Work, Energy and Power for class 10 ICSE Physics
Work, Energy and Power for class 10 ICSE PhysicsWork, Energy and Power for class 10 ICSE Physics
Work, Energy and Power for class 10 ICSE Physics
ย 

Causal Learning Boosts Privacy for ML Models

  • 1. Alleviating Privacy Attacks via Causal Learning Shruti Tople, Amit Sharma, Aditya V. Nori Microsoft Research https://arxiv.org/abs/1909.12732 https://github.com/microsoft/robustdg
  • 2. Motivation: ML models leak information about data points in the training set Neural Network TrainingHealth Records (HIV/AIDS patients) ML-as-a-service Member of Train Dataset Non-member Membership Inference Attacks [SPโ€™17][CSFโ€™18][NDSSโ€™19][SPโ€™19]
  • 3. The likely reason is overfitting Output 85% Output 95% Overfitting to dataset โ€ข Neural networks or associational models overfit to the training dataset โ€ข Membership inference adversary exploits differences in prediction score for training and test data [CSFโ€™18]
  • 4. Overfitting to distribution The likely reason is overfitting โ€ข Neural networks or associational models overfit to the training dataset โ€ข Membership inference attacks exploit differences in prediction score for training and test data [CSFโ€™18] โ€ข Privacy risk can increase when model is deployed to different distributions โ€ข E.g., Hospital in one region shares the model to other regions Output 85% Output 95% Overfitting to dataset Output 75% Poor generalization across distributions exacerbates membership inference risk.
  • 6. Can causal ML models help? Contributions 1. Causal models provide stronger (differential) privacy guarantees than associational models. โ€ข Due to their better generalizability on new distributions. 2. And hence are more robust to membership inference attacks. โ€ข As the training dataset size โ†’ โˆž, membership inference attackโ€™s accuracy drops to a random guess. 3. We empirically demonstrate privacy benefits of causal models across 5 datasets. โ€ข Associational models exhibit up to 80% attack accuracy whereas causal models exhibit attack accuracy close to 50%. Causal Learning Privacy
  • 7. Disease Severity Background: Causal Learning ๐’€ Blood Pressure Heart Rate ๐‘ฟ ๐’‘๐’‚๐’“๐’†๐’๐’• ๐‘ฟ ๐’‘๐’‚๐’“๐’†๐’๐’• ๐‘ฟ ๐Ÿ ๐‘ฟ ๐Ÿ Weight Age Use a structural causal model (SCM) that defines what conditional probabilities are invariant across different distributions [Pearlโ€™09].
  • 8. Background: Causal Learning Use a structural causal model (SCM) that defines what conditional probabilities are invariant across different distributions [Pearlโ€™09]. Causal Predictive Model: A prediction model based only on the parents of the outcome Y. What if SCM is not known? Learn an invariant feature representation across distributions [ABGDโ€™19, MTSโ€™20]. For ML models, causal learning can be useful for fairness [KLRSโ€™17] explainability [DSZโ€™16, MTSโ€™19] privacy [this work] Disease Severity ๐’€ Blood Pressure Heart Rate ๐‘ฟ ๐’‘๐’‚๐’“๐’†๐’๐’• ๐‘ฟ ๐’‘๐’‚๐’“๐’†๐’๐’• ๐‘ฟ ๐Ÿ ๐‘ฟ ๐Ÿ Weight Age
  • 9. ๐’€ ๐‘‹๐‘†0 ๐‘‹ ๐‘ƒ๐ด ๐‘‹๐‘†2 ๐‘‹๐‘†1 ๐‘‹ ๐ถ๐ป ๐‘‹๐‘๐‘ Intervention Why is a model based on causal parents invariant across data distributions?
  • 10. Why is a model based on causal parents invariant across data distributions? ๐’€ ๐‘‹๐‘†0 ๐‘‹ ๐‘ƒ๐ด ๐‘‹๐‘†2 ๐‘‹๐‘†1 ๐‘‹ ๐ถ๐ป ๐‘‹๐‘๐‘ Intervention ๐’€ ๐‘‹๐‘†0 ๐‘‹ ๐‘ƒ๐ด ๐‘‹๐‘†2 ๐‘‹๐‘†1 ๐‘‹ ๐ถ๐ป ๐‘‹๐‘๐‘ ๐‘ƒ(๐‘Œ|๐‘‹ ๐‘ƒ๐ด) is invariant across different distributions, unless there is a change in true data-generating process for Y.
  • 11. Result 1: Worst-case out-of-distribution error of a causal model is lower than an associational model.
  • 12. For any model โ„Ž, and ๐‘ƒโˆ— such that ๐‘ƒโˆ— ๐‘Œ ๐‘‹ ๐‘ƒ๐ด = ๐‘ƒ(๐‘Œ|๐‘‹ ๐‘ƒ๐ด), In-Distribution Error (IDE)= ๐ˆ๐ƒ๐„ ๐ ๐’‰, ๐’š = ๐‹ ๐‘ท ๐’‰, ๐’š โˆ’ ๐‹ ๐‘บโˆผP(๐’‰, ๐’š) Expected loss on the same distribution as the train data Out-of-Distribution Error (ODE)=๐Ž๐ƒ๐„ ๐,๐โˆ— ๐’‰, ๐’š = ๐‹ ๐‘ทโˆ— ๐’‰, ๐’š โˆ’ ๐‹ ๐‘บโˆผP ๐’‰, ๐’š Expected loss on a different distribution ๐‘ƒโˆ— than the train data Result 1: Worst-case out-of-distribution error of a causal model is lower than an associational model.
  • 13. For any model โ„Ž, and ๐‘ƒโˆ— such that ๐‘ƒโˆ— ๐‘Œ ๐‘‹ ๐‘ƒ๐ด = ๐‘ƒ(๐‘Œ|๐‘‹ ๐‘ƒ๐ด), In-Distribution Error (IDE)= ๐ˆ๐ƒ๐„ ๐ ๐’‰, ๐’š = ๐‹ ๐‘ท ๐’‰, ๐’š โˆ’ ๐‹ ๐‘บโˆผP(๐’‰, ๐’š) Expected loss on the same distribution as the train data Out-of-Distribution Error (ODE)=๐Ž๐ƒ๐„ ๐,๐โˆ— ๐’‰, ๐’š = ๐‹ ๐‘ทโˆ— ๐’‰, ๐’š โˆ’ ๐‹ ๐‘บโˆผP ๐’‰, ๐’š Expected loss on a different distribution ๐‘ƒโˆ— than the train data Proof Idea. Simple case: Assume ๐‘ฆ = ๐‘“(๐’™) is deterministic. ๐Ž๐ƒ๐„ ๐,๐โˆ— ๐’‰ ๐œ, ๐’š โ‰ค ๐ˆ๐ƒ๐„ ๐(๐’‰ ๐’„, ๐’š) + ๐’…๐’Š๐’”๐’„ ๐‹ ๐‘ท, ๐‘ทโˆ— Discrepancy b/w ๐‘ท and ๐‘ทโˆ— distributions Causal Model Result 1: Worst-case out-of-distribution error of a causal model is lower than an associational model.
  • 14. For any model โ„Ž, and ๐‘ƒโˆ— such that ๐‘ƒโˆ— ๐‘Œ ๐‘‹ ๐‘ƒ๐ด = ๐‘ƒ(๐‘Œ|๐‘‹ ๐‘ƒ๐ด), In-Distribution Error (IDE)= ๐ˆ๐ƒ๐„ ๐ ๐’‰, ๐’š = ๐‹ ๐‘ท ๐’‰, ๐’š โˆ’ ๐‹ ๐‘บโˆผP(๐’‰, ๐’š) Expected loss on the same distribution as the train data Out-of-Distribution Error (ODE)=๐Ž๐ƒ๐„ ๐,๐โˆ— ๐’‰, ๐’š = ๐‹ ๐‘ทโˆ— ๐’‰, ๐’š โˆ’ ๐‹ ๐‘บโˆผP ๐’‰, ๐’š Expected loss on a different distribution ๐‘ƒโˆ— than the train data Proof Idea. Simple case: Assume ๐‘ฆ = ๐‘“(๐’™) is deterministic. ๐Ž๐ƒ๐„ ๐,๐โˆ— ๐’‰ ๐œ, ๐’š โ‰ค ๐ˆ๐ƒ๐„ ๐(๐’‰ ๐’„, ๐’š) + ๐’…๐’Š๐’”๐’„ ๐‹ ๐‘ท, ๐‘ทโˆ— ๐Ž๐ƒ๐„ ๐,๐โˆ— ๐’‰ ๐’‚, ๐’š โ‰ค ๐ˆ๐ƒ๐„ ๐ ๐’‰ ๐’‚, ๐’š + ๐’…๐’Š๐’”๐’„ ๐‹ ๐‘ท, ๐‘ทโˆ— + ๐‹ ๐‘ทโˆ—(๐’‰ ๐’‚,๐‘ท ๐‘ถ๐‘ท๐‘ป , ๐’š) โ‡’ max ๐โˆ— ๐Ž๐ƒ๐„๐๐จ๐ฎ๐ง๐ ๐,๐โˆ— ๐’‰ ๐œ, ๐’š โ‰ค max ๐โˆ— ๐Ž๐ƒ๐„๐๐จ๐ฎ๐ง๐ ๐,๐โˆ— ๐’‰ ๐’‚, ๐’š Discrepancy b/w ๐‘ท and ๐‘ทโˆ— distributions Optimal ๐’‰ ๐’‚ on P is not optimal on ๐‘ทโˆ— Causal Model Assoc. Model Result 1: Worst-case out-of-distribution error of a causal model is lower than an associational model.
  • 15. And better generalization results in lower sensitivity for a causal model Sensitivity: If a single data point ๐’™, ๐‘ฆ โˆผ ๐‘ƒโˆ— is added to the train dataset ๐‘† to create ๐‘†โ€ฒ, how much does the learnt model h ๐‘† min change? Since the optimal causal model is the same across all ๐‘ƒโˆ— , adding any ๐’™, ๐‘ฆ โˆผ ๐‘ƒโˆ— has less impact on a trained causal model. Sensitivity for a causal model Sensitivity for an associational model
  • 16. Main Result: A causal model has stronger Differential Privacy guarantees Let M be a mechanism that returns a ML model trained over dataset ๐‘†, M(๐‘†) = โ„Ž. Differential Privacy [DRโ€™14]: A learning mechanism M satisfies ๐œ–-differential privacy if for any two datasets, ๐‘†, ๐‘†โ€ฒ that differ in one data point, Pr(M ๐‘† โˆˆ๐ป) Pr(M ๐‘†โ€ฒ โˆˆ๐ป) โ‰ค ๐‘’ ๐œ–. (Smaller ๐œ– values provide better privacy guarantees) Since lower sensitivity โ‡’ lower ๐œ–, Theorem: When equivalent Laplace noise is added and models are trained on same dataset, causal mechanism MC provides ๐œ– ๐ถ-DP and associational mechanism MA provides ๐œ– ๐ด-DP guarantees such that: ๐ ๐’„ โ‰ค ๐ ๐‘จ
  • 17. Therefore, causal models are more robust to membership inference (MI) attacks Advantage of an MI adversary: (True Positive Rate โ€“ False Positive Rate) in detecting whether ๐‘ฅ is from training dataset or not. [From Yeom et al. CSFโ€™18] Membership advantage of an adversary is bounded by ๐‘’ ๐œ– โˆ’ 1. Since the optimal causal models are the same for ๐‘ƒ and ๐‘ƒโˆ—, As ๐‘› โ†’ โˆž, membership advantage of causal model โ†’ 0. Theorem: When trained on the same dataset of size ๐‘›, membership advantage of a causal model is lower than the membership advantage for an associational model.
  • 19. Goal: Compare MI attack accuracy between causal and associational models [BN] When true causal structure is known Datasets generated from Bayesian networks: Child, Sachs, Water, Alarm Causal model: MLE estimation based on Yโ€™s parents Associational model: Neural networks with 3 linear layers ๐‘ƒโˆ—: Noise added to conditional probabilities (uniform or additive) [MNIST] When true causal structure is unknown Colored MNIST dataset (Digits are correlated with color) Causal Model: Invariant Risk Minimization that utilizes ๐‘ƒ ๐‘Œ ๐‘‹ ๐‘ƒ๐ด is same across distributions [ABGDโ€™19] Associational Model: Empirical Risk Minimization using the same NN architecture ๐‘ƒโˆ—: Different correlations between color and digit than the train dataset Attacker Model: Predict whether an input belongs to train dataset or not
  • 20. [BN] With uniform noise, MI attack accuracy for a causal model is near a random guess 80% 50% For associational models, the attacker can guess membership in training set with 80% accuracy.
  • 21. [BN-Child] With uniform noise, MI attack accuracy for a causal model is near a random guess 80% 50% For associational models, the attacker can guess membership in training set with 80% accuracy. Privacy without loss in utility: Causal & DNN models achieve same prediction accuracy.
  • 22. [BN-Child] MI Attack accuracy increases with amount of noise for associational models, but stays constant at 50% for causal models
  • 23. [BN] Consistent results across all four datasets High attack accuracy for associational models when ๐‘ƒโˆ— (Test2) has uniform noise. Same classification accuracy between causal and associational models.
  • 24. [MNIST] MI attack accuracy is lower for invariant risk minimizer compared to associational model IRM model motivated by causal reasoning has 53% attack accuracy, close to random. Associational model also fails to generalize: 16% accuracy on test set. Model Train Accuracy (%) Test Accuracy (%) Attack Accuracy (%) Causal Model (IRM) 70 69 53 Associational Model (ERM) 87 16 66
  • 25. Conclusion โ€ข Established theoretical connection between causality and differential privacy. โ€ข Demonstrated the benefits of causal ML models for alleviating privacy attacks, both theoretically and empirically. โ€ข Code available at https://github.com/microsoft/robustdg Future work: Investigate robustness of causal models with other kinds of adversarial attacks. Causal Learning Privacy thank you! Amit Sharma Microsoft Research
  • 26. References โ€ข [ABGDโ€™19] Martin Arjovsky, Lรฉon Bottou, Ishaan Gulrajani, and David Lopez-Paz. Invariant risk minimization. arXiv preprint arXiv:1907.02893, 2019. โ€ข [CSFโ€™18] Yeom, S., Giacomelli, I., Fredrikson, M., and Jha, S. Privacy risk in machine learning: Analyzing the connection to overfitting. CSF 2018. โ€ข [DRโ€™14] Cynthia Dwork, Aaron Roth, et al. The algorithmic foundations of differential privacy. Foundations and Trends in Theoretical Computer Science, 9(3โ€“4):211โ€“407, 2014. โ€ข [DSZโ€™16] Anupam Datta, Shayak Sen, and Yair Zick. Algorithmic transparency via quantitative input influence: Theory and experiments with learning systems. In Security and Privacy (SP), 2016 IEEE Symposium on, pp. 598โ€“617. IEEE, 2016 โ€ข [KLRSโ€™17] Matt J Kusner, Joshua Loftus, Chris Russell, and Ricardo Silva. Counterfactual fairness. In Advances in Neural Information Processing Systems, pp. 4066โ€“4076, 2017. โ€ข [MTSโ€™19] Mahajan, Divyat, Chenhao Tan, and Amit Sharma. "Preserving Causal Constraints in Counterfactual Explanations for Machine Learning Classifiers." arXiv preprint arXiv:1912.03277 (2019). โ€ข [MTSโ€™20] Mahajan, Divyat, Shruti Tople and Amit Sharma. โ€œDomain Generalization using Causal Matchingโ€. arXiv preprint arXiv:2006.07500, 2020. โ€ข [NDSSโ€™19] Salem, A., Zhang, Y., Humbert, M., Fritz, M., and Backes, M. Ml-leaks: Model and data independent membership inference attacks and defenses on machine learning models. NDSS 2019. โ€ข [SPโ€™17] Shokri, R., Stronati, M., Song, C., and Shmatikov, V. Membership inference attacks against machine learning models. Security and Privacy (SP), 2017. โ€ข [SPโ€™19] Nasr, M., Shokri, R., and Houmansadr, A. Comprehensive privacy analysis of deep learning: Stand-alone and federated learning under passive and active white-box inference attacks. Security and Privacy (SP), 2019.