SlideShare a Scribd company logo
1 of 14
Download to read offline
Cool vs. Creepy: Ethics in Data Science
Cathy Cooper
3 February 2017
© 2016 SAP SE or an SAP affiliate company. All rights reserved. 2Public
(Image of a hand with a credit card + JetBlue plane to Orlando?)
xxx
Planes, meals, theme
parks and … credit
cards?
© 2016 SAP SE or an SAP affiliate company. All rights reserved. 3Public
Connection to ethics & data science
DATA SCIENCE
Data Analysis Applications
EXPERIENCE
Transparency Permission
© 2016 SAP SE or an SAP affiliate company. All rights reserved. 4Public
Data, Analysis and Applications: Biased Data
Google tested machine learning analysis on their News data to try to find relationships between
concepts.
One of the examples was the following “gender + profession” comparison.
man + computer programmer vs. woman + _________
__________________ = Homemaker
Why? Because the system was trained with an inherently biased source.
© 2016 SAP SE or an SAP affiliate company. All rights reserved. 5Public
Data, Analysis and Applications: Anonymized Data
How many time & location records does it
take to link cell phone usage data to an
individual?
Only four
© 2016 SAP SE or an SAP affiliate company. All rights reserved. 6Public
Data, Analysis and Applications: Context
A financial institution was looking to predict customer churn.
Who did they identify as the prime target for their retention offers?
Spouses getting their finances in order before filing for divorce.
A test was conducted using just social media sentiment analysis to predict changes in US
unemployment rate.
It did predict an unemployment drop – but it was wrong. Why?
Relying only on the tool – and poor timing: Steve Jobs had just died.
© 2016 SAP SE or an SAP affiliate company. All rights reserved. 7Public
Data, Analysis and Applications: Data/Predictions Change Behavior
and Vice Versa
GPS
Flu projection
© 2016 SAP SE or an SAP affiliate company. All rights reserved. 8Public
Data, Analysis and Applications: Human Intervention Required
“… people have
too much trust in
data to be
intrinsically
objective, even
though it is in fact
only as good as
the human
processes that
collected it.” --
Cathy O’Neill
“Humans will need to check on the
outcomes and see if the models and the
algorithms and the rules are performing
as intended, and then intervene if they
don’t.” – Tom Davenport
© 2016 SAP SE or an SAP affiliate company. All rights reserved. 9Public
Transparency: Try it for yourself
University of Cambridge The Psychometrics Centre uses your Facebook profile and activity for:
► YouAreWhatYouLike describe openness to new ideas, extraversion and introversion, your warmth
or competitiveness, and other personality traits.
► Apply Magic Sauce, predicts your politics, relationship status, sexual orientation, gender, and
more.
Pennebaker Conglomerate’s
► AnalyzeWords leverages linguistics to discover the personality you (other others) portray on
Twitter. You can analyze anyone, not just yourself.
© 2016 SAP SE or an SAP affiliate company. All rights reserved. 10Public
Transparency: How It Can Work
Banks have to prove:
►their algorithms are not based on variables like race & gender
►their models don’t focus on patterns that disfavor specific demographic groups
and
they have to allow outside data scientists to assess their models for code or data that might have a
discriminatory effect.
Consider what it will takes to build transparency into AI system design: human access and
action plan
© 2016 SAP SE or an SAP affiliate company. All rights reserved. 11Public
Permission/Fairness
Challenge and opportunity:
► Additional uses for previously
collected/aggregated data in
new types of analysis,
applications and businesses
► Coming legal and regulatory
requirements to audit and
correct algorithm-based
decision making
© 2016 SAP SE or an SAP affiliate company. All rights reserved. 12Public
In closing…
“… algorithms have to be designed with fairness and legality in
mind, with standards that are understandable to everyone, from
the business leader to the people being scored.” – Cathy O’Neill,
“Unmasking Unconscious Bias in Algorithms”, The Digitalist,
data scientist and author Weapons of Math Destruction
“…a fundamental rethinking and a careful approach to software creation itself is needed.
Extra care has to be put into training the people who create the systems, and into
incorporating research into the machine learning algorithms so you don’t accidentally create
more bias. The system has to be re-trained to think differently.” – Yvonne Baur, “Is Machine
Learning Sexist?”, Techcrunch, and Head of Predictive Analytics/Machine Learning, SAP
SuccessFactors,
© 2016 SAP SE or an SAP affiliate company. All rights reserved. 13Public
Additional Readings and Resources on Ethics & Data Science
http://www.digitalistmag.com/digital-supply-
networks/2017/01/16/cathy-oneil-
unmasking-unconscious-bias-in-algorithms-
04839140
http://www.digitalistmag.com/executive-
research/how-ai-can-end-bias
http://www.digitalistmag.com/executive-
research/an-ai-shares-my-office
http://www.digitalistmag.com/digital-
economy/digital-
futures/2017/01/23/machine-learning-real-
business-intelligence-04830225
http://www.digitalistmag.com/executive-
research/empathy-the-killer-app-for-
artificial-intelligence
http://www.digitaltrends.com/cool-
tech/women-in-artificial-intelligence/
https://techcrunch.com/2016/10/11/is-
machine-learning-sexist/
https://open.sap.com/courses/ds1
http://www.slate.com/articles/technology/fut
ure_tense/2014/10/youarewhatyoulike_find_
out_what_algorithms_can_tell_about_you_b
ased_on_your.html
http://searchcio.techtarget.com/opinion/Dat
a-products-introduce-ethical-dilemmas-for-
data-scientists
http://searchcio.techtarget.com/opinion/Big-
data-ethics-Why-the-CIO-needs-to-get-
involved
http://searchcloudapplications.techtarget.co
m/feature/Big-data-collection-efforts-spark-
an-information-ethics-debate
https://arstechnica.com/science/2015/11/ne
w-flu-tracker-uses-google-search-data-
better-than-google/
https://gigaom.com/2013/10/09/the-upside-
of-prism-at-least-were-talking-about-data-
privacy-or-lack-the-thereof/
http://nymag.com/thecut/2016/09/cathy-
oneils-weapons-of-math-destruction-math-
is-biased.html?mid=twitter-share-thecut
https://www.bloomberg.com/news/articles/2
016-06-23/artificial-intelligence-has-a-sea-of-
dudes-problem
http://boingboing.net/2016/01/06/weapons-
of-math-destruction-h.html
http://fortune.com/2015/12/04/ethics-
training-for-data-scientists/
https://gigaom.com/2013/03/25/why-the-
collision-of-big-data-and-privacy-will-
require-a-new-realpolitik/
http://www.nature.com/articles/srep01376
http://www.slate.com/articles/technology/fut
ure_tense/2016/02/how_to_bring_better_ethi
cs_to_data_science.htmlhttp://searchcio.tec
htarget.com/opinion/Big-data-bad-analytics
http://www.slate.com/articles/technology/fut
ure_tense/2016/02/how_to_bring_better_ethi
cs_to_data_science.html
https://open.sap.com/courses/ml1
https://motherboard.vice.com/en_us/article/
big-data-cambridge-analytica-brexit-trump
Thank you
Contact information:
Cathy Cooper
Sr. Director, Big Data Analytics
cathy.cooper@sap.com
212 653 9555
@catcooper1

More Related Content

What's hot

Move It Don't Lose It: Is Your Big Data Collecting Dust?
Move It Don't Lose It: Is Your Big Data Collecting Dust?Move It Don't Lose It: Is Your Big Data Collecting Dust?
Move It Don't Lose It: Is Your Big Data Collecting Dust?Jennifer Walker
 
Big Data Management: Work Smarter Not Harder
Big Data Management: Work Smarter Not HarderBig Data Management: Work Smarter Not Harder
Big Data Management: Work Smarter Not HarderJennifer Walker
 
Predictive Analytics: Business Perspective & Use Cases
Predictive Analytics: Business Perspective & Use CasesPredictive Analytics: Business Perspective & Use Cases
Predictive Analytics: Business Perspective & Use CasesCagri Sarigoz
 
Hadoop: Data Storage Locker or Agile Analytics Platform? It’s Up to You.
Hadoop: Data Storage Locker or Agile Analytics Platform? It’s Up to You.Hadoop: Data Storage Locker or Agile Analytics Platform? It’s Up to You.
Hadoop: Data Storage Locker or Agile Analytics Platform? It’s Up to You.Jennifer Walker
 
What is Data Science? Daniel D Gutierrez
What is Data Science? Daniel D GutierrezWhat is Data Science? Daniel D Gutierrez
What is Data Science? Daniel D Gutierrezamuletc
 
Naghi Prasad at AI Frontiers: Building AI systems to automate enterprise proc...
Naghi Prasad at AI Frontiers: Building AI systems to automate enterprise proc...Naghi Prasad at AI Frontiers: Building AI systems to automate enterprise proc...
Naghi Prasad at AI Frontiers: Building AI systems to automate enterprise proc...AI Frontiers
 
Algorithmic Bias: Challenges and Opportunities for AI in Healthcare
Algorithmic Bias:  Challenges and Opportunities for AI in HealthcareAlgorithmic Bias:  Challenges and Opportunities for AI in Healthcare
Algorithmic Bias: Challenges and Opportunities for AI in HealthcareGregory Nelson
 
The role of Organisational Network Analysis in People Analytics
The role of Organisational Network Analysis in People AnalyticsThe role of Organisational Network Analysis in People Analytics
The role of Organisational Network Analysis in People AnalyticsDavid Green
 
Onboarding AI by Jana Eggers
Onboarding AI by Jana EggersOnboarding AI by Jana Eggers
Onboarding AI by Jana EggersGlobant
 
How semantic search changes recruitment - Glen Cathey
How semantic search changes recruitment - Glen CatheyHow semantic search changes recruitment - Glen Cathey
How semantic search changes recruitment - Glen CatheyTextkernel
 
Intro to Data Science for Non-Data Scientists
Intro to Data Science for Non-Data ScientistsIntro to Data Science for Non-Data Scientists
Intro to Data Science for Non-Data ScientistsSri Ambati
 
Introduction to Data Science (Data Summit, 2017)
Introduction to Data Science (Data Summit, 2017)Introduction to Data Science (Data Summit, 2017)
Introduction to Data Science (Data Summit, 2017)Caserta
 
Rady School Master of Science Business Analytics (MSBA) Program Overview
Rady School Master of Science Business Analytics (MSBA) Program OverviewRady School Master of Science Business Analytics (MSBA) Program Overview
Rady School Master of Science Business Analytics (MSBA) Program OverviewUC San Diego Rady School of Management
 
Build Intelligent Fraud Prevention with Machine Learning and Graphs
Build Intelligent Fraud Prevention with Machine Learning and GraphsBuild Intelligent Fraud Prevention with Machine Learning and Graphs
Build Intelligent Fraud Prevention with Machine Learning and GraphsNeo4j
 
Is Data Scientist the Sexiest Job of the 21st century?
Is Data Scientist the Sexiest Job of the 21st century?Is Data Scientist the Sexiest Job of the 21st century?
Is Data Scientist the Sexiest Job of the 21st century?Edureka!
 
Data Science Applications | Data Science For Beginners | Data Science Trainin...
Data Science Applications | Data Science For Beginners | Data Science Trainin...Data Science Applications | Data Science For Beginners | Data Science Trainin...
Data Science Applications | Data Science For Beginners | Data Science Trainin...Edureka!
 

What's hot (20)

Move It Don't Lose It: Is Your Big Data Collecting Dust?
Move It Don't Lose It: Is Your Big Data Collecting Dust?Move It Don't Lose It: Is Your Big Data Collecting Dust?
Move It Don't Lose It: Is Your Big Data Collecting Dust?
 
Big Data Management: Work Smarter Not Harder
Big Data Management: Work Smarter Not HarderBig Data Management: Work Smarter Not Harder
Big Data Management: Work Smarter Not Harder
 
Predictive Analytics: Business Perspective & Use Cases
Predictive Analytics: Business Perspective & Use CasesPredictive Analytics: Business Perspective & Use Cases
Predictive Analytics: Business Perspective & Use Cases
 
Hadoop: Data Storage Locker or Agile Analytics Platform? It’s Up to You.
Hadoop: Data Storage Locker or Agile Analytics Platform? It’s Up to You.Hadoop: Data Storage Locker or Agile Analytics Platform? It’s Up to You.
Hadoop: Data Storage Locker or Agile Analytics Platform? It’s Up to You.
 
The REAL face of Big Data
The REAL face of Big DataThe REAL face of Big Data
The REAL face of Big Data
 
Data Scientist Enablement roadmap 1.0
Data Scientist Enablement roadmap 1.0Data Scientist Enablement roadmap 1.0
Data Scientist Enablement roadmap 1.0
 
What is Data Science? Daniel D Gutierrez
What is Data Science? Daniel D GutierrezWhat is Data Science? Daniel D Gutierrez
What is Data Science? Daniel D Gutierrez
 
Naghi Prasad at AI Frontiers: Building AI systems to automate enterprise proc...
Naghi Prasad at AI Frontiers: Building AI systems to automate enterprise proc...Naghi Prasad at AI Frontiers: Building AI systems to automate enterprise proc...
Naghi Prasad at AI Frontiers: Building AI systems to automate enterprise proc...
 
Algorithmic Bias: Challenges and Opportunities for AI in Healthcare
Algorithmic Bias:  Challenges and Opportunities for AI in HealthcareAlgorithmic Bias:  Challenges and Opportunities for AI in Healthcare
Algorithmic Bias: Challenges and Opportunities for AI in Healthcare
 
The role of Organisational Network Analysis in People Analytics
The role of Organisational Network Analysis in People AnalyticsThe role of Organisational Network Analysis in People Analytics
The role of Organisational Network Analysis in People Analytics
 
Onboarding AI by Jana Eggers
Onboarding AI by Jana EggersOnboarding AI by Jana Eggers
Onboarding AI by Jana Eggers
 
How semantic search changes recruitment - Glen Cathey
How semantic search changes recruitment - Glen CatheyHow semantic search changes recruitment - Glen Cathey
How semantic search changes recruitment - Glen Cathey
 
Intro to Data Science for Non-Data Scientists
Intro to Data Science for Non-Data ScientistsIntro to Data Science for Non-Data Scientists
Intro to Data Science for Non-Data Scientists
 
The 25 Predictions About The Future Of Big Data
The 25 Predictions About The Future Of Big DataThe 25 Predictions About The Future Of Big Data
The 25 Predictions About The Future Of Big Data
 
Introduction to Data Science (Data Summit, 2017)
Introduction to Data Science (Data Summit, 2017)Introduction to Data Science (Data Summit, 2017)
Introduction to Data Science (Data Summit, 2017)
 
Big Data for HR
Big Data for HRBig Data for HR
Big Data for HR
 
Rady School Master of Science Business Analytics (MSBA) Program Overview
Rady School Master of Science Business Analytics (MSBA) Program OverviewRady School Master of Science Business Analytics (MSBA) Program Overview
Rady School Master of Science Business Analytics (MSBA) Program Overview
 
Build Intelligent Fraud Prevention with Machine Learning and Graphs
Build Intelligent Fraud Prevention with Machine Learning and GraphsBuild Intelligent Fraud Prevention with Machine Learning and Graphs
Build Intelligent Fraud Prevention with Machine Learning and Graphs
 
Is Data Scientist the Sexiest Job of the 21st century?
Is Data Scientist the Sexiest Job of the 21st century?Is Data Scientist the Sexiest Job of the 21st century?
Is Data Scientist the Sexiest Job of the 21st century?
 
Data Science Applications | Data Science For Beginners | Data Science Trainin...
Data Science Applications | Data Science For Beginners | Data Science Trainin...Data Science Applications | Data Science For Beginners | Data Science Trainin...
Data Science Applications | Data Science For Beginners | Data Science Trainin...
 

Similar to Cool vs Creepy - Ethics and Data Science - Cooper 2Feb

7 Things You Should Know About People Analytics
7 Things You Should Know About People Analytics7 Things You Should Know About People Analytics
7 Things You Should Know About People AnalyticsPixentia
 
Analytics- Dawn of the Cognitive Era.PDF
Analytics- Dawn of the Cognitive Era.PDFAnalytics- Dawn of the Cognitive Era.PDF
Analytics- Dawn of the Cognitive Era.PDFMacGregor Olson
 
Machine Learning for Business - Eight Best Practices for Getting Started
Machine Learning for Business - Eight Best Practices for Getting StartedMachine Learning for Business - Eight Best Practices for Getting Started
Machine Learning for Business - Eight Best Practices for Getting StartedBhupesh Chaurasia
 
Building and deploying AI/ML models on AWS for Biosciences professionals
Building and deploying AI/ML models on AWS for Biosciences professionalsBuilding and deploying AI/ML models on AWS for Biosciences professionals
Building and deploying AI/ML models on AWS for Biosciences professionalsjavier ramirez
 
Introduction To Predictive Modelling
Introduction To Predictive ModellingIntroduction To Predictive Modelling
Introduction To Predictive ModellingSpotle.ai
 
Fact vs. Fiction: How Innovations in AI Will Intersect with Recruitment in th...
Fact vs. Fiction: How Innovations in AI Will Intersect with Recruitment in th...Fact vs. Fiction: How Innovations in AI Will Intersect with Recruitment in th...
Fact vs. Fiction: How Innovations in AI Will Intersect with Recruitment in th...CareerBuilder
 
SAS an open ecosystem for Artifical Intelligence - Dean Zouari
SAS an open ecosystem for Artifical Intelligence - Dean ZouariSAS an open ecosystem for Artifical Intelligence - Dean Zouari
SAS an open ecosystem for Artifical Intelligence - Dean ZouariInstitute of Contemporary Sciences
 
MPCA-SAS-innovators-flight-plan-ai.pdf
MPCA-SAS-innovators-flight-plan-ai.pdfMPCA-SAS-innovators-flight-plan-ai.pdf
MPCA-SAS-innovators-flight-plan-ai.pdfProsper85
 
P 01 ins_analytics_ai_in_life_case_studies_2017_10_16_v12
P 01 ins_analytics_ai_in_life_case_studies_2017_10_16_v12P 01 ins_analytics_ai_in_life_case_studies_2017_10_16_v12
P 01 ins_analytics_ai_in_life_case_studies_2017_10_16_v12Vishwa Kolla
 
Big Data = Big Headache? Using People Analytics to Fuel ROI
Big Data = Big Headache? Using People Analytics to Fuel ROIBig Data = Big Headache? Using People Analytics to Fuel ROI
Big Data = Big Headache? Using People Analytics to Fuel ROItalent.imperative
 
Regression and correlation
Regression and correlationRegression and correlation
Regression and correlationVrushaliSolanke
 
Paradigm4 Research Report: Leaving Data on the table
Paradigm4 Research Report: Leaving Data on the tableParadigm4 Research Report: Leaving Data on the table
Paradigm4 Research Report: Leaving Data on the tableParadigm4
 
Assignment of data science ds4 mubeen
Assignment of data science ds4 mubeenAssignment of data science ds4 mubeen
Assignment of data science ds4 mubeenmubeenasghar3
 
In-Depth Data Analytics
In-Depth Data AnalyticsIn-Depth Data Analytics
In-Depth Data AnalyticsYASH GAIKWAD
 
FAQ for the Predictive Testing of Opportunities
FAQ for the Predictive Testing of OpportunitiesFAQ for the Predictive Testing of Opportunities
FAQ for the Predictive Testing of OpportunitiesThe Inovo Group
 
The value of big data
The value of big dataThe value of big data
The value of big dataSeymourSloan
 
Artificial Intelligence
Artificial IntelligenceArtificial Intelligence
Artificial IntelligenceEnes Bolfidan
 
Embracing data science
Embracing data scienceEmbracing data science
Embracing data scienceVipul Kalamkar
 

Similar to Cool vs Creepy - Ethics and Data Science - Cooper 2Feb (20)

7 Things You Should Know About People Analytics
7 Things You Should Know About People Analytics7 Things You Should Know About People Analytics
7 Things You Should Know About People Analytics
 
Analytics- Dawn of the Cognitive Era.PDF
Analytics- Dawn of the Cognitive Era.PDFAnalytics- Dawn of the Cognitive Era.PDF
Analytics- Dawn of the Cognitive Era.PDF
 
Machine Learning for Business - Eight Best Practices for Getting Started
Machine Learning for Business - Eight Best Practices for Getting StartedMachine Learning for Business - Eight Best Practices for Getting Started
Machine Learning for Business - Eight Best Practices for Getting Started
 
Building and deploying AI/ML models on AWS for Biosciences professionals
Building and deploying AI/ML models on AWS for Biosciences professionalsBuilding and deploying AI/ML models on AWS for Biosciences professionals
Building and deploying AI/ML models on AWS for Biosciences professionals
 
Introduction To Predictive Modelling
Introduction To Predictive ModellingIntroduction To Predictive Modelling
Introduction To Predictive Modelling
 
What is data science ?
What is data science ?What is data science ?
What is data science ?
 
Building data science teams
Building data science teamsBuilding data science teams
Building data science teams
 
Fact vs. Fiction: How Innovations in AI Will Intersect with Recruitment in th...
Fact vs. Fiction: How Innovations in AI Will Intersect with Recruitment in th...Fact vs. Fiction: How Innovations in AI Will Intersect with Recruitment in th...
Fact vs. Fiction: How Innovations in AI Will Intersect with Recruitment in th...
 
SAS an open ecosystem for Artifical Intelligence - Dean Zouari
SAS an open ecosystem for Artifical Intelligence - Dean ZouariSAS an open ecosystem for Artifical Intelligence - Dean Zouari
SAS an open ecosystem for Artifical Intelligence - Dean Zouari
 
MPCA-SAS-innovators-flight-plan-ai.pdf
MPCA-SAS-innovators-flight-plan-ai.pdfMPCA-SAS-innovators-flight-plan-ai.pdf
MPCA-SAS-innovators-flight-plan-ai.pdf
 
P 01 ins_analytics_ai_in_life_case_studies_2017_10_16_v12
P 01 ins_analytics_ai_in_life_case_studies_2017_10_16_v12P 01 ins_analytics_ai_in_life_case_studies_2017_10_16_v12
P 01 ins_analytics_ai_in_life_case_studies_2017_10_16_v12
 
Big Data = Big Headache? Using People Analytics to Fuel ROI
Big Data = Big Headache? Using People Analytics to Fuel ROIBig Data = Big Headache? Using People Analytics to Fuel ROI
Big Data = Big Headache? Using People Analytics to Fuel ROI
 
Regression and correlation
Regression and correlationRegression and correlation
Regression and correlation
 
Paradigm4 Research Report: Leaving Data on the table
Paradigm4 Research Report: Leaving Data on the tableParadigm4 Research Report: Leaving Data on the table
Paradigm4 Research Report: Leaving Data on the table
 
Assignment of data science ds4 mubeen
Assignment of data science ds4 mubeenAssignment of data science ds4 mubeen
Assignment of data science ds4 mubeen
 
In-Depth Data Analytics
In-Depth Data AnalyticsIn-Depth Data Analytics
In-Depth Data Analytics
 
FAQ for the Predictive Testing of Opportunities
FAQ for the Predictive Testing of OpportunitiesFAQ for the Predictive Testing of Opportunities
FAQ for the Predictive Testing of Opportunities
 
The value of big data
The value of big dataThe value of big data
The value of big data
 
Artificial Intelligence
Artificial IntelligenceArtificial Intelligence
Artificial Intelligence
 
Embracing data science
Embracing data scienceEmbracing data science
Embracing data science
 

Cool vs Creepy - Ethics and Data Science - Cooper 2Feb

  • 1. Cool vs. Creepy: Ethics in Data Science Cathy Cooper 3 February 2017
  • 2. © 2016 SAP SE or an SAP affiliate company. All rights reserved. 2Public (Image of a hand with a credit card + JetBlue plane to Orlando?) xxx Planes, meals, theme parks and … credit cards?
  • 3. © 2016 SAP SE or an SAP affiliate company. All rights reserved. 3Public Connection to ethics & data science DATA SCIENCE Data Analysis Applications EXPERIENCE Transparency Permission
  • 4. © 2016 SAP SE or an SAP affiliate company. All rights reserved. 4Public Data, Analysis and Applications: Biased Data Google tested machine learning analysis on their News data to try to find relationships between concepts. One of the examples was the following “gender + profession” comparison. man + computer programmer vs. woman + _________ __________________ = Homemaker Why? Because the system was trained with an inherently biased source.
  • 5. © 2016 SAP SE or an SAP affiliate company. All rights reserved. 5Public Data, Analysis and Applications: Anonymized Data How many time & location records does it take to link cell phone usage data to an individual? Only four
  • 6. © 2016 SAP SE or an SAP affiliate company. All rights reserved. 6Public Data, Analysis and Applications: Context A financial institution was looking to predict customer churn. Who did they identify as the prime target for their retention offers? Spouses getting their finances in order before filing for divorce. A test was conducted using just social media sentiment analysis to predict changes in US unemployment rate. It did predict an unemployment drop – but it was wrong. Why? Relying only on the tool – and poor timing: Steve Jobs had just died.
  • 7. © 2016 SAP SE or an SAP affiliate company. All rights reserved. 7Public Data, Analysis and Applications: Data/Predictions Change Behavior and Vice Versa GPS Flu projection
  • 8. © 2016 SAP SE or an SAP affiliate company. All rights reserved. 8Public Data, Analysis and Applications: Human Intervention Required “… people have too much trust in data to be intrinsically objective, even though it is in fact only as good as the human processes that collected it.” -- Cathy O’Neill “Humans will need to check on the outcomes and see if the models and the algorithms and the rules are performing as intended, and then intervene if they don’t.” – Tom Davenport
  • 9. © 2016 SAP SE or an SAP affiliate company. All rights reserved. 9Public Transparency: Try it for yourself University of Cambridge The Psychometrics Centre uses your Facebook profile and activity for: ► YouAreWhatYouLike describe openness to new ideas, extraversion and introversion, your warmth or competitiveness, and other personality traits. ► Apply Magic Sauce, predicts your politics, relationship status, sexual orientation, gender, and more. Pennebaker Conglomerate’s ► AnalyzeWords leverages linguistics to discover the personality you (other others) portray on Twitter. You can analyze anyone, not just yourself.
  • 10. © 2016 SAP SE or an SAP affiliate company. All rights reserved. 10Public Transparency: How It Can Work Banks have to prove: ►their algorithms are not based on variables like race & gender ►their models don’t focus on patterns that disfavor specific demographic groups and they have to allow outside data scientists to assess their models for code or data that might have a discriminatory effect. Consider what it will takes to build transparency into AI system design: human access and action plan
  • 11. © 2016 SAP SE or an SAP affiliate company. All rights reserved. 11Public Permission/Fairness Challenge and opportunity: ► Additional uses for previously collected/aggregated data in new types of analysis, applications and businesses ► Coming legal and regulatory requirements to audit and correct algorithm-based decision making
  • 12. © 2016 SAP SE or an SAP affiliate company. All rights reserved. 12Public In closing… “… algorithms have to be designed with fairness and legality in mind, with standards that are understandable to everyone, from the business leader to the people being scored.” – Cathy O’Neill, “Unmasking Unconscious Bias in Algorithms”, The Digitalist, data scientist and author Weapons of Math Destruction “…a fundamental rethinking and a careful approach to software creation itself is needed. Extra care has to be put into training the people who create the systems, and into incorporating research into the machine learning algorithms so you don’t accidentally create more bias. The system has to be re-trained to think differently.” – Yvonne Baur, “Is Machine Learning Sexist?”, Techcrunch, and Head of Predictive Analytics/Machine Learning, SAP SuccessFactors,
  • 13. © 2016 SAP SE or an SAP affiliate company. All rights reserved. 13Public Additional Readings and Resources on Ethics & Data Science http://www.digitalistmag.com/digital-supply- networks/2017/01/16/cathy-oneil- unmasking-unconscious-bias-in-algorithms- 04839140 http://www.digitalistmag.com/executive- research/how-ai-can-end-bias http://www.digitalistmag.com/executive- research/an-ai-shares-my-office http://www.digitalistmag.com/digital- economy/digital- futures/2017/01/23/machine-learning-real- business-intelligence-04830225 http://www.digitalistmag.com/executive- research/empathy-the-killer-app-for- artificial-intelligence http://www.digitaltrends.com/cool- tech/women-in-artificial-intelligence/ https://techcrunch.com/2016/10/11/is- machine-learning-sexist/ https://open.sap.com/courses/ds1 http://www.slate.com/articles/technology/fut ure_tense/2014/10/youarewhatyoulike_find_ out_what_algorithms_can_tell_about_you_b ased_on_your.html http://searchcio.techtarget.com/opinion/Dat a-products-introduce-ethical-dilemmas-for- data-scientists http://searchcio.techtarget.com/opinion/Big- data-ethics-Why-the-CIO-needs-to-get- involved http://searchcloudapplications.techtarget.co m/feature/Big-data-collection-efforts-spark- an-information-ethics-debate https://arstechnica.com/science/2015/11/ne w-flu-tracker-uses-google-search-data- better-than-google/ https://gigaom.com/2013/10/09/the-upside- of-prism-at-least-were-talking-about-data- privacy-or-lack-the-thereof/ http://nymag.com/thecut/2016/09/cathy- oneils-weapons-of-math-destruction-math- is-biased.html?mid=twitter-share-thecut https://www.bloomberg.com/news/articles/2 016-06-23/artificial-intelligence-has-a-sea-of- dudes-problem http://boingboing.net/2016/01/06/weapons- of-math-destruction-h.html http://fortune.com/2015/12/04/ethics- training-for-data-scientists/ https://gigaom.com/2013/03/25/why-the- collision-of-big-data-and-privacy-will- require-a-new-realpolitik/ http://www.nature.com/articles/srep01376 http://www.slate.com/articles/technology/fut ure_tense/2016/02/how_to_bring_better_ethi cs_to_data_science.htmlhttp://searchcio.tec htarget.com/opinion/Big-data-bad-analytics http://www.slate.com/articles/technology/fut ure_tense/2016/02/how_to_bring_better_ethi cs_to_data_science.html https://open.sap.com/courses/ml1 https://motherboard.vice.com/en_us/article/ big-data-cambridge-analytica-brexit-trump
  • 14. Thank you Contact information: Cathy Cooper Sr. Director, Big Data Analytics cathy.cooper@sap.com 212 653 9555 @catcooper1