SlideShare a Scribd company logo
1 of 42
DATA SCIENCE
IIT AGNE MARCH 2019
ANURAG WAKHLU, CFA, MBA
IIT BOMBAY ’93
@ANURAGWAKHLU @ANURAG3DS
WHAT
IS
THIS?
Bob: i can i i everything else . . . . . . . . . . . . . .
Alice: balls have zero to me to me to me to me to me to me to me
to me to
Bob: you i everything else . . . . . . . . . . . . . .
Alice: balls have a ball to me to me to me to me to me to me to
me
Bob: i i can i i i everything else . . . . . . . . . . . . . .
Alice: balls have a ball to me to me to me to me to me to me to
me
Bob: i . . . . . . . . . . . . . . . . . . .
Alice: balls have zero to me to me to me to me to me to me to me
to me to
Bob: you i i i i i everything else . . . . . . . . . . . . . .
Alice: balls have 0 to me to me to me to me to me to me to me to
me to
WHAT
IS
DATA
SCIENC
E
Data science is the extraction of relevant insights from data
WHAT DO YOU DO IN DATA SCIENCE?
• Classification (e.g., spam or not spam)
• Pattern detection and grouping (classification without known classes)
• Anomaly detection (e.g., fraud detection)
• Recognition (image, text, audio, video, facial, …)
• Actionable insights (via dashboards, reports, visualizations, …)
• Automated processes and decision-making (e.g., credit card approval)
• Scoring and ranking (e.g., FICO score)
• Segmentation (e.g., demographic-based marketing)
• Optimization (e.g., risk management)
WHAT CAN YOU DO WITH DATA SCIENCE
Recommendation
s
Fraud detection
Customer sentiment
analysis
Churn, Next Best Action, Propensity
Predictive Shipping
Supply Chain
Optimization
Price
Optimization
Clickstream
analytics
VISUALIZATION
VISUALIZATION
GROWTH OF CONNECTED THINGS
127 new devices connecting to the Internet every
second.
99% OF THINGS ARE STILL NOT
CONNECTED
A BRIEF HISTORY OF … DATA
2.5 Quintillion (Exabytes) data / day
(2018)
10 TB = printed Library of
Congress
90% of the world’s data
was created in last 2
years.
1.7 megabytes of new
information will be
created / second / person,
by 2020
A Boeing 787 aircraft could generate 40 TBs per hour of flight.
An ADV car will churn out 4,000 GB of data per hour of driving
In just 10 minutes, 16 players with
6 balls can produce almost 13
million data points! (soccer)
“You’re capturing real-time
data at every point, on every
single food product.”
- Walmart, Food Trust
blockchain
CERN LHC ~ 40TB/S DURING PARTICLE
SMASHING
NASA ~ CREATES 15 TB/DAY
NASDAQ, NYSE, CBOE ~ 1 TB / DAY EACH
IMAGE
PROCESSIN
G
Self Driving Car
GOOGLE ASSISTANT MAKES A PHONE
CALL...
AI FOR BETTER DETECTION OF CANCER
DEEP FAKES
• Deep dreaming
•
AlphaGo first learned from studying 30 million
moves of expert human play. AlphaGo Zero just
learned the rules and played.
DATA SCIENCE IN FINANCIAL SERVICES
• Dataminr analyzes billions of tweets to monitor the entire
world – predicting stock movements
• 56% of hedge funds said they used AI/ML for investing
• Blackrock (largest Asset Manager, 6.5 TN AUM) using AI for
investing.
• JPM Chase (largest bank, 2.6 TN assets) using AI to
“deepen customer engagements.”
• Risk management
• Algorithmic trading
“Machine managed portfolio will out
perform a human managed one, in 7
years” – AW 
DATA SCIENCE IN BANKING
Financial Conditions
Policy Liquidity
Quantity Liquidity
Domestic Liquidity
Equity Exposures
Bond Exposures
Money Flows
Monetized Savings
Momentum TedSpread
OIS spread
10yr, 2yr CMT
Convexity at 5yr
5 yr inflation + 5
years
Banks' swap
spreads
CB Credit Risk
Index
Sentiment Index
Dollar Sentiment
Trade weighted $
2-10yr Yield Curve
BAA-AAA credit
spread
Mkt PE & EPS
VIX, S&P 500, FTSE
EuroStoxx, MSCI EM
USD/ GBP, EUR
MARKET VOLATILITY PREDICTION - DATA
MODEL BUILDING PROCESS
Rule modelingInput enhancements
Classify output
Lag/lead the factors
Remove correlations
Induction for
optimization
Transform inputs,
outputs
Analyze explanatory
power
Operational
Monitoring
Compute risk
&probability
What if scenario
modeling
Monitor incoming
data
Expert Analysis
Analyze rules for
purity
Analyze rules for
causality
Analyze new
requirements
Feedback
PREDICTIVE RISK MAP VS. MARKETS…
What is
happenin
g in
global
markets?The RBA
surprised …
cutting its …
rate by 0.25 …a
level last seen
in late 2009
Heightened
global risk in
May – Jul
2013
Japan starts
QE
The US Fed
“Taper” talk
starts
3D RISK MAP – ADVANCED VISUALISATION
Factors for a high out performance… Insight: Brazilian Real…for
real?
FINANCIAL PORTFOLIO CONSTRUCTION
PERSON
A OF A
DATA
SCIENTIS
T
Credit- Stephan
Kolassa – Data Science
Expert – SAP
Switzerland AG
Business
Domain
Knowledge
& soft skills
Math, Stats,
Data
Engineering,
Programming
WHY BE A DATA SCIENTIST
• If you like data 
• Data scientists today are akin to the Wall Street “quants”
of the 1980s 1990s.. And 2000s
• Salary $120-160K +
Sexiest Job of the 21st Century – Harvard Business Rev

More Related Content

Similar to Data Science for Business

Data mining final year project in ludhiana
Data mining final year project in ludhianaData mining final year project in ludhiana
Data mining final year project in ludhianadeepikakaler1
 
Data Science - An emerging Stream of Science with its Spreading Reach & Impact
Data Science - An emerging Stream of Science with its Spreading Reach & ImpactData Science - An emerging Stream of Science with its Spreading Reach & Impact
Data Science - An emerging Stream of Science with its Spreading Reach & ImpactDr. Sunil Kr. Pandey
 
What is Big Data
What is Big Data What is Big Data
What is Big Data Hani Saif
 
Data Driven Disruption - Why Marketing and Advertising in WA lags - ADMA WA 2...
Data Driven Disruption - Why Marketing and Advertising in WA lags - ADMA WA 2...Data Driven Disruption - Why Marketing and Advertising in WA lags - ADMA WA 2...
Data Driven Disruption - Why Marketing and Advertising in WA lags - ADMA WA 2...Coert Du Plessis (杜康)
 
Bigger and Better: Employing a Holistic Strategy for Big Data toward a Strong...
Bigger and Better: Employing a Holistic Strategy for Big Data toward a Strong...Bigger and Better: Employing a Holistic Strategy for Big Data toward a Strong...
Bigger and Better: Employing a Holistic Strategy for Big Data toward a Strong...IT Network marcus evans
 
Qu'est ce que le Big Data ? Avec Victoria Galano Data Scientist chez Air France
Qu'est ce que le Big Data ? Avec Victoria Galano Data Scientist chez Air FranceQu'est ce que le Big Data ? Avec Victoria Galano Data Scientist chez Air France
Qu'est ce que le Big Data ? Avec Victoria Galano Data Scientist chez Air FranceJedha Bootcamp
 
introduction to data science
introduction to data scienceintroduction to data science
introduction to data sciencebhavesh lande
 
Rise of Prediction: Managing AI, Risk & Ethics (June 2019)
Rise of Prediction: Managing AI, Risk & Ethics (June 2019)Rise of Prediction: Managing AI, Risk & Ethics (June 2019)
Rise of Prediction: Managing AI, Risk & Ethics (June 2019)PickAxes & Shovels
 
BI, AI/ML, Use Cases, Business Impact and how to get started
BI, AI/ML, Use Cases, Business Impact and how to get startedBI, AI/ML, Use Cases, Business Impact and how to get started
BI, AI/ML, Use Cases, Business Impact and how to get startedKarthick S
 
Demystifying Data Science
Demystifying Data ScienceDemystifying Data Science
Demystifying Data ScienceJonathan Sedar
 
AI Governance – The Responsible Use of AI
AI Governance – The Responsible Use of AIAI Governance – The Responsible Use of AI
AI Governance – The Responsible Use of AINUS-ISS
 
Entering the Data Analytics industry
Entering the Data Analytics industryEntering the Data Analytics industry
Entering the Data Analytics industryGramener
 
Data science and business analytics
Data  science and business analyticsData  science and business analytics
Data science and business analyticsInbavalli Valli
 
Sl12 opportunities in challenging markets
Sl12   opportunities in challenging marketsSl12   opportunities in challenging markets
Sl12 opportunities in challenging marketscabotmoney
 
The value of storytelling through data
The value of storytelling through dataThe value of storytelling through data
The value of storytelling through dataGramener
 
AI For Good Bad guys, messy data, & NLP
AI For Good  Bad guys, messy data, & NLPAI For Good  Bad guys, messy data, & NLP
AI For Good Bad guys, messy data, & NLPChristopher Mack
 

Similar to Data Science for Business (20)

Data mining final year project in ludhiana
Data mining final year project in ludhianaData mining final year project in ludhiana
Data mining final year project in ludhiana
 
Rulex big data and analytics
Rulex big data and analyticsRulex big data and analytics
Rulex big data and analytics
 
Data mining
Data miningData mining
Data mining
 
Data Science - An emerging Stream of Science with its Spreading Reach & Impact
Data Science - An emerging Stream of Science with its Spreading Reach & ImpactData Science - An emerging Stream of Science with its Spreading Reach & Impact
Data Science - An emerging Stream of Science with its Spreading Reach & Impact
 
What is Big Data
What is Big Data What is Big Data
What is Big Data
 
Data Driven Disruption - Why Marketing and Advertising in WA lags - ADMA WA 2...
Data Driven Disruption - Why Marketing and Advertising in WA lags - ADMA WA 2...Data Driven Disruption - Why Marketing and Advertising in WA lags - ADMA WA 2...
Data Driven Disruption - Why Marketing and Advertising in WA lags - ADMA WA 2...
 
Bigger and Better: Employing a Holistic Strategy for Big Data toward a Strong...
Bigger and Better: Employing a Holistic Strategy for Big Data toward a Strong...Bigger and Better: Employing a Holistic Strategy for Big Data toward a Strong...
Bigger and Better: Employing a Holistic Strategy for Big Data toward a Strong...
 
Big data
Big dataBig data
Big data
 
Qu'est ce que le Big Data ? Avec Victoria Galano Data Scientist chez Air France
Qu'est ce que le Big Data ? Avec Victoria Galano Data Scientist chez Air FranceQu'est ce que le Big Data ? Avec Victoria Galano Data Scientist chez Air France
Qu'est ce que le Big Data ? Avec Victoria Galano Data Scientist chez Air France
 
introduction to data science
introduction to data scienceintroduction to data science
introduction to data science
 
Rise of Prediction: Managing AI, Risk & Ethics (June 2019)
Rise of Prediction: Managing AI, Risk & Ethics (June 2019)Rise of Prediction: Managing AI, Risk & Ethics (June 2019)
Rise of Prediction: Managing AI, Risk & Ethics (June 2019)
 
BI, AI/ML, Use Cases, Business Impact and how to get started
BI, AI/ML, Use Cases, Business Impact and how to get startedBI, AI/ML, Use Cases, Business Impact and how to get started
BI, AI/ML, Use Cases, Business Impact and how to get started
 
Demystifying Data Science
Demystifying Data ScienceDemystifying Data Science
Demystifying Data Science
 
AI Governance – The Responsible Use of AI
AI Governance – The Responsible Use of AIAI Governance – The Responsible Use of AI
AI Governance – The Responsible Use of AI
 
Data Science Webinar
Data Science WebinarData Science Webinar
Data Science Webinar
 
Entering the Data Analytics industry
Entering the Data Analytics industryEntering the Data Analytics industry
Entering the Data Analytics industry
 
Data science and business analytics
Data  science and business analyticsData  science and business analytics
Data science and business analytics
 
Sl12 opportunities in challenging markets
Sl12   opportunities in challenging marketsSl12   opportunities in challenging markets
Sl12 opportunities in challenging markets
 
The value of storytelling through data
The value of storytelling through dataThe value of storytelling through data
The value of storytelling through data
 
AI For Good Bad guys, messy data, & NLP
AI For Good  Bad guys, messy data, & NLPAI For Good  Bad guys, messy data, & NLP
AI For Good Bad guys, messy data, & NLP
 

Recently uploaded

Russian Call girl in Ajman +971563133746 Ajman Call girl Service
Russian Call girl in Ajman +971563133746 Ajman Call girl ServiceRussian Call girl in Ajman +971563133746 Ajman Call girl Service
Russian Call girl in Ajman +971563133746 Ajman Call girl Servicegwenoracqe6
 
SEO Growth Program-Digital optimization Specialist
SEO Growth Program-Digital optimization SpecialistSEO Growth Program-Digital optimization Specialist
SEO Growth Program-Digital optimization SpecialistKHM Anwar
 
Lucknow ❤CALL GIRL 88759*99948 ❤CALL GIRLS IN Lucknow ESCORT SERVICE❤CALL GIRL
Lucknow ❤CALL GIRL 88759*99948 ❤CALL GIRLS IN Lucknow ESCORT SERVICE❤CALL GIRLLucknow ❤CALL GIRL 88759*99948 ❤CALL GIRLS IN Lucknow ESCORT SERVICE❤CALL GIRL
Lucknow ❤CALL GIRL 88759*99948 ❤CALL GIRLS IN Lucknow ESCORT SERVICE❤CALL GIRLimonikaupta
 
Networking in the Penumbra presented by Geoff Huston at NZNOG
Networking in the Penumbra presented by Geoff Huston at NZNOGNetworking in the Penumbra presented by Geoff Huston at NZNOG
Networking in the Penumbra presented by Geoff Huston at NZNOGAPNIC
 
Call Girls In Saket Delhi 💯Call Us 🔝8264348440🔝
Call Girls In Saket Delhi 💯Call Us 🔝8264348440🔝Call Girls In Saket Delhi 💯Call Us 🔝8264348440🔝
Call Girls In Saket Delhi 💯Call Us 🔝8264348440🔝soniya singh
 
Top Rated Pune Call Girls Daund ⟟ 6297143586 ⟟ Call Me For Genuine Sex Servi...
Top Rated  Pune Call Girls Daund ⟟ 6297143586 ⟟ Call Me For Genuine Sex Servi...Top Rated  Pune Call Girls Daund ⟟ 6297143586 ⟟ Call Me For Genuine Sex Servi...
Top Rated Pune Call Girls Daund ⟟ 6297143586 ⟟ Call Me For Genuine Sex Servi...Call Girls in Nagpur High Profile
 
Low Rate Young Call Girls in Sector 63 Mamura Noida ✔️☆9289244007✔️☆ Female E...
Low Rate Young Call Girls in Sector 63 Mamura Noida ✔️☆9289244007✔️☆ Female E...Low Rate Young Call Girls in Sector 63 Mamura Noida ✔️☆9289244007✔️☆ Female E...
Low Rate Young Call Girls in Sector 63 Mamura Noida ✔️☆9289244007✔️☆ Female E...SofiyaSharma5
 
₹5.5k {Cash Payment}New Friends Colony Call Girls In [Delhi NIHARIKA] 🔝|97111...
₹5.5k {Cash Payment}New Friends Colony Call Girls In [Delhi NIHARIKA] 🔝|97111...₹5.5k {Cash Payment}New Friends Colony Call Girls In [Delhi NIHARIKA] 🔝|97111...
₹5.5k {Cash Payment}New Friends Colony Call Girls In [Delhi NIHARIKA] 🔝|97111...Diya Sharma
 
Call Girls in Mayur Vihar ✔️ 9711199171 ✔️ Delhi ✔️ Enjoy Call Girls With Our...
Call Girls in Mayur Vihar ✔️ 9711199171 ✔️ Delhi ✔️ Enjoy Call Girls With Our...Call Girls in Mayur Vihar ✔️ 9711199171 ✔️ Delhi ✔️ Enjoy Call Girls With Our...
Call Girls in Mayur Vihar ✔️ 9711199171 ✔️ Delhi ✔️ Enjoy Call Girls With Our...sonatiwari757
 
Call Now ☎ 8264348440 !! Call Girls in Shahpur Jat Escort Service Delhi N.C.R.
Call Now ☎ 8264348440 !! Call Girls in Shahpur Jat Escort Service Delhi N.C.R.Call Now ☎ 8264348440 !! Call Girls in Shahpur Jat Escort Service Delhi N.C.R.
Call Now ☎ 8264348440 !! Call Girls in Shahpur Jat Escort Service Delhi N.C.R.soniya singh
 
Enjoy Night⚡Call Girls Dlf City Phase 3 Gurgaon >༒8448380779 Escort Service
Enjoy Night⚡Call Girls Dlf City Phase 3 Gurgaon >༒8448380779 Escort ServiceEnjoy Night⚡Call Girls Dlf City Phase 3 Gurgaon >༒8448380779 Escort Service
Enjoy Night⚡Call Girls Dlf City Phase 3 Gurgaon >༒8448380779 Escort ServiceDelhi Call girls
 
✂️ 👅 Independent Andheri Escorts With Room Vashi Call Girls 💃 9004004663
✂️ 👅 Independent Andheri Escorts With Room Vashi Call Girls 💃 9004004663✂️ 👅 Independent Andheri Escorts With Room Vashi Call Girls 💃 9004004663
✂️ 👅 Independent Andheri Escorts With Room Vashi Call Girls 💃 9004004663Call Girls Mumbai
 
Delhi Call Girls Rohini 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
Delhi Call Girls Rohini 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip CallDelhi Call Girls Rohini 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
Delhi Call Girls Rohini 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Callshivangimorya083
 
Pune Airport ( Call Girls ) Pune 6297143586 Hot Model With Sexy Bhabi Ready...
Pune Airport ( Call Girls ) Pune  6297143586  Hot Model With Sexy Bhabi Ready...Pune Airport ( Call Girls ) Pune  6297143586  Hot Model With Sexy Bhabi Ready...
Pune Airport ( Call Girls ) Pune 6297143586 Hot Model With Sexy Bhabi Ready...tanu pandey
 
Call Girls In Sukhdev Vihar Delhi 💯Call Us 🔝8264348440🔝
Call Girls In Sukhdev Vihar Delhi 💯Call Us 🔝8264348440🔝Call Girls In Sukhdev Vihar Delhi 💯Call Us 🔝8264348440🔝
Call Girls In Sukhdev Vihar Delhi 💯Call Us 🔝8264348440🔝soniya singh
 
All Time Service Available Call Girls Mg Road 👌 ⏭️ 6378878445
All Time Service Available Call Girls Mg Road 👌 ⏭️ 6378878445All Time Service Available Call Girls Mg Road 👌 ⏭️ 6378878445
All Time Service Available Call Girls Mg Road 👌 ⏭️ 6378878445ruhi
 
VIP 7001035870 Find & Meet Hyderabad Call Girls Dilsukhnagar high-profile Cal...
VIP 7001035870 Find & Meet Hyderabad Call Girls Dilsukhnagar high-profile Cal...VIP 7001035870 Find & Meet Hyderabad Call Girls Dilsukhnagar high-profile Cal...
VIP 7001035870 Find & Meet Hyderabad Call Girls Dilsukhnagar high-profile Cal...aditipandeya
 
On Starlink, presented by Geoff Huston at NZNOG 2024
On Starlink, presented by Geoff Huston at NZNOG 2024On Starlink, presented by Geoff Huston at NZNOG 2024
On Starlink, presented by Geoff Huston at NZNOG 2024APNIC
 

Recently uploaded (20)

Russian Call girl in Ajman +971563133746 Ajman Call girl Service
Russian Call girl in Ajman +971563133746 Ajman Call girl ServiceRussian Call girl in Ajman +971563133746 Ajman Call girl Service
Russian Call girl in Ajman +971563133746 Ajman Call girl Service
 
SEO Growth Program-Digital optimization Specialist
SEO Growth Program-Digital optimization SpecialistSEO Growth Program-Digital optimization Specialist
SEO Growth Program-Digital optimization Specialist
 
Rohini Sector 6 Call Girls Delhi 9999965857 @Sabina Saikh No Advance
Rohini Sector 6 Call Girls Delhi 9999965857 @Sabina Saikh No AdvanceRohini Sector 6 Call Girls Delhi 9999965857 @Sabina Saikh No Advance
Rohini Sector 6 Call Girls Delhi 9999965857 @Sabina Saikh No Advance
 
Call Girls In Noida 📱 9999965857 🤩 Delhi 🫦 HOT AND SEXY VVIP 🍎 SERVICE
Call Girls In Noida 📱  9999965857  🤩 Delhi 🫦 HOT AND SEXY VVIP 🍎 SERVICECall Girls In Noida 📱  9999965857  🤩 Delhi 🫦 HOT AND SEXY VVIP 🍎 SERVICE
Call Girls In Noida 📱 9999965857 🤩 Delhi 🫦 HOT AND SEXY VVIP 🍎 SERVICE
 
Lucknow ❤CALL GIRL 88759*99948 ❤CALL GIRLS IN Lucknow ESCORT SERVICE❤CALL GIRL
Lucknow ❤CALL GIRL 88759*99948 ❤CALL GIRLS IN Lucknow ESCORT SERVICE❤CALL GIRLLucknow ❤CALL GIRL 88759*99948 ❤CALL GIRLS IN Lucknow ESCORT SERVICE❤CALL GIRL
Lucknow ❤CALL GIRL 88759*99948 ❤CALL GIRLS IN Lucknow ESCORT SERVICE❤CALL GIRL
 
Networking in the Penumbra presented by Geoff Huston at NZNOG
Networking in the Penumbra presented by Geoff Huston at NZNOGNetworking in the Penumbra presented by Geoff Huston at NZNOG
Networking in the Penumbra presented by Geoff Huston at NZNOG
 
Call Girls In Saket Delhi 💯Call Us 🔝8264348440🔝
Call Girls In Saket Delhi 💯Call Us 🔝8264348440🔝Call Girls In Saket Delhi 💯Call Us 🔝8264348440🔝
Call Girls In Saket Delhi 💯Call Us 🔝8264348440🔝
 
Top Rated Pune Call Girls Daund ⟟ 6297143586 ⟟ Call Me For Genuine Sex Servi...
Top Rated  Pune Call Girls Daund ⟟ 6297143586 ⟟ Call Me For Genuine Sex Servi...Top Rated  Pune Call Girls Daund ⟟ 6297143586 ⟟ Call Me For Genuine Sex Servi...
Top Rated Pune Call Girls Daund ⟟ 6297143586 ⟟ Call Me For Genuine Sex Servi...
 
Low Rate Young Call Girls in Sector 63 Mamura Noida ✔️☆9289244007✔️☆ Female E...
Low Rate Young Call Girls in Sector 63 Mamura Noida ✔️☆9289244007✔️☆ Female E...Low Rate Young Call Girls in Sector 63 Mamura Noida ✔️☆9289244007✔️☆ Female E...
Low Rate Young Call Girls in Sector 63 Mamura Noida ✔️☆9289244007✔️☆ Female E...
 
₹5.5k {Cash Payment}New Friends Colony Call Girls In [Delhi NIHARIKA] 🔝|97111...
₹5.5k {Cash Payment}New Friends Colony Call Girls In [Delhi NIHARIKA] 🔝|97111...₹5.5k {Cash Payment}New Friends Colony Call Girls In [Delhi NIHARIKA] 🔝|97111...
₹5.5k {Cash Payment}New Friends Colony Call Girls In [Delhi NIHARIKA] 🔝|97111...
 
Call Girls in Mayur Vihar ✔️ 9711199171 ✔️ Delhi ✔️ Enjoy Call Girls With Our...
Call Girls in Mayur Vihar ✔️ 9711199171 ✔️ Delhi ✔️ Enjoy Call Girls With Our...Call Girls in Mayur Vihar ✔️ 9711199171 ✔️ Delhi ✔️ Enjoy Call Girls With Our...
Call Girls in Mayur Vihar ✔️ 9711199171 ✔️ Delhi ✔️ Enjoy Call Girls With Our...
 
Call Now ☎ 8264348440 !! Call Girls in Shahpur Jat Escort Service Delhi N.C.R.
Call Now ☎ 8264348440 !! Call Girls in Shahpur Jat Escort Service Delhi N.C.R.Call Now ☎ 8264348440 !! Call Girls in Shahpur Jat Escort Service Delhi N.C.R.
Call Now ☎ 8264348440 !! Call Girls in Shahpur Jat Escort Service Delhi N.C.R.
 
Enjoy Night⚡Call Girls Dlf City Phase 3 Gurgaon >༒8448380779 Escort Service
Enjoy Night⚡Call Girls Dlf City Phase 3 Gurgaon >༒8448380779 Escort ServiceEnjoy Night⚡Call Girls Dlf City Phase 3 Gurgaon >༒8448380779 Escort Service
Enjoy Night⚡Call Girls Dlf City Phase 3 Gurgaon >༒8448380779 Escort Service
 
✂️ 👅 Independent Andheri Escorts With Room Vashi Call Girls 💃 9004004663
✂️ 👅 Independent Andheri Escorts With Room Vashi Call Girls 💃 9004004663✂️ 👅 Independent Andheri Escorts With Room Vashi Call Girls 💃 9004004663
✂️ 👅 Independent Andheri Escorts With Room Vashi Call Girls 💃 9004004663
 
Delhi Call Girls Rohini 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
Delhi Call Girls Rohini 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip CallDelhi Call Girls Rohini 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
Delhi Call Girls Rohini 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
 
Pune Airport ( Call Girls ) Pune 6297143586 Hot Model With Sexy Bhabi Ready...
Pune Airport ( Call Girls ) Pune  6297143586  Hot Model With Sexy Bhabi Ready...Pune Airport ( Call Girls ) Pune  6297143586  Hot Model With Sexy Bhabi Ready...
Pune Airport ( Call Girls ) Pune 6297143586 Hot Model With Sexy Bhabi Ready...
 
Call Girls In Sukhdev Vihar Delhi 💯Call Us 🔝8264348440🔝
Call Girls In Sukhdev Vihar Delhi 💯Call Us 🔝8264348440🔝Call Girls In Sukhdev Vihar Delhi 💯Call Us 🔝8264348440🔝
Call Girls In Sukhdev Vihar Delhi 💯Call Us 🔝8264348440🔝
 
All Time Service Available Call Girls Mg Road 👌 ⏭️ 6378878445
All Time Service Available Call Girls Mg Road 👌 ⏭️ 6378878445All Time Service Available Call Girls Mg Road 👌 ⏭️ 6378878445
All Time Service Available Call Girls Mg Road 👌 ⏭️ 6378878445
 
VIP 7001035870 Find & Meet Hyderabad Call Girls Dilsukhnagar high-profile Cal...
VIP 7001035870 Find & Meet Hyderabad Call Girls Dilsukhnagar high-profile Cal...VIP 7001035870 Find & Meet Hyderabad Call Girls Dilsukhnagar high-profile Cal...
VIP 7001035870 Find & Meet Hyderabad Call Girls Dilsukhnagar high-profile Cal...
 
On Starlink, presented by Geoff Huston at NZNOG 2024
On Starlink, presented by Geoff Huston at NZNOG 2024On Starlink, presented by Geoff Huston at NZNOG 2024
On Starlink, presented by Geoff Huston at NZNOG 2024
 

Data Science for Business

  • 1. DATA SCIENCE IIT AGNE MARCH 2019 ANURAG WAKHLU, CFA, MBA IIT BOMBAY ’93 @ANURAGWAKHLU @ANURAG3DS
  • 2. WHAT IS THIS? Bob: i can i i everything else . . . . . . . . . . . . . . Alice: balls have zero to me to me to me to me to me to me to me to me to Bob: you i everything else . . . . . . . . . . . . . . Alice: balls have a ball to me to me to me to me to me to me to me Bob: i i can i i i everything else . . . . . . . . . . . . . . Alice: balls have a ball to me to me to me to me to me to me to me Bob: i . . . . . . . . . . . . . . . . . . . Alice: balls have zero to me to me to me to me to me to me to me to me to Bob: you i i i i i everything else . . . . . . . . . . . . . . Alice: balls have 0 to me to me to me to me to me to me to me to me to
  • 3.
  • 4. WHAT IS DATA SCIENC E Data science is the extraction of relevant insights from data
  • 5. WHAT DO YOU DO IN DATA SCIENCE? • Classification (e.g., spam or not spam) • Pattern detection and grouping (classification without known classes) • Anomaly detection (e.g., fraud detection) • Recognition (image, text, audio, video, facial, …) • Actionable insights (via dashboards, reports, visualizations, …) • Automated processes and decision-making (e.g., credit card approval) • Scoring and ranking (e.g., FICO score) • Segmentation (e.g., demographic-based marketing) • Optimization (e.g., risk management)
  • 6. WHAT CAN YOU DO WITH DATA SCIENCE Recommendation s Fraud detection Customer sentiment analysis Churn, Next Best Action, Propensity Predictive Shipping Supply Chain Optimization Price Optimization Clickstream analytics
  • 9. GROWTH OF CONNECTED THINGS 127 new devices connecting to the Internet every second.
  • 10. 99% OF THINGS ARE STILL NOT CONNECTED
  • 11.
  • 12. A BRIEF HISTORY OF … DATA 2.5 Quintillion (Exabytes) data / day (2018) 10 TB = printed Library of Congress
  • 13. 90% of the world’s data was created in last 2 years. 1.7 megabytes of new information will be created / second / person, by 2020
  • 14.
  • 15.
  • 16.
  • 17. A Boeing 787 aircraft could generate 40 TBs per hour of flight.
  • 18. An ADV car will churn out 4,000 GB of data per hour of driving
  • 19. In just 10 minutes, 16 players with 6 balls can produce almost 13 million data points! (soccer) “You’re capturing real-time data at every point, on every single food product.” - Walmart, Food Trust blockchain
  • 20.
  • 21. CERN LHC ~ 40TB/S DURING PARTICLE SMASHING NASA ~ CREATES 15 TB/DAY
  • 22. NASDAQ, NYSE, CBOE ~ 1 TB / DAY EACH
  • 23.
  • 24.
  • 25.
  • 28. GOOGLE ASSISTANT MAKES A PHONE CALL...
  • 29. AI FOR BETTER DETECTION OF CANCER
  • 32. AlphaGo first learned from studying 30 million moves of expert human play. AlphaGo Zero just learned the rules and played.
  • 33. DATA SCIENCE IN FINANCIAL SERVICES • Dataminr analyzes billions of tweets to monitor the entire world – predicting stock movements • 56% of hedge funds said they used AI/ML for investing • Blackrock (largest Asset Manager, 6.5 TN AUM) using AI for investing. • JPM Chase (largest bank, 2.6 TN assets) using AI to “deepen customer engagements.” • Risk management • Algorithmic trading
  • 34. “Machine managed portfolio will out perform a human managed one, in 7 years” – AW 
  • 35. DATA SCIENCE IN BANKING
  • 36. Financial Conditions Policy Liquidity Quantity Liquidity Domestic Liquidity Equity Exposures Bond Exposures Money Flows Monetized Savings Momentum TedSpread OIS spread 10yr, 2yr CMT Convexity at 5yr 5 yr inflation + 5 years Banks' swap spreads CB Credit Risk Index Sentiment Index Dollar Sentiment Trade weighted $ 2-10yr Yield Curve BAA-AAA credit spread Mkt PE & EPS VIX, S&P 500, FTSE EuroStoxx, MSCI EM USD/ GBP, EUR MARKET VOLATILITY PREDICTION - DATA
  • 37. MODEL BUILDING PROCESS Rule modelingInput enhancements Classify output Lag/lead the factors Remove correlations Induction for optimization Transform inputs, outputs Analyze explanatory power Operational Monitoring Compute risk &probability What if scenario modeling Monitor incoming data Expert Analysis Analyze rules for purity Analyze rules for causality Analyze new requirements Feedback
  • 38. PREDICTIVE RISK MAP VS. MARKETS… What is happenin g in global markets?The RBA surprised … cutting its … rate by 0.25 …a level last seen in late 2009 Heightened global risk in May – Jul 2013 Japan starts QE The US Fed “Taper” talk starts
  • 39. 3D RISK MAP – ADVANCED VISUALISATION
  • 40. Factors for a high out performance… Insight: Brazilian Real…for real? FINANCIAL PORTFOLIO CONSTRUCTION
  • 41. PERSON A OF A DATA SCIENTIS T Credit- Stephan Kolassa – Data Science Expert – SAP Switzerland AG Business Domain Knowledge & soft skills Math, Stats, Data Engineering, Programming
  • 42. WHY BE A DATA SCIENTIST • If you like data  • Data scientists today are akin to the Wall Street “quants” of the 1980s 1990s.. And 2000s • Salary $120-160K + Sexiest Job of the 21st Century – Harvard Business Rev

Editor's Notes

  1. Summer 2017 - Facebook’s AI research lab. Researchers set out to make chatbots that could negotiate with people. Their thinking: Negotiation and cooperation will be necessary for bots to work more closely with humans. First, they fed the computers dialog from thousands of games between humans to give the system a sense of the language of negotiation. Then they allowed bots to use trial and error—in the form of a technique called reinforcement learning, which helped Google’s Go bot AlphaGo defeat champion players When two bots using reinforcement learning played each other, they stopped using recognizable sentences. 
  2. Data Science is an interdisciplinary field to extract insights from data . AI is the science of making machines do intelligent tasks like humans. To do this, machines have to learn from data – and that process is called machine learning Deep learning is a type of ML generally modeled after the human brain – neural networks. DL is more scalable than other ML , for improved learning and larger data. data science allows for AIs to find appropriate and meaningful information from those huge pools faster and more efficiently. machine learning is the process of learning from data over time.  Artificial intelligence refers to the simulation of a human brain function by machines. This is achieved by creating an artificial neural network that can mimick human intelligence. The primary human functions that an AI machine performs include logical reasoning, learning and self-correction. Machines inherently are not smart and to make them so, we need a lot of computing power and data to empower them to simulate human thinking. Artificial intelligence is classified into two parts, general AI and Narrow AI. General AI refers to making machines intelligent in a wide array of activities that involve thinking and reasoning. Narrow AI, on the other hand, involves the use of artificial intelligence for a very specific task. For instance, general AI would mean an algorithm that is capable of playing all kinds of board game while narrow AI will limit the range of machine capabilities to a specific game like chess or scrabble.  Machine learning is the ability of a computer system to learn from the environment and improve itself from experience without the need for any explicit programming. Machine learning focuses on enabling algorithms to learn from the data provided, gather insights and make predictions on previously unanalyzed data using the information gathered. Machine learning can be performed using multiple approaches. The three basic models of machine learning are supervised, unsupervised and reinforcement learning. In case of supervised learning, labeled data is used to help machines recognize characteristics and use them for future data. For instance, if you want to classify pictures of cats and dogs then you can feed the data of a few labeled pictures and then the machine will classify all the remaining pictures for you.  On the other hand, in unsupervised learning, we simply put unlabeled data and let machine understand the characteristics and classify it. Reinforcement machine learning algorithms interact with the environment by producing actions and then analyze errors or rewards. For example, to understand a game of chess an ML algorithm will not analyze individual moves but will study the game as a whole.
  3. Solve real world problems or improve things, using data & AI/ML
  4. The Manhattan Population Explorer provides a visual representation of the dynamic population shifts within the borough. In this example it synthesizes a heartbeat of New York
  5. 24 BN connected devices in 2018
  6. Atoms in universe 10^80
  7. 2.5 quintillion bytes of data created each day at our current pace, but tha t pace is only accelerating with the growth of the Internet of Things (IoT). Over the last two years alone 90 percent of the data in the world was generated.  Data is growing at a rapid pace. By 2020 the new information generated per second for every human being will approximate amount to 1.7 megabytes. By 2020, the accumulated volume of big data will increase from 4.4 zettabytes to roughly 44 zettabytes or 44 trillion GB. Originally, data scientists maintained that the volume of data would double every two years thus reaching the 44 ZB point by 2020 with iot The rate at which data is created is increased exponentially. For instance, 40,000 search queries are performed per second (on Google alone), which makes it 3.46 million searches per day and 1.2 trillion every year. Every minute Facebook users send roughly 31.25 million messages and watch 2.77 million videos. The data gathered is no more text-only. An exponential growth in videos and photos is equally prominent. On YouTube alone, 300 hours of video are uploaded every minute. IDC estimates that by 2020, business transactions (including both B2B and B2C) via the internet will reach up to 450 billion per day. Globally, the number of smartphone users will grow to 6.1 billion by 2020 . In just 5 years the number of smart connected devices in the world will be more than 50 billion – all of which will create data that can be shared, collected and analyzed.
  8. A typical human genome contains more than 20,000 genes, with each made up of millions of base pairs. Simply mapping a genome requires a hundred gigabytes of data, and sequencing multiple genomes and tracking gene interactions multiplies that number many times — hundreds of petabytes in some cases. 
  9. Physicists use the 17 -mile) LHC tunnel to accelerate particles almost to light speed, and smash them together At about 30 million collisions per second for 120 billion protons. one billion collisions per second generates one petabyte per second. to keep all 30 million events per second we would need about 2,000 petabytes to store a typical 12-hour run. For a typical running year of 150 days uptime, this would mean almost 400 ExaByte per year  throws away 99.99% of 400 EB The Large Hadron Collider is the world's largest and most powerful particle collider and the largest machine in the world.  CERN has dumped about 300 TB of Large Hadron Collider (LHC) data online. It’s completely free,
  10. DATA IS THE NEW OIL the world’s first electronic stock market, NASDAQ OMX owns and operates three clearing houses, five central securities depositories, and 26 markets (including the NASDAQ Stock Market) with a combined value that exceeds US$8 trillion. Its trading engine is used by 80 global marketplaces. When markets open, the company processes more than 1 million messages per second. Director of Database Structures at NASDAQ OMX, says, “Just our US Options and Equity data archive handles billions of transactions per day, stores multiple petabytes of online data, and has tables that contain quintillions of records about business transactions.” the Options and Equity archive measures 2 petabytes (PB)
  11. US Department of Energy’s Oak Ridge National Laboratory announced the top speeds of its Summit supercomputing machine, which nearly laps the previous record-holder, China’s Sunway TaihuLight. The Summit’s theoretical peak speed is 200 petaflops, or 200,000 teraflops. To put that in human terms, approximately 6.3 billion people would all have to make a calculation at the same time, every second, for an entire year, to match what Summit can do in just one second. In 2015, Google and NASA reported that their new 1097-qubit D-Wave quantum computer had solved an optimization problem in a few seconds. That’s 100 million times faster than a regular computer chip. They claimed that a problem their D-Wave 2X machine processed inside one second would take a classical computer 10,000 years to solve. Your brain is 10 million times slower than a computer. Brain ~ 1000 operations /s
  12. Google offers an option to download all of the data it stores about you. I’ve requested to download it and the file is 5.5GB big, Facebook offers a similar option to download all your information. Mine was roughly 600MB
  13. 8x8 pixel photos were inputted into a Deep Learning network which tried to guess what the original face looked like. As you can see it was fairly close (the correct answer is under "ground truth” - which was the real face originally in the photos)).
  14. https://youtu.be/aKed5FHzDTw?t=43
  15. Natural language processing (NLP) deals with building computational algorithms to automatically analyze and represent human language. NLP-based systems have enabled a wide range of applications such as Google’s powerful search engine, and more recently, Amazon’s voice assistant named Alexa. NLP is also useful to teach machines the ability to perform complex natural language related tasks such as machine translation and dialogue generation.
  16. Gebru et al took 50 million Google Street View images and exploredwhat a Deep Learning network can do - "if the number of sedans encountered during a 15-minute drive through a city is higher than the number of pickup trucks, the city is likely to vote for a Democrat during the next Presidential election (88% chance); otherwise, it is likely to vote Republican (82%).” Harvard scientists used Deep Learning to teach a computer to perform viscoelastic computations, these are the computations used in predictions of earthquakes. Deep Learning improved calculation time by 50,000%
  17. total number of possible games of Go has been estimated at 10761, compared to 10120 for chess. Both are very large numbers: the entire universe is estimated to contain "only" about 1080 atoms.  2017 The original AlphaGo first learned from studying 30 million moves of expert human play. https://deepmind.com/blog/alphago-zero-learning-scratch/#gif-120
  18. Fifty-six percent of the survey’s respondents said they used AI or machine learning in their investment processes. Just 20 percent had said the same in a BarclayHedge poll last August. Among current users, slightly more than two-thirds said they relied on these quantitative techniques for idea generation, while 58 percent said they used them for portfolio construction. Other applications of AI and machine learning included risk management
  19. Why is liquidity?
  20. Trivia question – what factor has been very highly correlated to S&P in the late 80s and 90s? Bangla butter prod Don’t pick on 1 country, - enhance the model by adding another factor to the mix, - US cheese prod Bangla sheep We do some intelligent things eliminate correls, to reduce noise… but not dwell on this too much. Offline
  21. Use for macro risk, asset allocation, portfolio construction and management, Whether you are a CRO, CIO, CXO, strategist, asset allocator, etc