SlideShare a Scribd company logo
1 of 13
Download to read offline
How to program your way into Data
Science?
Eeshan Chatterjee
Data Scientist @ MediaIQ Digital
https://in.linkedin.com/in/eeshanchatterjee
www.github.com/EeshanChatterjee
What is Data?
Google Definition:
● Facts and statistics collected together for reference or analysis.
● The quantities, characters, or symbols on which operations are performed by a computer, which may be stored
and transmitted in the form of electrical signals and recorded on magnetic, optical, or mechanical recording media.
● Things known or assumed as facts, making the basis of reasoning or calculation.
Umm... OK. But what is data in the business world?
Lets simplify the entire thing.
If you can Observe it, Record it, Store it and Measure it, It's gonna help your business.
This is the data that is important to you.
What data does my business
generate?
Each and every department, right from the CEO's Office, to the janitorial division collects data.
Stored!
People Data
Sales Data
Customer Satisfaction Data
Industrial Production
& Wastage Data Travel Data
Energy Data
Now the Buzzword: Data Science
The Basics
How did we arrive at Data Science?
Measure KPIs
Model Key Metrics
Operations
Research
The Era of
Business Intelligence
Dashboards
Frequent Updates
Business Analytics
The Era of
Data Science
Cockpits
Distributed
Computation
Federated Data
Intelligent Systems
Guess What didn't Change: Help Business make Better Decisions!
The Era of
Statistical Insight
The Basics
If it's always been the same core job, can a statistician call himself a Data Scientist?
Well... Not exactly. Today the job has diversified, demanding a wider skillset!
Data Design
Architect
DataEngineer
Requirement/Business
Analyst
Math &
Statistics
Business
&
Domain
Tech &
Computer
Science
DESIGNTHINKING
}
But.. Programming for Everything?
Actually, Yes. Let's look at a popular cheatsheet circulating on the internet.
Infographic courtesy: http://nirvacana.com/thoughts/becoming-a-data-scientist/
Guess what, We
can't tick off 15%
of this checklist
without
programming!
Programming for Math
Scripting
Language
Packages
Data
Structures
Notebooks &
Markdown
Plotting
Techniques
Classes &
Functions
Cross-
Language
Execution
The Algo Whiz Codebook
● Choose your scripting
language. R & Python
are the popular chioces.
● Use what's out there.
Prebuilt packages for
almost every technique
are freely available for
use.
● Interactive plots cut
down EDA time by a
huge margin.
R or Python?
The holy grail of data science choices! It is indeed difficult to choose between the two.
Their capabilities are pretty much the same. So, Which one do I choose?*
Choose R When Choose Python When
● You are begining to
explore your data
● You are looking to find
one-time insight or
developing analysis
methodology
● You want to try out a
broad spectrum of
techniques to find best
ensembles to use
● You have a good
understanding of the
data and techniques you
want to use
● You want to deploy your
analysis methodology
as a persistant large-
scale production system
● You want to train deep
models on GPUs
* This one is based on my experience and opinion. It has worked for me.
The next person you ask, will have a different take on the matter.
Programming for Tech
Data Platforms
Ingestion &
Management
Services
JAVA
Distribution &
Scale
Hadoop, Yarn,
Scala, JADE...
JAVA
Efficient
Processing
Low level
Subroutines
C++
GPGPU & Large
Scale ML
CUDA, OpenGL,
MPI
C/C++
The Scale-Out Toolbox
● C++ and JAVA form the
backbone of almost
every at-scale data
system
● Most NoSQL &
NewSQL databases are
based on Java
● Large scale machine
learning with millions of
data points most
certainly need GPU
scale processing.
Programming for the Business
Image courtesy: http://exposedata.com/tutorial/canvas/
The Decision-Maker's
Cockpit
● Interactive charts allow
answering of business
questions intuitive.
● Real time updates allow
decisions based on the
latest information
available.
● Bird's eye and drill down
capabilities allow for
multiple perspectives
without losing context.
Design Thinking and Programming
Design Thinking let's you break down and analyse the problem and synthesize the best solution
from multiple solutions possible.
At-Scale
Solution
Desired
Future
State
Complication 1
Roadblock 2
Issue 3
Possible
Solution 1
Possible
Solution 2
Possible
Solution 3
Possible
Solution 4
Prototype
Solution 4
Prototype
Solution 3
Prototype
Solution 2
Prototype
Solution 1
Consumption
Current
State
Define | Ideate | Prototype | Iterate | Develop | Deploy
Questions?
Eeshan Chatterjee
eeshanchatterjee@gmail.com
Thank You!

More Related Content

What's hot

Data Culture Series - Keynote - 24th feb
Data Culture Series - Keynote - 24th febData Culture Series - Keynote - 24th feb
Data Culture Series - Keynote - 24th febJonathan Woodward
 
Top career opportunities in data science
Top career opportunities in data scienceTop career opportunities in data science
Top career opportunities in data scienceTanyaAgarwal71
 
A Practical-ish Introduction to Data Science
A Practical-ish Introduction to Data ScienceA Practical-ish Introduction to Data Science
A Practical-ish Introduction to Data ScienceMark West
 
Who is a Data Scientist? | How to become a Data Scientist? | Data Science Cou...
Who is a Data Scientist? | How to become a Data Scientist? | Data Science Cou...Who is a Data Scientist? | How to become a Data Scientist? | Data Science Cou...
Who is a Data Scientist? | How to become a Data Scientist? | Data Science Cou...Edureka!
 
Helping data scientists escape the seduction of the sandbox - Krish Swamy, We...
Helping data scientists escape the seduction of the sandbox - Krish Swamy, We...Helping data scientists escape the seduction of the sandbox - Krish Swamy, We...
Helping data scientists escape the seduction of the sandbox - Krish Swamy, We...Sri Ambati
 
Things you need to know about big data
Things you need to know about big dataThings you need to know about big data
Things you need to know about big dataLantern Institute
 
Top 8 Data Science Tools | Open Source Tools for Data Scientists | Edureka
Top 8 Data Science Tools | Open Source Tools for Data Scientists | EdurekaTop 8 Data Science Tools | Open Source Tools for Data Scientists | Edureka
Top 8 Data Science Tools | Open Source Tools for Data Scientists | EdurekaEdureka!
 
Planning Your Data Science Projects
Planning Your Data Science ProjectsPlanning Your Data Science Projects
Planning Your Data Science ProjectsSpotle.ai
 
Data Science Salon: Introduction to Machine Learning - Marketing Use Case
Data Science Salon: Introduction to Machine Learning - Marketing Use CaseData Science Salon: Introduction to Machine Learning - Marketing Use Case
Data Science Salon: Introduction to Machine Learning - Marketing Use CaseFormulatedby
 
Demystify big data data science
Demystify big data  data scienceDemystify big data  data science
Demystify big data data scienceMahesh Kumar CV
 
Data science | What is Data science
Data science | What is Data scienceData science | What is Data science
Data science | What is Data scienceShilpaKrishna6
 
Leveraging Open Source Automated Data Science Tools
Leveraging Open Source Automated Data Science ToolsLeveraging Open Source Automated Data Science Tools
Leveraging Open Source Automated Data Science ToolsDomino Data Lab
 
Putting data science in your business a first utility feedback
Putting data science in your business a first utility feedbackPutting data science in your business a first utility feedback
Putting data science in your business a first utility feedbackPeculium Crypto
 

What's hot (19)

Data science
Data scienceData science
Data science
 
Future of datascience
Future of datascienceFuture of datascience
Future of datascience
 
The REAL face of Big Data
The REAL face of Big DataThe REAL face of Big Data
The REAL face of Big Data
 
Data Culture Series - Keynote - 24th feb
Data Culture Series - Keynote - 24th febData Culture Series - Keynote - 24th feb
Data Culture Series - Keynote - 24th feb
 
Top career opportunities in data science
Top career opportunities in data scienceTop career opportunities in data science
Top career opportunities in data science
 
A Practical-ish Introduction to Data Science
A Practical-ish Introduction to Data ScienceA Practical-ish Introduction to Data Science
A Practical-ish Introduction to Data Science
 
Vikrant data scientist
Vikrant data scientistVikrant data scientist
Vikrant data scientist
 
Who is a Data Scientist? | How to become a Data Scientist? | Data Science Cou...
Who is a Data Scientist? | How to become a Data Scientist? | Data Science Cou...Who is a Data Scientist? | How to become a Data Scientist? | Data Science Cou...
Who is a Data Scientist? | How to become a Data Scientist? | Data Science Cou...
 
Helping data scientists escape the seduction of the sandbox - Krish Swamy, We...
Helping data scientists escape the seduction of the sandbox - Krish Swamy, We...Helping data scientists escape the seduction of the sandbox - Krish Swamy, We...
Helping data scientists escape the seduction of the sandbox - Krish Swamy, We...
 
Things you need to know about big data
Things you need to know about big dataThings you need to know about big data
Things you need to know about big data
 
Top 8 Data Science Tools | Open Source Tools for Data Scientists | Edureka
Top 8 Data Science Tools | Open Source Tools for Data Scientists | EdurekaTop 8 Data Science Tools | Open Source Tools for Data Scientists | Edureka
Top 8 Data Science Tools | Open Source Tools for Data Scientists | Edureka
 
Planning Your Data Science Projects
Planning Your Data Science ProjectsPlanning Your Data Science Projects
Planning Your Data Science Projects
 
Introduction to Data Analytics
Introduction to Data AnalyticsIntroduction to Data Analytics
Introduction to Data Analytics
 
Data Science Salon: Introduction to Machine Learning - Marketing Use Case
Data Science Salon: Introduction to Machine Learning - Marketing Use CaseData Science Salon: Introduction to Machine Learning - Marketing Use Case
Data Science Salon: Introduction to Machine Learning - Marketing Use Case
 
Demystify big data data science
Demystify big data  data scienceDemystify big data  data science
Demystify big data data science
 
Data science | What is Data science
Data science | What is Data scienceData science | What is Data science
Data science | What is Data science
 
Leveraging Open Source Automated Data Science Tools
Leveraging Open Source Automated Data Science ToolsLeveraging Open Source Automated Data Science Tools
Leveraging Open Source Automated Data Science Tools
 
Data Scientist
Data ScientistData Scientist
Data Scientist
 
Putting data science in your business a first utility feedback
Putting data science in your business a first utility feedbackPutting data science in your business a first utility feedback
Putting data science in your business a first utility feedback
 

Viewers also liked

Executive Post Graduate Program in Management (batch - 06) from IMT Ghaziabad
Executive Post Graduate Program in Management (batch - 06) from IMT GhaziabadExecutive Post Graduate Program in Management (batch - 06) from IMT Ghaziabad
Executive Post Graduate Program in Management (batch - 06) from IMT Ghaziabadniitimperia01
 
Certificates Transcript
Certificates TranscriptCertificates Transcript
Certificates TranscriptJoanne Rohner
 
Microsoft word
Microsoft wordMicrosoft word
Microsoft worddfrancon19
 
Vlaamse Vereniging Huisvestingsmijen Geothermie en sociale huisvesting edb
Vlaamse Vereniging Huisvestingsmijen Geothermie en sociale huisvesting edbVlaamse Vereniging Huisvestingsmijen Geothermie en sociale huisvesting edb
Vlaamse Vereniging Huisvestingsmijen Geothermie en sociale huisvesting edbGeert Schoofs
 
Secretaria salud y dh 2016 final final
Secretaria salud y dh  2016 final  finalSecretaria salud y dh  2016 final  final
Secretaria salud y dh 2016 final finalAnalia Vallejo
 
Proyecto sala teatro
Proyecto sala teatroProyecto sala teatro
Proyecto sala teatroDanilo Rojas
 
joanne rohner resume 2015
joanne rohner resume 2015joanne rohner resume 2015
joanne rohner resume 2015Joanne Rohner
 
Diseño de ductos de aire acondicionado
Diseño de ductos de aire acondicionadoDiseño de ductos de aire acondicionado
Diseño de ductos de aire acondicionadojhon inga herrera
 
Planosana1 layout2
Planosana1 layout2Planosana1 layout2
Planosana1 layout2David Durán
 
Raiza 2 recover planta baja
Raiza 2 recover planta bajaRaiza 2 recover planta baja
Raiza 2 recover planta bajaDavid Durán
 
Plano san felipe nuevo estacion chivacoa
Plano san felipe nuevo estacion chivacoaPlano san felipe nuevo estacion chivacoa
Plano san felipe nuevo estacion chivacoaDavid Durán
 

Viewers also liked (17)

Executive Post Graduate Program in Management (batch - 06) from IMT Ghaziabad
Executive Post Graduate Program in Management (batch - 06) from IMT GhaziabadExecutive Post Graduate Program in Management (batch - 06) from IMT Ghaziabad
Executive Post Graduate Program in Management (batch - 06) from IMT Ghaziabad
 
Certificates Transcript
Certificates TranscriptCertificates Transcript
Certificates Transcript
 
Microsoft word
Microsoft wordMicrosoft word
Microsoft word
 
Updated
UpdatedUpdated
Updated
 
Business analyst
Business analystBusiness analyst
Business analyst
 
Vlaamse Vereniging Huisvestingsmijen Geothermie en sociale huisvesting edb
Vlaamse Vereniging Huisvestingsmijen Geothermie en sociale huisvesting edbVlaamse Vereniging Huisvestingsmijen Geothermie en sociale huisvesting edb
Vlaamse Vereniging Huisvestingsmijen Geothermie en sociale huisvesting edb
 
mahfouz CV
mahfouz CVmahfouz CV
mahfouz CV
 
Secretaria salud y dh 2016 final final
Secretaria salud y dh  2016 final  finalSecretaria salud y dh  2016 final  final
Secretaria salud y dh 2016 final final
 
Proyecto sala teatro
Proyecto sala teatroProyecto sala teatro
Proyecto sala teatro
 
Hitos infraestructura vial en antioquia
Hitos infraestructura vial en antioquiaHitos infraestructura vial en antioquia
Hitos infraestructura vial en antioquia
 
Montaje de aire acondicionado split
Montaje de aire acondicionado splitMontaje de aire acondicionado split
Montaje de aire acondicionado split
 
joanne rohner resume 2015
joanne rohner resume 2015joanne rohner resume 2015
joanne rohner resume 2015
 
Diseño de ductos de aire acondicionado
Diseño de ductos de aire acondicionadoDiseño de ductos de aire acondicionado
Diseño de ductos de aire acondicionado
 
Planosana1 layout2
Planosana1 layout2Planosana1 layout2
Planosana1 layout2
 
0823 2002
0823 20020823 2002
0823 2002
 
Raiza 2 recover planta baja
Raiza 2 recover planta bajaRaiza 2 recover planta baja
Raiza 2 recover planta baja
 
Plano san felipe nuevo estacion chivacoa
Plano san felipe nuevo estacion chivacoaPlano san felipe nuevo estacion chivacoa
Plano san felipe nuevo estacion chivacoa
 

Similar to How to program your way into data science?

Data analytics presentation- Management career institute
Data analytics presentation- Management career institute Data analytics presentation- Management career institute
Data analytics presentation- Management career institute PoojaPatidar11
 
What is Data analytics? How is data analytics a better career option?
What is Data analytics? How is data analytics a better career option?What is Data analytics? How is data analytics a better career option?
What is Data analytics? How is data analytics a better career option?Aspire Techsoft Academy
 
The 3 Key Barriers Keeping Companies from Deploying Data Products
The 3 Key Barriers Keeping Companies from Deploying Data Products The 3 Key Barriers Keeping Companies from Deploying Data Products
The 3 Key Barriers Keeping Companies from Deploying Data Products Dataiku
 
Big Data overview
Big Data overviewBig Data overview
Big Data overviewalexisroos
 
Unlocking Insights_ The Power of Data Analytics in the Modern World.pptx
Unlocking Insights_ The Power of Data Analytics in the Modern World.pptxUnlocking Insights_ The Power of Data Analytics in the Modern World.pptx
Unlocking Insights_ The Power of Data Analytics in the Modern World.pptxAPTRON Solutions Noida
 
Top Big data Analytics tools: Emerging trends and Best practices
Top Big data Analytics tools: Emerging trends and Best practicesTop Big data Analytics tools: Emerging trends and Best practices
Top Big data Analytics tools: Emerging trends and Best practicesSpringPeople
 
How to add machine learning to your applications today
How to add machine learning to your applications todayHow to add machine learning to your applications today
How to add machine learning to your applications todayMichal Hodinka
 
How Data Virtualization Puts Machine Learning into Production (APAC)
How Data Virtualization Puts Machine Learning into Production (APAC)How Data Virtualization Puts Machine Learning into Production (APAC)
How Data Virtualization Puts Machine Learning into Production (APAC)Denodo
 
Data Science Environment with R on openSUSE Leap 15.1
Data Science Environment with R on openSUSE Leap 15.1Data Science Environment with R on openSUSE Leap 15.1
Data Science Environment with R on openSUSE Leap 15.1Sabar Suwarsono
 
data science and business analytics
data science and business analyticsdata science and business analytics
data science and business analyticssunnypatil1778
 
Top 10 areas of expertise in data science
Top 10 areas of expertise in data scienceTop 10 areas of expertise in data science
Top 10 areas of expertise in data scienceGlobalTechCouncil
 
Data Con LA 2022 - Demystifying the Art of Business Intelligence and Data Ana...
Data Con LA 2022 - Demystifying the Art of Business Intelligence and Data Ana...Data Con LA 2022 - Demystifying the Art of Business Intelligence and Data Ana...
Data Con LA 2022 - Demystifying the Art of Business Intelligence and Data Ana...Data Con LA
 
Big data and Marketing by Edward Chenard
Big data and Marketing by Edward ChenardBig data and Marketing by Edward Chenard
Big data and Marketing by Edward ChenardEdward Chenard
 
#MarketingShake - Edward Chenard - Descubrí el poder del Big Data para Transf...
#MarketingShake - Edward Chenard - Descubrí el poder del Big Data para Transf...#MarketingShake - Edward Chenard - Descubrí el poder del Big Data para Transf...
#MarketingShake - Edward Chenard - Descubrí el poder del Big Data para Transf...amdia
 
Data science Nagarajan and madhav.pptx
Data science Nagarajan and madhav.pptxData science Nagarajan and madhav.pptx
Data science Nagarajan and madhav.pptxNagarajanG35
 
Credit card fraud detection using python machine learning
Credit card fraud detection using python machine learningCredit card fraud detection using python machine learning
Credit card fraud detection using python machine learningSandeep Garg
 

Similar to How to program your way into data science? (20)

Data analytics presentation- Management career institute
Data analytics presentation- Management career institute Data analytics presentation- Management career institute
Data analytics presentation- Management career institute
 
What is Data analytics? How is data analytics a better career option?
What is Data analytics? How is data analytics a better career option?What is Data analytics? How is data analytics a better career option?
What is Data analytics? How is data analytics a better career option?
 
The 3 Key Barriers Keeping Companies from Deploying Data Products
The 3 Key Barriers Keeping Companies from Deploying Data Products The 3 Key Barriers Keeping Companies from Deploying Data Products
The 3 Key Barriers Keeping Companies from Deploying Data Products
 
Big Data overview
Big Data overviewBig Data overview
Big Data overview
 
Unlocking Insights_ The Power of Data Analytics in the Modern World.pptx
Unlocking Insights_ The Power of Data Analytics in the Modern World.pptxUnlocking Insights_ The Power of Data Analytics in the Modern World.pptx
Unlocking Insights_ The Power of Data Analytics in the Modern World.pptx
 
Top Big data Analytics tools: Emerging trends and Best practices
Top Big data Analytics tools: Emerging trends and Best practicesTop Big data Analytics tools: Emerging trends and Best practices
Top Big data Analytics tools: Emerging trends and Best practices
 
Best Data Science Hybrid Course in Pune.
Best Data Science Hybrid Course in Pune.Best Data Science Hybrid Course in Pune.
Best Data Science Hybrid Course in Pune.
 
How to add machine learning to your applications today
How to add machine learning to your applications todayHow to add machine learning to your applications today
How to add machine learning to your applications today
 
What is business analytics
What is business analyticsWhat is business analytics
What is business analytics
 
How Data Virtualization Puts Machine Learning into Production (APAC)
How Data Virtualization Puts Machine Learning into Production (APAC)How Data Virtualization Puts Machine Learning into Production (APAC)
How Data Virtualization Puts Machine Learning into Production (APAC)
 
Data Science Environment with R on openSUSE Leap 15.1
Data Science Environment with R on openSUSE Leap 15.1Data Science Environment with R on openSUSE Leap 15.1
Data Science Environment with R on openSUSE Leap 15.1
 
data science and business analytics
data science and business analyticsdata science and business analytics
data science and business analytics
 
Top 10 areas of expertise in data science
Top 10 areas of expertise in data scienceTop 10 areas of expertise in data science
Top 10 areas of expertise in data science
 
Demystifying Data Science
Demystifying Data ScienceDemystifying Data Science
Demystifying Data Science
 
Data Con LA 2022 - Demystifying the Art of Business Intelligence and Data Ana...
Data Con LA 2022 - Demystifying the Art of Business Intelligence and Data Ana...Data Con LA 2022 - Demystifying the Art of Business Intelligence and Data Ana...
Data Con LA 2022 - Demystifying the Art of Business Intelligence and Data Ana...
 
Big data and Marketing by Edward Chenard
Big data and Marketing by Edward ChenardBig data and Marketing by Edward Chenard
Big data and Marketing by Edward Chenard
 
#MarketingShake - Edward Chenard - Descubrí el poder del Big Data para Transf...
#MarketingShake - Edward Chenard - Descubrí el poder del Big Data para Transf...#MarketingShake - Edward Chenard - Descubrí el poder del Big Data para Transf...
#MarketingShake - Edward Chenard - Descubrí el poder del Big Data para Transf...
 
Proposed Talk Outline for Pycon2017
Proposed Talk Outline for Pycon2017 Proposed Talk Outline for Pycon2017
Proposed Talk Outline for Pycon2017
 
Data science Nagarajan and madhav.pptx
Data science Nagarajan and madhav.pptxData science Nagarajan and madhav.pptx
Data science Nagarajan and madhav.pptx
 
Credit card fraud detection using python machine learning
Credit card fraud detection using python machine learningCredit card fraud detection using python machine learning
Credit card fraud detection using python machine learning
 

More from DeZyre

Top 10 Data Visualization Tools
Top 10 Data Visualization ToolsTop 10 Data Visualization Tools
Top 10 Data Visualization ToolsDeZyre
 
Data Scientist Skills
Data Scientist SkillsData Scientist Skills
Data Scientist SkillsDeZyre
 
What companies hiring data scientists and hadoop developers are looking for?
What companies hiring data scientists and hadoop developers are looking for?What companies hiring data scientists and hadoop developers are looking for?
What companies hiring data scientists and hadoop developers are looking for?DeZyre
 
Big Data Timeline
Big Data TimelineBig Data Timeline
Big Data TimelineDeZyre
 
Big Data Use Cases
Big Data Use CasesBig Data Use Cases
Big Data Use CasesDeZyre
 
Big data hadoop salary trends
Big data hadoop salary trendsBig data hadoop salary trends
Big data hadoop salary trendsDeZyre
 
Stay updated through online hackathons
Stay updated through online hackathonsStay updated through online hackathons
Stay updated through online hackathonsDeZyre
 
How to become a data scientist
How to become a data scientistHow to become a data scientist
How to become a data scientistDeZyre
 
How big data is transforming BI
How big data is transforming BIHow big data is transforming BI
How big data is transforming BIDeZyre
 
Sports and Big data
Sports and Big dataSports and Big data
Sports and Big dataDeZyre
 
Internet of Things
Internet of ThingsInternet of Things
Internet of ThingsDeZyre
 
Big data in healthcare
Big data in healthcareBig data in healthcare
Big data in healthcareDeZyre
 
What is big data
What is big data What is big data
What is big data DeZyre
 
25 things that make Amazons Jeff Bezos, Jeff Bezos
25 things that make Amazons Jeff Bezos, Jeff Bezos25 things that make Amazons Jeff Bezos, Jeff Bezos
25 things that make Amazons Jeff Bezos, Jeff BezosDeZyre
 

More from DeZyre (14)

Top 10 Data Visualization Tools
Top 10 Data Visualization ToolsTop 10 Data Visualization Tools
Top 10 Data Visualization Tools
 
Data Scientist Skills
Data Scientist SkillsData Scientist Skills
Data Scientist Skills
 
What companies hiring data scientists and hadoop developers are looking for?
What companies hiring data scientists and hadoop developers are looking for?What companies hiring data scientists and hadoop developers are looking for?
What companies hiring data scientists and hadoop developers are looking for?
 
Big Data Timeline
Big Data TimelineBig Data Timeline
Big Data Timeline
 
Big Data Use Cases
Big Data Use CasesBig Data Use Cases
Big Data Use Cases
 
Big data hadoop salary trends
Big data hadoop salary trendsBig data hadoop salary trends
Big data hadoop salary trends
 
Stay updated through online hackathons
Stay updated through online hackathonsStay updated through online hackathons
Stay updated through online hackathons
 
How to become a data scientist
How to become a data scientistHow to become a data scientist
How to become a data scientist
 
How big data is transforming BI
How big data is transforming BIHow big data is transforming BI
How big data is transforming BI
 
Sports and Big data
Sports and Big dataSports and Big data
Sports and Big data
 
Internet of Things
Internet of ThingsInternet of Things
Internet of Things
 
Big data in healthcare
Big data in healthcareBig data in healthcare
Big data in healthcare
 
What is big data
What is big data What is big data
What is big data
 
25 things that make Amazons Jeff Bezos, Jeff Bezos
25 things that make Amazons Jeff Bezos, Jeff Bezos25 things that make Amazons Jeff Bezos, Jeff Bezos
25 things that make Amazons Jeff Bezos, Jeff Bezos
 

Recently uploaded

Call Girls in Defence Colony Delhi 💯Call Us 🔝8264348440🔝
Call Girls in Defence Colony Delhi 💯Call Us 🔝8264348440🔝Call Girls in Defence Colony Delhi 💯Call Us 🔝8264348440🔝
Call Girls in Defence Colony Delhi 💯Call Us 🔝8264348440🔝soniya singh
 
定制英国白金汉大学毕业证(UCB毕业证书) 成绩单原版一比一
定制英国白金汉大学毕业证(UCB毕业证书)																			成绩单原版一比一定制英国白金汉大学毕业证(UCB毕业证书)																			成绩单原版一比一
定制英国白金汉大学毕业证(UCB毕业证书) 成绩单原版一比一ffjhghh
 
Predicting Employee Churn: A Data-Driven Approach Project Presentation
Predicting Employee Churn: A Data-Driven Approach Project PresentationPredicting Employee Churn: A Data-Driven Approach Project Presentation
Predicting Employee Churn: A Data-Driven Approach Project PresentationBoston Institute of Analytics
 
Log Analysis using OSSEC sasoasasasas.pptx
Log Analysis using OSSEC sasoasasasas.pptxLog Analysis using OSSEC sasoasasasas.pptx
Log Analysis using OSSEC sasoasasasas.pptxJohnnyPlasten
 
(PARI) Call Girls Wanowrie ( 7001035870 ) HI-Fi Pune Escorts Service
(PARI) Call Girls Wanowrie ( 7001035870 ) HI-Fi Pune Escorts Service(PARI) Call Girls Wanowrie ( 7001035870 ) HI-Fi Pune Escorts Service
(PARI) Call Girls Wanowrie ( 7001035870 ) HI-Fi Pune Escorts Serviceranjana rawat
 
Indian Call Girls in Abu Dhabi O5286O24O8 Call Girls in Abu Dhabi By Independ...
Indian Call Girls in Abu Dhabi O5286O24O8 Call Girls in Abu Dhabi By Independ...Indian Call Girls in Abu Dhabi O5286O24O8 Call Girls in Abu Dhabi By Independ...
Indian Call Girls in Abu Dhabi O5286O24O8 Call Girls in Abu Dhabi By Independ...dajasot375
 
Call Girls In Mahipalpur O9654467111 Escorts Service
Call Girls In Mahipalpur O9654467111  Escorts ServiceCall Girls In Mahipalpur O9654467111  Escorts Service
Call Girls In Mahipalpur O9654467111 Escorts ServiceSapana Sha
 
B2 Creative Industry Response Evaluation.docx
B2 Creative Industry Response Evaluation.docxB2 Creative Industry Response Evaluation.docx
B2 Creative Industry Response Evaluation.docxStephen266013
 
Delhi Call Girls Punjabi Bagh 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
Delhi Call Girls Punjabi Bagh 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip CallDelhi Call Girls Punjabi Bagh 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
Delhi Call Girls Punjabi Bagh 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Callshivangimorya083
 
꧁❤ Greater Noida Call Girls Delhi ❤꧂ 9711199171 ☎️ Hard And Sexy Vip Call
꧁❤ Greater Noida Call Girls Delhi ❤꧂ 9711199171 ☎️ Hard And Sexy Vip Call꧁❤ Greater Noida Call Girls Delhi ❤꧂ 9711199171 ☎️ Hard And Sexy Vip Call
꧁❤ Greater Noida Call Girls Delhi ❤꧂ 9711199171 ☎️ Hard And Sexy Vip Callshivangimorya083
 
Digi Khata Problem along complete plan.pptx
Digi Khata Problem along complete plan.pptxDigi Khata Problem along complete plan.pptx
Digi Khata Problem along complete plan.pptxTanveerAhmed817946
 
Best VIP Call Girls Noida Sector 39 Call Me: 8448380779
Best VIP Call Girls Noida Sector 39 Call Me: 8448380779Best VIP Call Girls Noida Sector 39 Call Me: 8448380779
Best VIP Call Girls Noida Sector 39 Call Me: 8448380779Delhi Call girls
 
Saket, (-DELHI )+91-9654467111-(=)CHEAP Call Girls in Escorts Service Saket C...
Saket, (-DELHI )+91-9654467111-(=)CHEAP Call Girls in Escorts Service Saket C...Saket, (-DELHI )+91-9654467111-(=)CHEAP Call Girls in Escorts Service Saket C...
Saket, (-DELHI )+91-9654467111-(=)CHEAP Call Girls in Escorts Service Saket C...Sapana Sha
 
Beautiful Sapna Vip Call Girls Hauz Khas 9711199012 Call /Whatsapps
Beautiful Sapna Vip  Call Girls Hauz Khas 9711199012 Call /WhatsappsBeautiful Sapna Vip  Call Girls Hauz Khas 9711199012 Call /Whatsapps
Beautiful Sapna Vip Call Girls Hauz Khas 9711199012 Call /Whatsappssapnasaifi408
 
Full night 🥵 Call Girls Delhi New Friends Colony {9711199171} Sanya Reddy ✌️o...
Full night 🥵 Call Girls Delhi New Friends Colony {9711199171} Sanya Reddy ✌️o...Full night 🥵 Call Girls Delhi New Friends Colony {9711199171} Sanya Reddy ✌️o...
Full night 🥵 Call Girls Delhi New Friends Colony {9711199171} Sanya Reddy ✌️o...shivangimorya083
 
Brighton SEO | April 2024 | Data Storytelling
Brighton SEO | April 2024 | Data StorytellingBrighton SEO | April 2024 | Data Storytelling
Brighton SEO | April 2024 | Data StorytellingNeil Barnes
 
From idea to production in a day – Leveraging Azure ML and Streamlit to build...
From idea to production in a day – Leveraging Azure ML and Streamlit to build...From idea to production in a day – Leveraging Azure ML and Streamlit to build...
From idea to production in a day – Leveraging Azure ML and Streamlit to build...Florian Roscheck
 
VIP Call Girls Service Miyapur Hyderabad Call +91-8250192130
VIP Call Girls Service Miyapur Hyderabad Call +91-8250192130VIP Call Girls Service Miyapur Hyderabad Call +91-8250192130
VIP Call Girls Service Miyapur Hyderabad Call +91-8250192130Suhani Kapoor
 
Kantar AI Summit- Under Embargo till Wednesday, 24th April 2024, 4 PM, IST.pdf
Kantar AI Summit- Under Embargo till Wednesday, 24th April 2024, 4 PM, IST.pdfKantar AI Summit- Under Embargo till Wednesday, 24th April 2024, 4 PM, IST.pdf
Kantar AI Summit- Under Embargo till Wednesday, 24th April 2024, 4 PM, IST.pdfSocial Samosa
 

Recently uploaded (20)

Call Girls in Defence Colony Delhi 💯Call Us 🔝8264348440🔝
Call Girls in Defence Colony Delhi 💯Call Us 🔝8264348440🔝Call Girls in Defence Colony Delhi 💯Call Us 🔝8264348440🔝
Call Girls in Defence Colony Delhi 💯Call Us 🔝8264348440🔝
 
定制英国白金汉大学毕业证(UCB毕业证书) 成绩单原版一比一
定制英国白金汉大学毕业证(UCB毕业证书)																			成绩单原版一比一定制英国白金汉大学毕业证(UCB毕业证书)																			成绩单原版一比一
定制英国白金汉大学毕业证(UCB毕业证书) 成绩单原版一比一
 
Predicting Employee Churn: A Data-Driven Approach Project Presentation
Predicting Employee Churn: A Data-Driven Approach Project PresentationPredicting Employee Churn: A Data-Driven Approach Project Presentation
Predicting Employee Churn: A Data-Driven Approach Project Presentation
 
Log Analysis using OSSEC sasoasasasas.pptx
Log Analysis using OSSEC sasoasasasas.pptxLog Analysis using OSSEC sasoasasasas.pptx
Log Analysis using OSSEC sasoasasasas.pptx
 
(PARI) Call Girls Wanowrie ( 7001035870 ) HI-Fi Pune Escorts Service
(PARI) Call Girls Wanowrie ( 7001035870 ) HI-Fi Pune Escorts Service(PARI) Call Girls Wanowrie ( 7001035870 ) HI-Fi Pune Escorts Service
(PARI) Call Girls Wanowrie ( 7001035870 ) HI-Fi Pune Escorts Service
 
Indian Call Girls in Abu Dhabi O5286O24O8 Call Girls in Abu Dhabi By Independ...
Indian Call Girls in Abu Dhabi O5286O24O8 Call Girls in Abu Dhabi By Independ...Indian Call Girls in Abu Dhabi O5286O24O8 Call Girls in Abu Dhabi By Independ...
Indian Call Girls in Abu Dhabi O5286O24O8 Call Girls in Abu Dhabi By Independ...
 
Call Girls In Mahipalpur O9654467111 Escorts Service
Call Girls In Mahipalpur O9654467111  Escorts ServiceCall Girls In Mahipalpur O9654467111  Escorts Service
Call Girls In Mahipalpur O9654467111 Escorts Service
 
B2 Creative Industry Response Evaluation.docx
B2 Creative Industry Response Evaluation.docxB2 Creative Industry Response Evaluation.docx
B2 Creative Industry Response Evaluation.docx
 
Delhi Call Girls Punjabi Bagh 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
Delhi Call Girls Punjabi Bagh 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip CallDelhi Call Girls Punjabi Bagh 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
Delhi Call Girls Punjabi Bagh 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
 
꧁❤ Greater Noida Call Girls Delhi ❤꧂ 9711199171 ☎️ Hard And Sexy Vip Call
꧁❤ Greater Noida Call Girls Delhi ❤꧂ 9711199171 ☎️ Hard And Sexy Vip Call꧁❤ Greater Noida Call Girls Delhi ❤꧂ 9711199171 ☎️ Hard And Sexy Vip Call
꧁❤ Greater Noida Call Girls Delhi ❤꧂ 9711199171 ☎️ Hard And Sexy Vip Call
 
Digi Khata Problem along complete plan.pptx
Digi Khata Problem along complete plan.pptxDigi Khata Problem along complete plan.pptx
Digi Khata Problem along complete plan.pptx
 
Best VIP Call Girls Noida Sector 39 Call Me: 8448380779
Best VIP Call Girls Noida Sector 39 Call Me: 8448380779Best VIP Call Girls Noida Sector 39 Call Me: 8448380779
Best VIP Call Girls Noida Sector 39 Call Me: 8448380779
 
Saket, (-DELHI )+91-9654467111-(=)CHEAP Call Girls in Escorts Service Saket C...
Saket, (-DELHI )+91-9654467111-(=)CHEAP Call Girls in Escorts Service Saket C...Saket, (-DELHI )+91-9654467111-(=)CHEAP Call Girls in Escorts Service Saket C...
Saket, (-DELHI )+91-9654467111-(=)CHEAP Call Girls in Escorts Service Saket C...
 
Beautiful Sapna Vip Call Girls Hauz Khas 9711199012 Call /Whatsapps
Beautiful Sapna Vip  Call Girls Hauz Khas 9711199012 Call /WhatsappsBeautiful Sapna Vip  Call Girls Hauz Khas 9711199012 Call /Whatsapps
Beautiful Sapna Vip Call Girls Hauz Khas 9711199012 Call /Whatsapps
 
E-Commerce Order PredictionShraddha Kamble.pptx
E-Commerce Order PredictionShraddha Kamble.pptxE-Commerce Order PredictionShraddha Kamble.pptx
E-Commerce Order PredictionShraddha Kamble.pptx
 
Full night 🥵 Call Girls Delhi New Friends Colony {9711199171} Sanya Reddy ✌️o...
Full night 🥵 Call Girls Delhi New Friends Colony {9711199171} Sanya Reddy ✌️o...Full night 🥵 Call Girls Delhi New Friends Colony {9711199171} Sanya Reddy ✌️o...
Full night 🥵 Call Girls Delhi New Friends Colony {9711199171} Sanya Reddy ✌️o...
 
Brighton SEO | April 2024 | Data Storytelling
Brighton SEO | April 2024 | Data StorytellingBrighton SEO | April 2024 | Data Storytelling
Brighton SEO | April 2024 | Data Storytelling
 
From idea to production in a day – Leveraging Azure ML and Streamlit to build...
From idea to production in a day – Leveraging Azure ML and Streamlit to build...From idea to production in a day – Leveraging Azure ML and Streamlit to build...
From idea to production in a day – Leveraging Azure ML and Streamlit to build...
 
VIP Call Girls Service Miyapur Hyderabad Call +91-8250192130
VIP Call Girls Service Miyapur Hyderabad Call +91-8250192130VIP Call Girls Service Miyapur Hyderabad Call +91-8250192130
VIP Call Girls Service Miyapur Hyderabad Call +91-8250192130
 
Kantar AI Summit- Under Embargo till Wednesday, 24th April 2024, 4 PM, IST.pdf
Kantar AI Summit- Under Embargo till Wednesday, 24th April 2024, 4 PM, IST.pdfKantar AI Summit- Under Embargo till Wednesday, 24th April 2024, 4 PM, IST.pdf
Kantar AI Summit- Under Embargo till Wednesday, 24th April 2024, 4 PM, IST.pdf
 

How to program your way into data science?

  • 1. How to program your way into Data Science? Eeshan Chatterjee Data Scientist @ MediaIQ Digital https://in.linkedin.com/in/eeshanchatterjee www.github.com/EeshanChatterjee
  • 2. What is Data? Google Definition: ● Facts and statistics collected together for reference or analysis. ● The quantities, characters, or symbols on which operations are performed by a computer, which may be stored and transmitted in the form of electrical signals and recorded on magnetic, optical, or mechanical recording media. ● Things known or assumed as facts, making the basis of reasoning or calculation. Umm... OK. But what is data in the business world? Lets simplify the entire thing. If you can Observe it, Record it, Store it and Measure it, It's gonna help your business. This is the data that is important to you.
  • 3. What data does my business generate? Each and every department, right from the CEO's Office, to the janitorial division collects data. Stored! People Data Sales Data Customer Satisfaction Data Industrial Production & Wastage Data Travel Data Energy Data
  • 4. Now the Buzzword: Data Science
  • 5. The Basics How did we arrive at Data Science? Measure KPIs Model Key Metrics Operations Research The Era of Business Intelligence Dashboards Frequent Updates Business Analytics The Era of Data Science Cockpits Distributed Computation Federated Data Intelligent Systems Guess What didn't Change: Help Business make Better Decisions! The Era of Statistical Insight
  • 6. The Basics If it's always been the same core job, can a statistician call himself a Data Scientist? Well... Not exactly. Today the job has diversified, demanding a wider skillset! Data Design Architect DataEngineer Requirement/Business Analyst Math & Statistics Business & Domain Tech & Computer Science DESIGNTHINKING }
  • 7. But.. Programming for Everything? Actually, Yes. Let's look at a popular cheatsheet circulating on the internet. Infographic courtesy: http://nirvacana.com/thoughts/becoming-a-data-scientist/ Guess what, We can't tick off 15% of this checklist without programming!
  • 8. Programming for Math Scripting Language Packages Data Structures Notebooks & Markdown Plotting Techniques Classes & Functions Cross- Language Execution The Algo Whiz Codebook ● Choose your scripting language. R & Python are the popular chioces. ● Use what's out there. Prebuilt packages for almost every technique are freely available for use. ● Interactive plots cut down EDA time by a huge margin.
  • 9. R or Python? The holy grail of data science choices! It is indeed difficult to choose between the two. Their capabilities are pretty much the same. So, Which one do I choose?* Choose R When Choose Python When ● You are begining to explore your data ● You are looking to find one-time insight or developing analysis methodology ● You want to try out a broad spectrum of techniques to find best ensembles to use ● You have a good understanding of the data and techniques you want to use ● You want to deploy your analysis methodology as a persistant large- scale production system ● You want to train deep models on GPUs * This one is based on my experience and opinion. It has worked for me. The next person you ask, will have a different take on the matter.
  • 10. Programming for Tech Data Platforms Ingestion & Management Services JAVA Distribution & Scale Hadoop, Yarn, Scala, JADE... JAVA Efficient Processing Low level Subroutines C++ GPGPU & Large Scale ML CUDA, OpenGL, MPI C/C++ The Scale-Out Toolbox ● C++ and JAVA form the backbone of almost every at-scale data system ● Most NoSQL & NewSQL databases are based on Java ● Large scale machine learning with millions of data points most certainly need GPU scale processing.
  • 11. Programming for the Business Image courtesy: http://exposedata.com/tutorial/canvas/ The Decision-Maker's Cockpit ● Interactive charts allow answering of business questions intuitive. ● Real time updates allow decisions based on the latest information available. ● Bird's eye and drill down capabilities allow for multiple perspectives without losing context.
  • 12. Design Thinking and Programming Design Thinking let's you break down and analyse the problem and synthesize the best solution from multiple solutions possible. At-Scale Solution Desired Future State Complication 1 Roadblock 2 Issue 3 Possible Solution 1 Possible Solution 2 Possible Solution 3 Possible Solution 4 Prototype Solution 4 Prototype Solution 3 Prototype Solution 2 Prototype Solution 1 Consumption Current State Define | Ideate | Prototype | Iterate | Develop | Deploy