SlideShare a Scribd company logo
1 of 35
Download to read offline
DATA+SCIENCE
A FIRST COURSE
What is Data Science?
Data Science is, in general terms,
the extraction of knowledge from
data
What is Data Science?
Data is increasingly cheap and ubiquitous. We
are collecting and analyzing data,
unprecedented in variety, complexity and
scale.
At the same time, new technologies are
emerging to organize and make sense of this
avalanche of data.
What is Data Science?
Data Science is an interdisciplinary subject
employing concepts and techniques from
mathematics, statistics, computer science
and economics.
It is used to identify patterns and regularities in
data, affecting all aspects of work and society
from medicine to marketing to scientific
research.
Who is a Data Scientist?
A data scientist is someone who is
better at statistics than most
software engineers and better at
software engineering than most
statisticians
Who is a Data Scientist?
A Data Scientist is a professional
with the training and curiosity to
make discoveries while swimming in
an ocean of data; communicating
what they learn and suggesting its
implications for new decisions.
Who is a Data Scientist?
They identify and combine rich and potentially
incomplete data sources, and bring structure to
large quantities of formless data, making
analysis possible.
They engage decision makers in an ongoing
conversation based on the implications of the
data for products, processes, and decisions.
Who is a Data Scientist?
★ A Data Scientist should have solid
quantitative and analytic skills
Statistical
Modelling
Experimental
Design
Bayesian
Inference
Machine
Learning
Information
Theory
Complex
Systems
Who is a Data Scientist?
★ A Data Scientist should be a good
programmer
Scripting:
e.g. python
Statistical
Packages: e.g. R
Databases: SQL
and NoSQL
MapReduce
concepts
Hadoop and
Hive/Pig
Computer
Science
Who is a Data Scientist?
In addition, a Data Scientist should
★ excel at communication and visualization
★ understand economics and business
concepts
★ be curious and creative
Demand for Data Scientists
Demand for Data Scientists
There is a growing demand for data-savvy
professionals in businesses, public agencies,
and nonprofits.
There is a limited supply of professionals who
can efficiently work with data at scale.
Thus, the salaries for data engineers, data
scientists, statisticians, and data analysts
have increased rapidly.
A recent study by the McKinsey Global
Institute estimates that there will be four to
five million jobs in the U.S. requiring data
analysis skills by 2018, and that large numbers
of positions will only be filled through training
or retraining.
In a survey of 816 data professionals in 53
countries, O’Reilly Media report a median
annual salary for Data Science professionals
as $98,000.
SQL, R, Python and Excel are the top earning
skills.
Data Science in India
According to a survey by Gartner
★ In 2013, the Data Analytics market in India
was $1.6 Billion with a growth rate of 8%
★ By 2018, the market is projected to be $3.7
Billion
"For the fourth year in a row, analytics ranks as the No.
1 priority in Gartner's CIO [India] Survey." Bhavish Sood,
research director at Gartner explains.
India is one of the strongest countries in the Data
Science marketplace that boasts of clients including
Facebook, GE, NASA, Tesco and Merck. It can
potentially build a talent pipeline for data scientists that
are virtually non-existent today.
India will need 200,000 data scientists in the next few
years. A single company, Wipro, already has as many as
8,000 people in analytics functions.
Data Science in India
The median annual salary for a Data Scientists in
India is Rs 670,665
The highest paying skills are
Python, Machine Learning,
Statistical Analysis, Big Data
Analytics, and R.
Bengal Chamber proposes smart and
green city for business analytics firms
The Bengal Chamber of Commerce and Industry has
taken an initiative to set up a smart city for business
analytics in West Bengal.
The project would involve service providers like KPMG
Advisory Services and PricewaterhouseCoopers,
corporate consumers, education institutions such as
Indian Institute of Technology Kharagpur, the Indian
Statistical Institute, and the Indian Institute of
Management, Calcutta.
How can you be a Data Scientist?
A Master’s degree is a natural route to be a Data
Scientist.
Massive Open Online Courses (MOOCs) give access to
self-learning at a low cost (often free), but leave it to the
student to identify a suitable set of courses and tools to
round out a coherent skill set.
Bootcamps offer students a practical and structured
learning environment at a far more affordable rate
compared with obtaining a Master’s Degree.
Master’s Degree
Duration 9 - 20 months
Faculty University Professors
Learning Theory and Assignments
Outcome Degree
Projects Practicum and Internship
Placement University Recruiting
Examples UC Berkeley, NYU, NCSU
IIT+IIM+ISI
Tuition $20,000 - $70,000 (US)
₹20,000,000 (India)
Self-Learning (MOOCs)
Duration 6 - 18 months (part time)
Faculty University Professors
(recorded lectures)
Learning Self guided
Outcome Certificate
Projects Projects on own time
Placement Self-driven job search
Examples Coursera, Udacity
Tuition Free- $500 (US)
Bootcamps
Duration 2 - 3 months
Faculty Professors & Data Scientists
Learning Experiential Learning
Outcome Certificate and Portfolio
Projects Built-In Projects
Placement Hiring Day and
Placement Assistance
Examples Zipfan, Metis, Data Incubator
Tuition Free - $16,000 (US)
The Course
Data+Science: A First Course is an intensive
eight-week program based on the bootcamp
model, organized by The Data+Science
Initiative.
It is designed to teach and train graduates in
quantitative fields to take an entry-level
position as a data scientist.
Objectives of the Course
Upon graduating a student will:
1. Have a clear understanding of and practical
experience with the process of designing,
implementing, and communicating the results of a
data science project.
2. Understand the landscape of data science tools and
their applications, and be prepared to identify and
dig into new technologies and algorithms needed
for the job at hand.
Overview
Data science gives valuable meaning to large sets
of complex and unstructured data.
The focus is around concepts and techniques to
mine, store, analyse and visualize data.
Data science is a highly interdisciplinary drawing
from fields such as computer science (algorithms
and databases), statistics (hypothesis testing and
inference), artificial intelligence (pattern
recognition and machine learning).
Course Content
Data Mining (⅛):
identifying data sources; extracting, cleaning
and verifying structured and unstructured data
Data Storage (¼):
structuring, storage and retrieval of data;
including big data and NoSQL
Data Analysis (½):
descriptive and inferential analysis; predictive
modelling, risk analysis and decision making
Data Visualization (⅛)
Course Content
Graduating students will:
1. Be proficient in statistical concepts and
mathematical techniques including correlation
functions, inference and hypothesis testing.
2. Be able to make predictive analyses by modelling
stochastic processes based on available data.
3. Learn and apply Machine Learning concepts to
solve data science problems
Course Content
4. Be capable coders in Python and R, including the
related packages and toolsets most commonly
used in data science.
5. Know the fundamentals of data visualization and
have experience creating static and dynamic data
visuals using JavaScript and D3.js.
6. Have introductory exposure to big data tools and
architecture such as the Hadoop stack, know when
these tools are necessary, and be poised to quickly
train up and utilize them in a big data project.
Prerequisites
Basic Statistics and Probability
descriptive statistics and distributions
Linear Algebra
vectors and matrices
Calculus and Differential Equations
basic calculus and finding extrema, ordinary
differential equations
Programming
basic proficiency in any programming language
Preferred Subjects
Computer Science
algorithms, data structures and databases
Advanced Statistics
bayesian inference and stochoastic processes
Statistical Mechanics/Information Theory
entropy, information, complexity
Economics
supply/demand, game theory
Web Development
HTML, CSS and Javascript
Eligibility
Anyone meeting the prerequisite criteria is
eligible, determined by a qualifying exam, with
preference given to those with knowledge of
the preferred subjects.
However, we would prefer applicants to have a
bachelor’s degree in a quantitative field, such
as: Engineering, Physics, Mathematics,
Statistics, Economics or Computer
Applications.
Course Details
The course consists of 24 classes over 8 weeks.
Each class (Mondays, Wednesdays, Fridays) is 6
hours in duration (10AM-4PM) including a lunch
hour.
Morning sessions consists of lectures and
discussions while the afternoons is a guided
programming session.
In addition, instructors will be available for office
hours at scheduled times.
Course Projects
The course is divided into three parts.
Part A (Weeks 1-4): daily programming projects
executed individually or in groups
Part B (Weeks 5-8): weekly projects in groups
drawn from the industry
Part C (Weeks 9-11, optional): course project in
groups with biweekly meetings with instructors
Benefits
Employment: Students will have the skill set and
portfolio to find employment as an entry level
data scientist. Such a skill set is in great demand,
both domestically as well as in developed
countries.
Research: Since Data Science is at the core of
academic research, our students, armed with the
knowledge, portfolio and recommendation will
find easier admission to universities, especially
abroad.

More Related Content

What's hot

Data Science Tutorial | Introduction To Data Science | Data Science Training ...
Data Science Tutorial | Introduction To Data Science | Data Science Training ...Data Science Tutorial | Introduction To Data Science | Data Science Training ...
Data Science Tutorial | Introduction To Data Science | Data Science Training ...Edureka!
 
Introduction to Data Science
Introduction to Data ScienceIntroduction to Data Science
Introduction to Data ScienceEdureka!
 
Responsible Data Use in AI - core tech pillars
Responsible Data Use in AI - core tech pillarsResponsible Data Use in AI - core tech pillars
Responsible Data Use in AI - core tech pillarsSofus Macskássy
 
Application of data mining
Application of data miningApplication of data mining
Application of data miningSHIVANI SONI
 
Business Intelligence & Predictive Analytic by Prof. Lili Saghafi
Business Intelligence & Predictive Analytic by Prof. Lili SaghafiBusiness Intelligence & Predictive Analytic by Prof. Lili Saghafi
Business Intelligence & Predictive Analytic by Prof. Lili SaghafiProfessor Lili Saghafi
 
Data Science Training | Data Science Tutorial | Data Science Certification | ...
Data Science Training | Data Science Tutorial | Data Science Certification | ...Data Science Training | Data Science Tutorial | Data Science Certification | ...
Data Science Training | Data Science Tutorial | Data Science Certification | ...Edureka!
 
Data Analyst Job Description | Edureka
Data Analyst Job Description | EdurekaData Analyst Job Description | Edureka
Data Analyst Job Description | EdurekaEdureka!
 
Foundation of Digital Forensics
Foundation of Digital ForensicsFoundation of Digital Forensics
Foundation of Digital ForensicsVictor C. Sovichea
 
Data Mining Tools / Orange
Data Mining Tools / OrangeData Mining Tools / Orange
Data Mining Tools / OrangeYasemin Karaman
 
An introduction to open data
An introduction to open dataAn introduction to open data
An introduction to open dataSally Lait
 
Exploratory data analysis data visualization
Exploratory data analysis data visualizationExploratory data analysis data visualization
Exploratory data analysis data visualizationDr. Hamdan Al-Sabri
 
Introduction to data science
Introduction to data scienceIntroduction to data science
Introduction to data scienceMahir Haque
 
Introduction on Data Science
Introduction on Data ScienceIntroduction on Data Science
Introduction on Data ScienceEdureka!
 
Data Science vs Machine Learning – What’s The Difference? | Data Science Cour...
Data Science vs Machine Learning – What’s The Difference? | Data Science Cour...Data Science vs Machine Learning – What’s The Difference? | Data Science Cour...
Data Science vs Machine Learning – What’s The Difference? | Data Science Cour...Edureka!
 
Top 10 Applications Of Artificial Intelligence | Edureka
Top 10 Applications Of Artificial Intelligence | EdurekaTop 10 Applications Of Artificial Intelligence | Edureka
Top 10 Applications Of Artificial Intelligence | EdurekaEdureka!
 
APPLICATION OF DATA SCIENCE IN HEALTHCARE
APPLICATION OF DATA SCIENCE IN HEALTHCAREAPPLICATION OF DATA SCIENCE IN HEALTHCARE
APPLICATION OF DATA SCIENCE IN HEALTHCAREAnnaAntony16
 
Data masking in sas
Data masking in sasData masking in sas
Data masking in sasMurphy Choy
 

What's hot (20)

Data Science Tutorial | Introduction To Data Science | Data Science Training ...
Data Science Tutorial | Introduction To Data Science | Data Science Training ...Data Science Tutorial | Introduction To Data Science | Data Science Training ...
Data Science Tutorial | Introduction To Data Science | Data Science Training ...
 
Introduction to Data Science
Introduction to Data ScienceIntroduction to Data Science
Introduction to Data Science
 
Responsible Data Use in AI - core tech pillars
Responsible Data Use in AI - core tech pillarsResponsible Data Use in AI - core tech pillars
Responsible Data Use in AI - core tech pillars
 
Application of data mining
Application of data miningApplication of data mining
Application of data mining
 
Business Intelligence & Predictive Analytic by Prof. Lili Saghafi
Business Intelligence & Predictive Analytic by Prof. Lili SaghafiBusiness Intelligence & Predictive Analytic by Prof. Lili Saghafi
Business Intelligence & Predictive Analytic by Prof. Lili Saghafi
 
Data Science Training | Data Science Tutorial | Data Science Certification | ...
Data Science Training | Data Science Tutorial | Data Science Certification | ...Data Science Training | Data Science Tutorial | Data Science Certification | ...
Data Science Training | Data Science Tutorial | Data Science Certification | ...
 
Data Analyst Job Description | Edureka
Data Analyst Job Description | EdurekaData Analyst Job Description | Edureka
Data Analyst Job Description | Edureka
 
Foundation of Digital Forensics
Foundation of Digital ForensicsFoundation of Digital Forensics
Foundation of Digital Forensics
 
Data science
Data science Data science
Data science
 
Data Mining Tools / Orange
Data Mining Tools / OrangeData Mining Tools / Orange
Data Mining Tools / Orange
 
Data science
Data scienceData science
Data science
 
An introduction to open data
An introduction to open dataAn introduction to open data
An introduction to open data
 
Exploratory data analysis data visualization
Exploratory data analysis data visualizationExploratory data analysis data visualization
Exploratory data analysis data visualization
 
Introduction to data science
Introduction to data scienceIntroduction to data science
Introduction to data science
 
Introduction on Data Science
Introduction on Data ScienceIntroduction on Data Science
Introduction on Data Science
 
Computer forensic
Computer forensicComputer forensic
Computer forensic
 
Data Science vs Machine Learning – What’s The Difference? | Data Science Cour...
Data Science vs Machine Learning – What’s The Difference? | Data Science Cour...Data Science vs Machine Learning – What’s The Difference? | Data Science Cour...
Data Science vs Machine Learning – What’s The Difference? | Data Science Cour...
 
Top 10 Applications Of Artificial Intelligence | Edureka
Top 10 Applications Of Artificial Intelligence | EdurekaTop 10 Applications Of Artificial Intelligence | Edureka
Top 10 Applications Of Artificial Intelligence | Edureka
 
APPLICATION OF DATA SCIENCE IN HEALTHCARE
APPLICATION OF DATA SCIENCE IN HEALTHCAREAPPLICATION OF DATA SCIENCE IN HEALTHCARE
APPLICATION OF DATA SCIENCE IN HEALTHCARE
 
Data masking in sas
Data masking in sasData masking in sas
Data masking in sas
 

Viewers also liked

Socializing Big Data: Collaborative Opportunities in Computer Science, the So...
Socializing Big Data: Collaborative Opportunities in Computer Science, the So...Socializing Big Data: Collaborative Opportunities in Computer Science, the So...
Socializing Big Data: Collaborative Opportunities in Computer Science, the So...Sheryl Grant
 
Buy Embedded Systems Projects,B tech Final Year Projects Online
Buy Embedded Systems Projects,B tech Final Year Projects OnlineBuy Embedded Systems Projects,B tech Final Year Projects Online
Buy Embedded Systems Projects,B tech Final Year Projects OnlineTechnogroovy
 
Fall Directors 2014: Junior/Upperclass Research Projects Presentation
Fall Directors 2014: Junior/Upperclass Research Projects PresentationFall Directors 2014: Junior/Upperclass Research Projects Presentation
Fall Directors 2014: Junior/Upperclass Research Projects PresentationBonner Foundation
 
Designing Course-Based, Student-Faculty Collaborative Research Projects Usi...
Designing Course-Based,  Student-Faculty Collaborative  Research Projects Usi...Designing Course-Based,  Student-Faculty Collaborative  Research Projects Usi...
Designing Course-Based, Student-Faculty Collaborative Research Projects Usi...Rebecca Davis
 
Mostra cultural Emec Paulo Freire e Cecília Meireles 2011
Mostra cultural Emec Paulo Freire e Cecília Meireles  2011Mostra cultural Emec Paulo Freire e Cecília Meireles  2011
Mostra cultural Emec Paulo Freire e Cecília Meireles 2011elianehistoriarte
 
School Science Projects based on Experiments
School Science Projects based on ExperimentsSchool Science Projects based on Experiments
School Science Projects based on ExperimentsHiran Amarasekera
 
Job satisfaction Research based project
Job satisfaction Research based projectJob satisfaction Research based project
Job satisfaction Research based projectHARSH SHAH
 
Bootstrapping Machine Learning
Bootstrapping Machine LearningBootstrapping Machine Learning
Bootstrapping Machine LearningLouis Dorard
 
Big Data application - OSS / BSS
Big Data application - OSS / BSSBig Data application - OSS / BSS
Big Data application - OSS / BSSKeyur Thakore
 
Advanced engineering math 8 e solutions manual evens kreyszig
Advanced engineering math 8 e solutions manual evens   kreyszigAdvanced engineering math 8 e solutions manual evens   kreyszig
Advanced engineering math 8 e solutions manual evens kreyszigBianca Iris Estrada Rivera
 
DevOps 101 - Moving Fast with Confidence
DevOps 101 - Moving Fast with ConfidenceDevOps 101 - Moving Fast with Confidence
DevOps 101 - Moving Fast with ConfidenceNew Relic
 

Viewers also liked (20)

Algorithm Class is a Training Institute on C, C++, CPP, DS, JAVA, data struct...
Algorithm Class is a Training Institute on C, C++, CPP, DS, JAVA, data struct...Algorithm Class is a Training Institute on C, C++, CPP, DS, JAVA, data struct...
Algorithm Class is a Training Institute on C, C++, CPP, DS, JAVA, data struct...
 
Mastering Data Structures | data structures training Hyderabad
Mastering Data Structures | data structures training HyderabadMastering Data Structures | data structures training Hyderabad
Mastering Data Structures | data structures training Hyderabad
 
Algorithm Class at KPHB (C, C++ Course Training Institute in KPHB, Kukatpally...
Algorithm Class at KPHB (C, C++ Course Training Institute in KPHB, Kukatpally...Algorithm Class at KPHB (C, C++ Course Training Institute in KPHB, Kukatpally...
Algorithm Class at KPHB (C, C++ Course Training Institute in KPHB, Kukatpally...
 
Socializing Big Data: Collaborative Opportunities in Computer Science, the So...
Socializing Big Data: Collaborative Opportunities in Computer Science, the So...Socializing Big Data: Collaborative Opportunities in Computer Science, the So...
Socializing Big Data: Collaborative Opportunities in Computer Science, the So...
 
Buy Embedded Systems Projects,B tech Final Year Projects Online
Buy Embedded Systems Projects,B tech Final Year Projects OnlineBuy Embedded Systems Projects,B tech Final Year Projects Online
Buy Embedded Systems Projects,B tech Final Year Projects Online
 
Fall Directors 2014: Junior/Upperclass Research Projects Presentation
Fall Directors 2014: Junior/Upperclass Research Projects PresentationFall Directors 2014: Junior/Upperclass Research Projects Presentation
Fall Directors 2014: Junior/Upperclass Research Projects Presentation
 
Novo lar planalto
Novo  lar  planaltoNovo  lar  planalto
Novo lar planalto
 
Novo lar parque das águas
Novo lar parque das águasNovo lar parque das águas
Novo lar parque das águas
 
Novo lar veneza
Novo lar  venezaNovo lar  veneza
Novo lar veneza
 
Linux training in chandigarh
Linux training in chandigarhLinux training in chandigarh
Linux training in chandigarh
 
Designing Course-Based, Student-Faculty Collaborative Research Projects Usi...
Designing Course-Based,  Student-Faculty Collaborative  Research Projects Usi...Designing Course-Based,  Student-Faculty Collaborative  Research Projects Usi...
Designing Course-Based, Student-Faculty Collaborative Research Projects Usi...
 
Building Data Teams
Building Data TeamsBuilding Data Teams
Building Data Teams
 
Mostra cultural Emec Paulo Freire e Cecília Meireles 2011
Mostra cultural Emec Paulo Freire e Cecília Meireles  2011Mostra cultural Emec Paulo Freire e Cecília Meireles  2011
Mostra cultural Emec Paulo Freire e Cecília Meireles 2011
 
School Science Projects based on Experiments
School Science Projects based on ExperimentsSchool Science Projects based on Experiments
School Science Projects based on Experiments
 
Data Mining (Predict The Future)
Data Mining (Predict The Future)Data Mining (Predict The Future)
Data Mining (Predict The Future)
 
Job satisfaction Research based project
Job satisfaction Research based projectJob satisfaction Research based project
Job satisfaction Research based project
 
Bootstrapping Machine Learning
Bootstrapping Machine LearningBootstrapping Machine Learning
Bootstrapping Machine Learning
 
Big Data application - OSS / BSS
Big Data application - OSS / BSSBig Data application - OSS / BSS
Big Data application - OSS / BSS
 
Advanced engineering math 8 e solutions manual evens kreyszig
Advanced engineering math 8 e solutions manual evens   kreyszigAdvanced engineering math 8 e solutions manual evens   kreyszig
Advanced engineering math 8 e solutions manual evens kreyszig
 
DevOps 101 - Moving Fast with Confidence
DevOps 101 - Moving Fast with ConfidenceDevOps 101 - Moving Fast with Confidence
DevOps 101 - Moving Fast with Confidence
 

Similar to Data+Science : A First Course

Certified Data Science Training in Pune-March
Certified Data Science Training in Pune-MarchCertified Data Science Training in Pune-March
Certified Data Science Training in Pune-MarchDataMites
 
Certified Data Science Course in Pune-March
Certified Data Science Course in Pune-MarchCertified Data Science Course in Pune-March
Certified Data Science Course in Pune-MarchDataMites
 
Certified Data Science Course in Pune-March
Certified Data Science Course in Pune-MarchCertified Data Science Course in Pune-March
Certified Data Science Course in Pune-MarchDataMites
 
Certified Data Science Training in Chennai-March
Certified Data Science Training in Chennai-MarchCertified Data Science Training in Chennai-March
Certified Data Science Training in Chennai-MarchDataMites
 
Certified Data Scientist Course in Chennai-March
Certified Data Scientist Course in Chennai-MarchCertified Data Scientist Course in Chennai-March
Certified Data Scientist Course in Chennai-MarchDataMites
 
Data science course in Moradabad.pdf
Data science course in Moradabad.pdfData science course in Moradabad.pdf
Data science course in Moradabad.pdfKajal Digital
 
Certified Data Science Course in Pune-March
Certified Data Science Course in Pune-MarchCertified Data Science Course in Pune-March
Certified Data Science Course in Pune-MarchDataMites
 
Certified Data Science Course in Pune-May
Certified Data Science Course in Pune-MayCertified Data Science Course in Pune-May
Certified Data Science Course in Pune-MayDataMites
 
Certified Data Science Course in Chennai-March
Certified Data Science Course in Chennai-MarchCertified Data Science Course in Chennai-March
Certified Data Science Course in Chennai-MarchDataMites
 
Certified Data Scientist Training in Pune-May.pptx
Certified Data Scientist Training in Pune-May.pptxCertified Data Scientist Training in Pune-May.pptx
Certified Data Scientist Training in Pune-May.pptxDataMites
 
Data Science (Moradabad).pdf
Data Science (Moradabad).pdfData Science (Moradabad).pdf
Data Science (Moradabad).pdfUmar khan
 
Introduction to Data Science.pdf
Introduction to Data Science.pdfIntroduction to Data Science.pdf
Introduction to Data Science.pdfUniversity of Sindh
 
Data Science Certification in Pune-March
Data Science Certification in Pune-MarchData Science Certification in Pune-March
Data Science Certification in Pune-MarchDataMites
 
Data Science Course after 12th A Comprehensive Guide.pptx
Data Science Course after 12th A Comprehensive Guide.pptxData Science Course after 12th A Comprehensive Guide.pptx
Data Science Course after 12th A Comprehensive Guide.pptxAvinash Sharma
 
From Data to Discovery: The Journey of a Data Scientist
From Data to Discovery: The Journey of a Data ScientistFrom Data to Discovery: The Journey of a Data Scientist
From Data to Discovery: The Journey of a Data ScientistUncodemy
 
Data Science Certification in Chennai-March
Data Science Certification in Chennai-MarchData Science Certification in Chennai-March
Data Science Certification in Chennai-MarchDataMites
 
ds.pptx
ds.pptxds.pptx
ds.pptxElves3
 
The Analytics and Data Science Landscape
The Analytics and Data Science LandscapeThe Analytics and Data Science Landscape
The Analytics and Data Science LandscapePhilip Bourne
 

Similar to Data+Science : A First Course (20)

Certified Data Science Training in Pune-March
Certified Data Science Training in Pune-MarchCertified Data Science Training in Pune-March
Certified Data Science Training in Pune-March
 
Certified Data Science Course in Pune-March
Certified Data Science Course in Pune-MarchCertified Data Science Course in Pune-March
Certified Data Science Course in Pune-March
 
Certified Data Science Course in Pune-March
Certified Data Science Course in Pune-MarchCertified Data Science Course in Pune-March
Certified Data Science Course in Pune-March
 
Certified Data Science Training in Chennai-March
Certified Data Science Training in Chennai-MarchCertified Data Science Training in Chennai-March
Certified Data Science Training in Chennai-March
 
Certified Data Scientist Course in Chennai-March
Certified Data Scientist Course in Chennai-MarchCertified Data Scientist Course in Chennai-March
Certified Data Scientist Course in Chennai-March
 
Data science course in Moradabad.pdf
Data science course in Moradabad.pdfData science course in Moradabad.pdf
Data science course in Moradabad.pdf
 
Certified Data Science Course in Pune-March
Certified Data Science Course in Pune-MarchCertified Data Science Course in Pune-March
Certified Data Science Course in Pune-March
 
Certified Data Science Course in Pune-May
Certified Data Science Course in Pune-MayCertified Data Science Course in Pune-May
Certified Data Science Course in Pune-May
 
Data Science.pptx
Data Science.pptxData Science.pptx
Data Science.pptx
 
Certified Data Science Course in Chennai-March
Certified Data Science Course in Chennai-MarchCertified Data Science Course in Chennai-March
Certified Data Science Course in Chennai-March
 
Certified Data Scientist Training in Pune-May.pptx
Certified Data Scientist Training in Pune-May.pptxCertified Data Scientist Training in Pune-May.pptx
Certified Data Scientist Training in Pune-May.pptx
 
Data Science (Moradabad).pdf
Data Science (Moradabad).pdfData Science (Moradabad).pdf
Data Science (Moradabad).pdf
 
Introduction to Data Science.pdf
Introduction to Data Science.pdfIntroduction to Data Science.pdf
Introduction to Data Science.pdf
 
Data Science Certification in Pune-March
Data Science Certification in Pune-MarchData Science Certification in Pune-March
Data Science Certification in Pune-March
 
Data Science Course after 12th A Comprehensive Guide.pptx
Data Science Course after 12th A Comprehensive Guide.pptxData Science Course after 12th A Comprehensive Guide.pptx
Data Science Course after 12th A Comprehensive Guide.pptx
 
From Data to Discovery: The Journey of a Data Scientist
From Data to Discovery: The Journey of a Data ScientistFrom Data to Discovery: The Journey of a Data Scientist
From Data to Discovery: The Journey of a Data Scientist
 
Data Science Certification in Chennai-March
Data Science Certification in Chennai-MarchData Science Certification in Chennai-March
Data Science Certification in Chennai-March
 
ds.pptx
ds.pptxds.pptx
ds.pptx
 
How to crack down big data?
How to crack down big data? How to crack down big data?
How to crack down big data?
 
The Analytics and Data Science Landscape
The Analytics and Data Science LandscapeThe Analytics and Data Science Landscape
The Analytics and Data Science Landscape
 

Recently uploaded

Data Analysis Project Presentation : NYC Shooting Cluster Analysis
Data Analysis Project Presentation : NYC Shooting Cluster AnalysisData Analysis Project Presentation : NYC Shooting Cluster Analysis
Data Analysis Project Presentation : NYC Shooting Cluster AnalysisBoston Institute of Analytics
 
Bios of leading Astrologers & Researchers
Bios of leading Astrologers & ResearchersBios of leading Astrologers & Researchers
Bios of leading Astrologers & Researchersdarmandersingh4580
 
如何办理(UCLA毕业证书)加州大学洛杉矶分校毕业证成绩单学位证留信学历认证原件一样
如何办理(UCLA毕业证书)加州大学洛杉矶分校毕业证成绩单学位证留信学历认证原件一样如何办理(UCLA毕业证书)加州大学洛杉矶分校毕业证成绩单学位证留信学历认证原件一样
如何办理(UCLA毕业证书)加州大学洛杉矶分校毕业证成绩单学位证留信学历认证原件一样jk0tkvfv
 
Displacement, Velocity, Acceleration, and Second Derivatives
Displacement, Velocity, Acceleration, and Second DerivativesDisplacement, Velocity, Acceleration, and Second Derivatives
Displacement, Velocity, Acceleration, and Second Derivatives23050636
 
NO1 Best Kala Jadu Expert Specialist In Germany Kala Jadu Expert Specialist I...
NO1 Best Kala Jadu Expert Specialist In Germany Kala Jadu Expert Specialist I...NO1 Best Kala Jadu Expert Specialist In Germany Kala Jadu Expert Specialist I...
NO1 Best Kala Jadu Expert Specialist In Germany Kala Jadu Expert Specialist I...Amil baba
 
Seven tools of quality control.slideshare
Seven tools of quality control.slideshareSeven tools of quality control.slideshare
Seven tools of quality control.slideshareraiaryan448
 
Genuine love spell caster )! ,+27834335081) Ex lover back permanently in At...
Genuine love spell caster )! ,+27834335081)   Ex lover back permanently in At...Genuine love spell caster )! ,+27834335081)   Ex lover back permanently in At...
Genuine love spell caster )! ,+27834335081) Ex lover back permanently in At...BabaJohn3
 
1:1原版定制伦敦政治经济学院毕业证(LSE毕业证)成绩单学位证书留信学历认证
1:1原版定制伦敦政治经济学院毕业证(LSE毕业证)成绩单学位证书留信学历认证1:1原版定制伦敦政治经济学院毕业证(LSE毕业证)成绩单学位证书留信学历认证
1:1原版定制伦敦政治经济学院毕业证(LSE毕业证)成绩单学位证书留信学历认证dq9vz1isj
 
What is Insertion Sort. Its basic information
What is Insertion Sort. Its basic informationWhat is Insertion Sort. Its basic information
What is Insertion Sort. Its basic informationmuqadasqasim10
 
原件一样伦敦国王学院毕业证成绩单留信学历认证
原件一样伦敦国王学院毕业证成绩单留信学历认证原件一样伦敦国王学院毕业证成绩单留信学历认证
原件一样伦敦国王学院毕业证成绩单留信学历认证pwgnohujw
 
Digital Marketing Demystified: Expert Tips from Samantha Rae Coolbeth
Digital Marketing Demystified: Expert Tips from Samantha Rae CoolbethDigital Marketing Demystified: Expert Tips from Samantha Rae Coolbeth
Digital Marketing Demystified: Expert Tips from Samantha Rae CoolbethSamantha Rae Coolbeth
 
如何办理(Dalhousie毕业证书)达尔豪斯大学毕业证成绩单留信学历认证
如何办理(Dalhousie毕业证书)达尔豪斯大学毕业证成绩单留信学历认证如何办理(Dalhousie毕业证书)达尔豪斯大学毕业证成绩单留信学历认证
如何办理(Dalhousie毕业证书)达尔豪斯大学毕业证成绩单留信学历认证zifhagzkk
 
Identify Customer Segments to Create Customer Offers for Each Segment - Appli...
Identify Customer Segments to Create Customer Offers for Each Segment - Appli...Identify Customer Segments to Create Customer Offers for Each Segment - Appli...
Identify Customer Segments to Create Customer Offers for Each Segment - Appli...ThinkInnovation
 
Identify Rules that Predict Patient’s Heart Disease - An Application of Decis...
Identify Rules that Predict Patient’s Heart Disease - An Application of Decis...Identify Rules that Predict Patient’s Heart Disease - An Application of Decis...
Identify Rules that Predict Patient’s Heart Disease - An Application of Decis...ThinkInnovation
 
Aggregations - The Elasticsearch "GROUP BY"
Aggregations - The Elasticsearch "GROUP BY"Aggregations - The Elasticsearch "GROUP BY"
Aggregations - The Elasticsearch "GROUP BY"John Sobanski
 
1:1原版定制利物浦大学毕业证(Liverpool毕业证)成绩单学位证书留信学历认证
1:1原版定制利物浦大学毕业证(Liverpool毕业证)成绩单学位证书留信学历认证1:1原版定制利物浦大学毕业证(Liverpool毕业证)成绩单学位证书留信学历认证
1:1原版定制利物浦大学毕业证(Liverpool毕业证)成绩单学位证书留信学历认证ppy8zfkfm
 
obat aborsi Bontang wa 081336238223 jual obat aborsi cytotec asli di Bontang6...
obat aborsi Bontang wa 081336238223 jual obat aborsi cytotec asli di Bontang6...obat aborsi Bontang wa 081336238223 jual obat aborsi cytotec asli di Bontang6...
obat aborsi Bontang wa 081336238223 jual obat aborsi cytotec asli di Bontang6...yulianti213969
 
Jual Obat Aborsi Bandung (Asli No.1) Wa 082134680322 Klinik Obat Penggugur Ka...
Jual Obat Aborsi Bandung (Asli No.1) Wa 082134680322 Klinik Obat Penggugur Ka...Jual Obat Aborsi Bandung (Asli No.1) Wa 082134680322 Klinik Obat Penggugur Ka...
Jual Obat Aborsi Bandung (Asli No.1) Wa 082134680322 Klinik Obat Penggugur Ka...Klinik Aborsi
 
Audience Researchndfhcvnfgvgbhujhgfv.pptx
Audience Researchndfhcvnfgvgbhujhgfv.pptxAudience Researchndfhcvnfgvgbhujhgfv.pptx
Audience Researchndfhcvnfgvgbhujhgfv.pptxStephen266013
 
Credit Card Fraud Detection: Safeguarding Transactions in the Digital Age
Credit Card Fraud Detection: Safeguarding Transactions in the Digital AgeCredit Card Fraud Detection: Safeguarding Transactions in the Digital Age
Credit Card Fraud Detection: Safeguarding Transactions in the Digital AgeBoston Institute of Analytics
 

Recently uploaded (20)

Data Analysis Project Presentation : NYC Shooting Cluster Analysis
Data Analysis Project Presentation : NYC Shooting Cluster AnalysisData Analysis Project Presentation : NYC Shooting Cluster Analysis
Data Analysis Project Presentation : NYC Shooting Cluster Analysis
 
Bios of leading Astrologers & Researchers
Bios of leading Astrologers & ResearchersBios of leading Astrologers & Researchers
Bios of leading Astrologers & Researchers
 
如何办理(UCLA毕业证书)加州大学洛杉矶分校毕业证成绩单学位证留信学历认证原件一样
如何办理(UCLA毕业证书)加州大学洛杉矶分校毕业证成绩单学位证留信学历认证原件一样如何办理(UCLA毕业证书)加州大学洛杉矶分校毕业证成绩单学位证留信学历认证原件一样
如何办理(UCLA毕业证书)加州大学洛杉矶分校毕业证成绩单学位证留信学历认证原件一样
 
Displacement, Velocity, Acceleration, and Second Derivatives
Displacement, Velocity, Acceleration, and Second DerivativesDisplacement, Velocity, Acceleration, and Second Derivatives
Displacement, Velocity, Acceleration, and Second Derivatives
 
NO1 Best Kala Jadu Expert Specialist In Germany Kala Jadu Expert Specialist I...
NO1 Best Kala Jadu Expert Specialist In Germany Kala Jadu Expert Specialist I...NO1 Best Kala Jadu Expert Specialist In Germany Kala Jadu Expert Specialist I...
NO1 Best Kala Jadu Expert Specialist In Germany Kala Jadu Expert Specialist I...
 
Seven tools of quality control.slideshare
Seven tools of quality control.slideshareSeven tools of quality control.slideshare
Seven tools of quality control.slideshare
 
Genuine love spell caster )! ,+27834335081) Ex lover back permanently in At...
Genuine love spell caster )! ,+27834335081)   Ex lover back permanently in At...Genuine love spell caster )! ,+27834335081)   Ex lover back permanently in At...
Genuine love spell caster )! ,+27834335081) Ex lover back permanently in At...
 
1:1原版定制伦敦政治经济学院毕业证(LSE毕业证)成绩单学位证书留信学历认证
1:1原版定制伦敦政治经济学院毕业证(LSE毕业证)成绩单学位证书留信学历认证1:1原版定制伦敦政治经济学院毕业证(LSE毕业证)成绩单学位证书留信学历认证
1:1原版定制伦敦政治经济学院毕业证(LSE毕业证)成绩单学位证书留信学历认证
 
What is Insertion Sort. Its basic information
What is Insertion Sort. Its basic informationWhat is Insertion Sort. Its basic information
What is Insertion Sort. Its basic information
 
原件一样伦敦国王学院毕业证成绩单留信学历认证
原件一样伦敦国王学院毕业证成绩单留信学历认证原件一样伦敦国王学院毕业证成绩单留信学历认证
原件一样伦敦国王学院毕业证成绩单留信学历认证
 
Digital Marketing Demystified: Expert Tips from Samantha Rae Coolbeth
Digital Marketing Demystified: Expert Tips from Samantha Rae CoolbethDigital Marketing Demystified: Expert Tips from Samantha Rae Coolbeth
Digital Marketing Demystified: Expert Tips from Samantha Rae Coolbeth
 
如何办理(Dalhousie毕业证书)达尔豪斯大学毕业证成绩单留信学历认证
如何办理(Dalhousie毕业证书)达尔豪斯大学毕业证成绩单留信学历认证如何办理(Dalhousie毕业证书)达尔豪斯大学毕业证成绩单留信学历认证
如何办理(Dalhousie毕业证书)达尔豪斯大学毕业证成绩单留信学历认证
 
Identify Customer Segments to Create Customer Offers for Each Segment - Appli...
Identify Customer Segments to Create Customer Offers for Each Segment - Appli...Identify Customer Segments to Create Customer Offers for Each Segment - Appli...
Identify Customer Segments to Create Customer Offers for Each Segment - Appli...
 
Identify Rules that Predict Patient’s Heart Disease - An Application of Decis...
Identify Rules that Predict Patient’s Heart Disease - An Application of Decis...Identify Rules that Predict Patient’s Heart Disease - An Application of Decis...
Identify Rules that Predict Patient’s Heart Disease - An Application of Decis...
 
Aggregations - The Elasticsearch "GROUP BY"
Aggregations - The Elasticsearch "GROUP BY"Aggregations - The Elasticsearch "GROUP BY"
Aggregations - The Elasticsearch "GROUP BY"
 
1:1原版定制利物浦大学毕业证(Liverpool毕业证)成绩单学位证书留信学历认证
1:1原版定制利物浦大学毕业证(Liverpool毕业证)成绩单学位证书留信学历认证1:1原版定制利物浦大学毕业证(Liverpool毕业证)成绩单学位证书留信学历认证
1:1原版定制利物浦大学毕业证(Liverpool毕业证)成绩单学位证书留信学历认证
 
obat aborsi Bontang wa 081336238223 jual obat aborsi cytotec asli di Bontang6...
obat aborsi Bontang wa 081336238223 jual obat aborsi cytotec asli di Bontang6...obat aborsi Bontang wa 081336238223 jual obat aborsi cytotec asli di Bontang6...
obat aborsi Bontang wa 081336238223 jual obat aborsi cytotec asli di Bontang6...
 
Jual Obat Aborsi Bandung (Asli No.1) Wa 082134680322 Klinik Obat Penggugur Ka...
Jual Obat Aborsi Bandung (Asli No.1) Wa 082134680322 Klinik Obat Penggugur Ka...Jual Obat Aborsi Bandung (Asli No.1) Wa 082134680322 Klinik Obat Penggugur Ka...
Jual Obat Aborsi Bandung (Asli No.1) Wa 082134680322 Klinik Obat Penggugur Ka...
 
Audience Researchndfhcvnfgvgbhujhgfv.pptx
Audience Researchndfhcvnfgvgbhujhgfv.pptxAudience Researchndfhcvnfgvgbhujhgfv.pptx
Audience Researchndfhcvnfgvgbhujhgfv.pptx
 
Credit Card Fraud Detection: Safeguarding Transactions in the Digital Age
Credit Card Fraud Detection: Safeguarding Transactions in the Digital AgeCredit Card Fraud Detection: Safeguarding Transactions in the Digital Age
Credit Card Fraud Detection: Safeguarding Transactions in the Digital Age
 

Data+Science : A First Course

  • 2. What is Data Science? Data Science is, in general terms, the extraction of knowledge from data
  • 3. What is Data Science? Data is increasingly cheap and ubiquitous. We are collecting and analyzing data, unprecedented in variety, complexity and scale. At the same time, new technologies are emerging to organize and make sense of this avalanche of data.
  • 4. What is Data Science? Data Science is an interdisciplinary subject employing concepts and techniques from mathematics, statistics, computer science and economics. It is used to identify patterns and regularities in data, affecting all aspects of work and society from medicine to marketing to scientific research.
  • 5. Who is a Data Scientist? A data scientist is someone who is better at statistics than most software engineers and better at software engineering than most statisticians
  • 6. Who is a Data Scientist? A Data Scientist is a professional with the training and curiosity to make discoveries while swimming in an ocean of data; communicating what they learn and suggesting its implications for new decisions.
  • 7. Who is a Data Scientist? They identify and combine rich and potentially incomplete data sources, and bring structure to large quantities of formless data, making analysis possible. They engage decision makers in an ongoing conversation based on the implications of the data for products, processes, and decisions.
  • 8. Who is a Data Scientist? ★ A Data Scientist should have solid quantitative and analytic skills Statistical Modelling Experimental Design Bayesian Inference Machine Learning Information Theory Complex Systems
  • 9. Who is a Data Scientist? ★ A Data Scientist should be a good programmer Scripting: e.g. python Statistical Packages: e.g. R Databases: SQL and NoSQL MapReduce concepts Hadoop and Hive/Pig Computer Science
  • 10. Who is a Data Scientist? In addition, a Data Scientist should ★ excel at communication and visualization ★ understand economics and business concepts ★ be curious and creative
  • 11. Demand for Data Scientists
  • 12. Demand for Data Scientists There is a growing demand for data-savvy professionals in businesses, public agencies, and nonprofits. There is a limited supply of professionals who can efficiently work with data at scale. Thus, the salaries for data engineers, data scientists, statisticians, and data analysts have increased rapidly.
  • 13. A recent study by the McKinsey Global Institute estimates that there will be four to five million jobs in the U.S. requiring data analysis skills by 2018, and that large numbers of positions will only be filled through training or retraining.
  • 14. In a survey of 816 data professionals in 53 countries, O’Reilly Media report a median annual salary for Data Science professionals as $98,000. SQL, R, Python and Excel are the top earning skills.
  • 15. Data Science in India According to a survey by Gartner ★ In 2013, the Data Analytics market in India was $1.6 Billion with a growth rate of 8% ★ By 2018, the market is projected to be $3.7 Billion "For the fourth year in a row, analytics ranks as the No. 1 priority in Gartner's CIO [India] Survey." Bhavish Sood, research director at Gartner explains.
  • 16. India is one of the strongest countries in the Data Science marketplace that boasts of clients including Facebook, GE, NASA, Tesco and Merck. It can potentially build a talent pipeline for data scientists that are virtually non-existent today. India will need 200,000 data scientists in the next few years. A single company, Wipro, already has as many as 8,000 people in analytics functions.
  • 17. Data Science in India The median annual salary for a Data Scientists in India is Rs 670,665 The highest paying skills are Python, Machine Learning, Statistical Analysis, Big Data Analytics, and R.
  • 18. Bengal Chamber proposes smart and green city for business analytics firms The Bengal Chamber of Commerce and Industry has taken an initiative to set up a smart city for business analytics in West Bengal. The project would involve service providers like KPMG Advisory Services and PricewaterhouseCoopers, corporate consumers, education institutions such as Indian Institute of Technology Kharagpur, the Indian Statistical Institute, and the Indian Institute of Management, Calcutta.
  • 19.
  • 20. How can you be a Data Scientist? A Master’s degree is a natural route to be a Data Scientist. Massive Open Online Courses (MOOCs) give access to self-learning at a low cost (often free), but leave it to the student to identify a suitable set of courses and tools to round out a coherent skill set. Bootcamps offer students a practical and structured learning environment at a far more affordable rate compared with obtaining a Master’s Degree.
  • 21. Master’s Degree Duration 9 - 20 months Faculty University Professors Learning Theory and Assignments Outcome Degree Projects Practicum and Internship Placement University Recruiting Examples UC Berkeley, NYU, NCSU IIT+IIM+ISI Tuition $20,000 - $70,000 (US) ₹20,000,000 (India)
  • 22. Self-Learning (MOOCs) Duration 6 - 18 months (part time) Faculty University Professors (recorded lectures) Learning Self guided Outcome Certificate Projects Projects on own time Placement Self-driven job search Examples Coursera, Udacity Tuition Free- $500 (US)
  • 23. Bootcamps Duration 2 - 3 months Faculty Professors & Data Scientists Learning Experiential Learning Outcome Certificate and Portfolio Projects Built-In Projects Placement Hiring Day and Placement Assistance Examples Zipfan, Metis, Data Incubator Tuition Free - $16,000 (US)
  • 24. The Course Data+Science: A First Course is an intensive eight-week program based on the bootcamp model, organized by The Data+Science Initiative. It is designed to teach and train graduates in quantitative fields to take an entry-level position as a data scientist.
  • 25. Objectives of the Course Upon graduating a student will: 1. Have a clear understanding of and practical experience with the process of designing, implementing, and communicating the results of a data science project. 2. Understand the landscape of data science tools and their applications, and be prepared to identify and dig into new technologies and algorithms needed for the job at hand.
  • 26. Overview Data science gives valuable meaning to large sets of complex and unstructured data. The focus is around concepts and techniques to mine, store, analyse and visualize data. Data science is a highly interdisciplinary drawing from fields such as computer science (algorithms and databases), statistics (hypothesis testing and inference), artificial intelligence (pattern recognition and machine learning).
  • 27. Course Content Data Mining (⅛): identifying data sources; extracting, cleaning and verifying structured and unstructured data Data Storage (¼): structuring, storage and retrieval of data; including big data and NoSQL Data Analysis (½): descriptive and inferential analysis; predictive modelling, risk analysis and decision making Data Visualization (⅛)
  • 28. Course Content Graduating students will: 1. Be proficient in statistical concepts and mathematical techniques including correlation functions, inference and hypothesis testing. 2. Be able to make predictive analyses by modelling stochastic processes based on available data. 3. Learn and apply Machine Learning concepts to solve data science problems
  • 29. Course Content 4. Be capable coders in Python and R, including the related packages and toolsets most commonly used in data science. 5. Know the fundamentals of data visualization and have experience creating static and dynamic data visuals using JavaScript and D3.js. 6. Have introductory exposure to big data tools and architecture such as the Hadoop stack, know when these tools are necessary, and be poised to quickly train up and utilize them in a big data project.
  • 30. Prerequisites Basic Statistics and Probability descriptive statistics and distributions Linear Algebra vectors and matrices Calculus and Differential Equations basic calculus and finding extrema, ordinary differential equations Programming basic proficiency in any programming language
  • 31. Preferred Subjects Computer Science algorithms, data structures and databases Advanced Statistics bayesian inference and stochoastic processes Statistical Mechanics/Information Theory entropy, information, complexity Economics supply/demand, game theory Web Development HTML, CSS and Javascript
  • 32. Eligibility Anyone meeting the prerequisite criteria is eligible, determined by a qualifying exam, with preference given to those with knowledge of the preferred subjects. However, we would prefer applicants to have a bachelor’s degree in a quantitative field, such as: Engineering, Physics, Mathematics, Statistics, Economics or Computer Applications.
  • 33. Course Details The course consists of 24 classes over 8 weeks. Each class (Mondays, Wednesdays, Fridays) is 6 hours in duration (10AM-4PM) including a lunch hour. Morning sessions consists of lectures and discussions while the afternoons is a guided programming session. In addition, instructors will be available for office hours at scheduled times.
  • 34. Course Projects The course is divided into three parts. Part A (Weeks 1-4): daily programming projects executed individually or in groups Part B (Weeks 5-8): weekly projects in groups drawn from the industry Part C (Weeks 9-11, optional): course project in groups with biweekly meetings with instructors
  • 35. Benefits Employment: Students will have the skill set and portfolio to find employment as an entry level data scientist. Such a skill set is in great demand, both domestically as well as in developed countries. Research: Since Data Science is at the core of academic research, our students, armed with the knowledge, portfolio and recommendation will find easier admission to universities, especially abroad.