SlideShare a Scribd company logo
1 of 13
INTRODUCTION TO DATA
SCIENCE
CHAPTER 1
“Introduction to Data Science : Practical Approach with R and Python ”
B.Uma Maheswari and R Sujatha
Copyright @ 2021 Wiley India Pvt. Ltd. All rights reserved.
LEARNING OBJECTIVES
•Understand the concept of data science
•Briefly learn the history of data science
•Learn about the fundamental fields related to data science
•Understand the different terminologies related to data science like big
data,
•Business intelligence, data mining, artificial intelligence, machine learning
and deep learning.
•Learn about the different types of analytics- descriptive, diagnostic,
predictive and prescriptive
•Learn briefly about the applications of data science.
•Comprehend the data science process model
DATA SCIENCE
Data Science is the science
of understanding data
using processes, tools and
techniques which aid in
decision making. It
involves techniques for
identifying, collecting and
exploring the data using
colorful plots and graphs
HISTORY OF DATA SCIENCE
John W.Tukey, a mathematician in his article “The Future of Data
Analysis”.
John Chambers, Consulting Professor, Stanford University. The S
system is the basis for all the future statistical programming
languages including the R language which will be discussed in this
book
Jeff Wu, Coco - Cola chair in Engineering Statistics and Professor at
Georgia Tech coined the term “Data Science” in 1997
William Cleveland , Distinguished Professor of Statistics and Professor
of Computer Science at Purdue University authored many books on
data visualization
Leo Breiman, distinguished statistician at the University of California,
Berkeley was one of the pioneers in ‘machine learning.
WHY IS DATA SCIENCE RECEIVING
SO MUCH ATTENTION
•Increasing usage of internet which has generated more data.
•Growing usage of smart phones, tablets and digital devices
•Increasing usage of social media
•Increasing computational capability with both hardware and software
becoming powerful by the day.
•Programming languages to work with such data are freely available through
open source platforms.
•Programmers across the world are creating complex algorithms and
contributing to the open source developers’ community.
•Easy and speedy access to such data for every individual or organization
irrespective of the size of the concern.
•Storage of data becoming cheaper.
DATA, DATA AND MORE DATA
Every minute on the internet,
Zoom hosts 2,08,333 participants in
meetings
Netflix users stream 4,00,444 hours
of video
Instagram users post 3,47,222 stories.
YouTube users upload 500 hours of
video
Twitter gains 319 new users
Facebook users share 1,50,000
messages
Linkedin users apply for 69,444 jobs
Amazon ships 6,659 packages
Whatsapp users share 4,16,66,667
According to the data captured by the cloud
software company Domo, as on April 2020,
internet has reached 59% of the world
population.
FUNDAMENTAL FIELDS OF STUDY
RELATING TO DATA SCIENCE
Data
Science
Computer
Science
Mathematics Statistics
Domain
Knowledge
BIG DATA
Business Intelligence
Business Intelligence (BI) involves gathering, pre-processing and most importantly
presenting such data using data-visualization tools and techniques through charts, plots,
tables and dashboards
DATA MINING
Data mining is the technology used for processing large
volume of data
Generate inferences from data such as
Identifying trends in stock prices
Categorizing customers on the basis of their preferences
Ascertaining the purchasing patterns of customers
Predicting student performance in an educational institution
 Lie detection in dealing with criminals etc.
Applications of data mining can be seen in the field of
agriculture, education, industrial engineering, marketing,
healthcare etc.
ARTIFICIAL INTELLIGENCE-
MACHINE LEARNING-DEEP
LEARNING
Artificial Intelligence:AI is the design of smart
machines or algorithms which can perform functions
or tasks that generally requires human intelligence
Machine Learning:Machine Learning (ML) is a subset
of artificial intelligence which refers to the modelling
techniques, where the model learns on its own without
human intervention.
Deep Learning:Deep learning is a part of machine
learning which works more effectively on larger
datasets and aims at pattern recognition by imitating
the human brain.
TYPES OF ANALYTICS
• Descriptive
Analytics
What has
happened ?
• Diagnostic
Analytics
Why did it
happen?
• Predictive
Analytics
What will
happen ?
• Prescriptive
Analytics
What should
we do ?
DATA SCIENCE PROCESS MODEL
Objective
The project
objective needs to
be identified
Data collection
Collate the data
from the different
sources
Exploratory
Data analysis
(Chapter 3)
Data
visualization
(Chapter 4)
Dimensionality
reduction
(Chapter 5)
Model
building
(Chapter 7-14)

More Related Content

Similar to Chapter 1 Introduction to Datascience (1).pptx

Untitled document.pdf
Untitled document.pdfUntitled document.pdf
Untitled document.pdf
MuhammadTahiriqbal13
 
A New Paradigm on Analytic-Driven Information and Automation V2.pdf
A New Paradigm on Analytic-Driven Information and Automation V2.pdfA New Paradigm on Analytic-Driven Information and Automation V2.pdf
A New Paradigm on Analytic-Driven Information and Automation V2.pdf
ArmyTrilidiaDevegaSK
 

Similar to Chapter 1 Introduction to Datascience (1).pptx (20)

Big Data & DS Analytics for PAARL
Big Data & DS Analytics for PAARLBig Data & DS Analytics for PAARL
Big Data & DS Analytics for PAARL
 
Untitled document.pdf
Untitled document.pdfUntitled document.pdf
Untitled document.pdf
 
INTRODUCTION TO DATA SCIENCE -CONCEPTS.pptx
INTRODUCTION TO DATA SCIENCE -CONCEPTS.pptxINTRODUCTION TO DATA SCIENCE -CONCEPTS.pptx
INTRODUCTION TO DATA SCIENCE -CONCEPTS.pptx
 
Data science
Data scienceData science
Data science
 
Datascience
DatascienceDatascience
Datascience
 
Lecture_1_Intro_toDS&AI.pptx
Lecture_1_Intro_toDS&AI.pptxLecture_1_Intro_toDS&AI.pptx
Lecture_1_Intro_toDS&AI.pptx
 
L3 Big Data and Application.pptx
L3  Big Data and Application.pptxL3  Big Data and Application.pptx
L3 Big Data and Application.pptx
 
Colloquium(7)_DataScience:ShivShaktiGhosh&MohitGarg
Colloquium(7)_DataScience:ShivShaktiGhosh&MohitGargColloquium(7)_DataScience:ShivShaktiGhosh&MohitGarg
Colloquium(7)_DataScience:ShivShaktiGhosh&MohitGarg
 
Big Data for Library Services (2017)
Big Data for Library Services (2017)Big Data for Library Services (2017)
Big Data for Library Services (2017)
 
Data Science ppt for the asjdbhsadbmsnc.pptx
Data Science ppt for the asjdbhsadbmsnc.pptxData Science ppt for the asjdbhsadbmsnc.pptx
Data Science ppt for the asjdbhsadbmsnc.pptx
 
A New Paradigm on Analytic-Driven Information and Automation V2.pdf
A New Paradigm on Analytic-Driven Information and Automation V2.pdfA New Paradigm on Analytic-Driven Information and Automation V2.pdf
A New Paradigm on Analytic-Driven Information and Automation V2.pdf
 
Data science
Data science Data science
Data science
 
Introduction to Data Science: Unveiling Insights Hidden in Data
Introduction to Data Science: Unveiling Insights Hidden in DataIntroduction to Data Science: Unveiling Insights Hidden in Data
Introduction to Data Science: Unveiling Insights Hidden in Data
 
data science
data sciencedata science
data science
 
data science
data sciencedata science
data science
 
How to Enhance Your Career with AI
How to Enhance Your Career with AIHow to Enhance Your Career with AI
How to Enhance Your Career with AI
 
Data Science - An emerging Stream of Science with its Spreading Reach & Impact
Data Science - An emerging Stream of Science with its Spreading Reach & ImpactData Science - An emerging Stream of Science with its Spreading Reach & Impact
Data Science - An emerging Stream of Science with its Spreading Reach & Impact
 
Presentation 3.pdf
Presentation 3.pdfPresentation 3.pdf
Presentation 3.pdf
 
DEALING CRISIS MANAGEMENT USING AI
DEALING CRISIS MANAGEMENT USING AIDEALING CRISIS MANAGEMENT USING AI
DEALING CRISIS MANAGEMENT USING AI
 
DEALING CRISIS MANAGEMENT USING AI
DEALING CRISIS MANAGEMENT USING AIDEALING CRISIS MANAGEMENT USING AI
DEALING CRISIS MANAGEMENT USING AI
 

Recently uploaded

Call Girls In Shalimar Bagh ( Delhi) 9953330565 Escorts Service
Call Girls In Shalimar Bagh ( Delhi) 9953330565 Escorts ServiceCall Girls In Shalimar Bagh ( Delhi) 9953330565 Escorts Service
Call Girls In Shalimar Bagh ( Delhi) 9953330565 Escorts Service
9953056974 Low Rate Call Girls In Saket, Delhi NCR
 
CHEAP Call Girls in Saket (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICE
CHEAP Call Girls in Saket (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICECHEAP Call Girls in Saket (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICE
CHEAP Call Girls in Saket (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICE
9953056974 Low Rate Call Girls In Saket, Delhi NCR
 
Log Analysis using OSSEC sasoasasasas.pptx
Log Analysis using OSSEC sasoasasasas.pptxLog Analysis using OSSEC sasoasasasas.pptx
Log Analysis using OSSEC sasoasasasas.pptx
JohnnyPlasten
 
Call Girls Hsr Layout Just Call 👗 7737669865 👗 Top Class Call Girl Service Ba...
Call Girls Hsr Layout Just Call 👗 7737669865 👗 Top Class Call Girl Service Ba...Call Girls Hsr Layout Just Call 👗 7737669865 👗 Top Class Call Girl Service Ba...
Call Girls Hsr Layout Just Call 👗 7737669865 👗 Top Class Call Girl Service Ba...
amitlee9823
 
Delhi Call Girls CP 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
Delhi Call Girls CP 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip CallDelhi Call Girls CP 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
Delhi Call Girls CP 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
shivangimorya083
 
Chintamani Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore ...
Chintamani Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore ...Chintamani Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore ...
Chintamani Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore ...
amitlee9823
 
Junnasandra Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore...
Junnasandra Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore...Junnasandra Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore...
Junnasandra Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore...
amitlee9823
 
FESE Capital Markets Fact Sheet 2024 Q1.pdf
FESE Capital Markets Fact Sheet 2024 Q1.pdfFESE Capital Markets Fact Sheet 2024 Q1.pdf
FESE Capital Markets Fact Sheet 2024 Q1.pdf
MarinCaroMartnezBerg
 

Recently uploaded (20)

Call Girls In Shalimar Bagh ( Delhi) 9953330565 Escorts Service
Call Girls In Shalimar Bagh ( Delhi) 9953330565 Escorts ServiceCall Girls In Shalimar Bagh ( Delhi) 9953330565 Escorts Service
Call Girls In Shalimar Bagh ( Delhi) 9953330565 Escorts Service
 
CHEAP Call Girls in Saket (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICE
CHEAP Call Girls in Saket (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICECHEAP Call Girls in Saket (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICE
CHEAP Call Girls in Saket (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICE
 
VidaXL dropshipping via API with DroFx.pptx
VidaXL dropshipping via API with DroFx.pptxVidaXL dropshipping via API with DroFx.pptx
VidaXL dropshipping via API with DroFx.pptx
 
Data-Analysis for Chicago Crime Data 2023
Data-Analysis for Chicago Crime Data  2023Data-Analysis for Chicago Crime Data  2023
Data-Analysis for Chicago Crime Data 2023
 
Log Analysis using OSSEC sasoasasasas.pptx
Log Analysis using OSSEC sasoasasasas.pptxLog Analysis using OSSEC sasoasasasas.pptx
Log Analysis using OSSEC sasoasasasas.pptx
 
(NEHA) Call Girls Katra Call Now 8617697112 Katra Escorts 24x7
(NEHA) Call Girls Katra Call Now 8617697112 Katra Escorts 24x7(NEHA) Call Girls Katra Call Now 8617697112 Katra Escorts 24x7
(NEHA) Call Girls Katra Call Now 8617697112 Katra Escorts 24x7
 
Halmar dropshipping via API with DroFx
Halmar  dropshipping  via API with DroFxHalmar  dropshipping  via API with DroFx
Halmar dropshipping via API with DroFx
 
Edukaciniai dropshipping via API with DroFx
Edukaciniai dropshipping via API with DroFxEdukaciniai dropshipping via API with DroFx
Edukaciniai dropshipping via API with DroFx
 
Ravak dropshipping via API with DroFx.pptx
Ravak dropshipping via API with DroFx.pptxRavak dropshipping via API with DroFx.pptx
Ravak dropshipping via API with DroFx.pptx
 
Generative AI on Enterprise Cloud with NiFi and Milvus
Generative AI on Enterprise Cloud with NiFi and MilvusGenerative AI on Enterprise Cloud with NiFi and Milvus
Generative AI on Enterprise Cloud with NiFi and Milvus
 
Best VIP Call Girls Noida Sector 39 Call Me: 8448380779
Best VIP Call Girls Noida Sector 39 Call Me: 8448380779Best VIP Call Girls Noida Sector 39 Call Me: 8448380779
Best VIP Call Girls Noida Sector 39 Call Me: 8448380779
 
Call me @ 9892124323 Cheap Rate Call Girls in Vashi with Real Photo 100% Secure
Call me @ 9892124323  Cheap Rate Call Girls in Vashi with Real Photo 100% SecureCall me @ 9892124323  Cheap Rate Call Girls in Vashi with Real Photo 100% Secure
Call me @ 9892124323 Cheap Rate Call Girls in Vashi with Real Photo 100% Secure
 
April 2024 - Crypto Market Report's Analysis
April 2024 - Crypto Market Report's AnalysisApril 2024 - Crypto Market Report's Analysis
April 2024 - Crypto Market Report's Analysis
 
Call Girls Hsr Layout Just Call 👗 7737669865 👗 Top Class Call Girl Service Ba...
Call Girls Hsr Layout Just Call 👗 7737669865 👗 Top Class Call Girl Service Ba...Call Girls Hsr Layout Just Call 👗 7737669865 👗 Top Class Call Girl Service Ba...
Call Girls Hsr Layout Just Call 👗 7737669865 👗 Top Class Call Girl Service Ba...
 
Delhi Call Girls CP 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
Delhi Call Girls CP 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip CallDelhi Call Girls CP 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
Delhi Call Girls CP 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
 
Call Girls in Sarai Kale Khan Delhi 💯 Call Us 🔝9205541914 🔝( Delhi) Escorts S...
Call Girls in Sarai Kale Khan Delhi 💯 Call Us 🔝9205541914 🔝( Delhi) Escorts S...Call Girls in Sarai Kale Khan Delhi 💯 Call Us 🔝9205541914 🔝( Delhi) Escorts S...
Call Girls in Sarai Kale Khan Delhi 💯 Call Us 🔝9205541914 🔝( Delhi) Escorts S...
 
Chintamani Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore ...
Chintamani Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore ...Chintamani Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore ...
Chintamani Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore ...
 
Junnasandra Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore...
Junnasandra Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore...Junnasandra Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore...
Junnasandra Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore...
 
BDSM⚡Call Girls in Mandawali Delhi >༒8448380779 Escort Service
BDSM⚡Call Girls in Mandawali Delhi >༒8448380779 Escort ServiceBDSM⚡Call Girls in Mandawali Delhi >༒8448380779 Escort Service
BDSM⚡Call Girls in Mandawali Delhi >༒8448380779 Escort Service
 
FESE Capital Markets Fact Sheet 2024 Q1.pdf
FESE Capital Markets Fact Sheet 2024 Q1.pdfFESE Capital Markets Fact Sheet 2024 Q1.pdf
FESE Capital Markets Fact Sheet 2024 Q1.pdf
 

Chapter 1 Introduction to Datascience (1).pptx

  • 1. INTRODUCTION TO DATA SCIENCE CHAPTER 1 “Introduction to Data Science : Practical Approach with R and Python ” B.Uma Maheswari and R Sujatha Copyright @ 2021 Wiley India Pvt. Ltd. All rights reserved.
  • 2. LEARNING OBJECTIVES •Understand the concept of data science •Briefly learn the history of data science •Learn about the fundamental fields related to data science •Understand the different terminologies related to data science like big data, •Business intelligence, data mining, artificial intelligence, machine learning and deep learning. •Learn about the different types of analytics- descriptive, diagnostic, predictive and prescriptive •Learn briefly about the applications of data science. •Comprehend the data science process model
  • 3. DATA SCIENCE Data Science is the science of understanding data using processes, tools and techniques which aid in decision making. It involves techniques for identifying, collecting and exploring the data using colorful plots and graphs
  • 4. HISTORY OF DATA SCIENCE John W.Tukey, a mathematician in his article “The Future of Data Analysis”. John Chambers, Consulting Professor, Stanford University. The S system is the basis for all the future statistical programming languages including the R language which will be discussed in this book Jeff Wu, Coco - Cola chair in Engineering Statistics and Professor at Georgia Tech coined the term “Data Science” in 1997 William Cleveland , Distinguished Professor of Statistics and Professor of Computer Science at Purdue University authored many books on data visualization Leo Breiman, distinguished statistician at the University of California, Berkeley was one of the pioneers in ‘machine learning.
  • 5. WHY IS DATA SCIENCE RECEIVING SO MUCH ATTENTION •Increasing usage of internet which has generated more data. •Growing usage of smart phones, tablets and digital devices •Increasing usage of social media •Increasing computational capability with both hardware and software becoming powerful by the day. •Programming languages to work with such data are freely available through open source platforms. •Programmers across the world are creating complex algorithms and contributing to the open source developers’ community. •Easy and speedy access to such data for every individual or organization irrespective of the size of the concern. •Storage of data becoming cheaper.
  • 6. DATA, DATA AND MORE DATA Every minute on the internet, Zoom hosts 2,08,333 participants in meetings Netflix users stream 4,00,444 hours of video Instagram users post 3,47,222 stories. YouTube users upload 500 hours of video Twitter gains 319 new users Facebook users share 1,50,000 messages Linkedin users apply for 69,444 jobs Amazon ships 6,659 packages Whatsapp users share 4,16,66,667 According to the data captured by the cloud software company Domo, as on April 2020, internet has reached 59% of the world population.
  • 7. FUNDAMENTAL FIELDS OF STUDY RELATING TO DATA SCIENCE Data Science Computer Science Mathematics Statistics Domain Knowledge
  • 9. Business Intelligence Business Intelligence (BI) involves gathering, pre-processing and most importantly presenting such data using data-visualization tools and techniques through charts, plots, tables and dashboards
  • 10. DATA MINING Data mining is the technology used for processing large volume of data Generate inferences from data such as Identifying trends in stock prices Categorizing customers on the basis of their preferences Ascertaining the purchasing patterns of customers Predicting student performance in an educational institution  Lie detection in dealing with criminals etc. Applications of data mining can be seen in the field of agriculture, education, industrial engineering, marketing, healthcare etc.
  • 11. ARTIFICIAL INTELLIGENCE- MACHINE LEARNING-DEEP LEARNING Artificial Intelligence:AI is the design of smart machines or algorithms which can perform functions or tasks that generally requires human intelligence Machine Learning:Machine Learning (ML) is a subset of artificial intelligence which refers to the modelling techniques, where the model learns on its own without human intervention. Deep Learning:Deep learning is a part of machine learning which works more effectively on larger datasets and aims at pattern recognition by imitating the human brain.
  • 12. TYPES OF ANALYTICS • Descriptive Analytics What has happened ? • Diagnostic Analytics Why did it happen? • Predictive Analytics What will happen ? • Prescriptive Analytics What should we do ?
  • 13. DATA SCIENCE PROCESS MODEL Objective The project objective needs to be identified Data collection Collate the data from the different sources Exploratory Data analysis (Chapter 3) Data visualization (Chapter 4) Dimensionality reduction (Chapter 5) Model building (Chapter 7-14)