SlideShare a Scribd company logo
MISSION
CHRIST is a nurturing ground for an individual’s
holistic development to make effective contribution to
the society in a dynamic environment
VISION
Excellence and Service
CORE VALUES
Faith in God | Moral Uprightness
Love of Fellow Beings
Social Responsibility | Pursuit of Excellence
MTH341C - PRINCIPLES OF DATA SCIENCE
Week1: 18 to 23 July 2022
Department of Data Science and Statistics, CHRIST (DEEMED TO BE
UNIVERSITY)
BANGALORE, KARNATAKA, INDIA
Introduction to Data Science
Dr. UMME SALMA M
Assistant Professor
Ummesalma.m@christuniversity.in
Excellence and Service
CHRIST
Deemed to be University
Class Details
● Programme
○ MSC Mathematics
● Course
○ MTH341C
○ PRINCIPLES OF DATA SCIENCE
● Unit 1
○ Introduction To Data Science and Big Data
● Topic 1
○ Data Science Market
● Material
○ Online resources
2
Excellence and Service
CHRIST
Deemed to be University
Outline
3
➔KYS
➔Data Science Family
➔Skills and Jobs
➔Resources
Excellence and Service
CHRIST
Deemed to be University
KYS :)
Gender; Region; +2/UG Groups; Interest;
Project; Goal
4
Excellence and Service
CHRIST
Deemed to be University
Data Science
5
Excellence and Service
CHRIST
Deemed to be University
6
Excellence and Service
CHRIST
Deemed to be University
Data Science Virtual Machine
7
Excellence and Service
CHRIST
Deemed to be University
Data Science Family
8
● Data Science is a field of science than mere data
● Data Mining is mainly about
finding useful information in a dataset and utilizing that information to
uncover hidden patterns.
● Data Analytics involves tools and techniques
○ [information resulting from the systematic analysis of data or
statistics]
& Data Mining
Excellence and Service
CHRIST
Deemed to be University
9
Excellence and Service
CHRIST
Deemed to be University
10
Excellence and Service
CHRIST
Deemed to be University
11
Excellence and Service
CHRIST
Deemed to be University
12
Excellence and Service
CHRIST
Deemed to be University
Data Science Steps
13
Step 1: The first step of this process is setting a research goal. The main purpose here is making sure all the
stakeholders understand the what, how, and why of the project.
Step 2: The second phase is data retrieval. You want to have data available for analysis, so this step
includes finding suitable data and getting access to the data from the data owner. The result is data in its
raw form, which probably needs polishing and transformation beforeit becomes usable.
Step 3: Data transformation converts a raw form into directly usable form. To achieve this, you’ll detectand
correctdifferentkinds of errors in the data, combine data from differentdata sources,and transform it. If you
have successfullycompletedthis step, you can progress to data visualization and modeling.
Step 4: Data Exploration helps to gain a deep understanding of the data. You’ll look for patterns, correlations,
and deviations based on visual and descriptive techniques.The insights you gain from this phase will enable
you to start modeling.
Step 5: Data modelling is the phase to attempt to gain the insights or make the predictions stated in your
projectcharter. Now is the time to bring out the heavy guns, but rememberresearchhas taught us that often
(but not always) a combinationof simple models tends to outperform one complicatedmodel.
Step 6:Presentation and automation is all about presenting your results and automating the analysis, if needed.
Excellence and Service
CHRIST
Deemed to be University
Data Science Steps Outcome
14
Step1 Outcome:Clear Understanding of the goals of research and its context.
A projectcharter requires teamwork, and your input covers at least the following:
■ A clear researchgoal
■ The projectmissionand context
■ How you’re going to perform your analysis
■ What resources you expectto use
■ Proof that it’s an achievable project,or proof of concepts
■ Deliverables and a measure of success
■ A timeline
Step 2 Outcome:Sometimesyou need to go into the field and designa data collectionprocess
yourself,but most of the time you won’t be involved in this step.
Step 3 Outcome:Getting access to data is another difficulttask. Organizations understand the
value and sensitivity of data and oftenhave policies in place so everyone has access to what
they need and nothing more. Don’t be afraid to shop around.
Excellence and Service
CHRIST
Deemed to be University
15
Step 4 Outcome:Cleansing data
Data cleansing is a subprocess of the data science processthat focuses on
removing rrors in your data so your data becomesa true and consistent
representationof the processes itoriginates from.
Combiningdata from differentdata sources
Step 5 Outcome:Working Model based upon the
requirement
Step 6: Deployed Model
Excellence and Service
CHRIST
Deemed to be University
16
Excellence and Service
CHRIST
Deemed to be University
17
Excellence and Service
CHRIST
Deemed to be University
18
Excellence and Service
CHRIST
Deemed to be University
19
Excellence and Service
CHRIST
Deemed to be University
20
Source:https://wheebox.com/assets/pdf/ISR_Report_2020.pdf
Excellence and Service
CHRIST
Deemed to be University
21
Source:https://wheebox.com/assets/pdf/ISR_Report_2021.pdf
Excellence and Service
CHRIST
Deemed to be University
22
Excellence and Service
CHRIST
Deemed to be University
23
Source: https://analyticsindiamag.com/why-you-may-not-be-getting-a-call-back-for-that-data-science-job/
Excellence and Service
CHRIST
Deemed to be University
Skills
and
Jobs
24
Source:https://blog.udacity.com/2014/11/data-science-job-skills.html
Excellence and Service
CHRIST
Deemed to be University
25
Souce: https://www.gartner.com/smarterwithgartner/gartner-top-10-data-and-analytics-trends-for-2021/
Excellence and Service
CHRIST
Deemed to be University
Data Repositories
26
•Google DatasetSearch.
•Kaggle.
•Data.Gov.
•Datahub.io.
•UCI Machine Learning Repository.
•Earth Data.
•CERN Open Data Portal.
•Global Health ObservatoryData Repository.
•NCBI
•CERT
•NCRB
•Indiastat
Excellence and Service
CHRIST
Deemed to be University
Resources
27
● https://www.kdnuggets.com
● https://www.kaggle.com/
● https://www.analyticsvidhya.com/
● https://towardsdatascience.com
● https://machinelearningmastery.com/
● https://pydata.org/
● https://www.meetup.com/topics/data-science/
arXiv ; GitHub; MOOCS
Excellence and Service
CHRIST
Deemed to be University
THANKYOU
Next Topic: Unit 1: Chapter 1
Data Science in a Big Data World
Next session: Monday 12.00 PM
28

More Related Content

What's hot

Predictive Modelling
Predictive ModellingPredictive Modelling
Predictive ModellingRajiv Advani
 
Introduction to Data Mining
Introduction to Data Mining Introduction to Data Mining
Introduction to Data Mining
Sushil Kulkarni
 
data mining
data miningdata mining
data mining
manasa polu
 
Lect12 graph mining
Lect12 graph miningLect12 graph mining
Lect12 graph mining
Houw Liong The
 
Data Mining: Concepts and Techniques (3rd ed.) - Chapter 3 preprocessing
Data Mining:  Concepts and Techniques (3rd ed.)- Chapter 3 preprocessingData Mining:  Concepts and Techniques (3rd ed.)- Chapter 3 preprocessing
Data Mining: Concepts and Techniques (3rd ed.) - Chapter 3 preprocessing
Salah Amean
 
Chapter 1 - Introduction
Chapter 1 - IntroductionChapter 1 - Introduction
Chapter 1 - Introduction
Charles Deledalle
 
Data Wrangling
Data WranglingData Wrangling
Data Wrangling
Gramener
 
02 Data Mining
02 Data Mining02 Data Mining
2.6 support vector machines and associative classifiers revised
2.6 support vector machines and associative classifiers revised2.6 support vector machines and associative classifiers revised
2.6 support vector machines and associative classifiers revised
Krish_ver2
 
Cure, Clustering Algorithm
Cure, Clustering AlgorithmCure, Clustering Algorithm
Cure, Clustering Algorithm
Lino Possamai
 
Anomaly Detection using Deep Auto-Encoders
Anomaly Detection using Deep Auto-EncodersAnomaly Detection using Deep Auto-Encoders
Anomaly Detection using Deep Auto-Encoders
Gianmario Spacagna
 
Data Visualization in Data Science
Data Visualization in Data ScienceData Visualization in Data Science
Data Visualization in Data Science
Maloy Manna, PMP®
 
Unit 2.pptx
Unit 2.pptxUnit 2.pptx
Linear models for classification
Linear models for classificationLinear models for classification
Linear models for classification
Sung Yub Kim
 
3D Laser Scanning
3D Laser Scanning 3D Laser Scanning
3D Laser Scanning
Ron Reed, MBA, PMP
 
K-Folds Cross Validation Method
K-Folds Cross Validation MethodK-Folds Cross Validation Method
K-Folds Cross Validation Method
SHUBHAM GUPTA
 
3.7 outlier analysis
3.7 outlier analysis3.7 outlier analysis
3.7 outlier analysis
Krish_ver2
 
Introduction to Machine Learning with SciKit-Learn
Introduction to Machine Learning with SciKit-LearnIntroduction to Machine Learning with SciKit-Learn
Introduction to Machine Learning with SciKit-Learn
Benjamin Bengfort
 

What's hot (20)

Predictive Modelling
Predictive ModellingPredictive Modelling
Predictive Modelling
 
Introduction to Data Mining
Introduction to Data Mining Introduction to Data Mining
Introduction to Data Mining
 
data mining
data miningdata mining
data mining
 
Lect12 graph mining
Lect12 graph miningLect12 graph mining
Lect12 graph mining
 
Data Mining: Concepts and Techniques (3rd ed.) - Chapter 3 preprocessing
Data Mining:  Concepts and Techniques (3rd ed.)- Chapter 3 preprocessingData Mining:  Concepts and Techniques (3rd ed.)- Chapter 3 preprocessing
Data Mining: Concepts and Techniques (3rd ed.) - Chapter 3 preprocessing
 
Chapter 1 - Introduction
Chapter 1 - IntroductionChapter 1 - Introduction
Chapter 1 - Introduction
 
Data Wrangling
Data WranglingData Wrangling
Data Wrangling
 
02 Data Mining
02 Data Mining02 Data Mining
02 Data Mining
 
2.6 support vector machines and associative classifiers revised
2.6 support vector machines and associative classifiers revised2.6 support vector machines and associative classifiers revised
2.6 support vector machines and associative classifiers revised
 
Data preprocessing ng
Data preprocessing   ngData preprocessing   ng
Data preprocessing ng
 
Cure, Clustering Algorithm
Cure, Clustering AlgorithmCure, Clustering Algorithm
Cure, Clustering Algorithm
 
Anomaly Detection using Deep Auto-Encoders
Anomaly Detection using Deep Auto-EncodersAnomaly Detection using Deep Auto-Encoders
Anomaly Detection using Deep Auto-Encoders
 
Data preprocessing
Data preprocessingData preprocessing
Data preprocessing
 
Data Visualization in Data Science
Data Visualization in Data ScienceData Visualization in Data Science
Data Visualization in Data Science
 
Unit 2.pptx
Unit 2.pptxUnit 2.pptx
Unit 2.pptx
 
Linear models for classification
Linear models for classificationLinear models for classification
Linear models for classification
 
3D Laser Scanning
3D Laser Scanning 3D Laser Scanning
3D Laser Scanning
 
K-Folds Cross Validation Method
K-Folds Cross Validation MethodK-Folds Cross Validation Method
K-Folds Cross Validation Method
 
3.7 outlier analysis
3.7 outlier analysis3.7 outlier analysis
3.7 outlier analysis
 
Introduction to Machine Learning with SciKit-Learn
Introduction to Machine Learning with SciKit-LearnIntroduction to Machine Learning with SciKit-Learn
Introduction to Machine Learning with SciKit-Learn
 

Similar to Introduction to Data Science

TTG Int.LTD Data Mining Technique
TTG Int.LTD Data Mining TechniqueTTG Int.LTD Data Mining Technique
TTG Int.LTD Data Mining Technique
Mehmet Beyaz
 
knowledge discovery and data mining approach in databases (2)
knowledge discovery and data mining approach in databases (2)knowledge discovery and data mining approach in databases (2)
knowledge discovery and data mining approach in databases (2)
Kartik Kalpande Patil
 
Introducition to Data scinece compiled by hu
Introducition to Data scinece compiled by huIntroducition to Data scinece compiled by hu
Introducition to Data scinece compiled by hu
wekineheshete
 
Luciano uvi hackfest.28.10.2020
Luciano uvi hackfest.28.10.2020Luciano uvi hackfest.28.10.2020
Luciano uvi hackfest.28.10.2020
Joanne Luciano
 
Data science unit1
Data science unit1Data science unit1
Data science unit1
varshakumar21
 
Introduction to Data Analytics.pptx
Introduction to Data Analytics.pptxIntroduction to Data Analytics.pptx
Introduction to Data Analytics.pptx
DikshantSharma63
 
Who Owns Faculty Data?: Fairness and transparency in UCLA's new academic HR s...
Who Owns Faculty Data?: Fairness and transparency in UCLA's new academic HR s...Who Owns Faculty Data?: Fairness and transparency in UCLA's new academic HR s...
Who Owns Faculty Data?: Fairness and transparency in UCLA's new academic HR s...
chloejreynolds
 
Data science.chapter-1,2,3
Data science.chapter-1,2,3Data science.chapter-1,2,3
Data science.chapter-1,2,3
varshakumar21
 
Recommendation of Data Mining Technique in Higher Education Prof. Priya Thaka...
Recommendation of Data Mining Technique in Higher Education Prof. Priya Thaka...Recommendation of Data Mining Technique in Higher Education Prof. Priya Thaka...
Recommendation of Data Mining Technique in Higher Education Prof. Priya Thaka...
ijceronline
 
STUDENTS’ PERFORMANCE PREDICTION SYSTEM USING MULTI AGENT DATA MINING TECHNIQUE
STUDENTS’ PERFORMANCE PREDICTION SYSTEM USING MULTI AGENT DATA MINING TECHNIQUESTUDENTS’ PERFORMANCE PREDICTION SYSTEM USING MULTI AGENT DATA MINING TECHNIQUE
STUDENTS’ PERFORMANCE PREDICTION SYSTEM USING MULTI AGENT DATA MINING TECHNIQUE
IJDKP
 
The Analytics and Data Science Landscape
The Analytics and Data Science LandscapeThe Analytics and Data Science Landscape
The Analytics and Data Science Landscape
Philip Bourne
 
KDD assignmnt data.docx
KDD assignmnt data.docxKDD assignmnt data.docx
KDD assignmnt data.docx
AbihaAkter201153203
 
Introduction to Data Science - Week 3 - Steps involved in Data Science
Introduction to Data Science - Week 3 - Steps involved in Data ScienceIntroduction to Data Science - Week 3 - Steps involved in Data Science
Introduction to Data Science - Week 3 - Steps involved in Data Science
Ferdin Joe John Joseph PhD
 
Christ PPT Template.pptx
Christ PPT Template.pptxChrist PPT Template.pptx
Christ PPT Template.pptx
SivaCheralathan
 
Unit 3.pdf
Unit 3.pdfUnit 3.pdf
Unit 3.pdf
Manisha Shinde
 
DataSpryng Overview
DataSpryng OverviewDataSpryng Overview
DataSpryng Overview
jkvr
 
Data Science (Moradabad).pdf
Data Science (Moradabad).pdfData Science (Moradabad).pdf
Data Science (Moradabad).pdf
Umar khan
 
التنقيب في البيانات - Data Mining
التنقيب في البيانات -  Data Miningالتنقيب في البيانات -  Data Mining
التنقيب في البيانات - Data Mining
nabil_alsharafi
 
Experience unparalleled data-driven success with our cutting-edge Data Scienc...
Experience unparalleled data-driven success with our cutting-edge Data Scienc...Experience unparalleled data-driven success with our cutting-edge Data Scienc...
Experience unparalleled data-driven success with our cutting-edge Data Scienc...
proitbridgePvtLtd
 
Study and Analysis of K-Means Clustering Algorithm Using Rapidminer
Study and Analysis of K-Means Clustering Algorithm Using RapidminerStudy and Analysis of K-Means Clustering Algorithm Using Rapidminer
Study and Analysis of K-Means Clustering Algorithm Using Rapidminer
IJERA Editor
 

Similar to Introduction to Data Science (20)

TTG Int.LTD Data Mining Technique
TTG Int.LTD Data Mining TechniqueTTG Int.LTD Data Mining Technique
TTG Int.LTD Data Mining Technique
 
knowledge discovery and data mining approach in databases (2)
knowledge discovery and data mining approach in databases (2)knowledge discovery and data mining approach in databases (2)
knowledge discovery and data mining approach in databases (2)
 
Introducition to Data scinece compiled by hu
Introducition to Data scinece compiled by huIntroducition to Data scinece compiled by hu
Introducition to Data scinece compiled by hu
 
Luciano uvi hackfest.28.10.2020
Luciano uvi hackfest.28.10.2020Luciano uvi hackfest.28.10.2020
Luciano uvi hackfest.28.10.2020
 
Data science unit1
Data science unit1Data science unit1
Data science unit1
 
Introduction to Data Analytics.pptx
Introduction to Data Analytics.pptxIntroduction to Data Analytics.pptx
Introduction to Data Analytics.pptx
 
Who Owns Faculty Data?: Fairness and transparency in UCLA's new academic HR s...
Who Owns Faculty Data?: Fairness and transparency in UCLA's new academic HR s...Who Owns Faculty Data?: Fairness and transparency in UCLA's new academic HR s...
Who Owns Faculty Data?: Fairness and transparency in UCLA's new academic HR s...
 
Data science.chapter-1,2,3
Data science.chapter-1,2,3Data science.chapter-1,2,3
Data science.chapter-1,2,3
 
Recommendation of Data Mining Technique in Higher Education Prof. Priya Thaka...
Recommendation of Data Mining Technique in Higher Education Prof. Priya Thaka...Recommendation of Data Mining Technique in Higher Education Prof. Priya Thaka...
Recommendation of Data Mining Technique in Higher Education Prof. Priya Thaka...
 
STUDENTS’ PERFORMANCE PREDICTION SYSTEM USING MULTI AGENT DATA MINING TECHNIQUE
STUDENTS’ PERFORMANCE PREDICTION SYSTEM USING MULTI AGENT DATA MINING TECHNIQUESTUDENTS’ PERFORMANCE PREDICTION SYSTEM USING MULTI AGENT DATA MINING TECHNIQUE
STUDENTS’ PERFORMANCE PREDICTION SYSTEM USING MULTI AGENT DATA MINING TECHNIQUE
 
The Analytics and Data Science Landscape
The Analytics and Data Science LandscapeThe Analytics and Data Science Landscape
The Analytics and Data Science Landscape
 
KDD assignmnt data.docx
KDD assignmnt data.docxKDD assignmnt data.docx
KDD assignmnt data.docx
 
Introduction to Data Science - Week 3 - Steps involved in Data Science
Introduction to Data Science - Week 3 - Steps involved in Data ScienceIntroduction to Data Science - Week 3 - Steps involved in Data Science
Introduction to Data Science - Week 3 - Steps involved in Data Science
 
Christ PPT Template.pptx
Christ PPT Template.pptxChrist PPT Template.pptx
Christ PPT Template.pptx
 
Unit 3.pdf
Unit 3.pdfUnit 3.pdf
Unit 3.pdf
 
DataSpryng Overview
DataSpryng OverviewDataSpryng Overview
DataSpryng Overview
 
Data Science (Moradabad).pdf
Data Science (Moradabad).pdfData Science (Moradabad).pdf
Data Science (Moradabad).pdf
 
التنقيب في البيانات - Data Mining
التنقيب في البيانات -  Data Miningالتنقيب في البيانات -  Data Mining
التنقيب في البيانات - Data Mining
 
Experience unparalleled data-driven success with our cutting-edge Data Scienc...
Experience unparalleled data-driven success with our cutting-edge Data Scienc...Experience unparalleled data-driven success with our cutting-edge Data Scienc...
Experience unparalleled data-driven success with our cutting-edge Data Scienc...
 
Study and Analysis of K-Means Clustering Algorithm Using Rapidminer
Study and Analysis of K-Means Clustering Algorithm Using RapidminerStudy and Analysis of K-Means Clustering Algorithm Using Rapidminer
Study and Analysis of K-Means Clustering Algorithm Using Rapidminer
 

More from UmmeSalmaM1

Welcome to Python Programming.pptx
Welcome to Python Programming.pptxWelcome to Python Programming.pptx
Welcome to Python Programming.pptx
UmmeSalmaM1
 
Role of digital technology in autism a case study
Role of digital technology in autism a case studyRole of digital technology in autism a case study
Role of digital technology in autism a case study
UmmeSalmaM1
 
Programming for data science in python
Programming for data science in pythonProgramming for data science in python
Programming for data science in python
UmmeSalmaM1
 
Demography basedhybridrecommendersystemformovierecommendation
Demography basedhybridrecommendersystemformovierecommendationDemography basedhybridrecommendersystemformovierecommendation
Demography basedhybridrecommendersystemformovierecommendation
UmmeSalmaM1
 
Datascience and python
Datascience and pythonDatascience and python
Datascience and python
UmmeSalmaM1
 
Machine learning visual_quiz
Machine learning visual_quizMachine learning visual_quiz
Machine learning visual_quiz
UmmeSalmaM1
 
Lecture1 introduction to machine learning
Lecture1 introduction to machine learningLecture1 introduction to machine learning
Lecture1 introduction to machine learning
UmmeSalmaM1
 
The Art of Entrepreneurship
The Art of Entrepreneurship The Art of Entrepreneurship
The Art of Entrepreneurship
UmmeSalmaM1
 
Impact of Learning Functions on Prediction of Stock Data
Impact of Learning Functions on Prediction of Stock DataImpact of Learning Functions on Prediction of Stock Data
Impact of Learning Functions on Prediction of Stock Data
UmmeSalmaM1
 

More from UmmeSalmaM1 (9)

Welcome to Python Programming.pptx
Welcome to Python Programming.pptxWelcome to Python Programming.pptx
Welcome to Python Programming.pptx
 
Role of digital technology in autism a case study
Role of digital technology in autism a case studyRole of digital technology in autism a case study
Role of digital technology in autism a case study
 
Programming for data science in python
Programming for data science in pythonProgramming for data science in python
Programming for data science in python
 
Demography basedhybridrecommendersystemformovierecommendation
Demography basedhybridrecommendersystemformovierecommendationDemography basedhybridrecommendersystemformovierecommendation
Demography basedhybridrecommendersystemformovierecommendation
 
Datascience and python
Datascience and pythonDatascience and python
Datascience and python
 
Machine learning visual_quiz
Machine learning visual_quizMachine learning visual_quiz
Machine learning visual_quiz
 
Lecture1 introduction to machine learning
Lecture1 introduction to machine learningLecture1 introduction to machine learning
Lecture1 introduction to machine learning
 
The Art of Entrepreneurship
The Art of Entrepreneurship The Art of Entrepreneurship
The Art of Entrepreneurship
 
Impact of Learning Functions on Prediction of Stock Data
Impact of Learning Functions on Prediction of Stock DataImpact of Learning Functions on Prediction of Stock Data
Impact of Learning Functions on Prediction of Stock Data
 

Recently uploaded

一比一原版(NYU毕业证)纽约大学毕业证成绩单
一比一原版(NYU毕业证)纽约大学毕业证成绩单一比一原版(NYU毕业证)纽约大学毕业证成绩单
一比一原版(NYU毕业证)纽约大学毕业证成绩单
ewymefz
 
一比一原版(UofS毕业证书)萨省大学毕业证如何办理
一比一原版(UofS毕业证书)萨省大学毕业证如何办理一比一原版(UofS毕业证书)萨省大学毕业证如何办理
一比一原版(UofS毕业证书)萨省大学毕业证如何办理
v3tuleee
 
做(mqu毕业证书)麦考瑞大学毕业证硕士文凭证书学费发票原版一模一样
做(mqu毕业证书)麦考瑞大学毕业证硕士文凭证书学费发票原版一模一样做(mqu毕业证书)麦考瑞大学毕业证硕士文凭证书学费发票原版一模一样
做(mqu毕业证书)麦考瑞大学毕业证硕士文凭证书学费发票原版一模一样
axoqas
 
一比一原版(IIT毕业证)伊利诺伊理工大学毕业证成绩单
一比一原版(IIT毕业证)伊利诺伊理工大学毕业证成绩单一比一原版(IIT毕业证)伊利诺伊理工大学毕业证成绩单
一比一原版(IIT毕业证)伊利诺伊理工大学毕业证成绩单
ewymefz
 
一比一原版(UniSA毕业证书)南澳大学毕业证如何办理
一比一原版(UniSA毕业证书)南澳大学毕业证如何办理一比一原版(UniSA毕业证书)南澳大学毕业证如何办理
一比一原版(UniSA毕业证书)南澳大学毕业证如何办理
slg6lamcq
 
Adjusting primitives for graph : SHORT REPORT / NOTES
Adjusting primitives for graph : SHORT REPORT / NOTESAdjusting primitives for graph : SHORT REPORT / NOTES
Adjusting primitives for graph : SHORT REPORT / NOTES
Subhajit Sahu
 
一比一原版(TWU毕业证)西三一大学毕业证成绩单
一比一原版(TWU毕业证)西三一大学毕业证成绩单一比一原版(TWU毕业证)西三一大学毕业证成绩单
一比一原版(TWU毕业证)西三一大学毕业证成绩单
ocavb
 
Data Centers - Striving Within A Narrow Range - Research Report - MCG - May 2...
Data Centers - Striving Within A Narrow Range - Research Report - MCG - May 2...Data Centers - Striving Within A Narrow Range - Research Report - MCG - May 2...
Data Centers - Striving Within A Narrow Range - Research Report - MCG - May 2...
pchutichetpong
 
【社内勉強会資料_Octo: An Open-Source Generalist Robot Policy】
【社内勉強会資料_Octo: An Open-Source Generalist Robot Policy】【社内勉強会資料_Octo: An Open-Source Generalist Robot Policy】
【社内勉強会資料_Octo: An Open-Source Generalist Robot Policy】
NABLAS株式会社
 
一比一原版(QU毕业证)皇后大学毕业证成绩单
一比一原版(QU毕业证)皇后大学毕业证成绩单一比一原版(QU毕业证)皇后大学毕业证成绩单
一比一原版(QU毕业证)皇后大学毕业证成绩单
enxupq
 
Q1’2024 Update: MYCI’s Leap Year Rebound
Q1’2024 Update: MYCI’s Leap Year ReboundQ1’2024 Update: MYCI’s Leap Year Rebound
Q1’2024 Update: MYCI’s Leap Year Rebound
Oppotus
 
一比一原版(CBU毕业证)不列颠海角大学毕业证成绩单
一比一原版(CBU毕业证)不列颠海角大学毕业证成绩单一比一原版(CBU毕业证)不列颠海角大学毕业证成绩单
一比一原版(CBU毕业证)不列颠海角大学毕业证成绩单
nscud
 
原版制作(Deakin毕业证书)迪肯大学毕业证学位证一模一样
原版制作(Deakin毕业证书)迪肯大学毕业证学位证一模一样原版制作(Deakin毕业证书)迪肯大学毕业证学位证一模一样
原版制作(Deakin毕业证书)迪肯大学毕业证学位证一模一样
u86oixdj
 
Opendatabay - Open Data Marketplace.pptx
Opendatabay - Open Data Marketplace.pptxOpendatabay - Open Data Marketplace.pptx
Opendatabay - Open Data Marketplace.pptx
Opendatabay
 
一比一原版(UofM毕业证)明尼苏达大学毕业证成绩单
一比一原版(UofM毕业证)明尼苏达大学毕业证成绩单一比一原版(UofM毕业证)明尼苏达大学毕业证成绩单
一比一原版(UofM毕业证)明尼苏达大学毕业证成绩单
ewymefz
 
一比一原版(UPenn毕业证)宾夕法尼亚大学毕业证成绩单
一比一原版(UPenn毕业证)宾夕法尼亚大学毕业证成绩单一比一原版(UPenn毕业证)宾夕法尼亚大学毕业证成绩单
一比一原版(UPenn毕业证)宾夕法尼亚大学毕业证成绩单
ewymefz
 
Sample_Global Non-invasive Prenatal Testing (NIPT) Market, 2019-2030.pdf
Sample_Global Non-invasive Prenatal Testing (NIPT) Market, 2019-2030.pdfSample_Global Non-invasive Prenatal Testing (NIPT) Market, 2019-2030.pdf
Sample_Global Non-invasive Prenatal Testing (NIPT) Market, 2019-2030.pdf
Linda486226
 
哪里卖(usq毕业证书)南昆士兰大学毕业证研究生文凭证书托福证书原版一模一样
哪里卖(usq毕业证书)南昆士兰大学毕业证研究生文凭证书托福证书原版一模一样哪里卖(usq毕业证书)南昆士兰大学毕业证研究生文凭证书托福证书原版一模一样
哪里卖(usq毕业证书)南昆士兰大学毕业证研究生文凭证书托福证书原版一模一样
axoqas
 
一比一原版(BU毕业证)波士顿大学毕业证成绩单
一比一原版(BU毕业证)波士顿大学毕业证成绩单一比一原版(BU毕业证)波士顿大学毕业证成绩单
一比一原版(BU毕业证)波士顿大学毕业证成绩单
ewymefz
 
一比一原版(UIUC毕业证)伊利诺伊大学|厄巴纳-香槟分校毕业证如何办理
一比一原版(UIUC毕业证)伊利诺伊大学|厄巴纳-香槟分校毕业证如何办理一比一原版(UIUC毕业证)伊利诺伊大学|厄巴纳-香槟分校毕业证如何办理
一比一原版(UIUC毕业证)伊利诺伊大学|厄巴纳-香槟分校毕业证如何办理
ahzuo
 

Recently uploaded (20)

一比一原版(NYU毕业证)纽约大学毕业证成绩单
一比一原版(NYU毕业证)纽约大学毕业证成绩单一比一原版(NYU毕业证)纽约大学毕业证成绩单
一比一原版(NYU毕业证)纽约大学毕业证成绩单
 
一比一原版(UofS毕业证书)萨省大学毕业证如何办理
一比一原版(UofS毕业证书)萨省大学毕业证如何办理一比一原版(UofS毕业证书)萨省大学毕业证如何办理
一比一原版(UofS毕业证书)萨省大学毕业证如何办理
 
做(mqu毕业证书)麦考瑞大学毕业证硕士文凭证书学费发票原版一模一样
做(mqu毕业证书)麦考瑞大学毕业证硕士文凭证书学费发票原版一模一样做(mqu毕业证书)麦考瑞大学毕业证硕士文凭证书学费发票原版一模一样
做(mqu毕业证书)麦考瑞大学毕业证硕士文凭证书学费发票原版一模一样
 
一比一原版(IIT毕业证)伊利诺伊理工大学毕业证成绩单
一比一原版(IIT毕业证)伊利诺伊理工大学毕业证成绩单一比一原版(IIT毕业证)伊利诺伊理工大学毕业证成绩单
一比一原版(IIT毕业证)伊利诺伊理工大学毕业证成绩单
 
一比一原版(UniSA毕业证书)南澳大学毕业证如何办理
一比一原版(UniSA毕业证书)南澳大学毕业证如何办理一比一原版(UniSA毕业证书)南澳大学毕业证如何办理
一比一原版(UniSA毕业证书)南澳大学毕业证如何办理
 
Adjusting primitives for graph : SHORT REPORT / NOTES
Adjusting primitives for graph : SHORT REPORT / NOTESAdjusting primitives for graph : SHORT REPORT / NOTES
Adjusting primitives for graph : SHORT REPORT / NOTES
 
一比一原版(TWU毕业证)西三一大学毕业证成绩单
一比一原版(TWU毕业证)西三一大学毕业证成绩单一比一原版(TWU毕业证)西三一大学毕业证成绩单
一比一原版(TWU毕业证)西三一大学毕业证成绩单
 
Data Centers - Striving Within A Narrow Range - Research Report - MCG - May 2...
Data Centers - Striving Within A Narrow Range - Research Report - MCG - May 2...Data Centers - Striving Within A Narrow Range - Research Report - MCG - May 2...
Data Centers - Striving Within A Narrow Range - Research Report - MCG - May 2...
 
【社内勉強会資料_Octo: An Open-Source Generalist Robot Policy】
【社内勉強会資料_Octo: An Open-Source Generalist Robot Policy】【社内勉強会資料_Octo: An Open-Source Generalist Robot Policy】
【社内勉強会資料_Octo: An Open-Source Generalist Robot Policy】
 
一比一原版(QU毕业证)皇后大学毕业证成绩单
一比一原版(QU毕业证)皇后大学毕业证成绩单一比一原版(QU毕业证)皇后大学毕业证成绩单
一比一原版(QU毕业证)皇后大学毕业证成绩单
 
Q1’2024 Update: MYCI’s Leap Year Rebound
Q1’2024 Update: MYCI’s Leap Year ReboundQ1’2024 Update: MYCI’s Leap Year Rebound
Q1’2024 Update: MYCI’s Leap Year Rebound
 
一比一原版(CBU毕业证)不列颠海角大学毕业证成绩单
一比一原版(CBU毕业证)不列颠海角大学毕业证成绩单一比一原版(CBU毕业证)不列颠海角大学毕业证成绩单
一比一原版(CBU毕业证)不列颠海角大学毕业证成绩单
 
原版制作(Deakin毕业证书)迪肯大学毕业证学位证一模一样
原版制作(Deakin毕业证书)迪肯大学毕业证学位证一模一样原版制作(Deakin毕业证书)迪肯大学毕业证学位证一模一样
原版制作(Deakin毕业证书)迪肯大学毕业证学位证一模一样
 
Opendatabay - Open Data Marketplace.pptx
Opendatabay - Open Data Marketplace.pptxOpendatabay - Open Data Marketplace.pptx
Opendatabay - Open Data Marketplace.pptx
 
一比一原版(UofM毕业证)明尼苏达大学毕业证成绩单
一比一原版(UofM毕业证)明尼苏达大学毕业证成绩单一比一原版(UofM毕业证)明尼苏达大学毕业证成绩单
一比一原版(UofM毕业证)明尼苏达大学毕业证成绩单
 
一比一原版(UPenn毕业证)宾夕法尼亚大学毕业证成绩单
一比一原版(UPenn毕业证)宾夕法尼亚大学毕业证成绩单一比一原版(UPenn毕业证)宾夕法尼亚大学毕业证成绩单
一比一原版(UPenn毕业证)宾夕法尼亚大学毕业证成绩单
 
Sample_Global Non-invasive Prenatal Testing (NIPT) Market, 2019-2030.pdf
Sample_Global Non-invasive Prenatal Testing (NIPT) Market, 2019-2030.pdfSample_Global Non-invasive Prenatal Testing (NIPT) Market, 2019-2030.pdf
Sample_Global Non-invasive Prenatal Testing (NIPT) Market, 2019-2030.pdf
 
哪里卖(usq毕业证书)南昆士兰大学毕业证研究生文凭证书托福证书原版一模一样
哪里卖(usq毕业证书)南昆士兰大学毕业证研究生文凭证书托福证书原版一模一样哪里卖(usq毕业证书)南昆士兰大学毕业证研究生文凭证书托福证书原版一模一样
哪里卖(usq毕业证书)南昆士兰大学毕业证研究生文凭证书托福证书原版一模一样
 
一比一原版(BU毕业证)波士顿大学毕业证成绩单
一比一原版(BU毕业证)波士顿大学毕业证成绩单一比一原版(BU毕业证)波士顿大学毕业证成绩单
一比一原版(BU毕业证)波士顿大学毕业证成绩单
 
一比一原版(UIUC毕业证)伊利诺伊大学|厄巴纳-香槟分校毕业证如何办理
一比一原版(UIUC毕业证)伊利诺伊大学|厄巴纳-香槟分校毕业证如何办理一比一原版(UIUC毕业证)伊利诺伊大学|厄巴纳-香槟分校毕业证如何办理
一比一原版(UIUC毕业证)伊利诺伊大学|厄巴纳-香槟分校毕业证如何办理
 

Introduction to Data Science

  • 1. MISSION CHRIST is a nurturing ground for an individual’s holistic development to make effective contribution to the society in a dynamic environment VISION Excellence and Service CORE VALUES Faith in God | Moral Uprightness Love of Fellow Beings Social Responsibility | Pursuit of Excellence MTH341C - PRINCIPLES OF DATA SCIENCE Week1: 18 to 23 July 2022 Department of Data Science and Statistics, CHRIST (DEEMED TO BE UNIVERSITY) BANGALORE, KARNATAKA, INDIA Introduction to Data Science Dr. UMME SALMA M Assistant Professor Ummesalma.m@christuniversity.in
  • 2. Excellence and Service CHRIST Deemed to be University Class Details ● Programme ○ MSC Mathematics ● Course ○ MTH341C ○ PRINCIPLES OF DATA SCIENCE ● Unit 1 ○ Introduction To Data Science and Big Data ● Topic 1 ○ Data Science Market ● Material ○ Online resources 2
  • 3. Excellence and Service CHRIST Deemed to be University Outline 3 ➔KYS ➔Data Science Family ➔Skills and Jobs ➔Resources
  • 4. Excellence and Service CHRIST Deemed to be University KYS :) Gender; Region; +2/UG Groups; Interest; Project; Goal 4
  • 5. Excellence and Service CHRIST Deemed to be University Data Science 5
  • 7. Excellence and Service CHRIST Deemed to be University Data Science Virtual Machine 7
  • 8. Excellence and Service CHRIST Deemed to be University Data Science Family 8 ● Data Science is a field of science than mere data ● Data Mining is mainly about finding useful information in a dataset and utilizing that information to uncover hidden patterns. ● Data Analytics involves tools and techniques ○ [information resulting from the systematic analysis of data or statistics] & Data Mining
  • 13. Excellence and Service CHRIST Deemed to be University Data Science Steps 13 Step 1: The first step of this process is setting a research goal. The main purpose here is making sure all the stakeholders understand the what, how, and why of the project. Step 2: The second phase is data retrieval. You want to have data available for analysis, so this step includes finding suitable data and getting access to the data from the data owner. The result is data in its raw form, which probably needs polishing and transformation beforeit becomes usable. Step 3: Data transformation converts a raw form into directly usable form. To achieve this, you’ll detectand correctdifferentkinds of errors in the data, combine data from differentdata sources,and transform it. If you have successfullycompletedthis step, you can progress to data visualization and modeling. Step 4: Data Exploration helps to gain a deep understanding of the data. You’ll look for patterns, correlations, and deviations based on visual and descriptive techniques.The insights you gain from this phase will enable you to start modeling. Step 5: Data modelling is the phase to attempt to gain the insights or make the predictions stated in your projectcharter. Now is the time to bring out the heavy guns, but rememberresearchhas taught us that often (but not always) a combinationof simple models tends to outperform one complicatedmodel. Step 6:Presentation and automation is all about presenting your results and automating the analysis, if needed.
  • 14. Excellence and Service CHRIST Deemed to be University Data Science Steps Outcome 14 Step1 Outcome:Clear Understanding of the goals of research and its context. A projectcharter requires teamwork, and your input covers at least the following: ■ A clear researchgoal ■ The projectmissionand context ■ How you’re going to perform your analysis ■ What resources you expectto use ■ Proof that it’s an achievable project,or proof of concepts ■ Deliverables and a measure of success ■ A timeline Step 2 Outcome:Sometimesyou need to go into the field and designa data collectionprocess yourself,but most of the time you won’t be involved in this step. Step 3 Outcome:Getting access to data is another difficulttask. Organizations understand the value and sensitivity of data and oftenhave policies in place so everyone has access to what they need and nothing more. Don’t be afraid to shop around.
  • 15. Excellence and Service CHRIST Deemed to be University 15 Step 4 Outcome:Cleansing data Data cleansing is a subprocess of the data science processthat focuses on removing rrors in your data so your data becomesa true and consistent representationof the processes itoriginates from. Combiningdata from differentdata sources Step 5 Outcome:Working Model based upon the requirement Step 6: Deployed Model
  • 20. Excellence and Service CHRIST Deemed to be University 20 Source:https://wheebox.com/assets/pdf/ISR_Report_2020.pdf
  • 21. Excellence and Service CHRIST Deemed to be University 21 Source:https://wheebox.com/assets/pdf/ISR_Report_2021.pdf
  • 23. Excellence and Service CHRIST Deemed to be University 23 Source: https://analyticsindiamag.com/why-you-may-not-be-getting-a-call-back-for-that-data-science-job/
  • 24. Excellence and Service CHRIST Deemed to be University Skills and Jobs 24 Source:https://blog.udacity.com/2014/11/data-science-job-skills.html
  • 25. Excellence and Service CHRIST Deemed to be University 25 Souce: https://www.gartner.com/smarterwithgartner/gartner-top-10-data-and-analytics-trends-for-2021/
  • 26. Excellence and Service CHRIST Deemed to be University Data Repositories 26 •Google DatasetSearch. •Kaggle. •Data.Gov. •Datahub.io. •UCI Machine Learning Repository. •Earth Data. •CERN Open Data Portal. •Global Health ObservatoryData Repository. •NCBI •CERT •NCRB •Indiastat
  • 27. Excellence and Service CHRIST Deemed to be University Resources 27 ● https://www.kdnuggets.com ● https://www.kaggle.com/ ● https://www.analyticsvidhya.com/ ● https://towardsdatascience.com ● https://machinelearningmastery.com/ ● https://pydata.org/ ● https://www.meetup.com/topics/data-science/ arXiv ; GitHub; MOOCS
  • 28. Excellence and Service CHRIST Deemed to be University THANKYOU Next Topic: Unit 1: Chapter 1 Data Science in a Big Data World Next session: Monday 12.00 PM 28