SlideShare a Scribd company logo
1 of 15
Amity Institute of Information Technology
Introduction to Data Science
BSc.IT/BCA/ DUAL VI Semester
Faculty: Dr. Shambhu Kumar Jha
1
Amity Institute of Information Technology
Module I : Introduction
What is Data Science
Why Data Science
Component of Data science
Data science Process
2
Amity Institute of Information Technology
Introduction
Let’s start by understanding what is data science?
– Have you ever wondered how Amazon, eBay suggest items
for you to buy?
– How Gmail filters your emails in the spam and non-spam
categories?
– How Netflix predicts the shows of your liking?
3
Amity Institute of Information Technology
What is Data Science?
• Data Science is the area of study which involves extracting
insights from vast amounts of data using various scientific methods,
algorithms, and processes.
• It helps you to discover hidden patterns from the raw data.
• The term Data Science has emerged because of the evolution of
mathematical statistics, data analysis, and big data.
• Data Science is an interdisciplinary field that allows you to extract
knowledge from structured or unstructured data.
• Data science enables you to translate a business problem into a
research project and then translate it back into a practical solution.4
Amity Institute of Information Technology
What is Data Science?
5
Data science is all
about using data to
solve problems.
Amity Institute of Information Technology
Why Data Science?
Significant advantages of using Data Analytics Technology:
• Data is the oil for today’s world.
• With right tools, technologies, algorithms, we can use data to convert
into distinct business advantage
• Data Science can help you to detect fraud using advanced machine
learning algorithms
• It helps you to prevent any significant monetary losses
• Allows to build intelligence ability in machines
• You can perform sentiment analysis to gauge customer brand loyalty
• It enables you to take better and faster decisions
• It helps you to recommend the right product to the right customer to
enhance your business
6
Amity Institute of Information Technology
Data Science & Statistics
7
Amity Institute of Information Technology
Data Science Components
8
Amity Institute of Information Technology
Data Science Components
• Statistics: Statistics is the most critical unit of Data Science basics,
and it is the method or science of collecting and analysing numerical
data in large quantities to get useful insights.
• Visualization:
• Visualization technique helps you access huge amounts of data in easy
to understand and digestible visuals.
• Machine Learning:
• Machine Learning explores the building and study of algorithms that
learn to make predictions about unforeseen/future data.
• Deep Learning:
• Deep Learning method is new machine learning research where the
algorithm selects the analysis model to follow.
9
Amity Institute of Information Technology
Data Science Process
10
Amity Institute of Information Technology
Data Science Process
Discovery:
• Discovery step involves acquiring data from all the
identified internal & external sources, which helps you
answer the business question.
The data can be:
• Logs from webservers
• Data gathered from social media
• Census datasets
• Data streamed from online sources using APIs
11
Amity Institute of Information Technology
Data Science Process
Preparation: ETL :Extraction Transformation Loading
Data can have many inconsistencies like:
• Missing values,
• Blank columns,
• An incorrect data format, which needs to be cleaned.
You need to process, explore, and condition data before
modelling.
The cleaner your data, the better are your predictions.
12
Amity Institute of Information Technology
Data Science Process
Model Planning:
• In this stage, you need to determine the method and
technique to draw the relation between input variables.
• Planning for a model is performed by using different
statistical formulas and visualization tools.
• SQL analysis services, R, and SAS/access are some of the
tools used for this purpose.
13
Amity Institute of Information Technology
Data Science Process
Model Building:
• In this step, the actual model building process starts.
• Here, Data scientist distributes datasets for training and
testing.
• Techniques like association, classification, and clustering
are applied to the training data set.
• The model, once prepared, is tested against the “testing”
dataset.
14
Amity Institute of Information Technology
Data Science Process
Operationalize:
• You deliver the final baselined model with reports, code,
and technical documents in this stage.
• Model is deployed into a real-time production environment
after thorough testing.
Communicate Results
• In this stage, the key findings are communicated to all
stakeholders.
• This helps you decide if the project results are a success or a
failure based on the inputs from the model.
15

More Related Content

Similar to L1 Introduction DS.pptx

Data science training in hyd ppt (1)
Data science training in hyd ppt (1)Data science training in hyd ppt (1)
Data science training in hyd ppt (1)SayyedYusufali
 
Data science training institute in hyderabad
Data science training institute in hyderabadData science training institute in hyderabad
Data science training institute in hyderabadVamsiNihal
 
Data science training in Hyderabad
Data science  training in HyderabadData science  training in Hyderabad
Data science training in Hyderabadsaitejavella
 
Data science training Hyderabad
Data science training HyderabadData science training Hyderabad
Data science training HyderabadNithinsunil1
 
Data science online training in hyderabad
Data science online training in hyderabadData science online training in hyderabad
Data science online training in hyderabadVamsiNihal
 
Data science training in hyd ppt (1)
Data science training in hyd ppt (1)Data science training in hyd ppt (1)
Data science training in hyd ppt (1)SayyedYusufali
 
data science training and placement
data science training and placementdata science training and placement
data science training and placementSaiprasadVella
 
online data science training
online data science trainingonline data science training
online data science trainingDIGITALSAI1
 
Data science online training in hyderabad
Data science online training in hyderabadData science online training in hyderabad
Data science online training in hyderabadVamsiNihal
 
data science online training in hyderabad
data science online training in hyderabaddata science online training in hyderabad
data science online training in hyderabadVamsiNihal
 
Best data science training in Hyderabad
Best data science training in HyderabadBest data science training in Hyderabad
Best data science training in HyderabadKumarNaik21
 
Data science training Hyderabad
Data science training HyderabadData science training Hyderabad
Data science training HyderabadNithinsunil1
 
Data+Science+in+Python+-+Data+Prep+&+EDA.pdf
Data+Science+in+Python+-+Data+Prep+&+EDA.pdfData+Science+in+Python+-+Data+Prep+&+EDA.pdf
Data+Science+in+Python+-+Data+Prep+&+EDA.pdfneelakandan2001kpm
 
Data science training in hyd ppt converted (1)
Data science training in hyd ppt converted (1)Data science training in hyd ppt converted (1)
Data science training in hyd ppt converted (1)SayyedYusufali
 
Data science training in hyd pdf converted (1)
Data science training in hyd pdf converted (1)Data science training in hyd pdf converted (1)
Data science training in hyd pdf converted (1)SayyedYusufali
 
Data science training in hydpdf converted (1)
Data science training in hydpdf  converted (1)Data science training in hydpdf  converted (1)
Data science training in hydpdf converted (1)SayyedYusufali
 
Data Science Training and Placement
Data Science Training and PlacementData Science Training and Placement
Data Science Training and PlacementAkhilGGM
 
Machine learning and big data
Machine learning and big dataMachine learning and big data
Machine learning and big dataPoo Kuan Hoong
 
Data Analytics Course In Surat.pdf
Data Analytics Course In Surat.pdfData Analytics Course In Surat.pdf
Data Analytics Course In Surat.pdfSujata Gupta
 
The path to be a data scientist
The path to be a data scientistThe path to be a data scientist
The path to be a data scientistPoo Kuan Hoong
 

Similar to L1 Introduction DS.pptx (20)

Data science training in hyd ppt (1)
Data science training in hyd ppt (1)Data science training in hyd ppt (1)
Data science training in hyd ppt (1)
 
Data science training institute in hyderabad
Data science training institute in hyderabadData science training institute in hyderabad
Data science training institute in hyderabad
 
Data science training in Hyderabad
Data science  training in HyderabadData science  training in Hyderabad
Data science training in Hyderabad
 
Data science training Hyderabad
Data science training HyderabadData science training Hyderabad
Data science training Hyderabad
 
Data science online training in hyderabad
Data science online training in hyderabadData science online training in hyderabad
Data science online training in hyderabad
 
Data science training in hyd ppt (1)
Data science training in hyd ppt (1)Data science training in hyd ppt (1)
Data science training in hyd ppt (1)
 
data science training and placement
data science training and placementdata science training and placement
data science training and placement
 
online data science training
online data science trainingonline data science training
online data science training
 
Data science online training in hyderabad
Data science online training in hyderabadData science online training in hyderabad
Data science online training in hyderabad
 
data science online training in hyderabad
data science online training in hyderabaddata science online training in hyderabad
data science online training in hyderabad
 
Best data science training in Hyderabad
Best data science training in HyderabadBest data science training in Hyderabad
Best data science training in Hyderabad
 
Data science training Hyderabad
Data science training HyderabadData science training Hyderabad
Data science training Hyderabad
 
Data+Science+in+Python+-+Data+Prep+&+EDA.pdf
Data+Science+in+Python+-+Data+Prep+&+EDA.pdfData+Science+in+Python+-+Data+Prep+&+EDA.pdf
Data+Science+in+Python+-+Data+Prep+&+EDA.pdf
 
Data science training in hyd ppt converted (1)
Data science training in hyd ppt converted (1)Data science training in hyd ppt converted (1)
Data science training in hyd ppt converted (1)
 
Data science training in hyd pdf converted (1)
Data science training in hyd pdf converted (1)Data science training in hyd pdf converted (1)
Data science training in hyd pdf converted (1)
 
Data science training in hydpdf converted (1)
Data science training in hydpdf  converted (1)Data science training in hydpdf  converted (1)
Data science training in hydpdf converted (1)
 
Data Science Training and Placement
Data Science Training and PlacementData Science Training and Placement
Data Science Training and Placement
 
Machine learning and big data
Machine learning and big dataMachine learning and big data
Machine learning and big data
 
Data Analytics Course In Surat.pdf
Data Analytics Course In Surat.pdfData Analytics Course In Surat.pdf
Data Analytics Course In Surat.pdf
 
The path to be a data scientist
The path to be a data scientistThe path to be a data scientist
The path to be a data scientist
 

More from Shambhavi Vats

L4 Intro Statistical Inferencing.pptx
L4 Intro Statistical Inferencing.pptxL4 Intro Statistical Inferencing.pptx
L4 Intro Statistical Inferencing.pptxShambhavi Vats
 
Vaastav talwar a010145020052 adbms psda vaastav talwar
Vaastav talwar a010145020052 adbms psda   vaastav talwarVaastav talwar a010145020052 adbms psda   vaastav talwar
Vaastav talwar a010145020052 adbms psda vaastav talwarShambhavi Vats
 
38 object-concepts (1)
38 object-concepts (1)38 object-concepts (1)
38 object-concepts (1)Shambhavi Vats
 
Aakarsh 038 csit142_lab_work (1)
Aakarsh 038 csit142_lab_work (1)Aakarsh 038 csit142_lab_work (1)
Aakarsh 038 csit142_lab_work (1)Shambhavi Vats
 
wool from fibre to fabric
wool from fibre to fabricwool from fibre to fabric
wool from fibre to fabricShambhavi Vats
 
road side rain water harvesting
road side rain water harvestingroad side rain water harvesting
road side rain water harvestingShambhavi Vats
 

More from Shambhavi Vats (9)

L4 Intro Statistical Inferencing.pptx
L4 Intro Statistical Inferencing.pptxL4 Intro Statistical Inferencing.pptx
L4 Intro Statistical Inferencing.pptx
 
Vaastav talwar a010145020052 adbms psda vaastav talwar
Vaastav talwar a010145020052 adbms psda   vaastav talwarVaastav talwar a010145020052 adbms psda   vaastav talwar
Vaastav talwar a010145020052 adbms psda vaastav talwar
 
38 object-concepts (1)
38 object-concepts (1)38 object-concepts (1)
38 object-concepts (1)
 
Bca 2b
Bca 2bBca 2b
Bca 2b
 
Aakarsh 038 csit142_lab_work (1)
Aakarsh 038 csit142_lab_work (1)Aakarsh 038 csit142_lab_work (1)
Aakarsh 038 csit142_lab_work (1)
 
wool from fibre to fabric
wool from fibre to fabricwool from fibre to fabric
wool from fibre to fabric
 
urban livehood
urban livehoodurban livehood
urban livehood
 
road side rain water harvesting
road side rain water harvestingroad side rain water harvesting
road side rain water harvesting
 
Water
WaterWater
Water
 

Recently uploaded

꧁❤ Greater Noida Call Girls Delhi ❤꧂ 9711199171 ☎️ Hard And Sexy Vip Call
꧁❤ Greater Noida Call Girls Delhi ❤꧂ 9711199171 ☎️ Hard And Sexy Vip Call꧁❤ Greater Noida Call Girls Delhi ❤꧂ 9711199171 ☎️ Hard And Sexy Vip Call
꧁❤ Greater Noida Call Girls Delhi ❤꧂ 9711199171 ☎️ Hard And Sexy Vip Callshivangimorya083
 
High Class Call Girls Noida Sector 39 Aarushi 🔝8264348440🔝 Independent Escort...
High Class Call Girls Noida Sector 39 Aarushi 🔝8264348440🔝 Independent Escort...High Class Call Girls Noida Sector 39 Aarushi 🔝8264348440🔝 Independent Escort...
High Class Call Girls Noida Sector 39 Aarushi 🔝8264348440🔝 Independent Escort...soniya singh
 
办理(Vancouver毕业证书)加拿大温哥华岛大学毕业证成绩单原版一比一
办理(Vancouver毕业证书)加拿大温哥华岛大学毕业证成绩单原版一比一办理(Vancouver毕业证书)加拿大温哥华岛大学毕业证成绩单原版一比一
办理(Vancouver毕业证书)加拿大温哥华岛大学毕业证成绩单原版一比一F La
 
VIP Call Girls Service Miyapur Hyderabad Call +91-8250192130
VIP Call Girls Service Miyapur Hyderabad Call +91-8250192130VIP Call Girls Service Miyapur Hyderabad Call +91-8250192130
VIP Call Girls Service Miyapur Hyderabad Call +91-8250192130Suhani Kapoor
 
Industrialised data - the key to AI success.pdf
Industrialised data - the key to AI success.pdfIndustrialised data - the key to AI success.pdf
Industrialised data - the key to AI success.pdfLars Albertsson
 
Beautiful Sapna Vip Call Girls Hauz Khas 9711199012 Call /Whatsapps
Beautiful Sapna Vip  Call Girls Hauz Khas 9711199012 Call /WhatsappsBeautiful Sapna Vip  Call Girls Hauz Khas 9711199012 Call /Whatsapps
Beautiful Sapna Vip Call Girls Hauz Khas 9711199012 Call /Whatsappssapnasaifi408
 
INTERNSHIP ON PURBASHA COMPOSITE TEX LTD
INTERNSHIP ON PURBASHA COMPOSITE TEX LTDINTERNSHIP ON PURBASHA COMPOSITE TEX LTD
INTERNSHIP ON PURBASHA COMPOSITE TEX LTDRafezzaman
 
GA4 Without Cookies [Measure Camp AMS]
GA4 Without Cookies [Measure Camp AMS]GA4 Without Cookies [Measure Camp AMS]
GA4 Without Cookies [Measure Camp AMS]📊 Markus Baersch
 
RadioAdProWritingCinderellabyButleri.pdf
RadioAdProWritingCinderellabyButleri.pdfRadioAdProWritingCinderellabyButleri.pdf
RadioAdProWritingCinderellabyButleri.pdfgstagge
 
Saket, (-DELHI )+91-9654467111-(=)CHEAP Call Girls in Escorts Service Saket C...
Saket, (-DELHI )+91-9654467111-(=)CHEAP Call Girls in Escorts Service Saket C...Saket, (-DELHI )+91-9654467111-(=)CHEAP Call Girls in Escorts Service Saket C...
Saket, (-DELHI )+91-9654467111-(=)CHEAP Call Girls in Escorts Service Saket C...Sapana Sha
 
9654467111 Call Girls In Munirka Hotel And Home Service
9654467111 Call Girls In Munirka Hotel And Home Service9654467111 Call Girls In Munirka Hotel And Home Service
9654467111 Call Girls In Munirka Hotel And Home ServiceSapana Sha
 
Call Us ➥97111√47426🤳Call Girls in Aerocity (Delhi NCR)
Call Us ➥97111√47426🤳Call Girls in Aerocity (Delhi NCR)Call Us ➥97111√47426🤳Call Girls in Aerocity (Delhi NCR)
Call Us ➥97111√47426🤳Call Girls in Aerocity (Delhi NCR)jennyeacort
 
Indian Call Girls in Abu Dhabi O5286O24O8 Call Girls in Abu Dhabi By Independ...
Indian Call Girls in Abu Dhabi O5286O24O8 Call Girls in Abu Dhabi By Independ...Indian Call Girls in Abu Dhabi O5286O24O8 Call Girls in Abu Dhabi By Independ...
Indian Call Girls in Abu Dhabi O5286O24O8 Call Girls in Abu Dhabi By Independ...dajasot375
 
Consent & Privacy Signals on Google *Pixels* - MeasureCamp Amsterdam 2024
Consent & Privacy Signals on Google *Pixels* - MeasureCamp Amsterdam 2024Consent & Privacy Signals on Google *Pixels* - MeasureCamp Amsterdam 2024
Consent & Privacy Signals on Google *Pixels* - MeasureCamp Amsterdam 2024thyngster
 
Call Girls In Mahipalpur O9654467111 Escorts Service
Call Girls In Mahipalpur O9654467111  Escorts ServiceCall Girls In Mahipalpur O9654467111  Escorts Service
Call Girls In Mahipalpur O9654467111 Escorts ServiceSapana Sha
 
Brighton SEO | April 2024 | Data Storytelling
Brighton SEO | April 2024 | Data StorytellingBrighton SEO | April 2024 | Data Storytelling
Brighton SEO | April 2024 | Data StorytellingNeil Barnes
 
Predictive Analysis - Using Insight-informed Data to Determine Factors Drivin...
Predictive Analysis - Using Insight-informed Data to Determine Factors Drivin...Predictive Analysis - Using Insight-informed Data to Determine Factors Drivin...
Predictive Analysis - Using Insight-informed Data to Determine Factors Drivin...ThinkInnovation
 

Recently uploaded (20)

꧁❤ Greater Noida Call Girls Delhi ❤꧂ 9711199171 ☎️ Hard And Sexy Vip Call
꧁❤ Greater Noida Call Girls Delhi ❤꧂ 9711199171 ☎️ Hard And Sexy Vip Call꧁❤ Greater Noida Call Girls Delhi ❤꧂ 9711199171 ☎️ Hard And Sexy Vip Call
꧁❤ Greater Noida Call Girls Delhi ❤꧂ 9711199171 ☎️ Hard And Sexy Vip Call
 
High Class Call Girls Noida Sector 39 Aarushi 🔝8264348440🔝 Independent Escort...
High Class Call Girls Noida Sector 39 Aarushi 🔝8264348440🔝 Independent Escort...High Class Call Girls Noida Sector 39 Aarushi 🔝8264348440🔝 Independent Escort...
High Class Call Girls Noida Sector 39 Aarushi 🔝8264348440🔝 Independent Escort...
 
办理(Vancouver毕业证书)加拿大温哥华岛大学毕业证成绩单原版一比一
办理(Vancouver毕业证书)加拿大温哥华岛大学毕业证成绩单原版一比一办理(Vancouver毕业证书)加拿大温哥华岛大学毕业证成绩单原版一比一
办理(Vancouver毕业证书)加拿大温哥华岛大学毕业证成绩单原版一比一
 
VIP Call Girls Service Miyapur Hyderabad Call +91-8250192130
VIP Call Girls Service Miyapur Hyderabad Call +91-8250192130VIP Call Girls Service Miyapur Hyderabad Call +91-8250192130
VIP Call Girls Service Miyapur Hyderabad Call +91-8250192130
 
Industrialised data - the key to AI success.pdf
Industrialised data - the key to AI success.pdfIndustrialised data - the key to AI success.pdf
Industrialised data - the key to AI success.pdf
 
Beautiful Sapna Vip Call Girls Hauz Khas 9711199012 Call /Whatsapps
Beautiful Sapna Vip  Call Girls Hauz Khas 9711199012 Call /WhatsappsBeautiful Sapna Vip  Call Girls Hauz Khas 9711199012 Call /Whatsapps
Beautiful Sapna Vip Call Girls Hauz Khas 9711199012 Call /Whatsapps
 
INTERNSHIP ON PURBASHA COMPOSITE TEX LTD
INTERNSHIP ON PURBASHA COMPOSITE TEX LTDINTERNSHIP ON PURBASHA COMPOSITE TEX LTD
INTERNSHIP ON PURBASHA COMPOSITE TEX LTD
 
GA4 Without Cookies [Measure Camp AMS]
GA4 Without Cookies [Measure Camp AMS]GA4 Without Cookies [Measure Camp AMS]
GA4 Without Cookies [Measure Camp AMS]
 
RadioAdProWritingCinderellabyButleri.pdf
RadioAdProWritingCinderellabyButleri.pdfRadioAdProWritingCinderellabyButleri.pdf
RadioAdProWritingCinderellabyButleri.pdf
 
Saket, (-DELHI )+91-9654467111-(=)CHEAP Call Girls in Escorts Service Saket C...
Saket, (-DELHI )+91-9654467111-(=)CHEAP Call Girls in Escorts Service Saket C...Saket, (-DELHI )+91-9654467111-(=)CHEAP Call Girls in Escorts Service Saket C...
Saket, (-DELHI )+91-9654467111-(=)CHEAP Call Girls in Escorts Service Saket C...
 
9654467111 Call Girls In Munirka Hotel And Home Service
9654467111 Call Girls In Munirka Hotel And Home Service9654467111 Call Girls In Munirka Hotel And Home Service
9654467111 Call Girls In Munirka Hotel And Home Service
 
Call Us ➥97111√47426🤳Call Girls in Aerocity (Delhi NCR)
Call Us ➥97111√47426🤳Call Girls in Aerocity (Delhi NCR)Call Us ➥97111√47426🤳Call Girls in Aerocity (Delhi NCR)
Call Us ➥97111√47426🤳Call Girls in Aerocity (Delhi NCR)
 
Indian Call Girls in Abu Dhabi O5286O24O8 Call Girls in Abu Dhabi By Independ...
Indian Call Girls in Abu Dhabi O5286O24O8 Call Girls in Abu Dhabi By Independ...Indian Call Girls in Abu Dhabi O5286O24O8 Call Girls in Abu Dhabi By Independ...
Indian Call Girls in Abu Dhabi O5286O24O8 Call Girls in Abu Dhabi By Independ...
 
Consent & Privacy Signals on Google *Pixels* - MeasureCamp Amsterdam 2024
Consent & Privacy Signals on Google *Pixels* - MeasureCamp Amsterdam 2024Consent & Privacy Signals on Google *Pixels* - MeasureCamp Amsterdam 2024
Consent & Privacy Signals on Google *Pixels* - MeasureCamp Amsterdam 2024
 
Call Girls In Mahipalpur O9654467111 Escorts Service
Call Girls In Mahipalpur O9654467111  Escorts ServiceCall Girls In Mahipalpur O9654467111  Escorts Service
Call Girls In Mahipalpur O9654467111 Escorts Service
 
E-Commerce Order PredictionShraddha Kamble.pptx
E-Commerce Order PredictionShraddha Kamble.pptxE-Commerce Order PredictionShraddha Kamble.pptx
E-Commerce Order PredictionShraddha Kamble.pptx
 
VIP Call Girls Service Charbagh { Lucknow Call Girls Service 9548273370 } Boo...
VIP Call Girls Service Charbagh { Lucknow Call Girls Service 9548273370 } Boo...VIP Call Girls Service Charbagh { Lucknow Call Girls Service 9548273370 } Boo...
VIP Call Girls Service Charbagh { Lucknow Call Girls Service 9548273370 } Boo...
 
꧁❤ Aerocity Call Girls Service Aerocity Delhi ❤꧂ 9999965857 ☎️ Hard And Sexy ...
꧁❤ Aerocity Call Girls Service Aerocity Delhi ❤꧂ 9999965857 ☎️ Hard And Sexy ...꧁❤ Aerocity Call Girls Service Aerocity Delhi ❤꧂ 9999965857 ☎️ Hard And Sexy ...
꧁❤ Aerocity Call Girls Service Aerocity Delhi ❤꧂ 9999965857 ☎️ Hard And Sexy ...
 
Brighton SEO | April 2024 | Data Storytelling
Brighton SEO | April 2024 | Data StorytellingBrighton SEO | April 2024 | Data Storytelling
Brighton SEO | April 2024 | Data Storytelling
 
Predictive Analysis - Using Insight-informed Data to Determine Factors Drivin...
Predictive Analysis - Using Insight-informed Data to Determine Factors Drivin...Predictive Analysis - Using Insight-informed Data to Determine Factors Drivin...
Predictive Analysis - Using Insight-informed Data to Determine Factors Drivin...
 

L1 Introduction DS.pptx

  • 1. Amity Institute of Information Technology Introduction to Data Science BSc.IT/BCA/ DUAL VI Semester Faculty: Dr. Shambhu Kumar Jha 1
  • 2. Amity Institute of Information Technology Module I : Introduction What is Data Science Why Data Science Component of Data science Data science Process 2
  • 3. Amity Institute of Information Technology Introduction Let’s start by understanding what is data science? – Have you ever wondered how Amazon, eBay suggest items for you to buy? – How Gmail filters your emails in the spam and non-spam categories? – How Netflix predicts the shows of your liking? 3
  • 4. Amity Institute of Information Technology What is Data Science? • Data Science is the area of study which involves extracting insights from vast amounts of data using various scientific methods, algorithms, and processes. • It helps you to discover hidden patterns from the raw data. • The term Data Science has emerged because of the evolution of mathematical statistics, data analysis, and big data. • Data Science is an interdisciplinary field that allows you to extract knowledge from structured or unstructured data. • Data science enables you to translate a business problem into a research project and then translate it back into a practical solution.4
  • 5. Amity Institute of Information Technology What is Data Science? 5 Data science is all about using data to solve problems.
  • 6. Amity Institute of Information Technology Why Data Science? Significant advantages of using Data Analytics Technology: • Data is the oil for today’s world. • With right tools, technologies, algorithms, we can use data to convert into distinct business advantage • Data Science can help you to detect fraud using advanced machine learning algorithms • It helps you to prevent any significant monetary losses • Allows to build intelligence ability in machines • You can perform sentiment analysis to gauge customer brand loyalty • It enables you to take better and faster decisions • It helps you to recommend the right product to the right customer to enhance your business 6
  • 7. Amity Institute of Information Technology Data Science & Statistics 7
  • 8. Amity Institute of Information Technology Data Science Components 8
  • 9. Amity Institute of Information Technology Data Science Components • Statistics: Statistics is the most critical unit of Data Science basics, and it is the method or science of collecting and analysing numerical data in large quantities to get useful insights. • Visualization: • Visualization technique helps you access huge amounts of data in easy to understand and digestible visuals. • Machine Learning: • Machine Learning explores the building and study of algorithms that learn to make predictions about unforeseen/future data. • Deep Learning: • Deep Learning method is new machine learning research where the algorithm selects the analysis model to follow. 9
  • 10. Amity Institute of Information Technology Data Science Process 10
  • 11. Amity Institute of Information Technology Data Science Process Discovery: • Discovery step involves acquiring data from all the identified internal & external sources, which helps you answer the business question. The data can be: • Logs from webservers • Data gathered from social media • Census datasets • Data streamed from online sources using APIs 11
  • 12. Amity Institute of Information Technology Data Science Process Preparation: ETL :Extraction Transformation Loading Data can have many inconsistencies like: • Missing values, • Blank columns, • An incorrect data format, which needs to be cleaned. You need to process, explore, and condition data before modelling. The cleaner your data, the better are your predictions. 12
  • 13. Amity Institute of Information Technology Data Science Process Model Planning: • In this stage, you need to determine the method and technique to draw the relation between input variables. • Planning for a model is performed by using different statistical formulas and visualization tools. • SQL analysis services, R, and SAS/access are some of the tools used for this purpose. 13
  • 14. Amity Institute of Information Technology Data Science Process Model Building: • In this step, the actual model building process starts. • Here, Data scientist distributes datasets for training and testing. • Techniques like association, classification, and clustering are applied to the training data set. • The model, once prepared, is tested against the “testing” dataset. 14
  • 15. Amity Institute of Information Technology Data Science Process Operationalize: • You deliver the final baselined model with reports, code, and technical documents in this stage. • Model is deployed into a real-time production environment after thorough testing. Communicate Results • In this stage, the key findings are communicated to all stakeholders. • This helps you decide if the project results are a success or a failure based on the inputs from the model. 15

Editor's Notes

  1. 1