SlideShare a Scribd company logo

Introduction to Data Science

Christy Abraham Joy
Christy Abraham JoySoftware Engineer- Analytics at GadgEon Smart Systems

A brief introduction to DataScience with explaining of the concepts, algorithms, machine learning, supervised and unsupervised learning, clustering, statistics, data preprocessing, real-world applications etc. It's part of a Data Science Corner Campaign where I will be discussing the fundamentals of DataScience, AIML, Statistics etc.

Introduction to Data Science

Christy Abraham Joy
Christy Abraham JoySoftware Engineer- Analytics at GadgEon Smart Systems

A brief introduction to DataScience with explaining of the concepts, algorithms, machine learning, supervised and unsupervised learning, clustering, statistics, data preprocessing, real-world applications etc. It's part of a Data Science Corner Campaign where I will be discussing the fundamentals of DataScience, AIML, Statistics etc.

Introduction to Data Science

1 of 51
Download to read offline
Introduction to
DATA SCIENCE
Introduction to Data Science
Introduction to Data Science
Introduction to Data Science
Challenges deep-dive
Why the Hype Around
Data Science?
● The demand for data scientists will soar by 28% by 2023
● Data scientist roles have grown over 650% since 2012, but
currently, 35,000 people in the US have data science skills,
while hundreds of companies are hiring for those roles.
● Software engineering is a common starting point for
professionals who are in the top five fasting growing jobs today.
● Data Science gives you career flexibility
Who are Data Scientist?

Recommended

Getting into the tech field. what next
Getting into the tech field. what next Getting into the tech field. what next
Getting into the tech field. what next Tessa Mero
 
Beginners Guide to TikTok for Search - Rachel Pearson - We are Tilt __ Bright...
Beginners Guide to TikTok for Search - Rachel Pearson - We are Tilt __ Bright...Beginners Guide to TikTok for Search - Rachel Pearson - We are Tilt __ Bright...
Beginners Guide to TikTok for Search - Rachel Pearson - We are Tilt __ Bright...RachelPearson36
 
The State of Enterprise Software
The State of Enterprise SoftwareThe State of Enterprise Software
The State of Enterprise SoftwareGreylock Partners
 
The Future of Everything
The Future of EverythingThe Future of Everything
The Future of EverythingCharbel Zeaiter
 
Things That Don't Matter in Your Presentation!
Things That Don't Matter in Your Presentation!Things That Don't Matter in Your Presentation!
Things That Don't Matter in Your Presentation!Ayman Sadiq
 
How to Use Social Media to Influence the World
How to Use Social Media to Influence the WorldHow to Use Social Media to Influence the World
How to Use Social Media to Influence the WorldSean Si
 

More Related Content

What's hot

Landscape of AI/ML in 2023
Landscape of AI/ML in 2023Landscape of AI/ML in 2023
Landscape of AI/ML in 2023HyunJoon Jung
 
leewayhertz.com-How to build a generative AI solution From prototyping to pro...
leewayhertz.com-How to build a generative AI solution From prototyping to pro...leewayhertz.com-How to build a generative AI solution From prototyping to pro...
leewayhertz.com-How to build a generative AI solution From prototyping to pro...robertsamuel23
 
Artificial intelligence (ai) and its impact to business
Artificial intelligence (ai) and its impact to businessArtificial intelligence (ai) and its impact to business
Artificial intelligence (ai) and its impact to businesspaul young cpa, cga
 
Hegazi_ChatGPT_Book.pdf
Hegazi_ChatGPT_Book.pdfHegazi_ChatGPT_Book.pdf
Hegazi_ChatGPT_Book.pdfAmirHegazi1
 
Ride the Storm: Navigating Through Unstable Periods / Katerina Rudko (Belka G...
Ride the Storm: Navigating Through Unstable Periods / Katerina Rudko (Belka G...Ride the Storm: Navigating Through Unstable Periods / Katerina Rudko (Belka G...
Ride the Storm: Navigating Through Unstable Periods / Katerina Rudko (Belka G...DevGAMM Conference
 
Machine Learning Model Deployment: Strategy to Implementation
Machine Learning Model Deployment: Strategy to ImplementationMachine Learning Model Deployment: Strategy to Implementation
Machine Learning Model Deployment: Strategy to ImplementationDataWorks Summit
 
SXSW 2016 takeaways
SXSW 2016 takeawaysSXSW 2016 takeaways
SXSW 2016 takeawaysHavas
 
Everything to know about ChatGPT
Everything to know about ChatGPTEverything to know about ChatGPT
Everything to know about ChatGPTKnoldus Inc.
 
Introduction To Machine Learning
Introduction To Machine LearningIntroduction To Machine Learning
Introduction To Machine LearningKnoldus Inc.
 
SlideShare Experts - 7 Experts Reveal Their Presentation Design Secrets
SlideShare Experts - 7 Experts Reveal Their Presentation Design SecretsSlideShare Experts - 7 Experts Reveal Their Presentation Design Secrets
SlideShare Experts - 7 Experts Reveal Their Presentation Design SecretsEugene Cheng
 
Working With Big Data
Working With Big DataWorking With Big Data
Working With Big DataSeth Familian
 
TEDx Manchester: AI & The Future of Work
TEDx Manchester: AI & The Future of WorkTEDx Manchester: AI & The Future of Work
TEDx Manchester: AI & The Future of WorkVolker Hirsch
 
Data Culture Keynote and Exec Track Birm Dec 8th
Data Culture Keynote and Exec Track Birm Dec 8thData Culture Keynote and Exec Track Birm Dec 8th
Data Culture Keynote and Exec Track Birm Dec 8thJonathan Woodward
 
Design Ethics for Artificial Intelligence
Design Ethics for Artificial IntelligenceDesign Ethics for Artificial Intelligence
Design Ethics for Artificial IntelligenceCharbel Zeaiter
 
20 Ideas for your Website Homepage Content
20 Ideas for your Website Homepage Content20 Ideas for your Website Homepage Content
20 Ideas for your Website Homepage ContentBarry Feldman
 
ChatGPT 101 - Vancouver ChatGPT Experts
ChatGPT 101 - Vancouver ChatGPT ExpertsChatGPT 101 - Vancouver ChatGPT Experts
ChatGPT 101 - Vancouver ChatGPT ExpertsAli Tavanayan
 

What's hot (20)

Landscape of AI/ML in 2023
Landscape of AI/ML in 2023Landscape of AI/ML in 2023
Landscape of AI/ML in 2023
 
leewayhertz.com-How to build a generative AI solution From prototyping to pro...
leewayhertz.com-How to build a generative AI solution From prototyping to pro...leewayhertz.com-How to build a generative AI solution From prototyping to pro...
leewayhertz.com-How to build a generative AI solution From prototyping to pro...
 
Artificial intelligence (ai) and its impact to business
Artificial intelligence (ai) and its impact to businessArtificial intelligence (ai) and its impact to business
Artificial intelligence (ai) and its impact to business
 
Hegazi_ChatGPT_Book.pdf
Hegazi_ChatGPT_Book.pdfHegazi_ChatGPT_Book.pdf
Hegazi_ChatGPT_Book.pdf
 
Ride the Storm: Navigating Through Unstable Periods / Katerina Rudko (Belka G...
Ride the Storm: Navigating Through Unstable Periods / Katerina Rudko (Belka G...Ride the Storm: Navigating Through Unstable Periods / Katerina Rudko (Belka G...
Ride the Storm: Navigating Through Unstable Periods / Katerina Rudko (Belka G...
 
Unlocking the Power of ChatGPT
Unlocking the Power of ChatGPTUnlocking the Power of ChatGPT
Unlocking the Power of ChatGPT
 
Behind the Scenes of ChatGPT.pptx
Behind the Scenes of ChatGPT.pptxBehind the Scenes of ChatGPT.pptx
Behind the Scenes of ChatGPT.pptx
 
Machine Learning Model Deployment: Strategy to Implementation
Machine Learning Model Deployment: Strategy to ImplementationMachine Learning Model Deployment: Strategy to Implementation
Machine Learning Model Deployment: Strategy to Implementation
 
ChatGPT webinar slides
ChatGPT webinar slidesChatGPT webinar slides
ChatGPT webinar slides
 
SXSW 2016 takeaways
SXSW 2016 takeawaysSXSW 2016 takeaways
SXSW 2016 takeaways
 
Everything to know about ChatGPT
Everything to know about ChatGPTEverything to know about ChatGPT
Everything to know about ChatGPT
 
Introduction To Machine Learning
Introduction To Machine LearningIntroduction To Machine Learning
Introduction To Machine Learning
 
SlideShare Experts - 7 Experts Reveal Their Presentation Design Secrets
SlideShare Experts - 7 Experts Reveal Their Presentation Design SecretsSlideShare Experts - 7 Experts Reveal Their Presentation Design Secrets
SlideShare Experts - 7 Experts Reveal Their Presentation Design Secrets
 
Working With Big Data
Working With Big DataWorking With Big Data
Working With Big Data
 
TEDx Manchester: AI & The Future of Work
TEDx Manchester: AI & The Future of WorkTEDx Manchester: AI & The Future of Work
TEDx Manchester: AI & The Future of Work
 
Data Culture Keynote and Exec Track Birm Dec 8th
Data Culture Keynote and Exec Track Birm Dec 8thData Culture Keynote and Exec Track Birm Dec 8th
Data Culture Keynote and Exec Track Birm Dec 8th
 
Design Ethics for Artificial Intelligence
Design Ethics for Artificial IntelligenceDesign Ethics for Artificial Intelligence
Design Ethics for Artificial Intelligence
 
20 Ideas for your Website Homepage Content
20 Ideas for your Website Homepage Content20 Ideas for your Website Homepage Content
20 Ideas for your Website Homepage Content
 
Journey of Generative AI
Journey of Generative AIJourney of Generative AI
Journey of Generative AI
 
ChatGPT 101 - Vancouver ChatGPT Experts
ChatGPT 101 - Vancouver ChatGPT ExpertsChatGPT 101 - Vancouver ChatGPT Experts
ChatGPT 101 - Vancouver ChatGPT Experts
 

Similar to Introduction to Data Science

INTERNSHIP ON MAcHINE LEARNING.pptx
INTERNSHIP ON MAcHINE LEARNING.pptxINTERNSHIP ON MAcHINE LEARNING.pptx
INTERNSHIP ON MAcHINE LEARNING.pptxsrikanthkallem1
 
Machine learning applications nurturing growth of various business domains
Machine learning applications nurturing growth of various business domainsMachine learning applications nurturing growth of various business domains
Machine learning applications nurturing growth of various business domainsShrutika Oswal
 
Machine Learning Ch 1.ppt
Machine Learning Ch 1.pptMachine Learning Ch 1.ppt
Machine Learning Ch 1.pptARVIND SARDAR
 
Machine Learning
Machine LearningMachine Learning
Machine LearningAmit Kumar
 
BIG DATA AND MACHINE LEARNING
BIG DATA AND MACHINE LEARNINGBIG DATA AND MACHINE LEARNING
BIG DATA AND MACHINE LEARNINGUmair Shafique
 
introduction to machine learning
introduction to machine learningintroduction to machine learning
introduction to machine learningJohnson Ubah
 
detailed Presentation on supervised learning
 detailed Presentation on supervised learning detailed Presentation on supervised learning
detailed Presentation on supervised learningZAMANCHBWN
 
machine_learning_section1_ebook.pdf
machine_learning_section1_ebook.pdfmachine_learning_section1_ebook.pdf
machine_learning_section1_ebook.pdfagfi
 
Applied Artificial Intelligence Unit 3 Semester 3 MSc IT Part 2 Mumbai Univer...
Applied Artificial Intelligence Unit 3 Semester 3 MSc IT Part 2 Mumbai Univer...Applied Artificial Intelligence Unit 3 Semester 3 MSc IT Part 2 Mumbai Univer...
Applied Artificial Intelligence Unit 3 Semester 3 MSc IT Part 2 Mumbai Univer...Madhav Mishra
 
Training_Report_on_Machine_Learning.docx
Training_Report_on_Machine_Learning.docxTraining_Report_on_Machine_Learning.docx
Training_Report_on_Machine_Learning.docxShubhamBishnoi14
 
How to build machine learning apps.pdf
How to build machine learning apps.pdfHow to build machine learning apps.pdf
How to build machine learning apps.pdfStephenAmell4
 
How to build machine learning apps.pdf
How to build machine learning apps.pdfHow to build machine learning apps.pdf
How to build machine learning apps.pdfAnastasiaSteele10
 
How to build machine learning apps.pdf
How to build machine learning apps.pdfHow to build machine learning apps.pdf
How to build machine learning apps.pdfJamieDornan2
 
How to build machine learning apps.pdf
How to build machine learning apps.pdfHow to build machine learning apps.pdf
How to build machine learning apps.pdfStephenAmell4
 
How to build machine learning apps.pdf
How to build machine learning apps.pdfHow to build machine learning apps.pdf
How to build machine learning apps.pdfAnastasiaSteele10
 
Supervised Machine Learning Techniques common algorithms and its application
Supervised Machine Learning Techniques common algorithms and its applicationSupervised Machine Learning Techniques common algorithms and its application
Supervised Machine Learning Techniques common algorithms and its applicationTara ram Goyal
 
Big data, big opportunities
Big data, big opportunitiesBig data, big opportunities
Big data, big opportunitiesChouaieb NEMRI
 
How to build machine learning apps.pdf
How to build machine learning apps.pdfHow to build machine learning apps.pdf
How to build machine learning apps.pdfJamieDornan2
 
Popular Machine Learning Myths
Popular Machine Learning Myths Popular Machine Learning Myths
Popular Machine Learning Myths Rock Interview
 

Similar to Introduction to Data Science (20)

INTERNSHIP ON MAcHINE LEARNING.pptx
INTERNSHIP ON MAcHINE LEARNING.pptxINTERNSHIP ON MAcHINE LEARNING.pptx
INTERNSHIP ON MAcHINE LEARNING.pptx
 
Machine learning applications nurturing growth of various business domains
Machine learning applications nurturing growth of various business domainsMachine learning applications nurturing growth of various business domains
Machine learning applications nurturing growth of various business domains
 
Machine Learning Ch 1.ppt
Machine Learning Ch 1.pptMachine Learning Ch 1.ppt
Machine Learning Ch 1.ppt
 
Machine Learning
Machine LearningMachine Learning
Machine Learning
 
BIG DATA AND MACHINE LEARNING
BIG DATA AND MACHINE LEARNINGBIG DATA AND MACHINE LEARNING
BIG DATA AND MACHINE LEARNING
 
introduction to machine learning
introduction to machine learningintroduction to machine learning
introduction to machine learning
 
detailed Presentation on supervised learning
 detailed Presentation on supervised learning detailed Presentation on supervised learning
detailed Presentation on supervised learning
 
AI.pdf
AI.pdfAI.pdf
AI.pdf
 
machine_learning_section1_ebook.pdf
machine_learning_section1_ebook.pdfmachine_learning_section1_ebook.pdf
machine_learning_section1_ebook.pdf
 
Applied Artificial Intelligence Unit 3 Semester 3 MSc IT Part 2 Mumbai Univer...
Applied Artificial Intelligence Unit 3 Semester 3 MSc IT Part 2 Mumbai Univer...Applied Artificial Intelligence Unit 3 Semester 3 MSc IT Part 2 Mumbai Univer...
Applied Artificial Intelligence Unit 3 Semester 3 MSc IT Part 2 Mumbai Univer...
 
Training_Report_on_Machine_Learning.docx
Training_Report_on_Machine_Learning.docxTraining_Report_on_Machine_Learning.docx
Training_Report_on_Machine_Learning.docx
 
How to build machine learning apps.pdf
How to build machine learning apps.pdfHow to build machine learning apps.pdf
How to build machine learning apps.pdf
 
How to build machine learning apps.pdf
How to build machine learning apps.pdfHow to build machine learning apps.pdf
How to build machine learning apps.pdf
 
How to build machine learning apps.pdf
How to build machine learning apps.pdfHow to build machine learning apps.pdf
How to build machine learning apps.pdf
 
How to build machine learning apps.pdf
How to build machine learning apps.pdfHow to build machine learning apps.pdf
How to build machine learning apps.pdf
 
How to build machine learning apps.pdf
How to build machine learning apps.pdfHow to build machine learning apps.pdf
How to build machine learning apps.pdf
 
Supervised Machine Learning Techniques common algorithms and its application
Supervised Machine Learning Techniques common algorithms and its applicationSupervised Machine Learning Techniques common algorithms and its application
Supervised Machine Learning Techniques common algorithms and its application
 
Big data, big opportunities
Big data, big opportunitiesBig data, big opportunities
Big data, big opportunities
 
How to build machine learning apps.pdf
How to build machine learning apps.pdfHow to build machine learning apps.pdf
How to build machine learning apps.pdf
 
Popular Machine Learning Myths
Popular Machine Learning Myths Popular Machine Learning Myths
Popular Machine Learning Myths
 

Recently uploaded

Till now I am learning.pptx
Till now I am learning.pptxTill now I am learning.pptx
Till now I am learning.pptxpranavraj960252
 
IFM 2024 Consumer Confidence INDEX Report (Jan 15,24).pdf
IFM  2024 Consumer Confidence INDEX Report (Jan 15,24).pdfIFM  2024 Consumer Confidence INDEX Report (Jan 15,24).pdf
IFM 2024 Consumer Confidence INDEX Report (Jan 15,24).pdfnhutnguyen355078
 
DATA SCIENCE APPLICATION IN TRANSPORTATION.pdf
DATA SCIENCE APPLICATION IN TRANSPORTATION.pdfDATA SCIENCE APPLICATION IN TRANSPORTATION.pdf
DATA SCIENCE APPLICATION IN TRANSPORTATION.pdfkanishkakb3
 
Applications of Data Science in YouTube for income generation.pdf
Applications of Data Science in YouTube for income generation.pdfApplications of Data Science in YouTube for income generation.pdf
Applications of Data Science in YouTube for income generation.pdfNukathotiSreeharshav
 
UNVEILING VALUE-BASED BIDDING SECRETS FOR MAXIMUM EFFICIENCY GAIN.pptx
UNVEILING VALUE-BASED BIDDING SECRETS FOR MAXIMUM EFFICIENCY GAIN.pptxUNVEILING VALUE-BASED BIDDING SECRETS FOR MAXIMUM EFFICIENCY GAIN.pptx
UNVEILING VALUE-BASED BIDDING SECRETS FOR MAXIMUM EFFICIENCY GAIN.pptxDoug Hall
 
Machine Learning with Tree-Based Models in Python
Machine Learning with Tree-Based Models in PythonMachine Learning with Tree-Based Models in Python
Machine Learning with Tree-Based Models in PythonVICTOR MAESTRE RAMIREZ
 
Application of Data Science in Cybersecurity.pptx
Application of Data Science in Cybersecurity.pptxApplication of Data Science in Cybersecurity.pptx
Application of Data Science in Cybersecurity.pptxchristinacammillus20
 
Application of Data Science in the Restaurant Industry.pptx
Application of Data Science in the Restaurant Industry.pptxApplication of Data Science in the Restaurant Industry.pptx
Application of Data Science in the Restaurant Industry.pptxkaustubhan
 
Search Engines Search engine is a software that is designed to information...
Search Engines  Search  engine  is a software that is designed to information...Search Engines  Search  engine  is a software that is designed to information...
Search Engines Search engine is a software that is designed to information...anitakumari533027
 
January 2024 - Top 10 Read Articles in International Journal of Artificial In...
January 2024 - Top 10 Read Articles in International Journal of Artificial In...January 2024 - Top 10 Read Articles in International Journal of Artificial In...
January 2024 - Top 10 Read Articles in International Journal of Artificial In...gerogepatton
 
Top-NO1 Amil Baba In Pakistan Amil Baba In Multan Amil Baba in sindh Amil Bab...
Top-NO1 Amil Baba In Pakistan Amil Baba In Multan Amil Baba in sindh Amil Bab...Top-NO1 Amil Baba In Pakistan Amil Baba In Multan Amil Baba in sindh Amil Bab...
Top-NO1 Amil Baba In Pakistan Amil Baba In Multan Amil Baba in sindh Amil Bab...Amil baba
 
Crunching Popcorn Numbers : How Data Science Shapes Film Franchises
Crunching Popcorn Numbers : How Data Science Shapes Film FranchisesCrunching Popcorn Numbers : How Data Science Shapes Film Franchises
Crunching Popcorn Numbers : How Data Science Shapes Film Franchisesdijalvincent1
 
CREAM - That Conference Austin - January 2024.pptx
CREAM - That Conference Austin - January 2024.pptxCREAM - That Conference Austin - January 2024.pptx
CREAM - That Conference Austin - January 2024.pptxMatthew Groves
 
Explainable Predictive and Prescriptive Process Analytics
Explainable Predictive and Prescriptive Process AnalyticsExplainable Predictive and Prescriptive Process Analytics
Explainable Predictive and Prescriptive Process AnalyticsRiccardoGalanti2
 
Impact of Data Science on the defence sector
Impact of Data Science on the defence sectorImpact of Data Science on the defence sector
Impact of Data Science on the defence sectorpoboxraghav
 
Applications of data science in social media.pptx
Applications of data science in social media.pptxApplications of data science in social media.pptx
Applications of data science in social media.pptxrangavikas1610
 
Wk 5 Comp Sci FINISHED pptx.pptx
Wk 5 Comp Sci FINISHED pptx.pptxWk 5 Comp Sci FINISHED pptx.pptx
Wk 5 Comp Sci FINISHED pptx.pptxanaum7
 

Recently uploaded (20)

Till now I am learning.pptx
Till now I am learning.pptxTill now I am learning.pptx
Till now I am learning.pptx
 
IFM 2024 Consumer Confidence INDEX Report (Jan 15,24).pdf
IFM  2024 Consumer Confidence INDEX Report (Jan 15,24).pdfIFM  2024 Consumer Confidence INDEX Report (Jan 15,24).pdf
IFM 2024 Consumer Confidence INDEX Report (Jan 15,24).pdf
 
Electricity Year 2023_updated_30012024.pptx
Electricity Year 2023_updated_30012024.pptxElectricity Year 2023_updated_30012024.pptx
Electricity Year 2023_updated_30012024.pptx
 
DATA SCIENCE APPLICATION IN TRANSPORTATION.pdf
DATA SCIENCE APPLICATION IN TRANSPORTATION.pdfDATA SCIENCE APPLICATION IN TRANSPORTATION.pdf
DATA SCIENCE APPLICATION IN TRANSPORTATION.pdf
 
Applications of Data Science in YouTube for income generation.pdf
Applications of Data Science in YouTube for income generation.pdfApplications of Data Science in YouTube for income generation.pdf
Applications of Data Science in YouTube for income generation.pdf
 
UNVEILING VALUE-BASED BIDDING SECRETS FOR MAXIMUM EFFICIENCY GAIN.pptx
UNVEILING VALUE-BASED BIDDING SECRETS FOR MAXIMUM EFFICIENCY GAIN.pptxUNVEILING VALUE-BASED BIDDING SECRETS FOR MAXIMUM EFFICIENCY GAIN.pptx
UNVEILING VALUE-BASED BIDDING SECRETS FOR MAXIMUM EFFICIENCY GAIN.pptx
 
Machine Learning with Tree-Based Models in Python
Machine Learning with Tree-Based Models in PythonMachine Learning with Tree-Based Models in Python
Machine Learning with Tree-Based Models in Python
 
Application of Data Science in Cybersecurity.pptx
Application of Data Science in Cybersecurity.pptxApplication of Data Science in Cybersecurity.pptx
Application of Data Science in Cybersecurity.pptx
 
Application of Data Science in the Restaurant Industry.pptx
Application of Data Science in the Restaurant Industry.pptxApplication of Data Science in the Restaurant Industry.pptx
Application of Data Science in the Restaurant Industry.pptx
 
Search Engines Search engine is a software that is designed to information...
Search Engines  Search  engine  is a software that is designed to information...Search Engines  Search  engine  is a software that is designed to information...
Search Engines Search engine is a software that is designed to information...
 
January 2024 - Top 10 Read Articles in International Journal of Artificial In...
January 2024 - Top 10 Read Articles in International Journal of Artificial In...January 2024 - Top 10 Read Articles in International Journal of Artificial In...
January 2024 - Top 10 Read Articles in International Journal of Artificial In...
 
Top-NO1 Amil Baba In Pakistan Amil Baba In Multan Amil Baba in sindh Amil Bab...
Top-NO1 Amil Baba In Pakistan Amil Baba In Multan Amil Baba in sindh Amil Bab...Top-NO1 Amil Baba In Pakistan Amil Baba In Multan Amil Baba in sindh Amil Bab...
Top-NO1 Amil Baba In Pakistan Amil Baba In Multan Amil Baba in sindh Amil Bab...
 
Crunching Popcorn Numbers : How Data Science Shapes Film Franchises
Crunching Popcorn Numbers : How Data Science Shapes Film FranchisesCrunching Popcorn Numbers : How Data Science Shapes Film Franchises
Crunching Popcorn Numbers : How Data Science Shapes Film Franchises
 
CREAM - That Conference Austin - January 2024.pptx
CREAM - That Conference Austin - January 2024.pptxCREAM - That Conference Austin - January 2024.pptx
CREAM - That Conference Austin - January 2024.pptx
 
Explainable Predictive and Prescriptive Process Analytics
Explainable Predictive and Prescriptive Process AnalyticsExplainable Predictive and Prescriptive Process Analytics
Explainable Predictive and Prescriptive Process Analytics
 
Data Compliance
Data ComplianceData Compliance
Data Compliance
 
Impact of Data Science on the defence sector
Impact of Data Science on the defence sectorImpact of Data Science on the defence sector
Impact of Data Science on the defence sector
 
Data Visualization.pptx
Data Visualization.pptxData Visualization.pptx
Data Visualization.pptx
 
Applications of data science in social media.pptx
Applications of data science in social media.pptxApplications of data science in social media.pptx
Applications of data science in social media.pptx
 
Wk 5 Comp Sci FINISHED pptx.pptx
Wk 5 Comp Sci FINISHED pptx.pptxWk 5 Comp Sci FINISHED pptx.pptx
Wk 5 Comp Sci FINISHED pptx.pptx
 

Introduction to Data Science

  • 5. Challenges deep-dive Why the Hype Around Data Science? ● The demand for data scientists will soar by 28% by 2023 ● Data scientist roles have grown over 650% since 2012, but currently, 35,000 people in the US have data science skills, while hundreds of companies are hiring for those roles. ● Software engineering is a common starting point for professionals who are in the top five fasting growing jobs today. ● Data Science gives you career flexibility
  • 6. Who are Data Scientist?
  • 8. Challenges deep-dive What is Machine Learning ? Machine learning teaches computers to do what comes naturally to humans and animals: learn from experience. Machine learning algorithms use computational methods to “learn” information directly from data without relying on a predetermined equation as a model. The algorithms adaptively improve their performance as the number of samples available for learning increases.
  • 9. Challenges deep-dive A Definition A computer program is said to learn from experience E with respect to some task T and some performance measure P if its performance on T, as measured by P, improves with experience E. -Tom Mitchell
  • 10. Challenges deep-dive A Small Question Suppose we feed a learning algorithm a lot of historical weather data, and have it learn to predict weather. In this setting, what is T,P,E?
  • 13. Challenges deep-dive Machine learning teaches computers to do what comes naturally to humans and animals: learn from experience. Machine learning algorithms use computational methods to “learn” information directly from data without relying on a predetermined equation as a model. The algorithms adaptively improve their performance as the number of samples available for learning increases. Real World Applications With the rise in big data, machine learning has become particularly important for solving problems in areas like these: ● Image processing and computer vision,for face recognition, motion detection, and object detection ● Computational biology, for tumor detection, drug discovery, and DNA sequencing ● Energy production, for price and load forecasting ● Automotive, aerospace, and manufacturing, for predictive maintenance ● Natural language processing
  • 14. Challenges deep-dive How Machine Learning Works Machine learning uses two types of techniques: ● Supervised learning, which trains a model on known input and output data so that it can predict future outputs ● Unsupervised learning, which finds hidden patterns or intrinsic structures in input data.
  • 16. Challenges deep-dive Supervised Learning The aim of supervised machine learning is to build a model that makes predictions based on evidence in the presence of uncertainty. A supervised learning algorithm takes a known set of input data and known responses to the data (output) and trains a model to generate reasonable predictions for the response to new data
  • 17. Classification - predict discrete responses Classification models classify input data into categories.for example, whether an email is genuine or spam, or whether a tumor is cancerous or benign. Regression - predict continuous responses for example, changes in temperature or fluctuations in power demand. Typical applications include electricity load forecasting and algorithmic trading.
  • 18. Challenges deep-dive Unsupervised Learning Unsupervised learning finds hidden patterns or intrinsic structures in data. It is used to draw inferences from dataset consisting of input data without labeled responses.
  • 19. Clustering is the most common unsupervised learning technique. It is used for exploratory data analysis to find hidden patterns or groupings in data.Applications for clustering include gene sequence analysis,market research, and object recognition
  • 20. Knowledge Test Which of the following would you apply supervised learning to? 1. Given genetic (DNA) data from a person, predict the odds of him/her developing diabetes over the next 10 years. 2. Given a large dataset of medical records from patients suffering from heart disease, try to learn whether there might be different clusters of such patients for which we might tailor separate treatments. 3. Given data on how 1000 medical patients respond to an experimental drug (such as effectiveness of the treatment, side effects, etc.), discover whether there are different categories or "types" of patients in terms of how they respond to the drug, and if so what these categories are. 4. Have a computer examine an audio clip of a piece of music, and classify whether or not there are vocals (i.e., a human voice singing) in that audio clip, or if it is a clip of only musical instruments (and no vocals).
  • 21. Knowledge Test Which of the following questions can be answered using a classification algorithm? 1. How does the exchange rate depend on the GDP? 2. Does a document contain the handwritten letter S? 3. How can I group supermarket products using purchase frequency?
  • 22. Knowledge Test 1. Suppose you are working on weather prediction, and you would like to predict whether or not it will be raining at 5pm tomorrow. You want to use a learning algorithm for this.Would you treat this as a classification or a regression problem? 2. Suppose you are working on stock market prediction. You would like to predict whether or not a certain company will declare bankruptcy within the next 7 days (by training on data of similar companies that had previously been at risk of bankruptcy). Would you treat this as a classification or a regression problem?
  • 23. How Do You Decide Which Algorithm to Use?
  • 24. Choosing the right algorithm can seem overwhelming There are dozens of supervised and unsupervised machine learning algorithms, and each takes a different approach to learning.
  • 25. There is no best method or one size fits all. Finding the right algorithm is partly just trial and error But algorithm selection also depends on the size and type of data you’re working with, the insights you want to get from the data, and how those insights will be used.
  • 26. Two - Class Classification
  • 27. Multi - Class Classification
  • 31. Challenges deep-dive When should we use Machine Learning Consider using machine learning when you have a complex task or problem involving a large amount of data and lots of variables, but no existing formula or equation.
  • 33. Knowledge Test Have a look at the statements below and identify the one which is not a machine learning problem 1. Given a viewer's shopping habits, recommend a product to purchase the next time she visits your website. 2. Given the symptoms of a patient, identify her illness. 3. Predict the USD/EUR exchange rate for February 2023. 4. Compute the mean wage of 10 employees for your company.
  • 34. Knowledge Test Which of the following statements uses a machine learning model? 1. Determine whether an incoming email is spam or not 2. Obtain the name of last year's FIFIA Ballon d’Or champion 3. Automatically tagging your new Facebook photos 4. Select the student with the highest grade on a statistics course
  • 36. Challenges deep-dive There is NO Straight Line With machine learning there’s rarely a straight line from start to finish. You’ll find yourself constantly iterating and trying different ideas and approaches
  • 37. Challenges deep-dive Machine learning teaches computers to do what comes naturally to humans and animals: learn from experience. Machine learning algorithms use computational methods to “learn” information directly from data without relying on a predetermined equation as a model. The algorithms adaptively improve their performance as the number of samples available for learning increases. Machine Learning Challenges ● Data comes in all shapes and sizes ● Preprocessing your data might require specialized knowledge and tools ● It takes time to find the best model to fit the data.
  • 38. Challenges deep-dive Machine learning teaches computers to do what comes naturally to humans and animals: learn from experience. Machine learning algorithms use computational methods to “learn” information directly from data without relying on a predetermined equation as a model. The algorithms adaptively improve their performance as the number of samples available for learning increases. Questions to Ask Before Starting Every machine learning workflow begins with three questions: ● What kind of data are you working with? ● What insights do you want to get from it? ● How and where will those insights be applied? Your answers to these questions help you decide whether to use supervised or unsupervised learning.
  • 39. Challenges deep-dive Machine learning teaches computers to do what comes naturally to humans and animals: learn from experience. Machine learning algorithms use computational methods to “learn” information directly from data without relying on a predetermined equation as a model. The algorithms adaptively improve their performance as the number of samples available for learning increases. Data Science - Five Questions There are only five questions that data science answers: ● Is this A or B? ● Is this weird? ● How much – or – How many? ● How is this organized? ● What should I do next?
  • 40. Knowledge Test Which of the following questions can be answered using a classification algorithm? 1. How does the exchange rate depend on the GDP? 2. Does a document contain the handwritten letter S? 3. How can I group supermarket products using purchase frequency?
  • 42. Workflow at a Glance
  • 43. Challenges deep-dive Machine learning teaches computers to do what comes naturally to humans and animals: learn from experience. Machine learning algorithms use computational methods to “learn” information directly from data without relying on a predetermined equation as a model. The algorithms adaptively improve their performance as the number of samples available for learning increases. Step 1 - Load the Data We store the labeled data sets in a text file. A flat file format such as text or CSV is easy to work with and makes it straightforward to import data. Machine learning algorithms aren’t smart enough to tell the difference between noise and valuable information. Before using the data for training, we need to make sure it’s clean and complete
  • 44. Challenges deep-dive Machine learning teaches computers to do what comes naturally to humans and animals: learn from experience. Machine learning algorithms use computational methods to “learn” information directly from data without relying on a predetermined equation as a model. The algorithms adaptively improve their performance as the number of samples available for learning increases. Step 2 - Preprocess the Data To preprocess the data we do the following: ● Look for outliers–data points that lie outside the rest of the data ● Check for missing values ● Divide the data into two sets ○ We save part of the data for testing (the test set) and use the rest (the training set) to build models. This is referred to as holdout, and is a useful cross-validation technique
  • 45. Challenges deep-dive Machine learning teaches computers to do what comes naturally to humans and animals: learn from experience. Machine learning algorithms use computational methods to “learn” information directly from data without relying on a predetermined equation as a model. The algorithms adaptively improve their performance as the number of samples available for learning increases. Step 3 - Derive Features Deriving features (also known as feature engineering or feature extraction) turns raw data into information that a machine learning algorithm can use. Use feature selection to: • Improve the accuracy of a machine learning algorithm • Boost model performance for high-dimensional data sets • Improve model interpretability • Prevent overfitting
  • 46. Challenges deep-dive Machine learning teaches computers to do what comes naturally to humans and animals: learn from experience. Machine learning algorithms use computational methods to “learn” information directly from data without relying on a predetermined equation as a model. The algorithms adaptively improve their performance as the number of samples available for learning increases. Step 4 - Build and Train Model ● The predefined algorithms and the test data are used for building the model. ● The training data is used to train and evaluate the model
  • 47. Challenges deep-dive Machine learning teaches computers to do what comes naturally to humans and animals: learn from experience. Machine learning algorithms use computational methods to “learn” information directly from data without relying on a predetermined equation as a model. The algorithms adaptively improve their performance as the number of samples available for learning increases. Step 5 - Improve the Model Improving a model can take two different directions: make the model simpler or add complexity. Simplify - reduce the number of features Add Complexity - make it more fine-tuned
  • 48. Simplify Popular feature reduction techniques include: ● Correlation matrix – shows the relationship between variables, so that variables (or features) that are not highly correlated can be removed. ● Principal component analysis (PCA) - eliminates redundancy by finding a combination of features that captures key distinctions between the original features and brings out strong patterns in the dataset. ● Sequential feature reduction – reduces features iteratively on the model until there is no improvement in performance
  • 49. Add Complexity ● Use model combination – merge multiple simpler models into a larger model that is better able to represent the trends in the data than any of the simpler models could on their own. ● Add more data sources
  • 50. TO DO ● Getting Started ● Familiarize with Maths and Algorithms ● Select the Infrastructure or Tool ● Create your profile and participate in competition
  • 51. Christy Abraham Joy Email - christyabrahamjoy@gmail.com Mob - +91 94000 95273 Feel Free to Contact!