SlideShare a Scribd company logo
Introduction to Data Science
Overview
1
2
Data Source
Evolution of Technology has generated a lot of data, where most of these data are
not structured.
Internet Of Thing (IOT) produces huge amount of data that is measured in
Zettabyte (a unit of information equal to one sextillion (1021) or, strictly, 270 bytes.)
3
Data Source
4
Data Source
Therefore, we can not rely on the traditional data processing systems
What is Data Science?
Data Science is the area of study which involves extracting insights from vast
amounts of data by the use of various scientific methods, algorithms, and
processes. It helps you to discover hidden patterns from the raw data.
The term Data Science has emerged because of the evolution of mathematical
statistics, data analysis, and big data.
Data Science is an interdisciplinary field that allows you to extract knowledge from
structured or unstructured data. Data science enables you to translate a business
problem into a research project and then translate it back into a practical solution.
5
6
Why Data Science?
Here, are significant advantages of using Data Analytics Technology:
1. Data is the oil for today's world. With the right tools, technologies, algorithms,
we can use data and convert it into a distinctive business advantage.
2. Data Science can help you to detect fraud using advanced machine learning
algorithms.
3. It helps you to prevent any significant monetary losses.
4. Allows to build intelligence ability in machines.
5. You can perform sentiment analysis to gauge customer brand loyalty.
6. It enables you to take better and faster decisions.
7. Helps you to recommend the right product to the right customer to enhance
your business.
7
Data Science Components
8
1. Statistics:
Statistics is the most critical unit in Data science. It is the method or science of
collecting and analyzing numerical data in large quantities to get useful insights.
2. Visualization:
Visualization technique helps you to access huge amounts of data in easy to
understand and digestible visuals.
3. domain expert
is a person who is an authority in a particular area or topic. The term domain
expert is frequently used in expert systems software development, and there
the term always refers to the domain other than the software domain. A
domain expert is a person with special knowledge or skills in a particular area of
endeavour (e.g. an accountant is an expert in the domain of accountancy). The
development of accounting software requires knowledge in two different
domains: accounting and software.
Data Science Components
9
6. Machine Learning:
Machine Learning explores the building and study of algorithms which learn to
make predictions about unforeseen/future data.
7. Deep Learning:
Deep Learning method is new machine learning research where the algorithm
selects the analysis model to follow.
4. Data Engineer
A data engineer is a worker whose primary job responsibilities involve preparing
data for analytical or operational uses. The specific tasks handled by data
engineers can vary from organization to organization but typically include
building data pipelines to pull together information from different source
systems; integrating, consolidating and cleansing data; and structuring it for use
in individual analytics applications.
Data Science Components
5. Advanced Computing
The full complement of capabilities that support compute- and data-intensive
research across the entire science and engineering spectrum, which are too
expensive to be purchased by an individual research group or department and
perhaps too expensive even for an individual research institution.
10
11
Most prominent Data Scientist job titles are:
Data Science Jobs Roles
12
Data Scientist:
Role:
A Data Scientist is a professional who manages enormous amounts of data to come
up with compelling business visions by using various tools, techniques,
methodologies, algorithms, etc.
Languages:
R, SAS, Python, SQL, Hive, Matlab, Pig, Spark
13
Data Scientist Skills List
14
Data Engineer:
Role:
The role of data engineer is of working with large amounts of data. He develops,
constructs, tests, and maintains architectures like large scale processing system and
databases.
Languages:
SQL, Hive, R, SAS, Matlab, Python, Java, Ruby, C + +, and Perl
Data Science Jobs Roles
15
Data Analyst:
Role:
A data analyst is responsible for mining vast amounts of data. He or she will look for
relationships, patterns, trends in data. Later he or she will deliver compelling
reporting and visualization for analyzing the data to take the most viable business
decisions.
Languages:
R, Python, HTML, JS, C, C+ + , SQL
Statistician:
Role:
The statistician collects, analyses, understand qualitative and quantitative data by
using statistical theories and methods.
Languages:
SQL, R, Matlab, Tableau, Python, Perl, Spark, and Hive
Data Science Jobs Roles
16
Data Administrator:
Role:
Data admin should ensure that the database is accessible to all relevant users. He
also makes sure that it is performing correctly and is being kept safe from hacking.
Languages:
Ruby on Rails, SQL, Java, C#, and Python
Business Analyst:
Role:
This professional need to improves business processes. He/she as an intermediary
between the business executive team and IT department.
Languages:
SQL, Tableau, Power BI and, Python
Data Science Jobs Roles
17
Tools for Data Science
18
Internet Search:
Google search use Data science technology to search a specific result within a
fraction of a second
Recommendation Systems:
To create a recommendation system. Example, "suggested friends" on Facebook or
suggested videos" on YouTube, everything is done with the help of Data Science.
Image & Speech Recognition:
Speech recognizes system like Siri, Google assistant, Alexa runs on the technique
of Data science. Moreover, Facebook recognizes your friend when you upload a
photo with them, with the help of Data Science.
Gaming world:
EA Sports, Sony, Nintendo, are using Data science technology. This enhances your
gaming experience. Games are now developed using Machine Learning technique.
It can update itself when you move to higher levels.
Online Price Comparison:
PriceRunner, Junglee, Shopzilla work on the Data science mechanism. Here, data is
fetched from the relevant websites using APIs.
Applications of Data science
19
Data Science Life Cycle
20
1. Business Requirements
21
2. Data Acquisition
22
3. Data Processing
23
4. Data Exploration
24
5. Modelling
25
6. Deployment
26
1. High variety of information & data is required for accurate analysis
2. Not adequate data science talent pool available
3. Management does not provide financial support for a data science team
4. Unavailability of/difficult access to data
5. Data Science results not effectively used by business decision makers
6. Explaining data science to others is difficult
7. Privacy issues
8. Lack of significant domain expert
9. If an organization is very small, they can't have a Data Science team
Challenges of Data science Technology
27
Summary
Data Science is the area of study which involves extracting insights from vast amounts
of data by the use of various scientific methods, algorithms, and processes.
Statistics, Visualization, Deep Learning, Machine Learning, are important Data Science
concepts.
Data Science Process goes through Discovery, Data Preparation, Model Planning,
Model Building, Operationalize, Communicate Results.
Important Data Scientist job roles are: 1) Data Scientist 2) Data Engineer 3) Data
Analyst 4) Statistician 5) Data Architect 6) Data Admin 7) Business Analyst 8)
Data/Analytics Manager
R, SQL, Python, SaS, are essential Data science tools
Important applications of Data science are 1) Internet Search 2) Recommendation
Systems 3) Image & Speech Recognition 4) Gaming world 5) Online Price Comparison.
High variety of information & data is the biggest challenge of Data Science technology.

More Related Content

Similar to Ch1IntroductiontoDataScience.pptx

Advanced Analytics and Machine Learning with Data Virtualization
Advanced Analytics and Machine Learning with Data VirtualizationAdvanced Analytics and Machine Learning with Data Virtualization
Advanced Analytics and Machine Learning with Data Virtualization
Denodo
 
1.-DE-LECTURE-1-INTRO-TO-DATA-ENGG.pptx
1.-DE-LECTURE-1-INTRO-TO-DATA-ENGG.pptx1.-DE-LECTURE-1-INTRO-TO-DATA-ENGG.pptx
1.-DE-LECTURE-1-INTRO-TO-DATA-ENGG.pptx
arpit206900
 
Career in Data Science
Career in Data ScienceCareer in Data Science
Career in Data Science
ActonRoy
 
How Data Virtualization Puts Enterprise Machine Learning Programs into Produc...
How Data Virtualization Puts Enterprise Machine Learning Programs into Produc...How Data Virtualization Puts Enterprise Machine Learning Programs into Produc...
How Data Virtualization Puts Enterprise Machine Learning Programs into Produc...
Denodo
 
DATASCIENCE vs BUSINESS INTELLIGENCE.pptx
DATASCIENCE vs BUSINESS INTELLIGENCE.pptxDATASCIENCE vs BUSINESS INTELLIGENCE.pptx
DATASCIENCE vs BUSINESS INTELLIGENCE.pptx
OTA13NayabNakhwa
 
Untitled document.pdf
Untitled document.pdfUntitled document.pdf
Untitled document.pdf
MuhammadTahiriqbal13
 
Data science
Data science Data science
Data science
SouravSadhukhan6
 
Advanced Analytics and Machine Learning with Data Virtualization
Advanced Analytics and Machine Learning with Data VirtualizationAdvanced Analytics and Machine Learning with Data Virtualization
Advanced Analytics and Machine Learning with Data Virtualization
Denodo
 
Data Science: Unlocking Insights and Transforming Industries
Data Science: Unlocking Insights and Transforming IndustriesData Science: Unlocking Insights and Transforming Industries
Data Science: Unlocking Insights and Transforming Industries
Uncodemy
 
Big data (word file)
Big data  (word file)Big data  (word file)
Big data (word file)
Shahbaz Anjam
 
Paper presentation
Paper presentationPaper presentation
Paper presentation
K.K. Tripathi
 
What is Data Science? |Role of Data Science in Big Data, Hadoop & Machine Lea...
What is Data Science? |Role of Data Science in Big Data, Hadoop & Machine Lea...What is Data Science? |Role of Data Science in Big Data, Hadoop & Machine Lea...
What is Data Science? |Role of Data Science in Big Data, Hadoop & Machine Lea...
vinayiqbusiness
 
1 UNIT-DSP.pptx
1 UNIT-DSP.pptx1 UNIT-DSP.pptx
1 UNIT-DSP.pptx
PothyeswariPothyes
 
The Simple 5-Step Process for Creating a Winning Data Pipeline.pdf
The Simple 5-Step Process for Creating a Winning Data Pipeline.pdfThe Simple 5-Step Process for Creating a Winning Data Pipeline.pdf
The Simple 5-Step Process for Creating a Winning Data Pipeline.pdf
Data Science Council of America
 
Colloquium(7)_DataScience:ShivShaktiGhosh&MohitGarg
Colloquium(7)_DataScience:ShivShaktiGhosh&MohitGargColloquium(7)_DataScience:ShivShaktiGhosh&MohitGarg
Colloquium(7)_DataScience:ShivShaktiGhosh&MohitGarg
Shiv Shakti Ghosh
 
Data science vs. Data scientist by Jothi Periasamy
Data science vs. Data scientist by Jothi PeriasamyData science vs. Data scientist by Jothi Periasamy
Data science vs. Data scientist by Jothi Periasamy
Peter Kua
 
Welcome to Data Science
Welcome to Data ScienceWelcome to Data Science
Welcome to Data Science
NyraSehgal
 
Data Science Introduction: Concepts, lifecycle, applications.pptx
Data Science Introduction: Concepts, lifecycle, applications.pptxData Science Introduction: Concepts, lifecycle, applications.pptx
Data Science Introduction: Concepts, lifecycle, applications.pptx
sumitkumar600840
 
Data Science Unit1 AMET.pdf
Data Science Unit1 AMET.pdfData Science Unit1 AMET.pdf
Data Science Unit1 AMET.pdf
mustaq4
 
Quicker Insights and Sustainable Business Agility Powered By Data Virtualizat...
Quicker Insights and Sustainable Business Agility Powered By Data Virtualizat...Quicker Insights and Sustainable Business Agility Powered By Data Virtualizat...
Quicker Insights and Sustainable Business Agility Powered By Data Virtualizat...
Denodo
 

Similar to Ch1IntroductiontoDataScience.pptx (20)

Advanced Analytics and Machine Learning with Data Virtualization
Advanced Analytics and Machine Learning with Data VirtualizationAdvanced Analytics and Machine Learning with Data Virtualization
Advanced Analytics and Machine Learning with Data Virtualization
 
1.-DE-LECTURE-1-INTRO-TO-DATA-ENGG.pptx
1.-DE-LECTURE-1-INTRO-TO-DATA-ENGG.pptx1.-DE-LECTURE-1-INTRO-TO-DATA-ENGG.pptx
1.-DE-LECTURE-1-INTRO-TO-DATA-ENGG.pptx
 
Career in Data Science
Career in Data ScienceCareer in Data Science
Career in Data Science
 
How Data Virtualization Puts Enterprise Machine Learning Programs into Produc...
How Data Virtualization Puts Enterprise Machine Learning Programs into Produc...How Data Virtualization Puts Enterprise Machine Learning Programs into Produc...
How Data Virtualization Puts Enterprise Machine Learning Programs into Produc...
 
DATASCIENCE vs BUSINESS INTELLIGENCE.pptx
DATASCIENCE vs BUSINESS INTELLIGENCE.pptxDATASCIENCE vs BUSINESS INTELLIGENCE.pptx
DATASCIENCE vs BUSINESS INTELLIGENCE.pptx
 
Untitled document.pdf
Untitled document.pdfUntitled document.pdf
Untitled document.pdf
 
Data science
Data science Data science
Data science
 
Advanced Analytics and Machine Learning with Data Virtualization
Advanced Analytics and Machine Learning with Data VirtualizationAdvanced Analytics and Machine Learning with Data Virtualization
Advanced Analytics and Machine Learning with Data Virtualization
 
Data Science: Unlocking Insights and Transforming Industries
Data Science: Unlocking Insights and Transforming IndustriesData Science: Unlocking Insights and Transforming Industries
Data Science: Unlocking Insights and Transforming Industries
 
Big data (word file)
Big data  (word file)Big data  (word file)
Big data (word file)
 
Paper presentation
Paper presentationPaper presentation
Paper presentation
 
What is Data Science? |Role of Data Science in Big Data, Hadoop & Machine Lea...
What is Data Science? |Role of Data Science in Big Data, Hadoop & Machine Lea...What is Data Science? |Role of Data Science in Big Data, Hadoop & Machine Lea...
What is Data Science? |Role of Data Science in Big Data, Hadoop & Machine Lea...
 
1 UNIT-DSP.pptx
1 UNIT-DSP.pptx1 UNIT-DSP.pptx
1 UNIT-DSP.pptx
 
The Simple 5-Step Process for Creating a Winning Data Pipeline.pdf
The Simple 5-Step Process for Creating a Winning Data Pipeline.pdfThe Simple 5-Step Process for Creating a Winning Data Pipeline.pdf
The Simple 5-Step Process for Creating a Winning Data Pipeline.pdf
 
Colloquium(7)_DataScience:ShivShaktiGhosh&MohitGarg
Colloquium(7)_DataScience:ShivShaktiGhosh&MohitGargColloquium(7)_DataScience:ShivShaktiGhosh&MohitGarg
Colloquium(7)_DataScience:ShivShaktiGhosh&MohitGarg
 
Data science vs. Data scientist by Jothi Periasamy
Data science vs. Data scientist by Jothi PeriasamyData science vs. Data scientist by Jothi Periasamy
Data science vs. Data scientist by Jothi Periasamy
 
Welcome to Data Science
Welcome to Data ScienceWelcome to Data Science
Welcome to Data Science
 
Data Science Introduction: Concepts, lifecycle, applications.pptx
Data Science Introduction: Concepts, lifecycle, applications.pptxData Science Introduction: Concepts, lifecycle, applications.pptx
Data Science Introduction: Concepts, lifecycle, applications.pptx
 
Data Science Unit1 AMET.pdf
Data Science Unit1 AMET.pdfData Science Unit1 AMET.pdf
Data Science Unit1 AMET.pdf
 
Quicker Insights and Sustainable Business Agility Powered By Data Virtualizat...
Quicker Insights and Sustainable Business Agility Powered By Data Virtualizat...Quicker Insights and Sustainable Business Agility Powered By Data Virtualizat...
Quicker Insights and Sustainable Business Agility Powered By Data Virtualizat...
 

Recently uploaded

Seamlessly Pay Online, Pay In Stores or Send Money
Seamlessly Pay Online, Pay In Stores or Send MoneySeamlessly Pay Online, Pay In Stores or Send Money
Seamlessly Pay Online, Pay In Stores or Send Money
gargtinna79
 
Machine learning _new.pptx for a presentation
Machine learning _new.pptx for a presentationMachine learning _new.pptx for a presentation
Machine learning _new.pptx for a presentation
RahulS66654
 
Beautiful Girls Call Pune 000XX00000 Provide Best And Top Girl Service And No...
Beautiful Girls Call Pune 000XX00000 Provide Best And Top Girl Service And No...Beautiful Girls Call Pune 000XX00000 Provide Best And Top Girl Service And No...
Beautiful Girls Call Pune 000XX00000 Provide Best And Top Girl Service And No...
birajmohan012
 
bai-tap-tieng-anh-lop-12-unit-4-the-mass-media (1).doc
bai-tap-tieng-anh-lop-12-unit-4-the-mass-media (1).docbai-tap-tieng-anh-lop-12-unit-4-the-mass-media (1).doc
bai-tap-tieng-anh-lop-12-unit-4-the-mass-media (1).doc
PhngThLmHnh
 
University of Wollongong degree offer diploma Transcript
University of Wollongong  degree offer diploma TranscriptUniversity of Wollongong  degree offer diploma Transcript
University of Wollongong degree offer diploma Transcript
taqyea
 
Maruti Wagon R on road price in Faridabad - CarDekho
Maruti Wagon R on road price in Faridabad - CarDekhoMaruti Wagon R on road price in Faridabad - CarDekho
Maruti Wagon R on road price in Faridabad - CarDekho
kamli sharma#S10
 
University of the Sunshine Coast degree offer diploma Transcript
University of the Sunshine Coast  degree offer diploma TranscriptUniversity of the Sunshine Coast  degree offer diploma Transcript
University of the Sunshine Coast degree offer diploma Transcript
taqyea
 
the unexpected potential of Dijkstra's Algorithm
the unexpected potential of Dijkstra's Algorithmthe unexpected potential of Dijkstra's Algorithm
the unexpected potential of Dijkstra's Algorithm
huseindihon
 
Australian Catholic University degree offer diploma Transcript
Australian Catholic University  degree offer diploma TranscriptAustralian Catholic University  degree offer diploma Transcript
Australian Catholic University degree offer diploma Transcript
taqyea
 
Universidad de Alcalá degree offer diploma Transcript
Universidad de Alcalá  degree offer diploma TranscriptUniversidad de Alcalá  degree offer diploma Transcript
Universidad de Alcalá degree offer diploma Transcript
taqyea
 
Beautiful Girls Call 9711199171 9711199171 Provide Best And Top Girl Service ...
Beautiful Girls Call 9711199171 9711199171 Provide Best And Top Girl Service ...Beautiful Girls Call 9711199171 9711199171 Provide Best And Top Girl Service ...
Beautiful Girls Call 9711199171 9711199171 Provide Best And Top Girl Service ...
janvikumar4133
 
Oracle Database Desupported Features on 23ai (Part A)
Oracle Database Desupported Features on 23ai (Part A)Oracle Database Desupported Features on 23ai (Part A)
Oracle Database Desupported Features on 23ai (Part A)
Alireza Kamrani
 
Female Service Girls Call Navi Mumbai 9930245274 Provide Best And Top Girl Se...
Female Service Girls Call Navi Mumbai 9930245274 Provide Best And Top Girl Se...Female Service Girls Call Navi Mumbai 9930245274 Provide Best And Top Girl Se...
Female Service Girls Call Navi Mumbai 9930245274 Provide Best And Top Girl Se...
dizzycaye
 
Artificial Intelligence (AI) Technology Project Proposal _ by Slidesgo.pptx
Artificial Intelligence (AI) Technology Project Proposal _ by Slidesgo.pptxArtificial Intelligence (AI) Technology Project Proposal _ by Slidesgo.pptx
Artificial Intelligence (AI) Technology Project Proposal _ by Slidesgo.pptx
vaishnavisharma877623
 
the potential of the development of the Ford–Fulkerson algorithm to solve the...
the potential of the development of the Ford–Fulkerson algorithm to solve the...the potential of the development of the Ford–Fulkerson algorithm to solve the...
the potential of the development of the Ford–Fulkerson algorithm to solve the...
huseindihon
 
PHENOMENOLOGY and Interpretive phenomenological analysis
PHENOMENOLOGY and Interpretive phenomenological analysisPHENOMENOLOGY and Interpretive phenomenological analysis
PHENOMENOLOGY and Interpretive phenomenological analysis
CharmoliApumKhrime
 
AIRLINE_SATISFACTION_Data Science Solution on Azure
AIRLINE_SATISFACTION_Data Science Solution on AzureAIRLINE_SATISFACTION_Data Science Solution on Azure
AIRLINE_SATISFACTION_Data Science Solution on Azure
SanelaNikodinoska1
 
How We Added Replication to QuestDB - JonTheBeach
How We Added Replication to QuestDB - JonTheBeachHow We Added Replication to QuestDB - JonTheBeach
How We Added Replication to QuestDB - JonTheBeach
javier ramirez
 
ch8_multiplexing cs553 st07 slide share ss
ch8_multiplexing cs553 st07 slide share ssch8_multiplexing cs553 st07 slide share ss
ch8_multiplexing cs553 st07 slide share ss
MinThetLwin1
 
Welcome back to Instagram. Sign in to check out what your
Welcome back to Instagram. Sign in to check out what yourWelcome back to Instagram. Sign in to check out what your
Welcome back to Instagram. Sign in to check out what your
Virni Arrora
 

Recently uploaded (20)

Seamlessly Pay Online, Pay In Stores or Send Money
Seamlessly Pay Online, Pay In Stores or Send MoneySeamlessly Pay Online, Pay In Stores or Send Money
Seamlessly Pay Online, Pay In Stores or Send Money
 
Machine learning _new.pptx for a presentation
Machine learning _new.pptx for a presentationMachine learning _new.pptx for a presentation
Machine learning _new.pptx for a presentation
 
Beautiful Girls Call Pune 000XX00000 Provide Best And Top Girl Service And No...
Beautiful Girls Call Pune 000XX00000 Provide Best And Top Girl Service And No...Beautiful Girls Call Pune 000XX00000 Provide Best And Top Girl Service And No...
Beautiful Girls Call Pune 000XX00000 Provide Best And Top Girl Service And No...
 
bai-tap-tieng-anh-lop-12-unit-4-the-mass-media (1).doc
bai-tap-tieng-anh-lop-12-unit-4-the-mass-media (1).docbai-tap-tieng-anh-lop-12-unit-4-the-mass-media (1).doc
bai-tap-tieng-anh-lop-12-unit-4-the-mass-media (1).doc
 
University of Wollongong degree offer diploma Transcript
University of Wollongong  degree offer diploma TranscriptUniversity of Wollongong  degree offer diploma Transcript
University of Wollongong degree offer diploma Transcript
 
Maruti Wagon R on road price in Faridabad - CarDekho
Maruti Wagon R on road price in Faridabad - CarDekhoMaruti Wagon R on road price in Faridabad - CarDekho
Maruti Wagon R on road price in Faridabad - CarDekho
 
University of the Sunshine Coast degree offer diploma Transcript
University of the Sunshine Coast  degree offer diploma TranscriptUniversity of the Sunshine Coast  degree offer diploma Transcript
University of the Sunshine Coast degree offer diploma Transcript
 
the unexpected potential of Dijkstra's Algorithm
the unexpected potential of Dijkstra's Algorithmthe unexpected potential of Dijkstra's Algorithm
the unexpected potential of Dijkstra's Algorithm
 
Australian Catholic University degree offer diploma Transcript
Australian Catholic University  degree offer diploma TranscriptAustralian Catholic University  degree offer diploma Transcript
Australian Catholic University degree offer diploma Transcript
 
Universidad de Alcalá degree offer diploma Transcript
Universidad de Alcalá  degree offer diploma TranscriptUniversidad de Alcalá  degree offer diploma Transcript
Universidad de Alcalá degree offer diploma Transcript
 
Beautiful Girls Call 9711199171 9711199171 Provide Best And Top Girl Service ...
Beautiful Girls Call 9711199171 9711199171 Provide Best And Top Girl Service ...Beautiful Girls Call 9711199171 9711199171 Provide Best And Top Girl Service ...
Beautiful Girls Call 9711199171 9711199171 Provide Best And Top Girl Service ...
 
Oracle Database Desupported Features on 23ai (Part A)
Oracle Database Desupported Features on 23ai (Part A)Oracle Database Desupported Features on 23ai (Part A)
Oracle Database Desupported Features on 23ai (Part A)
 
Female Service Girls Call Navi Mumbai 9930245274 Provide Best And Top Girl Se...
Female Service Girls Call Navi Mumbai 9930245274 Provide Best And Top Girl Se...Female Service Girls Call Navi Mumbai 9930245274 Provide Best And Top Girl Se...
Female Service Girls Call Navi Mumbai 9930245274 Provide Best And Top Girl Se...
 
Artificial Intelligence (AI) Technology Project Proposal _ by Slidesgo.pptx
Artificial Intelligence (AI) Technology Project Proposal _ by Slidesgo.pptxArtificial Intelligence (AI) Technology Project Proposal _ by Slidesgo.pptx
Artificial Intelligence (AI) Technology Project Proposal _ by Slidesgo.pptx
 
the potential of the development of the Ford–Fulkerson algorithm to solve the...
the potential of the development of the Ford–Fulkerson algorithm to solve the...the potential of the development of the Ford–Fulkerson algorithm to solve the...
the potential of the development of the Ford–Fulkerson algorithm to solve the...
 
PHENOMENOLOGY and Interpretive phenomenological analysis
PHENOMENOLOGY and Interpretive phenomenological analysisPHENOMENOLOGY and Interpretive phenomenological analysis
PHENOMENOLOGY and Interpretive phenomenological analysis
 
AIRLINE_SATISFACTION_Data Science Solution on Azure
AIRLINE_SATISFACTION_Data Science Solution on AzureAIRLINE_SATISFACTION_Data Science Solution on Azure
AIRLINE_SATISFACTION_Data Science Solution on Azure
 
How We Added Replication to QuestDB - JonTheBeach
How We Added Replication to QuestDB - JonTheBeachHow We Added Replication to QuestDB - JonTheBeach
How We Added Replication to QuestDB - JonTheBeach
 
ch8_multiplexing cs553 st07 slide share ss
ch8_multiplexing cs553 st07 slide share ssch8_multiplexing cs553 st07 slide share ss
ch8_multiplexing cs553 st07 slide share ss
 
Welcome back to Instagram. Sign in to check out what your
Welcome back to Instagram. Sign in to check out what yourWelcome back to Instagram. Sign in to check out what your
Welcome back to Instagram. Sign in to check out what your
 

Ch1IntroductiontoDataScience.pptx

  • 1. Introduction to Data Science Overview 1
  • 2. 2 Data Source Evolution of Technology has generated a lot of data, where most of these data are not structured. Internet Of Thing (IOT) produces huge amount of data that is measured in Zettabyte (a unit of information equal to one sextillion (1021) or, strictly, 270 bytes.)
  • 4. 4 Data Source Therefore, we can not rely on the traditional data processing systems
  • 5. What is Data Science? Data Science is the area of study which involves extracting insights from vast amounts of data by the use of various scientific methods, algorithms, and processes. It helps you to discover hidden patterns from the raw data. The term Data Science has emerged because of the evolution of mathematical statistics, data analysis, and big data. Data Science is an interdisciplinary field that allows you to extract knowledge from structured or unstructured data. Data science enables you to translate a business problem into a research project and then translate it back into a practical solution. 5
  • 6. 6
  • 7. Why Data Science? Here, are significant advantages of using Data Analytics Technology: 1. Data is the oil for today's world. With the right tools, technologies, algorithms, we can use data and convert it into a distinctive business advantage. 2. Data Science can help you to detect fraud using advanced machine learning algorithms. 3. It helps you to prevent any significant monetary losses. 4. Allows to build intelligence ability in machines. 5. You can perform sentiment analysis to gauge customer brand loyalty. 6. It enables you to take better and faster decisions. 7. Helps you to recommend the right product to the right customer to enhance your business. 7
  • 9. 1. Statistics: Statistics is the most critical unit in Data science. It is the method or science of collecting and analyzing numerical data in large quantities to get useful insights. 2. Visualization: Visualization technique helps you to access huge amounts of data in easy to understand and digestible visuals. 3. domain expert is a person who is an authority in a particular area or topic. The term domain expert is frequently used in expert systems software development, and there the term always refers to the domain other than the software domain. A domain expert is a person with special knowledge or skills in a particular area of endeavour (e.g. an accountant is an expert in the domain of accountancy). The development of accounting software requires knowledge in two different domains: accounting and software. Data Science Components 9
  • 10. 6. Machine Learning: Machine Learning explores the building and study of algorithms which learn to make predictions about unforeseen/future data. 7. Deep Learning: Deep Learning method is new machine learning research where the algorithm selects the analysis model to follow. 4. Data Engineer A data engineer is a worker whose primary job responsibilities involve preparing data for analytical or operational uses. The specific tasks handled by data engineers can vary from organization to organization but typically include building data pipelines to pull together information from different source systems; integrating, consolidating and cleansing data; and structuring it for use in individual analytics applications. Data Science Components 5. Advanced Computing The full complement of capabilities that support compute- and data-intensive research across the entire science and engineering spectrum, which are too expensive to be purchased by an individual research group or department and perhaps too expensive even for an individual research institution. 10
  • 11. 11 Most prominent Data Scientist job titles are: Data Science Jobs Roles
  • 12. 12 Data Scientist: Role: A Data Scientist is a professional who manages enormous amounts of data to come up with compelling business visions by using various tools, techniques, methodologies, algorithms, etc. Languages: R, SAS, Python, SQL, Hive, Matlab, Pig, Spark
  • 14. 14 Data Engineer: Role: The role of data engineer is of working with large amounts of data. He develops, constructs, tests, and maintains architectures like large scale processing system and databases. Languages: SQL, Hive, R, SAS, Matlab, Python, Java, Ruby, C + +, and Perl Data Science Jobs Roles
  • 15. 15 Data Analyst: Role: A data analyst is responsible for mining vast amounts of data. He or she will look for relationships, patterns, trends in data. Later he or she will deliver compelling reporting and visualization for analyzing the data to take the most viable business decisions. Languages: R, Python, HTML, JS, C, C+ + , SQL Statistician: Role: The statistician collects, analyses, understand qualitative and quantitative data by using statistical theories and methods. Languages: SQL, R, Matlab, Tableau, Python, Perl, Spark, and Hive Data Science Jobs Roles
  • 16. 16 Data Administrator: Role: Data admin should ensure that the database is accessible to all relevant users. He also makes sure that it is performing correctly and is being kept safe from hacking. Languages: Ruby on Rails, SQL, Java, C#, and Python Business Analyst: Role: This professional need to improves business processes. He/she as an intermediary between the business executive team and IT department. Languages: SQL, Tableau, Power BI and, Python Data Science Jobs Roles
  • 17. 17 Tools for Data Science
  • 18. 18 Internet Search: Google search use Data science technology to search a specific result within a fraction of a second Recommendation Systems: To create a recommendation system. Example, "suggested friends" on Facebook or suggested videos" on YouTube, everything is done with the help of Data Science. Image & Speech Recognition: Speech recognizes system like Siri, Google assistant, Alexa runs on the technique of Data science. Moreover, Facebook recognizes your friend when you upload a photo with them, with the help of Data Science. Gaming world: EA Sports, Sony, Nintendo, are using Data science technology. This enhances your gaming experience. Games are now developed using Machine Learning technique. It can update itself when you move to higher levels. Online Price Comparison: PriceRunner, Junglee, Shopzilla work on the Data science mechanism. Here, data is fetched from the relevant websites using APIs. Applications of Data science
  • 26. 26 1. High variety of information & data is required for accurate analysis 2. Not adequate data science talent pool available 3. Management does not provide financial support for a data science team 4. Unavailability of/difficult access to data 5. Data Science results not effectively used by business decision makers 6. Explaining data science to others is difficult 7. Privacy issues 8. Lack of significant domain expert 9. If an organization is very small, they can't have a Data Science team Challenges of Data science Technology
  • 27. 27 Summary Data Science is the area of study which involves extracting insights from vast amounts of data by the use of various scientific methods, algorithms, and processes. Statistics, Visualization, Deep Learning, Machine Learning, are important Data Science concepts. Data Science Process goes through Discovery, Data Preparation, Model Planning, Model Building, Operationalize, Communicate Results. Important Data Scientist job roles are: 1) Data Scientist 2) Data Engineer 3) Data Analyst 4) Statistician 5) Data Architect 6) Data Admin 7) Business Analyst 8) Data/Analytics Manager R, SQL, Python, SaS, are essential Data science tools Important applications of Data science are 1) Internet Search 2) Recommendation Systems 3) Image & Speech Recognition 4) Gaming world 5) Online Price Comparison. High variety of information & data is the biggest challenge of Data Science technology.