SlideShare a Scribd company logo
Data Science
Muhammad Suleman Memon
Assistant Professor
Department of Information Technology,
Dadu Campus,
University of Sindh
What is
Data
Science?
Data science is the domain of
study that deals with vast
volumes.
Find unseen patterns, derive
meaningful information, and
make business decisions.
Data science uses complex
machine learning algorithms to
build predictive models.
Data Science
Applications
Sources of the Data
Data Science Lifecycle
Prerequisites
for Data
Science
1. Machine Learning
2. Modeling
3. Statistics
4. Programming
5. Databases
Who Oversees the Data Science Process?
• Business Managers
• To collaborate with the data science team to characterize the problem and
establish an analytical method.
• IT Managers
• Developing the infrastructure and architecture to enable data science
activities.
• Data Science Managers
• Supervise the working procedures of all data science team members.
• They also manage and keep track of the day-to-day activities of the three data
science teams.
What is a
Data
Scientist?
professionals who have the technical ability
to handle complicated issues as well as the
desire to investigate what questions need to
be answered.
They're a mix of mathematicians, computer
scientists, and trend forecasters.
They're also in high demand and well-paid
because they work in both the business and
IT sectors.
On a daily
basis, a data
scientist
may do the
following
tasks:
Discover patterns and
trends in datasets to get
insights.
Create forecasting
algorithms and data
models.
Improve the quality of
data or product offerings
by utilising machine
learning techniques.
Distribute suggestions to
other teams and top
management.
In data analysis, use data
tools such as R, SAS,
Python, or SQL.
Top the field of data
science innovations.
What Does a
Data Scientist
Do?
Determine the
problem.
Determines the
correct set of
variables and
datasets.
Gather structured
and unstructured
data from many
sources.
Convert raw data
into a suitable
format.
Apply ML
algorithms.
Interpret the data to
find opportunities
and solutions.
Prepare the
results and
insights to share
with stake
holders.
Why Become
a Data
Scientist?
• According to Glassdoor and Forbes,
demand for data scientists will
increase by 28 percent by 2026,
which speaks of the profession’s
durability and longevity, so if you
want a secure career, data science
offers you that chance.
Use of Data
Science
1. Data science may detect patterns in seemingly
unstructured or unconnected data, allowing
conclusions and predictions to be made.
2. Tech businesses that acquire user data can
utilize strategies to transform that data into
valuable or profitable information.
3. Data Science has also made inroads into the
transportation industry, such as with driverless
cars.
4. Data Science applications provide a better level
of therapeutic customization through genetics
and genomics research.
Data Scientist
Job role: Determine what the
problem is, what questions
need answers, and where to
find the data. Also, they mine,
clean, and present the relevant
data.
Skills needed: Programming
skills (SAS, R, Python),
storytelling and data
visualization, statistical and
mathematical skills, knowledge
of Hadoop, SQL, and Machine
Learning.
Data Analyst
Job role: Analysts bridge the gap
between the data scientists and the
business analysts, organizing and
analyzing data to answer the
questions the organization poses.
They take the technical analyses and
turn them into qualitative action
items.
Skills needed: Statistical and
mathematical skills, programming
skills (SAS, R, Python), plus
experience in data wrangling and
data visualization.
Data Engineer
Job role: Data engineers focus on
developing, deploying, managing,
and optimizing the organization’s
data infrastructure and data
pipelines. Engineers support data
scientists by helping to transfer
and transform data for queries.
Skills needed: NoSQL databases
(e.g., MongoDB, Cassandra DB),
programming languages such as
Java and Scala, and frameworks
(Apache Hadoop).
Data
Science
Tools
Data Analysis: SAS, Jupyter, R
Studio, MATLAB, Excel, RapidMiner
Data Warehousing: Informatica/
Talend, AWS Redshift
Data Visualization: Jupyter, Tableau,
Cognos, RAW
Machine Learning: Spark MLib,
Mahout, Azure ML studio
Difference
Between
Business
Intelligence
and Data
Science
BUSINESS INTELLIGENCE DATA SCIENCE
Uses structured data Uses both structured and
unstructured data
Analytical in nature - provides a
historical report of the data
Scientific in nature - perform an in-
depth statistical analysis on the
data
Use of basic statistics with
emphasis on visualization
(dashboards, reports)
Leverages more sophisticated
statistical and predictive analysis
and machine learning (ML)
Compares historical data to current
data to identify trends
Combines historical and current
data to predict future performance
and outcomes
Applications
of Data
Science
1. Healthcare
2. Gaming
3. Image
Recognition
4.
Recommendation
Systems
5. Logistics
6. Fraud
Detection
7. Internet Search
8. Speech
recognition
9. Targeted
Advertising
10. Airline Route
Planning
11. Augmented
Reality
Programming Language
for Data Science
Python
Fundamental
Python
Libraries for
Data
Scientists
Numpy
SciPy
Pandas
Scikit-Learn
IDE
Pycharm
Getting Started
Import pandas as pd
1
Import numpy as np
2
Import
matplotlib.pyplot as
plt
3
Getting Started
data = { ’year ’: [2010 , 2011 , 2012 ,2010 , 2011 , 2012 ,2010 , 2011 , 2012],
’team ’: [’ FCBarcelona ’, ’ FCBarcelona ’,’ FCBarcelona ’, ’ RMadrid ’,’ RMadrid ’, ’ RMadrid ’,’ ValenciaCF ’, ’
ValenciaCF ’,’ ValenciaCF ’
],
’wins ’: [30 , 28 , 32 , 29 , 32 , 26 , 21 , 17 , 19] ,
’ draws ’: [6 , 7, 4, 5, 4, 7, 8, 10 , 8] ,
’ losses ’: [2 , 3, 2, 4, 2, 5, 9, 11 , 11]
}
football = pd . DataFrame ( data , columns = [
’year ’, ’team ’, ’wins ’, ’ draws ’, ’ losses ’
]
)
Output
Read CSV
• Import pandas as pd
• mydata = pd.read_csv(‘data.csv’)
First Five Rows
• mydata.head()
Last Five Rows
• mydata.tail()
Show Statistical Information
• mydata.describe()
Selecting Data
• mydata[‘column’]
Subset of Rows
• mydata[5:10]
Thank You

More Related Content

Similar to Introduction to Data Science.pdf

Data Science course in Hyderabad .
Data Science course in Hyderabad            .Data Science course in Hyderabad            .
Data Science course in Hyderabad .
rajasrichalamala3zen
 
Data Science course in Hyderabad .
Data Science course in Hyderabad         .Data Science course in Hyderabad         .
Data Science course in Hyderabad .
rajasrichalamala3zen
 
data science course in Hyderabad data science course in Hyderabad
data science course in Hyderabad data science course in Hyderabaddata science course in Hyderabad data science course in Hyderabad
data science course in Hyderabad data science course in Hyderabad
akhilamadupativibhin
 
data science course training in Hyderabad
data science course training in Hyderabaddata science course training in Hyderabad
data science course training in Hyderabad
madhupriya3zen
 
data science course training in Hyderabad
data science course training in Hyderabaddata science course training in Hyderabad
data science course training in Hyderabad
madhupriya3zen
 
data science.pptx
data science.pptxdata science.pptx
data science.pptx
shaikruhiarsha3zenco
 
Colloquium(7)_DataScience:ShivShaktiGhosh&MohitGarg
Colloquium(7)_DataScience:ShivShaktiGhosh&MohitGargColloquium(7)_DataScience:ShivShaktiGhosh&MohitGarg
Colloquium(7)_DataScience:ShivShaktiGhosh&MohitGarg
Shiv Shakti Ghosh
 
What is data science ?
What is data science ?What is data science ?
What is data science ?
Bohitesh Misra, PMP
 
Data+Science : A First Course
Data+Science : A First CourseData+Science : A First Course
Data+Science : A First Course
Arnab Majumdar
 
Data science Nagarajan and madhav.pptx
Data science Nagarajan and madhav.pptxData science Nagarajan and madhav.pptx
Data science Nagarajan and madhav.pptx
NagarajanG35
 
Data Scientist Salary, Skills, Jobs And Resume | Data Scientist Career | Data...
Data Scientist Salary, Skills, Jobs And Resume | Data Scientist Career | Data...Data Scientist Salary, Skills, Jobs And Resume | Data Scientist Career | Data...
Data Scientist Salary, Skills, Jobs And Resume | Data Scientist Career | Data...
Simplilearn
 
Data Science: Unlocking Insights and Transforming Industries
Data Science: Unlocking Insights and Transforming IndustriesData Science: Unlocking Insights and Transforming Industries
Data Science: Unlocking Insights and Transforming Industries
Uncodemy
 
DATASCIENCE vs BUSINESS INTELLIGENCE.pptx
DATASCIENCE vs BUSINESS INTELLIGENCE.pptxDATASCIENCE vs BUSINESS INTELLIGENCE.pptx
DATASCIENCE vs BUSINESS INTELLIGENCE.pptx
OTA13NayabNakhwa
 
what is data science
 what is data science what is data science
what is data science
Crampete
 
Welcome to Data Science
Welcome to Data ScienceWelcome to Data Science
Welcome to Data Science
NyraSehgal
 
Learn All about Data Science from the Best Private University in Karnataka
Learn All about Data Science from the Best Private University in KarnatakaLearn All about Data Science from the Best Private University in Karnataka
Learn All about Data Science from the Best Private University in Karnataka
REVA University
 
Who is a data scientist
Who is a data scientist  Who is a data scientist
Who is a data scientist
prateek kumar
 
Data Analytics Course In Surat.pdf
Data Analytics Course In Surat.pdfData Analytics Course In Surat.pdf
Data Analytics Course In Surat.pdf
Sujata Gupta
 
Big data (word file)
Big data  (word file)Big data  (word file)
Big data (word file)
Shahbaz Anjam
 
INTRODUCTION TO DATA SCIENCE -CONCEPTS.pptx
INTRODUCTION TO DATA SCIENCE -CONCEPTS.pptxINTRODUCTION TO DATA SCIENCE -CONCEPTS.pptx
INTRODUCTION TO DATA SCIENCE -CONCEPTS.pptx
Madhumitha N
 

Similar to Introduction to Data Science.pdf (20)

Data Science course in Hyderabad .
Data Science course in Hyderabad            .Data Science course in Hyderabad            .
Data Science course in Hyderabad .
 
Data Science course in Hyderabad .
Data Science course in Hyderabad         .Data Science course in Hyderabad         .
Data Science course in Hyderabad .
 
data science course in Hyderabad data science course in Hyderabad
data science course in Hyderabad data science course in Hyderabaddata science course in Hyderabad data science course in Hyderabad
data science course in Hyderabad data science course in Hyderabad
 
data science course training in Hyderabad
data science course training in Hyderabaddata science course training in Hyderabad
data science course training in Hyderabad
 
data science course training in Hyderabad
data science course training in Hyderabaddata science course training in Hyderabad
data science course training in Hyderabad
 
data science.pptx
data science.pptxdata science.pptx
data science.pptx
 
Colloquium(7)_DataScience:ShivShaktiGhosh&MohitGarg
Colloquium(7)_DataScience:ShivShaktiGhosh&MohitGargColloquium(7)_DataScience:ShivShaktiGhosh&MohitGarg
Colloquium(7)_DataScience:ShivShaktiGhosh&MohitGarg
 
What is data science ?
What is data science ?What is data science ?
What is data science ?
 
Data+Science : A First Course
Data+Science : A First CourseData+Science : A First Course
Data+Science : A First Course
 
Data science Nagarajan and madhav.pptx
Data science Nagarajan and madhav.pptxData science Nagarajan and madhav.pptx
Data science Nagarajan and madhav.pptx
 
Data Scientist Salary, Skills, Jobs And Resume | Data Scientist Career | Data...
Data Scientist Salary, Skills, Jobs And Resume | Data Scientist Career | Data...Data Scientist Salary, Skills, Jobs And Resume | Data Scientist Career | Data...
Data Scientist Salary, Skills, Jobs And Resume | Data Scientist Career | Data...
 
Data Science: Unlocking Insights and Transforming Industries
Data Science: Unlocking Insights and Transforming IndustriesData Science: Unlocking Insights and Transforming Industries
Data Science: Unlocking Insights and Transforming Industries
 
DATASCIENCE vs BUSINESS INTELLIGENCE.pptx
DATASCIENCE vs BUSINESS INTELLIGENCE.pptxDATASCIENCE vs BUSINESS INTELLIGENCE.pptx
DATASCIENCE vs BUSINESS INTELLIGENCE.pptx
 
what is data science
 what is data science what is data science
what is data science
 
Welcome to Data Science
Welcome to Data ScienceWelcome to Data Science
Welcome to Data Science
 
Learn All about Data Science from the Best Private University in Karnataka
Learn All about Data Science from the Best Private University in KarnatakaLearn All about Data Science from the Best Private University in Karnataka
Learn All about Data Science from the Best Private University in Karnataka
 
Who is a data scientist
Who is a data scientist  Who is a data scientist
Who is a data scientist
 
Data Analytics Course In Surat.pdf
Data Analytics Course In Surat.pdfData Analytics Course In Surat.pdf
Data Analytics Course In Surat.pdf
 
Big data (word file)
Big data  (word file)Big data  (word file)
Big data (word file)
 
INTRODUCTION TO DATA SCIENCE -CONCEPTS.pptx
INTRODUCTION TO DATA SCIENCE -CONCEPTS.pptxINTRODUCTION TO DATA SCIENCE -CONCEPTS.pptx
INTRODUCTION TO DATA SCIENCE -CONCEPTS.pptx
 

Recently uploaded

Neuro-symbolic is not enough, we need neuro-*semantic*
Neuro-symbolic is not enough, we need neuro-*semantic*Neuro-symbolic is not enough, we need neuro-*semantic*
Neuro-symbolic is not enough, we need neuro-*semantic*
Frank van Harmelen
 
Accelerate your Kubernetes clusters with Varnish Caching
Accelerate your Kubernetes clusters with Varnish CachingAccelerate your Kubernetes clusters with Varnish Caching
Accelerate your Kubernetes clusters with Varnish Caching
Thijs Feryn
 
How world-class product teams are winning in the AI era by CEO and Founder, P...
How world-class product teams are winning in the AI era by CEO and Founder, P...How world-class product teams are winning in the AI era by CEO and Founder, P...
How world-class product teams are winning in the AI era by CEO and Founder, P...
Product School
 
Designing Great Products: The Power of Design and Leadership by Chief Designe...
Designing Great Products: The Power of Design and Leadership by Chief Designe...Designing Great Products: The Power of Design and Leadership by Chief Designe...
Designing Great Products: The Power of Design and Leadership by Chief Designe...
Product School
 
Bits & Pixels using AI for Good.........
Bits & Pixels using AI for Good.........Bits & Pixels using AI for Good.........
Bits & Pixels using AI for Good.........
Alison B. Lowndes
 
DevOps and Testing slides at DASA Connect
DevOps and Testing slides at DASA ConnectDevOps and Testing slides at DASA Connect
DevOps and Testing slides at DASA Connect
Kari Kakkonen
 
To Graph or Not to Graph Knowledge Graph Architectures and LLMs
To Graph or Not to Graph Knowledge Graph Architectures and LLMsTo Graph or Not to Graph Knowledge Graph Architectures and LLMs
To Graph or Not to Graph Knowledge Graph Architectures and LLMs
Paul Groth
 
De-mystifying Zero to One: Design Informed Techniques for Greenfield Innovati...
De-mystifying Zero to One: Design Informed Techniques for Greenfield Innovati...De-mystifying Zero to One: Design Informed Techniques for Greenfield Innovati...
De-mystifying Zero to One: Design Informed Techniques for Greenfield Innovati...
Product School
 
PCI PIN Basics Webinar from the Controlcase Team
PCI PIN Basics Webinar from the Controlcase TeamPCI PIN Basics Webinar from the Controlcase Team
PCI PIN Basics Webinar from the Controlcase Team
ControlCase
 
LF Energy Webinar: Electrical Grid Modelling and Simulation Through PowSyBl -...
LF Energy Webinar: Electrical Grid Modelling and Simulation Through PowSyBl -...LF Energy Webinar: Electrical Grid Modelling and Simulation Through PowSyBl -...
LF Energy Webinar: Electrical Grid Modelling and Simulation Through PowSyBl -...
DanBrown980551
 
UiPath Test Automation using UiPath Test Suite series, part 3
UiPath Test Automation using UiPath Test Suite series, part 3UiPath Test Automation using UiPath Test Suite series, part 3
UiPath Test Automation using UiPath Test Suite series, part 3
DianaGray10
 
AI for Every Business: Unlocking Your Product's Universal Potential by VP of ...
AI for Every Business: Unlocking Your Product's Universal Potential by VP of ...AI for Every Business: Unlocking Your Product's Universal Potential by VP of ...
AI for Every Business: Unlocking Your Product's Universal Potential by VP of ...
Product School
 
State of ICS and IoT Cyber Threat Landscape Report 2024 preview
State of ICS and IoT Cyber Threat Landscape Report 2024 previewState of ICS and IoT Cyber Threat Landscape Report 2024 preview
State of ICS and IoT Cyber Threat Landscape Report 2024 preview
Prayukth K V
 
Leading Change strategies and insights for effective change management pdf 1.pdf
Leading Change strategies and insights for effective change management pdf 1.pdfLeading Change strategies and insights for effective change management pdf 1.pdf
Leading Change strategies and insights for effective change management pdf 1.pdf
OnBoard
 
Elevating Tactical DDD Patterns Through Object Calisthenics
Elevating Tactical DDD Patterns Through Object CalisthenicsElevating Tactical DDD Patterns Through Object Calisthenics
Elevating Tactical DDD Patterns Through Object Calisthenics
Dorra BARTAGUIZ
 
Knowledge engineering: from people to machines and back
Knowledge engineering: from people to machines and backKnowledge engineering: from people to machines and back
Knowledge engineering: from people to machines and back
Elena Simperl
 
Software Delivery At the Speed of AI: Inflectra Invests In AI-Powered Quality
Software Delivery At the Speed of AI: Inflectra Invests In AI-Powered QualitySoftware Delivery At the Speed of AI: Inflectra Invests In AI-Powered Quality
Software Delivery At the Speed of AI: Inflectra Invests In AI-Powered Quality
Inflectra
 
Empowering NextGen Mobility via Large Action Model Infrastructure (LAMI): pav...
Empowering NextGen Mobility via Large Action Model Infrastructure (LAMI): pav...Empowering NextGen Mobility via Large Action Model Infrastructure (LAMI): pav...
Empowering NextGen Mobility via Large Action Model Infrastructure (LAMI): pav...
Thierry Lestable
 
Essentials of Automations: Optimizing FME Workflows with Parameters
Essentials of Automations: Optimizing FME Workflows with ParametersEssentials of Automations: Optimizing FME Workflows with Parameters
Essentials of Automations: Optimizing FME Workflows with Parameters
Safe Software
 
The Future of Platform Engineering
The Future of Platform EngineeringThe Future of Platform Engineering
The Future of Platform Engineering
Jemma Hussein Allen
 

Recently uploaded (20)

Neuro-symbolic is not enough, we need neuro-*semantic*
Neuro-symbolic is not enough, we need neuro-*semantic*Neuro-symbolic is not enough, we need neuro-*semantic*
Neuro-symbolic is not enough, we need neuro-*semantic*
 
Accelerate your Kubernetes clusters with Varnish Caching
Accelerate your Kubernetes clusters with Varnish CachingAccelerate your Kubernetes clusters with Varnish Caching
Accelerate your Kubernetes clusters with Varnish Caching
 
How world-class product teams are winning in the AI era by CEO and Founder, P...
How world-class product teams are winning in the AI era by CEO and Founder, P...How world-class product teams are winning in the AI era by CEO and Founder, P...
How world-class product teams are winning in the AI era by CEO and Founder, P...
 
Designing Great Products: The Power of Design and Leadership by Chief Designe...
Designing Great Products: The Power of Design and Leadership by Chief Designe...Designing Great Products: The Power of Design and Leadership by Chief Designe...
Designing Great Products: The Power of Design and Leadership by Chief Designe...
 
Bits & Pixels using AI for Good.........
Bits & Pixels using AI for Good.........Bits & Pixels using AI for Good.........
Bits & Pixels using AI for Good.........
 
DevOps and Testing slides at DASA Connect
DevOps and Testing slides at DASA ConnectDevOps and Testing slides at DASA Connect
DevOps and Testing slides at DASA Connect
 
To Graph or Not to Graph Knowledge Graph Architectures and LLMs
To Graph or Not to Graph Knowledge Graph Architectures and LLMsTo Graph or Not to Graph Knowledge Graph Architectures and LLMs
To Graph or Not to Graph Knowledge Graph Architectures and LLMs
 
De-mystifying Zero to One: Design Informed Techniques for Greenfield Innovati...
De-mystifying Zero to One: Design Informed Techniques for Greenfield Innovati...De-mystifying Zero to One: Design Informed Techniques for Greenfield Innovati...
De-mystifying Zero to One: Design Informed Techniques for Greenfield Innovati...
 
PCI PIN Basics Webinar from the Controlcase Team
PCI PIN Basics Webinar from the Controlcase TeamPCI PIN Basics Webinar from the Controlcase Team
PCI PIN Basics Webinar from the Controlcase Team
 
LF Energy Webinar: Electrical Grid Modelling and Simulation Through PowSyBl -...
LF Energy Webinar: Electrical Grid Modelling and Simulation Through PowSyBl -...LF Energy Webinar: Electrical Grid Modelling and Simulation Through PowSyBl -...
LF Energy Webinar: Electrical Grid Modelling and Simulation Through PowSyBl -...
 
UiPath Test Automation using UiPath Test Suite series, part 3
UiPath Test Automation using UiPath Test Suite series, part 3UiPath Test Automation using UiPath Test Suite series, part 3
UiPath Test Automation using UiPath Test Suite series, part 3
 
AI for Every Business: Unlocking Your Product's Universal Potential by VP of ...
AI for Every Business: Unlocking Your Product's Universal Potential by VP of ...AI for Every Business: Unlocking Your Product's Universal Potential by VP of ...
AI for Every Business: Unlocking Your Product's Universal Potential by VP of ...
 
State of ICS and IoT Cyber Threat Landscape Report 2024 preview
State of ICS and IoT Cyber Threat Landscape Report 2024 previewState of ICS and IoT Cyber Threat Landscape Report 2024 preview
State of ICS and IoT Cyber Threat Landscape Report 2024 preview
 
Leading Change strategies and insights for effective change management pdf 1.pdf
Leading Change strategies and insights for effective change management pdf 1.pdfLeading Change strategies and insights for effective change management pdf 1.pdf
Leading Change strategies and insights for effective change management pdf 1.pdf
 
Elevating Tactical DDD Patterns Through Object Calisthenics
Elevating Tactical DDD Patterns Through Object CalisthenicsElevating Tactical DDD Patterns Through Object Calisthenics
Elevating Tactical DDD Patterns Through Object Calisthenics
 
Knowledge engineering: from people to machines and back
Knowledge engineering: from people to machines and backKnowledge engineering: from people to machines and back
Knowledge engineering: from people to machines and back
 
Software Delivery At the Speed of AI: Inflectra Invests In AI-Powered Quality
Software Delivery At the Speed of AI: Inflectra Invests In AI-Powered QualitySoftware Delivery At the Speed of AI: Inflectra Invests In AI-Powered Quality
Software Delivery At the Speed of AI: Inflectra Invests In AI-Powered Quality
 
Empowering NextGen Mobility via Large Action Model Infrastructure (LAMI): pav...
Empowering NextGen Mobility via Large Action Model Infrastructure (LAMI): pav...Empowering NextGen Mobility via Large Action Model Infrastructure (LAMI): pav...
Empowering NextGen Mobility via Large Action Model Infrastructure (LAMI): pav...
 
Essentials of Automations: Optimizing FME Workflows with Parameters
Essentials of Automations: Optimizing FME Workflows with ParametersEssentials of Automations: Optimizing FME Workflows with Parameters
Essentials of Automations: Optimizing FME Workflows with Parameters
 
The Future of Platform Engineering
The Future of Platform EngineeringThe Future of Platform Engineering
The Future of Platform Engineering
 

Introduction to Data Science.pdf

  • 1. Data Science Muhammad Suleman Memon Assistant Professor Department of Information Technology, Dadu Campus, University of Sindh
  • 2. What is Data Science? Data science is the domain of study that deals with vast volumes. Find unseen patterns, derive meaningful information, and make business decisions. Data science uses complex machine learning algorithms to build predictive models.
  • 6. Prerequisites for Data Science 1. Machine Learning 2. Modeling 3. Statistics 4. Programming 5. Databases
  • 7. Who Oversees the Data Science Process? • Business Managers • To collaborate with the data science team to characterize the problem and establish an analytical method. • IT Managers • Developing the infrastructure and architecture to enable data science activities. • Data Science Managers • Supervise the working procedures of all data science team members. • They also manage and keep track of the day-to-day activities of the three data science teams.
  • 8. What is a Data Scientist? professionals who have the technical ability to handle complicated issues as well as the desire to investigate what questions need to be answered. They're a mix of mathematicians, computer scientists, and trend forecasters. They're also in high demand and well-paid because they work in both the business and IT sectors.
  • 9. On a daily basis, a data scientist may do the following tasks: Discover patterns and trends in datasets to get insights. Create forecasting algorithms and data models. Improve the quality of data or product offerings by utilising machine learning techniques. Distribute suggestions to other teams and top management. In data analysis, use data tools such as R, SAS, Python, or SQL. Top the field of data science innovations.
  • 10. What Does a Data Scientist Do? Determine the problem. Determines the correct set of variables and datasets. Gather structured and unstructured data from many sources. Convert raw data into a suitable format. Apply ML algorithms. Interpret the data to find opportunities and solutions. Prepare the results and insights to share with stake holders.
  • 11. Why Become a Data Scientist? • According to Glassdoor and Forbes, demand for data scientists will increase by 28 percent by 2026, which speaks of the profession’s durability and longevity, so if you want a secure career, data science offers you that chance.
  • 12. Use of Data Science 1. Data science may detect patterns in seemingly unstructured or unconnected data, allowing conclusions and predictions to be made. 2. Tech businesses that acquire user data can utilize strategies to transform that data into valuable or profitable information. 3. Data Science has also made inroads into the transportation industry, such as with driverless cars. 4. Data Science applications provide a better level of therapeutic customization through genetics and genomics research.
  • 13. Data Scientist Job role: Determine what the problem is, what questions need answers, and where to find the data. Also, they mine, clean, and present the relevant data. Skills needed: Programming skills (SAS, R, Python), storytelling and data visualization, statistical and mathematical skills, knowledge of Hadoop, SQL, and Machine Learning.
  • 14. Data Analyst Job role: Analysts bridge the gap between the data scientists and the business analysts, organizing and analyzing data to answer the questions the organization poses. They take the technical analyses and turn them into qualitative action items. Skills needed: Statistical and mathematical skills, programming skills (SAS, R, Python), plus experience in data wrangling and data visualization.
  • 15. Data Engineer Job role: Data engineers focus on developing, deploying, managing, and optimizing the organization’s data infrastructure and data pipelines. Engineers support data scientists by helping to transfer and transform data for queries. Skills needed: NoSQL databases (e.g., MongoDB, Cassandra DB), programming languages such as Java and Scala, and frameworks (Apache Hadoop).
  • 16. Data Science Tools Data Analysis: SAS, Jupyter, R Studio, MATLAB, Excel, RapidMiner Data Warehousing: Informatica/ Talend, AWS Redshift Data Visualization: Jupyter, Tableau, Cognos, RAW Machine Learning: Spark MLib, Mahout, Azure ML studio
  • 17. Difference Between Business Intelligence and Data Science BUSINESS INTELLIGENCE DATA SCIENCE Uses structured data Uses both structured and unstructured data Analytical in nature - provides a historical report of the data Scientific in nature - perform an in- depth statistical analysis on the data Use of basic statistics with emphasis on visualization (dashboards, reports) Leverages more sophisticated statistical and predictive analysis and machine learning (ML) Compares historical data to current data to identify trends Combines historical and current data to predict future performance and outcomes
  • 18. Applications of Data Science 1. Healthcare 2. Gaming 3. Image Recognition 4. Recommendation Systems 5. Logistics 6. Fraud Detection 7. Internet Search 8. Speech recognition 9. Targeted Advertising 10. Airline Route Planning 11. Augmented Reality
  • 22. Getting Started Import pandas as pd 1 Import numpy as np 2 Import matplotlib.pyplot as plt 3
  • 23. Getting Started data = { ’year ’: [2010 , 2011 , 2012 ,2010 , 2011 , 2012 ,2010 , 2011 , 2012], ’team ’: [’ FCBarcelona ’, ’ FCBarcelona ’,’ FCBarcelona ’, ’ RMadrid ’,’ RMadrid ’, ’ RMadrid ’,’ ValenciaCF ’, ’ ValenciaCF ’,’ ValenciaCF ’ ], ’wins ’: [30 , 28 , 32 , 29 , 32 , 26 , 21 , 17 , 19] , ’ draws ’: [6 , 7, 4, 5, 4, 7, 8, 10 , 8] , ’ losses ’: [2 , 3, 2, 4, 2, 5, 9, 11 , 11] } football = pd . DataFrame ( data , columns = [ ’year ’, ’team ’, ’wins ’, ’ draws ’, ’ losses ’ ] )
  • 25. Read CSV • Import pandas as pd • mydata = pd.read_csv(‘data.csv’)
  • 26. First Five Rows • mydata.head()
  • 27. Last Five Rows • mydata.tail()
  • 28. Show Statistical Information • mydata.describe()
  • 30. Subset of Rows • mydata[5:10]