INTRODUCTION TO DATA ANALYTICS
Utkarsh Sharma
Asst. Prof (CSE Dept.)
Jaypee University of Engineering & Technology
CONTENTS
 What is Data Science, Big Data, Data Analytics?
 Roles and Responsibilities of Data Scientist, Big Data Professional and Data Analyst
 Required Skill set.
 Understanding how data science, big data, and data analytics is used to drive the success of Netflix.
ROLE OF STATISTICAL LEARNING
Here are some examples of learning problems:
 Predict whether a patient, hospitalized due to a heart attack, will have a second heart attack. The
prediction is to be based on demo-graphic, diet and clinical measurements for that patient.
 Predict the price of a stock in 6 months from now, on the basis of company performance measures and
economic data.
 Identify the numbers in a handwritten ZIP code, from a digitized image.
 Estimate the amount of glucose in the blood of a diabetic person, from the infrared absorption spectrum
of that person’s blood.
 Identify the risk factors for prostate cancer, based on clinical and demographic variables.
WHAT IS DATA SCIENCE?
 Combination of mathematics, statistics and programming.
 Context of problem being solved.
 Ingenious way of capturing data which is not captured.
 Ability to look at the things differently
WHAT IS BIG DATA?
 Large amount of data from various source.
 Traditional data processing system are incapable to deal.
 In terms of Volume, Variety, Veracity, Velocity and value.
WHAT IS DATA ANALYTICS?
 Discovering useful information from data.
 Supports decision making.
 Involves inspecting, cleansing, transforming and modelling data.
 Uses qualitative and quantitative techniques.
WHAT DO DATA SCIENTISTS DO?
 Predicts future based on past patterns using AI and machine learning.
 Examines data from multiple sources.
 Finding co-relations and hidden patterns from data.
WHAT DOES A BIG DATA PROFESSIONAL DO?
 Architect distributed systems.
 Build large scale data processing system.
 Process the data using various big data tools.
WHAT DOES A DATA ANALYST DO?
 Acquire, analyse and process the data.
 Finding insights of captured data.
 Create data report using various reporting tools.
SKILLS REQUIRED
SALARIES TO EXPECT
Data Scientist
₹610,811
Big Data Professional
₹520,811
Data Analyst
₹390,811
 How Data analytics, Data science and Big data professional used in
Netflix…..
AN EXAMPLE SCENARIO
Big Data Professional
AN EXAMPLE SCENARIO
Data Scientist
Understanding of
the impact of QoE
on User Behavior
Creating
personalized
streaming
experience
Optimizing content
caching
Improving content
quality
AN EXAMPLE SCENARIO
 For any queries kindly reach me at utkarsh.sharma@juet.ac.in.

Introduction to Data Analytics

  • 1.
    INTRODUCTION TO DATAANALYTICS Utkarsh Sharma Asst. Prof (CSE Dept.) Jaypee University of Engineering & Technology
  • 2.
    CONTENTS  What isData Science, Big Data, Data Analytics?  Roles and Responsibilities of Data Scientist, Big Data Professional and Data Analyst  Required Skill set.  Understanding how data science, big data, and data analytics is used to drive the success of Netflix.
  • 3.
    ROLE OF STATISTICALLEARNING Here are some examples of learning problems:  Predict whether a patient, hospitalized due to a heart attack, will have a second heart attack. The prediction is to be based on demo-graphic, diet and clinical measurements for that patient.  Predict the price of a stock in 6 months from now, on the basis of company performance measures and economic data.  Identify the numbers in a handwritten ZIP code, from a digitized image.  Estimate the amount of glucose in the blood of a diabetic person, from the infrared absorption spectrum of that person’s blood.  Identify the risk factors for prostate cancer, based on clinical and demographic variables.
  • 4.
    WHAT IS DATASCIENCE?  Combination of mathematics, statistics and programming.  Context of problem being solved.  Ingenious way of capturing data which is not captured.  Ability to look at the things differently
  • 5.
    WHAT IS BIGDATA?  Large amount of data from various source.  Traditional data processing system are incapable to deal.  In terms of Volume, Variety, Veracity, Velocity and value.
  • 6.
    WHAT IS DATAANALYTICS?  Discovering useful information from data.  Supports decision making.  Involves inspecting, cleansing, transforming and modelling data.  Uses qualitative and quantitative techniques.
  • 7.
    WHAT DO DATASCIENTISTS DO?  Predicts future based on past patterns using AI and machine learning.  Examines data from multiple sources.  Finding co-relations and hidden patterns from data.
  • 8.
    WHAT DOES ABIG DATA PROFESSIONAL DO?  Architect distributed systems.  Build large scale data processing system.  Process the data using various big data tools.
  • 9.
    WHAT DOES ADATA ANALYST DO?  Acquire, analyse and process the data.  Finding insights of captured data.  Create data report using various reporting tools.
  • 10.
  • 11.
    SALARIES TO EXPECT DataScientist ₹610,811 Big Data Professional ₹520,811 Data Analyst ₹390,811
  • 12.
     How Dataanalytics, Data science and Big data professional used in Netflix…..
  • 13.
    AN EXAMPLE SCENARIO BigData Professional
  • 14.
    AN EXAMPLE SCENARIO DataScientist Understanding of the impact of QoE on User Behavior Creating personalized streaming experience Optimizing content caching Improving content quality
  • 15.
  • 16.
     For anyqueries kindly reach me at utkarsh.sharma@juet.ac.in.