SlideShare a Scribd company logo
1 of 12
Introduction to Data
Analysis
Course Notes
02-Feb-2024
by: Grace Jidael
Contents
What is Data and Data Analysis
Types of Data
Types of Data Analysis
Life Cycle of Data Analysis Project
Tools for Data Analysis
2
3
7
8
10
11
Data
3
Data:
Data is Raw facts and figures that need to be processed to
extract meaningful information.
The term big data refers to data sets that are so massive, so
quickly built, and so varied that they defy traditional analysis
methods such as you might perform with a relational database.
Big data is often described in terms of five V's; velocity,
volume, variety, veracity, and value.
Examples: Numbers, text, images, sound, etc.
Data Analysis
4
The systematic process of inspecting, cleaning, transforming,
and modeling data to discover meaningful information, draw
conclusions, and support decision-making.
is the process and method for extracting knowledge and
insights from large volumes of disparate data. It's an
interdisciplinary field involving probability, programming,
mathematics, statistical analysis, data visualization, and more.
It's what makes it possible for us to appropriate information,
see patterns, find meaning from large volumes of data and use
it to make decisions that drive business.
Good to Know...
5
If you have data, and you have curiosity, and you're manipulating
it, you're exploring it, the very exercise of going through analyzing
data, trying to get some answers from it is data analysis.
Data science/Analysis is relevant today because we have tons of
data available.
We used to worry about lack of data. Now we have a data deluge.
In the past, we didn't have algorithms, now we have algorithms.
In the past, the software was expensive, now it's open source and
free.
In the past, we couldn't store large amounts of data, now for a
fraction of the cost, we can have gazillions of datasets for a
very low cost.
So, the tools to work with data, the very availability of data,
and the ability to store and analyze data, it's all cheap, it's all
available, it's all ubiquitous, it's here.
There's never been a better time to be a data scientist/analyst
Good to Know...
6
If data set has one variable it is Univariate, and if you have multiple variables it is Multivariate
Grace Jidael
Types of Data
7
Structured
Unstructured
Semi Structured
Categorical(Nominal or
Ordinal)
Numeric(Continuous or
Interval)
Cross Sectional
Time-Series
By Structure By Type By Variable Type
Types of Analysis
8
Inferential Analysis:
Inferential analysis are numeric values that
enables an analyst/researcher draw
conclusions about a population based on a
sample of data. It aims to make generalizations
or predictions about a larger group from which
the data was sampled.
It uses Statistical test, either to test for
significant relationships amongst variables or
to find statistical support to hypotheses.
It is based on laws of probability
Descriptive Analysis:
Summarizes and organizes data to understand the
sample’s characteristics.
Descriptive Statistics are numeric values obtained
from the data that gives meaning to the data
collected. They include frequency distribution,
measures of central tendency, measures of
dispersion/variability, bi-variate descriptive
statistics.
It gives the current status of data
Other Types of Analysis
9
Predictive Analysis(Forcasting):
Uses statistical algorithms and machine
learning to make predictions about future
outcomes.
What if these trends continue?
What will happen next?
Exploratory Data Analysis:
Analyzes data sets to uncover patterns,
relationships, or trends.
Diagnostic Analysis:
Focuses on identifying the cause of a particular
problem or issue.
What happened?
Why is it happening?
Prescriptive Analysis:
Recommends actions to optimize or take
advantage of predicted future scenarios.
How do we solve it?
10
Life cycle of data
science project
11
Tools for Data Analysis
1. Business Intelligence (BI) Tools:
Tableau: A powerful BI tool for creating interactive and shareable dashboards.
Power BI: Microsoft's BI tool for data visualization, reporting, and sharing insights.
2. Spreadsheet Software:
Microsoft Excel: Widely used for data analysis, modeling, and visualization.
Google Sheets: Collaborative spreadsheet software with data analysis capabilities.
3. Database Tools:
SQL (Structured Query Language): Essential for querying and managing relational databases.
MongoDB: A NoSQL database often used for handling unstructured data.
4. Programming Languages:
Python: A versatile language with extensive libraries like NumPy and pandas for data manipulation and
analysis.
R: Specialized for statistical computing and graphics, widely used in academia and research.
5. Statistical Tools:
SPSS (Statistical Package for the Social Sciences): Used for statistical analysis in social science research.
SAS (Statistical Analysis System): A software suite for advanced analytics, business intelligence, and
data management.
Thank You
Introduction to Data Analysis
02-Feb-2024
by: Grace Jidael

More Related Content

What's hot

What's hot (20)

Presentation on Big Data
Presentation on Big DataPresentation on Big Data
Presentation on Big Data
 
Introduction to Data Analytics
Introduction to Data AnalyticsIntroduction to Data Analytics
Introduction to Data Analytics
 
Top Data Mining Techniques and Their Applications
Top Data Mining Techniques and Their ApplicationsTop Data Mining Techniques and Their Applications
Top Data Mining Techniques and Their Applications
 
Data Mining
Data MiningData Mining
Data Mining
 
Big Data Evolution
Big Data EvolutionBig Data Evolution
Big Data Evolution
 
Data analytics
Data analyticsData analytics
Data analytics
 
Chapter 5 - Identity Management
Chapter 5 - Identity ManagementChapter 5 - Identity Management
Chapter 5 - Identity Management
 
Data Mining: Application and trends in data mining
Data Mining: Application and trends in data miningData Mining: Application and trends in data mining
Data Mining: Application and trends in data mining
 
Data base and data entry presentation by mj n somya
Data base and data entry presentation by mj n somyaData base and data entry presentation by mj n somya
Data base and data entry presentation by mj n somya
 
Data Mining in Health Care
Data Mining in Health CareData Mining in Health Care
Data Mining in Health Care
 
Introduction to Data Management
Introduction to Data ManagementIntroduction to Data Management
Introduction to Data Management
 
An introduction to Business intelligence
An introduction to Business intelligenceAn introduction to Business intelligence
An introduction to Business intelligence
 
Data Protection and Privacy
Data Protection and PrivacyData Protection and Privacy
Data Protection and Privacy
 
Data visualisation & analytics with Tableau
Data visualisation & analytics with Tableau Data visualisation & analytics with Tableau
Data visualisation & analytics with Tableau
 
Data Mining in Healthcare: How Health Systems Can Improve Quality and Reduce...
Data Mining in Healthcare:  How Health Systems Can Improve Quality and Reduce...Data Mining in Healthcare:  How Health Systems Can Improve Quality and Reduce...
Data Mining in Healthcare: How Health Systems Can Improve Quality and Reduce...
 
Data Analytics
Data AnalyticsData Analytics
Data Analytics
 
Overview on data privacy
Overview on data privacy Overview on data privacy
Overview on data privacy
 
What’s The Difference Between Structured, Semi-Structured And Unstructured Data?
What’s The Difference Between Structured, Semi-Structured And Unstructured Data?What’s The Difference Between Structured, Semi-Structured And Unstructured Data?
What’s The Difference Between Structured, Semi-Structured And Unstructured Data?
 
Information privacy and Security
Information privacy and SecurityInformation privacy and Security
Information privacy and Security
 
Application of predictive analytics
Application of predictive analyticsApplication of predictive analytics
Application of predictive analytics
 

Similar to Introduction to Data Analysis Course Notes.pdf

Untitled document.pdf
Untitled document.pdfUntitled document.pdf
Untitled document.pdf
MuhammadTahiriqbal13
 
Data Science.pptx NEW COURICUUMN IN DATA
Data Science.pptx NEW COURICUUMN IN DATAData Science.pptx NEW COURICUUMN IN DATA
Data Science.pptx NEW COURICUUMN IN DATA
javed75
 

Similar to Introduction to Data Analysis Course Notes.pdf (20)

Research EDU821-1.pptx
Research EDU821-1.pptxResearch EDU821-1.pptx
Research EDU821-1.pptx
 
KIT-601 Lecture Notes-UNIT-1.pdf
KIT-601 Lecture Notes-UNIT-1.pdfKIT-601 Lecture Notes-UNIT-1.pdf
KIT-601 Lecture Notes-UNIT-1.pdf
 
Untitled document.pdf
Untitled document.pdfUntitled document.pdf
Untitled document.pdf
 
what is ..how to process types and methods involved in data analysis
what is ..how to process types and methods involved in data analysiswhat is ..how to process types and methods involved in data analysis
what is ..how to process types and methods involved in data analysis
 
Week-1-Introduction to Data Mining.pptx
Week-1-Introduction to Data Mining.pptxWeek-1-Introduction to Data Mining.pptx
Week-1-Introduction to Data Mining.pptx
 
1 UNIT-DSP.pptx
1 UNIT-DSP.pptx1 UNIT-DSP.pptx
1 UNIT-DSP.pptx
 
Big Data Analytics
Big Data AnalyticsBig Data Analytics
Big Data Analytics
 
Unit2
Unit2Unit2
Unit2
 
Introduction of Data Science and Data Analytics
Introduction of Data Science and Data AnalyticsIntroduction of Data Science and Data Analytics
Introduction of Data Science and Data Analytics
 
Introduction to Data Analytics and data analytics life cycle
Introduction to Data Analytics and data analytics life cycleIntroduction to Data Analytics and data analytics life cycle
Introduction to Data Analytics and data analytics life cycle
 
Colloquium(7)_DataScience:ShivShaktiGhosh&MohitGarg
Colloquium(7)_DataScience:ShivShaktiGhosh&MohitGargColloquium(7)_DataScience:ShivShaktiGhosh&MohitGarg
Colloquium(7)_DataScience:ShivShaktiGhosh&MohitGarg
 
Bigdataanalytics
BigdataanalyticsBigdataanalytics
Bigdataanalytics
 
Python for Data Analysis: A Comprehensive Guide
Python for Data Analysis: A Comprehensive GuidePython for Data Analysis: A Comprehensive Guide
Python for Data Analysis: A Comprehensive Guide
 
Data science and data analytics major similarities and distinctions (1)
Data science and data analytics  major similarities and distinctions (1)Data science and data analytics  major similarities and distinctions (1)
Data science and data analytics major similarities and distinctions (1)
 
Data Science.pptx NEW COURICUUMN IN DATA
Data Science.pptx NEW COURICUUMN IN DATAData Science.pptx NEW COURICUUMN IN DATA
Data Science.pptx NEW COURICUUMN IN DATA
 
Data Processing & Explain each term in details.pptx
Data Processing & Explain each term in details.pptxData Processing & Explain each term in details.pptx
Data Processing & Explain each term in details.pptx
 
KIT-601-L-UNIT-1 (Revised) Introduction to Data Analytcs.pdf
KIT-601-L-UNIT-1 (Revised) Introduction to Data Analytcs.pdfKIT-601-L-UNIT-1 (Revised) Introduction to Data Analytcs.pdf
KIT-601-L-UNIT-1 (Revised) Introduction to Data Analytcs.pdf
 
introduction to data science
introduction to data scienceintroduction to data science
introduction to data science
 
Data Analytics Introduction.pptx
Data Analytics Introduction.pptxData Analytics Introduction.pptx
Data Analytics Introduction.pptx
 
Data Analytics Introduction.pptx
Data Analytics Introduction.pptxData Analytics Introduction.pptx
Data Analytics Introduction.pptx
 

Recently uploaded

Jual obat aborsi Bandung ( 085657271886 ) Cytote pil telat bulan penggugur ka...
Jual obat aborsi Bandung ( 085657271886 ) Cytote pil telat bulan penggugur ka...Jual obat aborsi Bandung ( 085657271886 ) Cytote pil telat bulan penggugur ka...
Jual obat aborsi Bandung ( 085657271886 ) Cytote pil telat bulan penggugur ka...
Klinik kandungan
 
如何办理(UPenn毕业证书)宾夕法尼亚大学毕业证成绩单本科硕士学位证留信学历认证
如何办理(UPenn毕业证书)宾夕法尼亚大学毕业证成绩单本科硕士学位证留信学历认证如何办理(UPenn毕业证书)宾夕法尼亚大学毕业证成绩单本科硕士学位证留信学历认证
如何办理(UPenn毕业证书)宾夕法尼亚大学毕业证成绩单本科硕士学位证留信学历认证
acoha1
 
obat aborsi Tarakan wa 081336238223 jual obat aborsi cytotec asli di Tarakan9...
obat aborsi Tarakan wa 081336238223 jual obat aborsi cytotec asli di Tarakan9...obat aborsi Tarakan wa 081336238223 jual obat aborsi cytotec asli di Tarakan9...
obat aborsi Tarakan wa 081336238223 jual obat aborsi cytotec asli di Tarakan9...
yulianti213969
 
Abortion pills in Jeddah | +966572737505 | Get Cytotec
Abortion pills in Jeddah | +966572737505 | Get CytotecAbortion pills in Jeddah | +966572737505 | Get Cytotec
Abortion pills in Jeddah | +966572737505 | Get Cytotec
Abortion pills in Riyadh +966572737505 get cytotec
 
Abortion pills in Riyadh Saudi Arabia (+966572737505 buy cytotec
Abortion pills in Riyadh Saudi Arabia (+966572737505 buy cytotecAbortion pills in Riyadh Saudi Arabia (+966572737505 buy cytotec
Abortion pills in Riyadh Saudi Arabia (+966572737505 buy cytotec
Abortion pills in Riyadh +966572737505 get cytotec
 
Abortion pills in Jeddah |+966572737505 | get cytotec
Abortion pills in Jeddah |+966572737505 | get cytotecAbortion pills in Jeddah |+966572737505 | get cytotec
Abortion pills in Jeddah |+966572737505 | get cytotec
Abortion pills in Riyadh +966572737505 get cytotec
 
如何办理英国诺森比亚大学毕业证(NU毕业证书)成绩单原件一模一样
如何办理英国诺森比亚大学毕业证(NU毕业证书)成绩单原件一模一样如何办理英国诺森比亚大学毕业证(NU毕业证书)成绩单原件一模一样
如何办理英国诺森比亚大学毕业证(NU毕业证书)成绩单原件一模一样
wsppdmt
 
Reconciling Conflicting Data Curation Actions: Transparency Through Argument...
Reconciling Conflicting Data Curation Actions:  Transparency Through Argument...Reconciling Conflicting Data Curation Actions:  Transparency Through Argument...
Reconciling Conflicting Data Curation Actions: Transparency Through Argument...
Bertram Ludäscher
 

Recently uploaded (20)

Predictive Precipitation: Advanced Rain Forecasting Techniques
Predictive Precipitation: Advanced Rain Forecasting TechniquesPredictive Precipitation: Advanced Rain Forecasting Techniques
Predictive Precipitation: Advanced Rain Forecasting Techniques
 
Jual obat aborsi Bandung ( 085657271886 ) Cytote pil telat bulan penggugur ka...
Jual obat aborsi Bandung ( 085657271886 ) Cytote pil telat bulan penggugur ka...Jual obat aborsi Bandung ( 085657271886 ) Cytote pil telat bulan penggugur ka...
Jual obat aborsi Bandung ( 085657271886 ) Cytote pil telat bulan penggugur ka...
 
如何办理(UPenn毕业证书)宾夕法尼亚大学毕业证成绩单本科硕士学位证留信学历认证
如何办理(UPenn毕业证书)宾夕法尼亚大学毕业证成绩单本科硕士学位证留信学历认证如何办理(UPenn毕业证书)宾夕法尼亚大学毕业证成绩单本科硕士学位证留信学历认证
如何办理(UPenn毕业证书)宾夕法尼亚大学毕业证成绩单本科硕士学位证留信学历认证
 
obat aborsi Tarakan wa 081336238223 jual obat aborsi cytotec asli di Tarakan9...
obat aborsi Tarakan wa 081336238223 jual obat aborsi cytotec asli di Tarakan9...obat aborsi Tarakan wa 081336238223 jual obat aborsi cytotec asli di Tarakan9...
obat aborsi Tarakan wa 081336238223 jual obat aborsi cytotec asli di Tarakan9...
 
Bios of leading Astrologers & Researchers
Bios of leading Astrologers & ResearchersBios of leading Astrologers & Researchers
Bios of leading Astrologers & Researchers
 
Capstone in Interprofessional Informatic // IMPACT OF COVID 19 ON EDUCATION
Capstone in Interprofessional Informatic  // IMPACT OF COVID 19 ON EDUCATIONCapstone in Interprofessional Informatic  // IMPACT OF COVID 19 ON EDUCATION
Capstone in Interprofessional Informatic // IMPACT OF COVID 19 ON EDUCATION
 
Abortion pills in Jeddah | +966572737505 | Get Cytotec
Abortion pills in Jeddah | +966572737505 | Get CytotecAbortion pills in Jeddah | +966572737505 | Get Cytotec
Abortion pills in Jeddah | +966572737505 | Get Cytotec
 
DS Lecture-1 about discrete structure .ppt
DS Lecture-1 about discrete structure .pptDS Lecture-1 about discrete structure .ppt
DS Lecture-1 about discrete structure .ppt
 
Harnessing the Power of GenAI for BI and Reporting.pptx
Harnessing the Power of GenAI for BI and Reporting.pptxHarnessing the Power of GenAI for BI and Reporting.pptx
Harnessing the Power of GenAI for BI and Reporting.pptx
 
DAA Assignment Solution.pdf is the best1
DAA Assignment Solution.pdf is the best1DAA Assignment Solution.pdf is the best1
DAA Assignment Solution.pdf is the best1
 
Credit Card Fraud Detection: Safeguarding Transactions in the Digital Age
Credit Card Fraud Detection: Safeguarding Transactions in the Digital AgeCredit Card Fraud Detection: Safeguarding Transactions in the Digital Age
Credit Card Fraud Detection: Safeguarding Transactions in the Digital Age
 
Case Study 4 Where the cry of rebellion happen?
Case Study 4 Where the cry of rebellion happen?Case Study 4 Where the cry of rebellion happen?
Case Study 4 Where the cry of rebellion happen?
 
Abortion pills in Riyadh Saudi Arabia (+966572737505 buy cytotec
Abortion pills in Riyadh Saudi Arabia (+966572737505 buy cytotecAbortion pills in Riyadh Saudi Arabia (+966572737505 buy cytotec
Abortion pills in Riyadh Saudi Arabia (+966572737505 buy cytotec
 
Abortion pills in Jeddah |+966572737505 | get cytotec
Abortion pills in Jeddah |+966572737505 | get cytotecAbortion pills in Jeddah |+966572737505 | get cytotec
Abortion pills in Jeddah |+966572737505 | get cytotec
 
Digital Transformation Playbook by Graham Ware
Digital Transformation Playbook by Graham WareDigital Transformation Playbook by Graham Ware
Digital Transformation Playbook by Graham Ware
 
Identify Customer Segments to Create Customer Offers for Each Segment - Appli...
Identify Customer Segments to Create Customer Offers for Each Segment - Appli...Identify Customer Segments to Create Customer Offers for Each Segment - Appli...
Identify Customer Segments to Create Customer Offers for Each Segment - Appli...
 
社内勉強会資料_Object Recognition as Next Token Prediction
社内勉強会資料_Object Recognition as Next Token Prediction社内勉強会資料_Object Recognition as Next Token Prediction
社内勉強会資料_Object Recognition as Next Token Prediction
 
如何办理英国诺森比亚大学毕业证(NU毕业证书)成绩单原件一模一样
如何办理英国诺森比亚大学毕业证(NU毕业证书)成绩单原件一模一样如何办理英国诺森比亚大学毕业证(NU毕业证书)成绩单原件一模一样
如何办理英国诺森比亚大学毕业证(NU毕业证书)成绩单原件一模一样
 
DBMS UNIT 5 46 CONTAINS NOTES FOR THE STUDENTS
DBMS UNIT 5 46 CONTAINS NOTES FOR THE STUDENTSDBMS UNIT 5 46 CONTAINS NOTES FOR THE STUDENTS
DBMS UNIT 5 46 CONTAINS NOTES FOR THE STUDENTS
 
Reconciling Conflicting Data Curation Actions: Transparency Through Argument...
Reconciling Conflicting Data Curation Actions:  Transparency Through Argument...Reconciling Conflicting Data Curation Actions:  Transparency Through Argument...
Reconciling Conflicting Data Curation Actions: Transparency Through Argument...
 

Introduction to Data Analysis Course Notes.pdf

  • 1. Introduction to Data Analysis Course Notes 02-Feb-2024 by: Grace Jidael
  • 2. Contents What is Data and Data Analysis Types of Data Types of Data Analysis Life Cycle of Data Analysis Project Tools for Data Analysis 2 3 7 8 10 11
  • 3. Data 3 Data: Data is Raw facts and figures that need to be processed to extract meaningful information. The term big data refers to data sets that are so massive, so quickly built, and so varied that they defy traditional analysis methods such as you might perform with a relational database. Big data is often described in terms of five V's; velocity, volume, variety, veracity, and value. Examples: Numbers, text, images, sound, etc.
  • 4. Data Analysis 4 The systematic process of inspecting, cleaning, transforming, and modeling data to discover meaningful information, draw conclusions, and support decision-making. is the process and method for extracting knowledge and insights from large volumes of disparate data. It's an interdisciplinary field involving probability, programming, mathematics, statistical analysis, data visualization, and more. It's what makes it possible for us to appropriate information, see patterns, find meaning from large volumes of data and use it to make decisions that drive business.
  • 5. Good to Know... 5 If you have data, and you have curiosity, and you're manipulating it, you're exploring it, the very exercise of going through analyzing data, trying to get some answers from it is data analysis. Data science/Analysis is relevant today because we have tons of data available. We used to worry about lack of data. Now we have a data deluge. In the past, we didn't have algorithms, now we have algorithms. In the past, the software was expensive, now it's open source and free.
  • 6. In the past, we couldn't store large amounts of data, now for a fraction of the cost, we can have gazillions of datasets for a very low cost. So, the tools to work with data, the very availability of data, and the ability to store and analyze data, it's all cheap, it's all available, it's all ubiquitous, it's here. There's never been a better time to be a data scientist/analyst Good to Know... 6
  • 7. If data set has one variable it is Univariate, and if you have multiple variables it is Multivariate Grace Jidael Types of Data 7 Structured Unstructured Semi Structured Categorical(Nominal or Ordinal) Numeric(Continuous or Interval) Cross Sectional Time-Series By Structure By Type By Variable Type
  • 8. Types of Analysis 8 Inferential Analysis: Inferential analysis are numeric values that enables an analyst/researcher draw conclusions about a population based on a sample of data. It aims to make generalizations or predictions about a larger group from which the data was sampled. It uses Statistical test, either to test for significant relationships amongst variables or to find statistical support to hypotheses. It is based on laws of probability Descriptive Analysis: Summarizes and organizes data to understand the sample’s characteristics. Descriptive Statistics are numeric values obtained from the data that gives meaning to the data collected. They include frequency distribution, measures of central tendency, measures of dispersion/variability, bi-variate descriptive statistics. It gives the current status of data
  • 9. Other Types of Analysis 9 Predictive Analysis(Forcasting): Uses statistical algorithms and machine learning to make predictions about future outcomes. What if these trends continue? What will happen next? Exploratory Data Analysis: Analyzes data sets to uncover patterns, relationships, or trends. Diagnostic Analysis: Focuses on identifying the cause of a particular problem or issue. What happened? Why is it happening? Prescriptive Analysis: Recommends actions to optimize or take advantage of predicted future scenarios. How do we solve it?
  • 10. 10 Life cycle of data science project
  • 11. 11 Tools for Data Analysis 1. Business Intelligence (BI) Tools: Tableau: A powerful BI tool for creating interactive and shareable dashboards. Power BI: Microsoft's BI tool for data visualization, reporting, and sharing insights. 2. Spreadsheet Software: Microsoft Excel: Widely used for data analysis, modeling, and visualization. Google Sheets: Collaborative spreadsheet software with data analysis capabilities. 3. Database Tools: SQL (Structured Query Language): Essential for querying and managing relational databases. MongoDB: A NoSQL database often used for handling unstructured data. 4. Programming Languages: Python: A versatile language with extensive libraries like NumPy and pandas for data manipulation and analysis. R: Specialized for statistical computing and graphics, widely used in academia and research. 5. Statistical Tools: SPSS (Statistical Package for the Social Sciences): Used for statistical analysis in social science research. SAS (Statistical Analysis System): A software suite for advanced analytics, business intelligence, and data management.
  • 12. Thank You Introduction to Data Analysis 02-Feb-2024 by: Grace Jidael