SlideShare a Scribd company logo
1 of 26
Data Analysis Lab - 1
Introduction
By
Dr. Abhishek Kumar Singh
Student Introduction
• Name
• City and State
• Education detail (graduation, XII and X)
• PhD (IIT BHU Varanasi)
• M Tech (IIT BHU Varanasi)
• B Tech (GBTU)
• 3 Research Paper in SCOPUS/ABDC Indexed
journals
• 8 papers reviewed as a reviewer
• Six sigma green belt
Content
• Syllabus
• Data Analysis
• Variables
• Univariate
• Bivariate
Univariate Descriptive Analysis
• Measures of Central Tendency- Mean, Median,
Mode
• Measures of Variability- Range, Variance, Standard
Deviation, Co-efficient of Deviation
• Measures of Shape- Skewness and Kurtosis
• Measures of Stability- Standard Error
Bivariate Descriptive Analysis
• Covariance
• Correlation
Data Analysis
• The Process of cleaning, transforming,
interpreting, analyzing and visualizing the data
to extract useful information and gain valuable
insights to make more effective business
decisions is called data analysis.
Variables
• Variables: Any character, characteristics or
quality that varies is termed a variable.
• E.g.: To collect the basic clinical and
demographic information on patients with
particular illness. Variables of interest may
include Gender (M/F), age and height of the
patients.
Variable
Categorical Numerical
Nominal Ordinal Discrete Continuous
Categories are
mutually
exclusive and
unordered.
Eg. Gender (M/F)
Blood Group
(A/B/AB/O)
Categories are
mutually exclusive
and ordered.
Eg. Disease
severity (Mild,
Moderate and
Severe)
Integer values,
typically counts no
notion of
magnitude. Eg. No.
of children
vaccinated, days
sick per year
Takes any value in
a range of values
have a magnitude.
E.g. weight in kg
and Height in cm
Statistics
Descriptive Inferential
• Collecting
• Organizing
• Summarizing
• Presenting Data
• Making inference
• Hypothesis testing
• Determining relationship
• Making Prediction
Three types of analysis
• Univariate analysis: the examination of cases on only
one variable at a time (e.g., weight of college
students).
• Bivariate analysis: the examination of two variables
simultaneously (e.g., the relation between gender
and weight of college students).
• Multivariate analysis: examination of two variables
simultaneously (e.g., the relationship between
gender, race, and weight of college students).
Purpose of different type of analysis
• Univariate analysis: mainly description
• Bivariate analysis: Determining the empirical
relationship between two variables.
• Multivariate analysis: Determining the empirical
relationship among multiple variables.
Univariate
• The objective of univariate analysis is to derive the
data, define and summarize it and analyze the
pattern present in it.
• Univariate techniques are appropriate when there is
a single measurement of each element in the sample
or when there are several measurements of each
element but each variable is analyzed in isolation.
Univariate
Descriptive Inferential
• Measures of Central Tendency- Mean,
Median, Mode
• Measures of Variability- Range,
Variance, Standard Deviation, Co-efficient
of Deviation
• Measures of Shape- Skewness and
Kurtosis
• Measures of Stability- Standard Error
• z test
• t test
• Chi square test
Numerical Methods
• Mean
– Let X1, X2, X3,….Xn be the n data points, then mean
of data is defined as
– Mean provide the central value about which the
data is spread out.
Numerical Methods
• Median
– Median is the value which divide the data in two
halves
– Let X1, X2, X3,….Xn be the n data points
– Order the n data values
– If the number of data points is odd then sample
median is the value in position of (n+1)/2
– If the number of data points is even then sample
median is the average of value in position of n/2
and (n/2+1)
Mean or Median?
• Both the measures provide the “middle” value
of data, so how do they compare?
– Median is robust again extreme values in the data
– While mean is affected by the extreme values
• Example: 8, 9, 10, 11, 12 be the five data
points
– Mean = 10 and Median = 10
– Replace 12 by 18
• Mean = 11.2 but Median =10
Numerical Methods
• Mode
– Mode is the a value in data that occurs with
highest frequency
– It’s the most probable value of the data
– It is possible to have data that has more than one
Mode value. Such data is called multimodal.
Measures of Variability
• Percentile
– Order the data in ascending order
• Then, p1 in called the first percentile if 1% of points lie
below this value
• Similarly pk is called the k% of data points lie below this
value, where 0≤k≤100
• Quartile
– P25 is called the 1st quartile Q1
– P75 is called the 3rd quartile Q3
– P50 is Median
Measure of Dispersion
• Measures the spread of data
– Range
– Variation or standard deviation
• Measures the spread about mean/average value of
data
– Interquartile range
• Measures the spread about median value of the data
Measure of Dispersion
• Range = M-m, where,
– M = Max (x1, x2, ….xn)
– m = Min (x1, x2, ….xn)
• Variance
– S2 =
– Standard deviation = S
• Interquartile range: Q3 - Q1
Standard Deviation
• Standard Deviation is most commonly used
measure of dispersion.
– Under the assumption of normality the range of
Covers 67% of the data.
• Hence, this is commonly used to show possible error in
the observed value of data
Graphical Method
• Histogram or Bar chart
– Frequency Plot
• Pie Chart
• Cumulative frequency plot
• Box and Whisker plot
Bivariate
• Bi means two and variate means variable, so here
there are two variables. The analysisis related to
cause and the relationship between the two
variables.
• Correlation
• Covariance

More Related Content

Similar to Data Analysis Introduction.pptx

Sampling and Data_Update.ppt
Sampling and Data_Update.pptSampling and Data_Update.ppt
Sampling and Data_Update.pptMdShohelRana69
 
3. Statistical Analysis.pptx
3. Statistical Analysis.pptx3. Statistical Analysis.pptx
3. Statistical Analysis.pptxjeyanthisivakumar
 
Introduction to statistics.pptx
Introduction to statistics.pptxIntroduction to statistics.pptx
Introduction to statistics.pptxMuddaAbdo1
 
Biostatistics mean median mode unit 1.pptx
Biostatistics mean median mode unit 1.pptxBiostatistics mean median mode unit 1.pptx
Biostatistics mean median mode unit 1.pptxSailajaReddyGunnam
 
Introduction to Biostatistics_20_4_17.ppt
Introduction to Biostatistics_20_4_17.pptIntroduction to Biostatistics_20_4_17.ppt
Introduction to Biostatistics_20_4_17.pptnyakundi340
 
Chapter 11 quantitative data
Chapter 11 quantitative dataChapter 11 quantitative data
Chapter 11 quantitative datau59
 
CHAPTER 2 - NORM, CORRELATION AND REGRESSION.ppt
CHAPTER 2  - NORM, CORRELATION AND REGRESSION.pptCHAPTER 2  - NORM, CORRELATION AND REGRESSION.ppt
CHAPTER 2 - NORM, CORRELATION AND REGRESSION.pptkriti137049
 
Descriptive_statistics - Sample 1.pptx
Descriptive_statistics - Sample 1.pptxDescriptive_statistics - Sample 1.pptx
Descriptive_statistics - Sample 1.pptxSachinKumar524686
 
Statistical analysis
Statistical analysisStatistical analysis
Statistical analysisXiuxia Du
 
Chapter 6.pptx Data Analysis and processing
Chapter 6.pptx Data Analysis and processingChapter 6.pptx Data Analysis and processing
Chapter 6.pptx Data Analysis and processingetebarkhmichale
 
Introduction to Data Analysis for Nurse Researchers
Introduction to Data Analysis for Nurse ResearchersIntroduction to Data Analysis for Nurse Researchers
Introduction to Data Analysis for Nurse ResearchersRupa Verma
 
PARAMETRIC TESTS.pptx
PARAMETRIC TESTS.pptxPARAMETRIC TESTS.pptx
PARAMETRIC TESTS.pptxDrLasya
 
Multivariate Analysis Techniques
Multivariate Analysis TechniquesMultivariate Analysis Techniques
Multivariate Analysis TechniquesMehul Gondaliya
 
Biostatistics and Research Methodology Semester 8
Biostatistics and Research Methodology Semester 8Biostatistics and Research Methodology Semester 8
Biostatistics and Research Methodology Semester 8ParulSharma130721
 
ANALYSIS OF DATA.pptx
ANALYSIS OF DATA.pptxANALYSIS OF DATA.pptx
ANALYSIS OF DATA.pptxFankstien Tayeng
 

Similar to Data Analysis Introduction.pptx (20)

Sampling and Data_Update.ppt
Sampling and Data_Update.pptSampling and Data_Update.ppt
Sampling and Data_Update.ppt
 
3. Statistical Analysis.pptx
3. Statistical Analysis.pptx3. Statistical Analysis.pptx
3. Statistical Analysis.pptx
 
Introduction to statistics.pptx
Introduction to statistics.pptxIntroduction to statistics.pptx
Introduction to statistics.pptx
 
Biostatistics mean median mode unit 1.pptx
Biostatistics mean median mode unit 1.pptxBiostatistics mean median mode unit 1.pptx
Biostatistics mean median mode unit 1.pptx
 
Introduction to Biostatistics_20_4_17.ppt
Introduction to Biostatistics_20_4_17.pptIntroduction to Biostatistics_20_4_17.ppt
Introduction to Biostatistics_20_4_17.ppt
 
Chapter 11 quantitative data
Chapter 11 quantitative dataChapter 11 quantitative data
Chapter 11 quantitative data
 
BMS.ppt
BMS.pptBMS.ppt
BMS.ppt
 
Analysis
AnalysisAnalysis
Analysis
 
CHAPTER 2 - NORM, CORRELATION AND REGRESSION.ppt
CHAPTER 2  - NORM, CORRELATION AND REGRESSION.pptCHAPTER 2  - NORM, CORRELATION AND REGRESSION.ppt
CHAPTER 2 - NORM, CORRELATION AND REGRESSION.ppt
 
Descriptive_statistics - Sample 1.pptx
Descriptive_statistics - Sample 1.pptxDescriptive_statistics - Sample 1.pptx
Descriptive_statistics - Sample 1.pptx
 
PRESENTATION.pptx
PRESENTATION.pptxPRESENTATION.pptx
PRESENTATION.pptx
 
Statistical analysis
Statistical analysisStatistical analysis
Statistical analysis
 
Chapter 6.pptx Data Analysis and processing
Chapter 6.pptx Data Analysis and processingChapter 6.pptx Data Analysis and processing
Chapter 6.pptx Data Analysis and processing
 
determinatiion of
determinatiion of determinatiion of
determinatiion of
 
Introduction to Data Analysis for Nurse Researchers
Introduction to Data Analysis for Nurse ResearchersIntroduction to Data Analysis for Nurse Researchers
Introduction to Data Analysis for Nurse Researchers
 
PARAMETRIC TESTS.pptx
PARAMETRIC TESTS.pptxPARAMETRIC TESTS.pptx
PARAMETRIC TESTS.pptx
 
Multivariate Analysis Techniques
Multivariate Analysis TechniquesMultivariate Analysis Techniques
Multivariate Analysis Techniques
 
Biostatistics and Research Methodology Semester 8
Biostatistics and Research Methodology Semester 8Biostatistics and Research Methodology Semester 8
Biostatistics and Research Methodology Semester 8
 
ANALYSIS OF DATA.pptx
ANALYSIS OF DATA.pptxANALYSIS OF DATA.pptx
ANALYSIS OF DATA.pptx
 
Biostatistics ppt
Biostatistics  pptBiostatistics  ppt
Biostatistics ppt
 

More from DrAbhishekKumarSingh3

More from DrAbhishekKumarSingh3 (6)

Microsoft word.pptx
Microsoft word.pptxMicrosoft word.pptx
Microsoft word.pptx
 
Data Preparation.pptx
Data Preparation.pptxData Preparation.pptx
Data Preparation.pptx
 
Sorting and Filtering.pptx
Sorting and Filtering.pptxSorting and Filtering.pptx
Sorting and Filtering.pptx
 
BASIC STRUCTURE OF COMPUTERS.pptx
BASIC STRUCTURE OF COMPUTERS.pptxBASIC STRUCTURE OF COMPUTERS.pptx
BASIC STRUCTURE OF COMPUTERS.pptx
 
How to start writing a paper.pptx
How to start writing a paper.pptxHow to start writing a paper.pptx
How to start writing a paper.pptx
 
Optimization using lp.pptx
Optimization using lp.pptxOptimization using lp.pptx
Optimization using lp.pptx
 

Recently uploaded

Do More with Less: Navigating Customer Acquisition Challenges for Today's Ent...
Do More with Less: Navigating Customer Acquisition Challenges for Today's Ent...Do More with Less: Navigating Customer Acquisition Challenges for Today's Ent...
Do More with Less: Navigating Customer Acquisition Challenges for Today's Ent...Search Engine Journal
 
VIP 7001035870 Find & Meet Hyderabad Call Girls Film Nagar high-profile Call ...
VIP 7001035870 Find & Meet Hyderabad Call Girls Film Nagar high-profile Call ...VIP 7001035870 Find & Meet Hyderabad Call Girls Film Nagar high-profile Call ...
VIP 7001035870 Find & Meet Hyderabad Call Girls Film Nagar high-profile Call ...aditipandeya
 
Situation Analysis | Management Company.
Situation Analysis | Management Company.Situation Analysis | Management Company.
Situation Analysis | Management Company.DanielaQuiroz63
 
April 2024 - VBOUT Partners Meeting Group
April 2024 - VBOUT Partners Meeting GroupApril 2024 - VBOUT Partners Meeting Group
April 2024 - VBOUT Partners Meeting GroupVbout.com
 
Unraveling the Mystery of Roanoke Colony: What Really Happened?
Unraveling the Mystery of Roanoke Colony: What Really Happened?Unraveling the Mystery of Roanoke Colony: What Really Happened?
Unraveling the Mystery of Roanoke Colony: What Really Happened?elizabethella096
 
BDSM⚡Call Girls in Sector 150 Noida Escorts >༒8448380779 Escort Service
BDSM⚡Call Girls in Sector 150 Noida Escorts >༒8448380779 Escort ServiceBDSM⚡Call Girls in Sector 150 Noida Escorts >༒8448380779 Escort Service
BDSM⚡Call Girls in Sector 150 Noida Escorts >༒8448380779 Escort ServiceDelhi Call girls
 
Brand experience Peoria City Soccer Presentation.pdf
Brand experience Peoria City Soccer Presentation.pdfBrand experience Peoria City Soccer Presentation.pdf
Brand experience Peoria City Soccer Presentation.pdftbatkhuu1
 
Enjoy Night⚡Call Girls Dlf City Phase 4 Gurgaon >༒8448380779 Escort Service
Enjoy Night⚡Call Girls Dlf City Phase 4 Gurgaon >༒8448380779 Escort ServiceEnjoy Night⚡Call Girls Dlf City Phase 4 Gurgaon >༒8448380779 Escort Service
Enjoy Night⚡Call Girls Dlf City Phase 4 Gurgaon >༒8448380779 Escort ServiceDelhi Call girls
 
Uncover Insightful User Journey Secrets Using GA4 Reports
Uncover Insightful User Journey Secrets Using GA4 ReportsUncover Insightful User Journey Secrets Using GA4 Reports
Uncover Insightful User Journey Secrets Using GA4 ReportsVWO
 
Beyond Resumes_ How Volunteering Shapes Career Trajectories by Kent Kubie
Beyond Resumes_ How Volunteering Shapes Career Trajectories by Kent KubieBeyond Resumes_ How Volunteering Shapes Career Trajectories by Kent Kubie
Beyond Resumes_ How Volunteering Shapes Career Trajectories by Kent KubieKent Kubie
 
Kraft Mac and Cheese campaign presentation
Kraft Mac and Cheese campaign presentationKraft Mac and Cheese campaign presentation
Kraft Mac and Cheese campaign presentationtbatkhuu1
 
Avoid the 2025 web accessibility rush: do not fear WCAG compliance
Avoid the 2025 web accessibility rush: do not fear WCAG complianceAvoid the 2025 web accessibility rush: do not fear WCAG compliance
Avoid the 2025 web accessibility rush: do not fear WCAG complianceDamien ROBERT
 
TOP DUBAI AGENCY OFFERS EXPERT DIGITAL MARKETING SERVICES.pdf
TOP DUBAI AGENCY OFFERS EXPERT DIGITAL MARKETING SERVICES.pdfTOP DUBAI AGENCY OFFERS EXPERT DIGITAL MARKETING SERVICES.pdf
TOP DUBAI AGENCY OFFERS EXPERT DIGITAL MARKETING SERVICES.pdfasiyahanif9977
 
Mastering SEO in the Evolving AI-driven World
Mastering SEO in the Evolving AI-driven WorldMastering SEO in the Evolving AI-driven World
Mastering SEO in the Evolving AI-driven WorldScalenut
 
Call Us ➥9654467111▻Call Girls In Delhi NCR
Call Us ➥9654467111▻Call Girls In Delhi NCRCall Us ➥9654467111▻Call Girls In Delhi NCR
Call Us ➥9654467111▻Call Girls In Delhi NCRSapana Sha
 

Recently uploaded (20)

SEO Master Class - Steve Wiideman, Wiideman Consulting Group
SEO Master Class - Steve Wiideman, Wiideman Consulting GroupSEO Master Class - Steve Wiideman, Wiideman Consulting Group
SEO Master Class - Steve Wiideman, Wiideman Consulting Group
 
Do More with Less: Navigating Customer Acquisition Challenges for Today's Ent...
Do More with Less: Navigating Customer Acquisition Challenges for Today's Ent...Do More with Less: Navigating Customer Acquisition Challenges for Today's Ent...
Do More with Less: Navigating Customer Acquisition Challenges for Today's Ent...
 
How to Create a Social Media Plan Like a Pro - Jordan Scheltgen
How to Create a Social Media Plan Like a Pro - Jordan ScheltgenHow to Create a Social Media Plan Like a Pro - Jordan Scheltgen
How to Create a Social Media Plan Like a Pro - Jordan Scheltgen
 
Turn Digital Reputation Threats into Offense Tactics - Daniel Lemin
Turn Digital Reputation Threats into Offense Tactics - Daniel LeminTurn Digital Reputation Threats into Offense Tactics - Daniel Lemin
Turn Digital Reputation Threats into Offense Tactics - Daniel Lemin
 
VIP 7001035870 Find & Meet Hyderabad Call Girls Film Nagar high-profile Call ...
VIP 7001035870 Find & Meet Hyderabad Call Girls Film Nagar high-profile Call ...VIP 7001035870 Find & Meet Hyderabad Call Girls Film Nagar high-profile Call ...
VIP 7001035870 Find & Meet Hyderabad Call Girls Film Nagar high-profile Call ...
 
Situation Analysis | Management Company.
Situation Analysis | Management Company.Situation Analysis | Management Company.
Situation Analysis | Management Company.
 
April 2024 - VBOUT Partners Meeting Group
April 2024 - VBOUT Partners Meeting GroupApril 2024 - VBOUT Partners Meeting Group
April 2024 - VBOUT Partners Meeting Group
 
Unraveling the Mystery of Roanoke Colony: What Really Happened?
Unraveling the Mystery of Roanoke Colony: What Really Happened?Unraveling the Mystery of Roanoke Colony: What Really Happened?
Unraveling the Mystery of Roanoke Colony: What Really Happened?
 
BDSM⚡Call Girls in Sector 150 Noida Escorts >༒8448380779 Escort Service
BDSM⚡Call Girls in Sector 150 Noida Escorts >༒8448380779 Escort ServiceBDSM⚡Call Girls in Sector 150 Noida Escorts >༒8448380779 Escort Service
BDSM⚡Call Girls in Sector 150 Noida Escorts >༒8448380779 Escort Service
 
Brand experience Peoria City Soccer Presentation.pdf
Brand experience Peoria City Soccer Presentation.pdfBrand experience Peoria City Soccer Presentation.pdf
Brand experience Peoria City Soccer Presentation.pdf
 
The Fandom Dividend - Catalyzing Brand Growth through Cultural Engagement - M...
The Fandom Dividend - Catalyzing Brand Growth through Cultural Engagement - M...The Fandom Dividend - Catalyzing Brand Growth through Cultural Engagement - M...
The Fandom Dividend - Catalyzing Brand Growth through Cultural Engagement - M...
 
Enjoy Night⚡Call Girls Dlf City Phase 4 Gurgaon >༒8448380779 Escort Service
Enjoy Night⚡Call Girls Dlf City Phase 4 Gurgaon >༒8448380779 Escort ServiceEnjoy Night⚡Call Girls Dlf City Phase 4 Gurgaon >༒8448380779 Escort Service
Enjoy Night⚡Call Girls Dlf City Phase 4 Gurgaon >༒8448380779 Escort Service
 
Uncover Insightful User Journey Secrets Using GA4 Reports
Uncover Insightful User Journey Secrets Using GA4 ReportsUncover Insightful User Journey Secrets Using GA4 Reports
Uncover Insightful User Journey Secrets Using GA4 Reports
 
Beyond Resumes_ How Volunteering Shapes Career Trajectories by Kent Kubie
Beyond Resumes_ How Volunteering Shapes Career Trajectories by Kent KubieBeyond Resumes_ How Volunteering Shapes Career Trajectories by Kent Kubie
Beyond Resumes_ How Volunteering Shapes Career Trajectories by Kent Kubie
 
Kraft Mac and Cheese campaign presentation
Kraft Mac and Cheese campaign presentationKraft Mac and Cheese campaign presentation
Kraft Mac and Cheese campaign presentation
 
Avoid the 2025 web accessibility rush: do not fear WCAG compliance
Avoid the 2025 web accessibility rush: do not fear WCAG complianceAvoid the 2025 web accessibility rush: do not fear WCAG compliance
Avoid the 2025 web accessibility rush: do not fear WCAG compliance
 
TOP DUBAI AGENCY OFFERS EXPERT DIGITAL MARKETING SERVICES.pdf
TOP DUBAI AGENCY OFFERS EXPERT DIGITAL MARKETING SERVICES.pdfTOP DUBAI AGENCY OFFERS EXPERT DIGITAL MARKETING SERVICES.pdf
TOP DUBAI AGENCY OFFERS EXPERT DIGITAL MARKETING SERVICES.pdf
 
Mastering SEO in the Evolving AI-driven World
Mastering SEO in the Evolving AI-driven WorldMastering SEO in the Evolving AI-driven World
Mastering SEO in the Evolving AI-driven World
 
Call Us ➥9654467111▻Call Girls In Delhi NCR
Call Us ➥9654467111▻Call Girls In Delhi NCRCall Us ➥9654467111▻Call Girls In Delhi NCR
Call Us ➥9654467111▻Call Girls In Delhi NCR
 
The Future of Brands on LinkedIn - Alison Kaltman
The Future of Brands on LinkedIn - Alison KaltmanThe Future of Brands on LinkedIn - Alison Kaltman
The Future of Brands on LinkedIn - Alison Kaltman
 

Data Analysis Introduction.pptx

  • 1. Data Analysis Lab - 1 Introduction By Dr. Abhishek Kumar Singh
  • 2. Student Introduction • Name • City and State • Education detail (graduation, XII and X)
  • 3. • PhD (IIT BHU Varanasi) • M Tech (IIT BHU Varanasi) • B Tech (GBTU) • 3 Research Paper in SCOPUS/ABDC Indexed journals • 8 papers reviewed as a reviewer • Six sigma green belt
  • 4. Content • Syllabus • Data Analysis • Variables • Univariate • Bivariate
  • 5. Univariate Descriptive Analysis • Measures of Central Tendency- Mean, Median, Mode • Measures of Variability- Range, Variance, Standard Deviation, Co-efficient of Deviation • Measures of Shape- Skewness and Kurtosis • Measures of Stability- Standard Error
  • 6. Bivariate Descriptive Analysis • Covariance • Correlation
  • 7. Data Analysis • The Process of cleaning, transforming, interpreting, analyzing and visualizing the data to extract useful information and gain valuable insights to make more effective business decisions is called data analysis.
  • 8. Variables • Variables: Any character, characteristics or quality that varies is termed a variable. • E.g.: To collect the basic clinical and demographic information on patients with particular illness. Variables of interest may include Gender (M/F), age and height of the patients.
  • 9. Variable Categorical Numerical Nominal Ordinal Discrete Continuous Categories are mutually exclusive and unordered. Eg. Gender (M/F) Blood Group (A/B/AB/O) Categories are mutually exclusive and ordered. Eg. Disease severity (Mild, Moderate and Severe) Integer values, typically counts no notion of magnitude. Eg. No. of children vaccinated, days sick per year Takes any value in a range of values have a magnitude. E.g. weight in kg and Height in cm
  • 10. Statistics Descriptive Inferential • Collecting • Organizing • Summarizing • Presenting Data • Making inference • Hypothesis testing • Determining relationship • Making Prediction
  • 11. Three types of analysis • Univariate analysis: the examination of cases on only one variable at a time (e.g., weight of college students). • Bivariate analysis: the examination of two variables simultaneously (e.g., the relation between gender and weight of college students). • Multivariate analysis: examination of two variables simultaneously (e.g., the relationship between gender, race, and weight of college students).
  • 12. Purpose of different type of analysis • Univariate analysis: mainly description • Bivariate analysis: Determining the empirical relationship between two variables. • Multivariate analysis: Determining the empirical relationship among multiple variables.
  • 13. Univariate • The objective of univariate analysis is to derive the data, define and summarize it and analyze the pattern present in it. • Univariate techniques are appropriate when there is a single measurement of each element in the sample or when there are several measurements of each element but each variable is analyzed in isolation.
  • 14. Univariate Descriptive Inferential • Measures of Central Tendency- Mean, Median, Mode • Measures of Variability- Range, Variance, Standard Deviation, Co-efficient of Deviation • Measures of Shape- Skewness and Kurtosis • Measures of Stability- Standard Error • z test • t test • Chi square test
  • 15. Numerical Methods • Mean – Let X1, X2, X3,….Xn be the n data points, then mean of data is defined as – Mean provide the central value about which the data is spread out.
  • 16. Numerical Methods • Median – Median is the value which divide the data in two halves – Let X1, X2, X3,….Xn be the n data points – Order the n data values – If the number of data points is odd then sample median is the value in position of (n+1)/2 – If the number of data points is even then sample median is the average of value in position of n/2 and (n/2+1)
  • 17. Mean or Median? • Both the measures provide the “middle” value of data, so how do they compare? – Median is robust again extreme values in the data – While mean is affected by the extreme values • Example: 8, 9, 10, 11, 12 be the five data points – Mean = 10 and Median = 10 – Replace 12 by 18 • Mean = 11.2 but Median =10
  • 18. Numerical Methods • Mode – Mode is the a value in data that occurs with highest frequency – It’s the most probable value of the data – It is possible to have data that has more than one Mode value. Such data is called multimodal.
  • 19. Measures of Variability • Percentile – Order the data in ascending order • Then, p1 in called the first percentile if 1% of points lie below this value • Similarly pk is called the k% of data points lie below this value, where 0≤k≤100 • Quartile – P25 is called the 1st quartile Q1 – P75 is called the 3rd quartile Q3 – P50 is Median
  • 20. Measure of Dispersion • Measures the spread of data – Range – Variation or standard deviation • Measures the spread about mean/average value of data – Interquartile range • Measures the spread about median value of the data
  • 21. Measure of Dispersion • Range = M-m, where, – M = Max (x1, x2, ….xn) – m = Min (x1, x2, ….xn) • Variance – S2 = – Standard deviation = S • Interquartile range: Q3 - Q1
  • 22. Standard Deviation • Standard Deviation is most commonly used measure of dispersion. – Under the assumption of normality the range of Covers 67% of the data. • Hence, this is commonly used to show possible error in the observed value of data
  • 23. Graphical Method • Histogram or Bar chart – Frequency Plot • Pie Chart • Cumulative frequency plot • Box and Whisker plot
  • 24.
  • 25.
  • 26. Bivariate • Bi means two and variate means variable, so here there are two variables. The analysisis related to cause and the relationship between the two variables. • Correlation • Covariance