SlideShare a Scribd company logo
1 of 45
Download to read offline
Descriptive Statistics
The farthest most people ever get
Data science and AI Certification Course
Visit: Learnbay.co
Descriptive Statistics
l Descriptive Statistics are Used by Researchers to
Report on Populations and Samples
l In Sociology:
Summary descriptions of measurements (variables)
taken about a group of people
l By Summarizing Information, Descriptive Statistics
Speed Up and Simplify Comprehension of a Group’s
Characteristics
Visit: Learnbay.co
Sample vs. Population
Population Sample
Visit: Learnbay.co
I.Q. Distribution
Visit: Learnbay.co
Source: www.wilderdom.com/.../L2-1UnderstandingIQ.html
Visit: Learnbay.co
Source: http://pse.cs.vt.edu/SoSci/converted/Dispersion_I/box_n_hist.gif
Visit: Learnbay.co
Descriptive Statistics
An Illustration:
Which Group is Smarter?
Class A--IQs of 13 Students Class B--IQs of 13 Students
102 115 127 162
128 109 131 103
131 89 96 111
98 106 80 109
140 119 93 87
93 97 120 105
110 109
Each individual may be different. If you try to understand a group by remembering the
qualities of each member, you become overwhelmed and fail to understand the group.
Visit: Learnbay.co
Descriptive Statistics
Which group is smarter now?
Class A--Average IQ
110.54 110.23
Class B--Average IQ
They’re roughly the same!
With a summary descriptive statistic, it is much easier
to answer our question.
Visit: Learnbay.co
Descriptive Statistics
Types of descriptive statistics:
l Organize Data
l Tables
l Graphs
l Summarize Data
l Central Tendency
l Variation
Visit: Learnbay.co
Descriptive Statistics
Types of descriptive statistics:
l Organize Data
l Tables
l Frequency Distributions
l Relative Frequency Distributions
l Graphs
l Bar Chart or Histogram
l Stem and Leaf Plot
l Frequency Polygon
Visit: Learnbay.co
Descriptive Statistics
Summarizing Data:
l Central Tendency (or Groups’ “Middle Values”)
l Mean
l Median
l Mode
l Variation (or Summary of Differences Within Groups)
l Range
l Interquartile Range
l Variance
l Standard Deviation
Visit: Learnbay.co
Mean
Most commonly called the “average.”
Add up the values for each case and divide by the
total number of cases.
Y-bar = (Y1 + Y2 + . . . + Yn)
n
Y-bar = Σ Yi
n
Visit: Learnbay.co
Mean
What’s up with all those symbols,man?
Y-bar = (Y1 + Y2 + . . . + Yn)
n
Y-bar = ΣYi
n
Some Symbolic Conventions in this Class:
Y = your variable (could be X or Q or J or even “Glitter”)l
l “-bar” or line over symbol of your variable = mean of that
variable
Y1 = first case’s value on variable Y
“. . .” = ellipsis = continue sequentially
Yn = last case’s value on variable Y
n = number of casesin your sample
Σ = Greek letter “sigma” = sum or add up what follows
l
l
l
l
l
l i = a typical case or each case in the sample (1 through n)
Visit: Learnbay.co
Mean
Class A--IQs of 13 Students
102 115
128 109
131 89
98 106
140 119
93 97
110
Σ Yi = 1437
Class B--IQs of 13 Students
127 162
131 103
96 111
80 109
93 87
120 105
109
Σ Yi = 1433
Y-barB = ΣYi = 1433 =
n 13
Y-barA
110.23
= Σ Yi = 1437 = 110.54
n 13
Visit: Learnbay.co
Mean
The mean is the “balance point.”
Each person’s score is like 1 pound placed at the score’s
position on a see-saw. Below, on a 200 cm see-saw, the
mean equals 110, the place on the see-saw where a fulcrum
finds balance:
17
units
below
4
units
below
110 cm
21
units
above
The scale is balanced because…
17 + 4 on the left = 21 on the right
0
units
1 lb at
93 cm
1 lb at
106 cm
1 lb at
131 cm
Visit: Learnbay.co
Mean
1. Means can be badly affected by outliers
(data points with extreme values unlike the
rest)
2. Outliers can make the mean a bad measure
of central tendency or common experience
Income in the U.S.
All of Us
Bill Gates
OutlierMean
Visit: Learnbay.co
Median
The middle value when a variable’s values are ranked
in order; the point that divides a distribution into two
equal halves.
When data are listed in order, the median is the point at
which 50% of the cases are above and 50% below it.
The 50th percentile.
Visit: Learnbay.co
Median
Median = 109
(six cases above, six below)
Class A--IQs of 13 Students
89
93
97
98
102
106
109
110
115
119
128
131 140
Visit: Learnbay.co
Median
If the first student were to drop out of Class A, there
would be a new median:
89
93
97
98
102
106
Median = 109.5
109 + 110 = 219/2 = 109.5
(six cases above, six below)
109
110
115
119
128
131
140
Visit:
Learnbay.co
Median
1. The median is unaffected by outliers,
making it a better measure of central
tendency, better describing the “typical
person” than the mean when data are
skewed.
All of Us
Bill Gates
outlier
Visit: Learnbay.co
Median
2. If the recorded values for a variable form a
symmetric distribution, the median and
mean are identical.
3. In skewed data, the mean lies further
toward the skew than the median.
Mean
Median
Mean
Median
Symmetric Skewed
Visit: Learnbay.co
Median
The middle score or measurement in a set of ranked
scores or measurements; the point that divides a
distribution into two equal halves.
Data are listed in order—the median is the point at
which 50% of the cases are above and 50% below.
The 50th percentile.
Visit: Learnbay.co
Mode
The most common data point is called the
mode.
The combined IQ scores for Classes A &B:
80 87 89 93 93 96 97 98 102 103 105 106 109 109 109 110 111 115 119 120
127 128 131 131 140 162
A la mode!!
BTW, It is possible to have more than one mode!
Visit: Learnbay.co
Mode
It may mot be at the
center of a
distribution.
Data distribution on the
right is “bimodal”
(even statistics can be
open-minded)
8
131.00162.00
82.00 .0014
00.0
1.0
12.
14.
16.
1.8
2.0
Cou
nt
Visit: Learnbay.co
Mode
1. It may give you the most likely experience rather than
the “typical” or “central” experience.
2. In symmetric distributions, the mean, median, and
mode are the same.
3. In skewed data, the mean and median lie further
toward the skew than the mode.
Mean
Median
Median MeanMode Mode
Symmetric Skewed
Visit: Learnbay.co
Descriptive Statistics
Summarizing Data:
ü Central Tendency (or Groups’ “Middle Values”)
ü Mean
ü Median
ü Mode
l Variation (or Summary of Differences Within Groups)
l Range
l Interquartile Range
l Variance
l Standard Deviation
Visit: Learnbay.co
Range
The spread, or the distance, between the lowest and highest
values of a variable.
To get the range for a variable, you subtract its lowest value from
its highest value.
Class A--IQs of 13 Students
102 115
128 109
131 89
98 106
140 119
93 97
110
Class A Range = 140 - 89 = 51
Class B--IQs of 13 Students
127 162
131 103
96 111
80 109
93 87
120 105
109
Class B Range = 162 - 80 = 82
Visit: Learnbay.co
Interquartile Range
A quartile is the value that marks one of the divisions that breaks a series of values into
four equal parts.
The median is a quartile and divides the cases in half.
25th percentile is a quartile that divides the first ¼ of cases from the latter ¾.
75th percentile is a quartile that divides the first ¾ of cases from the latter ¼.
The interquartile range is the distance or range between the 25th percentile and the 75th
percentile. Below, what is the interquartile range?
0 250 500 750 1000
25%
of
cases
25% 25% 25%
of
cases
Visit: Learnbay.co
Variance
A measure of the spread of the recorded values on a variable. A
measure of dispersion.
The larger the variance, the further the individual cases are from the
mean.
Mean
The smaller the variance, the closer the individual scores are to the
mean.
Mean
Visit: Learnbay.co
Variance
Variance is a number that at first seems
complex to calculate.
Calculating variance starts with a “deviation.”
A deviation is the distance away from the mean of a case’s score.
Yi – Y-bar If the average person’s car costs $20,000,
my deviation from the mean is - $14,000!
6K - 20K = -14K
Visit: Learnbay.co
Variance
The deviation of 102 from 110.54 is? Deviation of 115?
Class A--IQs of 13 Students
102 115
128 109
131 89
98106
140 119
9397
110
Y-barA = 110.54
Visit: Learnbay.co
Variance
The deviation of 102 from 110.54 is? Deviation of 115?
102 - 110.54 = -8.54 115 - 110.54 = 4.46
Class A--IQs of 13 Students
102 115
128 109
131 89
98106
140 119
9397
110
Y-barA = 110.54
Visit: Learnbay.co
Variance
l We want to add these to get total deviations, but if we
were to do that, we would get zero every time. Why?
l We need a way to eliminate negative signs.
Squaring the deviations will eliminate negative signs...
A Deviation Squared: (Yi – Y-bar)2
Back to the IQ example,
A deviation squared for 102 is: of 115:
(102 - 110.54)2 = (-8.54)2 = 72.93 (115 - 110.54)2 = (4.46)2 = 19.89
Visit: Learnbay.co
Variance
If you were to add all the squared deviations
together, you’d get what we call the
“Sum of Squares.”
Sum of Squares (SS) = Σ (Yi – Y-bar)2
SS = (Y1 – Y-bar)2 + (Y2 – Y-bar)2 + . . . + (Yn – Y-bar)2
Visit: Learnbay.co
Variance
Class A, sum of squares:
(102 – 110.54)2 + (115 – 110.54)2 +
(126 – 110.54)2 + (109 – 110.54)2 +
(131 – 110.54)2 + (89 – 110.54)2 +
(98 – 110.54)2 + (106 – 110.54)2 +
(140 – 110.54)2 + (119 – 110.54)2 +
(93 – 110.54)2 + (97 – 110.54)2 +
(110 – 110.54) = SS = 2825.39
Class A--IQs of 13 Students
102 115
128 109
131 89
98106
140 119
9397
110
Y-bar = 110.54
Visit: Learnbay.co
Variance
The last step…
The approximate average sum of squares is the variance.
SS/N = Variance for a population.
SS/n-1 = Variance for a sample.
Variance = Σ(Yi – Y-bar)2 / n – 1
Visit: Learnbay.co
Variance
For Class A, Variance = 2825.39 / n - 1
= 2825.39 / 12 = 235.45
How helpful is that???
Visit: Learnbay.co
Standard Deviation
To convert variance into something of meaning, let’s create
standard deviation.
The square root of the variance reveals the average
deviation of the observations from the mean.
s.d. = Σ(Yi – Y-bar)2
n - 1
Visit: Learnbay.co
Standard Deviation
For Class A, the standard deviation is:
235.45 = 15.34
The average of persons’ deviation from the mean IQ of
110.54 is 15.34 IQ points.
Review:
1. Deviation
2. Deviation squared
3. Sum of squares
4. Variance
5. Standard deviation Visit: Learnbay.co
Standard Deviation
1. Larger s.d. = greater amounts of variation around the mean.
For example:
19 25 31
Y = 25
s.d. = 3
13 25 37
Y = 25
s.d. = 6
2. s.d. = 0 only when all values are the same (only when you have a
constant and not a “variable”)
If you were to “rescale” a variable, the s.d. would change by the same
magnitude—if we changed units above so the mean equaled 250, the s.d.
on the left would be 30, and on the right, 60
Like the mean, the s.d. will be inflated by an outlier case value.
3.
4.
Visit: Learnbay.co
Descriptive Statistics
Summarizing Data:
ü Central Tendency (or Groups’ “Middle Values”)
ü
ü
ü
Mean
Median
Mode
ü Variation (or Summary of Differences Within Groups)
ü
ü
ü
ü
Range
Interquartile Range
Variance
Standard Deviation
l …Wait! There’s more
Visit: Learnbay.co
Box-Plots
A way to graphically portray almost all the
descriptive statistics at once is the box-plot.
A box-plot shows: Upper and lower quartiles
Mean
Median
Range
Outliers (1.5 IQR)
Visit: Learnbay.co
Box-PlotsIQ
80.00120.00140.00
123.5
106.5
96.5
82
162
M=110.5
IQR = 27; There
is no outlier.
Visit: Learnbay.co
Descriptive Statistics
l Now you are qualified use descriptive
statistics!
l Questions?
Visit: Learnbay.co
Visit: Learnbay.co
THANK YOU!!!

More Related Content

What's hot

Multiple regression presentation
Multiple regression presentationMultiple regression presentation
Multiple regression presentationCarlo Magno
 
Basic Descriptive statistics
Basic Descriptive statisticsBasic Descriptive statistics
Basic Descriptive statisticsAjendra Sharma
 
Descriptive statistics
Descriptive statisticsDescriptive statistics
Descriptive statisticskemdoby
 
Inferential statistics.ppt
Inferential statistics.pptInferential statistics.ppt
Inferential statistics.pptNursing Path
 
Concept of Inferential statistics
Concept of Inferential statisticsConcept of Inferential statistics
Concept of Inferential statisticsSarfraz Ahmad
 
Simple linear regression
Simple linear regressionSimple linear regression
Simple linear regressionpankaj8108
 
Scales of measurement and presentation of data
Scales of measurement and presentation of dataScales of measurement and presentation of data
Scales of measurement and presentation of dataDr Sithun Kumar Patro
 
Logistic regression
Logistic regressionLogistic regression
Logistic regressionDrZahid Khan
 
ML - Multiple Linear Regression
ML - Multiple Linear RegressionML - Multiple Linear Regression
ML - Multiple Linear RegressionAndrew Ferlitsch
 
Correlation & Regression Analysis using SPSS
Correlation & Regression Analysis  using SPSSCorrelation & Regression Analysis  using SPSS
Correlation & Regression Analysis using SPSSParag Shah
 
Descriptive statistics
Descriptive statisticsDescriptive statistics
Descriptive statisticsMmedsc Hahm
 
Introduction to Statistics - Basic concepts
Introduction to Statistics - Basic conceptsIntroduction to Statistics - Basic concepts
Introduction to Statistics - Basic conceptsDocIbrahimAbdelmonaem
 
Regression diagnostics
Regression diagnosticsRegression diagnostics
Regression diagnosticsdermengles
 
Descriptive statistics
Descriptive statisticsDescriptive statistics
Descriptive statisticsAttaullah Khan
 
INFERENTIAL STATISTICS: AN INTRODUCTION
INFERENTIAL STATISTICS: AN INTRODUCTIONINFERENTIAL STATISTICS: AN INTRODUCTION
INFERENTIAL STATISTICS: AN INTRODUCTIONJohn Labrador
 

What's hot (20)

Multiple regression presentation
Multiple regression presentationMultiple regression presentation
Multiple regression presentation
 
Basic Descriptive statistics
Basic Descriptive statisticsBasic Descriptive statistics
Basic Descriptive statistics
 
Descriptive statistics
Descriptive statisticsDescriptive statistics
Descriptive statistics
 
Descriptive statistics
Descriptive statisticsDescriptive statistics
Descriptive statistics
 
Inferential statistics.ppt
Inferential statistics.pptInferential statistics.ppt
Inferential statistics.ppt
 
Descriptive statistics
Descriptive statisticsDescriptive statistics
Descriptive statistics
 
Descriptive statistics ii
Descriptive statistics iiDescriptive statistics ii
Descriptive statistics ii
 
Concept of Inferential statistics
Concept of Inferential statisticsConcept of Inferential statistics
Concept of Inferential statistics
 
Simple linear regression
Simple linear regressionSimple linear regression
Simple linear regression
 
Scales of measurement and presentation of data
Scales of measurement and presentation of dataScales of measurement and presentation of data
Scales of measurement and presentation of data
 
Logistic regression
Logistic regressionLogistic regression
Logistic regression
 
ML - Multiple Linear Regression
ML - Multiple Linear RegressionML - Multiple Linear Regression
ML - Multiple Linear Regression
 
Correlation & Regression Analysis using SPSS
Correlation & Regression Analysis  using SPSSCorrelation & Regression Analysis  using SPSS
Correlation & Regression Analysis using SPSS
 
Descriptive statistics
Descriptive statisticsDescriptive statistics
Descriptive statistics
 
Normal distribution
Normal distributionNormal distribution
Normal distribution
 
Introduction to Statistics - Basic concepts
Introduction to Statistics - Basic conceptsIntroduction to Statistics - Basic concepts
Introduction to Statistics - Basic concepts
 
Regression diagnostics
Regression diagnosticsRegression diagnostics
Regression diagnostics
 
Regression analysis
Regression analysisRegression analysis
Regression analysis
 
Descriptive statistics
Descriptive statisticsDescriptive statistics
Descriptive statistics
 
INFERENTIAL STATISTICS: AN INTRODUCTION
INFERENTIAL STATISTICS: AN INTRODUCTIONINFERENTIAL STATISTICS: AN INTRODUCTION
INFERENTIAL STATISTICS: AN INTRODUCTION
 

Similar to Descriptive statistics

Descriptive stat
Descriptive statDescriptive stat
Descriptive stato_devinyak
 
asDescriptive_Statistics2.ppt
asDescriptive_Statistics2.pptasDescriptive_Statistics2.ppt
asDescriptive_Statistics2.pptradha91354
 
Basics of Stats (2).pptx
Basics of Stats (2).pptxBasics of Stats (2).pptx
Basics of Stats (2).pptxmadihamaqbool6
 
Chapter2 slides-part 2-harish complete
Chapter2 slides-part 2-harish completeChapter2 slides-part 2-harish complete
Chapter2 slides-part 2-harish completeEasyStudy3
 
m2_2_variation_z_scores.pptx
m2_2_variation_z_scores.pptxm2_2_variation_z_scores.pptx
m2_2_variation_z_scores.pptxMesfinMelese4
 
multiple regression
multiple regressionmultiple regression
multiple regressionPriya Sharma
 
Basic biostatistics dr.eezn
Basic biostatistics dr.eeznBasic biostatistics dr.eezn
Basic biostatistics dr.eeznEhealthMoHS
 
research Correlation_and_Regression-3.ppt
research Correlation_and_Regression-3.pptresearch Correlation_and_Regression-3.ppt
research Correlation_and_Regression-3.pptNehaPandeySharma
 
Frequency distribution, central tendency, measures of dispersion
Frequency distribution, central tendency, measures of dispersionFrequency distribution, central tendency, measures of dispersion
Frequency distribution, central tendency, measures of dispersionDhwani Shah
 
Module 3 statistics
Module 3   statisticsModule 3   statistics
Module 3 statisticsdionesioable
 
Measures of dispersion qt pgdm 1st trisemester
Measures of dispersion qt pgdm 1st trisemester Measures of dispersion qt pgdm 1st trisemester
Measures of dispersion qt pgdm 1st trisemester Karan Kukreja
 
Correlation_and_Regression-3.ppt
Correlation_and_Regression-3.pptCorrelation_and_Regression-3.ppt
Correlation_and_Regression-3.pptRidaIrfan10
 
continuous probability distributions.ppt
continuous probability distributions.pptcontinuous probability distributions.ppt
continuous probability distributions.pptLLOYDARENAS1
 
Module-2_Notes-with-Example for data science
Module-2_Notes-with-Example for data scienceModule-2_Notes-with-Example for data science
Module-2_Notes-with-Example for data sciencepujashri1975
 
Quant Data Analysis
Quant Data AnalysisQuant Data Analysis
Quant Data AnalysisSaad Chahine
 

Similar to Descriptive statistics (20)

Descriptive stat
Descriptive statDescriptive stat
Descriptive stat
 
asDescriptive_Statistics2.ppt
asDescriptive_Statistics2.pptasDescriptive_Statistics2.ppt
asDescriptive_Statistics2.ppt
 
Basics of Stats (2).pptx
Basics of Stats (2).pptxBasics of Stats (2).pptx
Basics of Stats (2).pptx
 
02a one sample_t-test
02a one sample_t-test02a one sample_t-test
02a one sample_t-test
 
Chapter 11
Chapter 11Chapter 11
Chapter 11
 
Chapter2 slides-part 2-harish complete
Chapter2 slides-part 2-harish completeChapter2 slides-part 2-harish complete
Chapter2 slides-part 2-harish complete
 
1. descriptive statistics
1. descriptive statistics1. descriptive statistics
1. descriptive statistics
 
m2_2_variation_z_scores.pptx
m2_2_variation_z_scores.pptxm2_2_variation_z_scores.pptx
m2_2_variation_z_scores.pptx
 
multiple regression
multiple regressionmultiple regression
multiple regression
 
Basic biostatistics dr.eezn
Basic biostatistics dr.eeznBasic biostatistics dr.eezn
Basic biostatistics dr.eezn
 
research Correlation_and_Regression-3.ppt
research Correlation_and_Regression-3.pptresearch Correlation_and_Regression-3.ppt
research Correlation_and_Regression-3.ppt
 
Frequency distribution, central tendency, measures of dispersion
Frequency distribution, central tendency, measures of dispersionFrequency distribution, central tendency, measures of dispersion
Frequency distribution, central tendency, measures of dispersion
 
Module 3 statistics
Module 3   statisticsModule 3   statistics
Module 3 statistics
 
Measures of dispersion qt pgdm 1st trisemester
Measures of dispersion qt pgdm 1st trisemester Measures of dispersion qt pgdm 1st trisemester
Measures of dispersion qt pgdm 1st trisemester
 
Correlation_and_Regression-3.ppt
Correlation_and_Regression-3.pptCorrelation_and_Regression-3.ppt
Correlation_and_Regression-3.ppt
 
lecture6.ppt
lecture6.pptlecture6.ppt
lecture6.ppt
 
continuous probability distributions.ppt
continuous probability distributions.pptcontinuous probability distributions.ppt
continuous probability distributions.ppt
 
Module-2_Notes-with-Example for data science
Module-2_Notes-with-Example for data scienceModule-2_Notes-with-Example for data science
Module-2_Notes-with-Example for data science
 
statistics
statisticsstatistics
statistics
 
Quant Data Analysis
Quant Data AnalysisQuant Data Analysis
Quant Data Analysis
 

More from Learnbay Datascience

Artificial Intelligence- Neural Networks
Artificial Intelligence- Neural NetworksArtificial Intelligence- Neural Networks
Artificial Intelligence- Neural NetworksLearnbay Datascience
 
Artificial intelligence - expert systems
 Artificial intelligence - expert systems Artificial intelligence - expert systems
Artificial intelligence - expert systemsLearnbay Datascience
 
Artificial intelligence - research areas
Artificial intelligence - research areasArtificial intelligence - research areas
Artificial intelligence - research areasLearnbay Datascience
 
Artificial intelligence intelligent systems
Artificial intelligence   intelligent systemsArtificial intelligence   intelligent systems
Artificial intelligence intelligent systemsLearnbay Datascience
 

More from Learnbay Datascience (20)

Top data science projects
Top data science projectsTop data science projects
Top data science projects
 
Python my SQL - create table
Python my SQL - create tablePython my SQL - create table
Python my SQL - create table
 
Python my SQL - create database
Python my SQL - create databasePython my SQL - create database
Python my SQL - create database
 
Python my sql database connection
Python my sql   database connectionPython my sql   database connection
Python my sql database connection
 
Python - mySOL
Python - mySOLPython - mySOL
Python - mySOL
 
AI - Issues and Terminology
AI - Issues and TerminologyAI - Issues and Terminology
AI - Issues and Terminology
 
AI - Fuzzy Logic Systems
AI - Fuzzy Logic SystemsAI - Fuzzy Logic Systems
AI - Fuzzy Logic Systems
 
AI - working of an ns
AI - working of an nsAI - working of an ns
AI - working of an ns
 
Artificial Intelligence- Neural Networks
Artificial Intelligence- Neural NetworksArtificial Intelligence- Neural Networks
Artificial Intelligence- Neural Networks
 
AI - Robotics
AI - RoboticsAI - Robotics
AI - Robotics
 
Applications of expert system
Applications of expert systemApplications of expert system
Applications of expert system
 
Components of expert systems
Components of expert systemsComponents of expert systems
Components of expert systems
 
Artificial intelligence - expert systems
 Artificial intelligence - expert systems Artificial intelligence - expert systems
Artificial intelligence - expert systems
 
AI - natural language processing
AI - natural language processingAI - natural language processing
AI - natural language processing
 
Ai popular search algorithms
Ai   popular search algorithmsAi   popular search algorithms
Ai popular search algorithms
 
AI - Agents & Environments
AI - Agents & EnvironmentsAI - Agents & Environments
AI - Agents & Environments
 
Artificial intelligence - research areas
Artificial intelligence - research areasArtificial intelligence - research areas
Artificial intelligence - research areas
 
Artificial intelligence composed
Artificial intelligence composedArtificial intelligence composed
Artificial intelligence composed
 
Artificial intelligence intelligent systems
Artificial intelligence   intelligent systemsArtificial intelligence   intelligent systems
Artificial intelligence intelligent systems
 
Applications of ai
Applications of aiApplications of ai
Applications of ai
 

Recently uploaded

What is Model Inheritance in Odoo 17 ERP
What is Model Inheritance in Odoo 17 ERPWhat is Model Inheritance in Odoo 17 ERP
What is Model Inheritance in Odoo 17 ERPCeline George
 
Introduction to AI in Higher Education_draft.pptx
Introduction to AI in Higher Education_draft.pptxIntroduction to AI in Higher Education_draft.pptx
Introduction to AI in Higher Education_draft.pptxpboyjonauth
 
Difference Between Search & Browse Methods in Odoo 17
Difference Between Search & Browse Methods in Odoo 17Difference Between Search & Browse Methods in Odoo 17
Difference Between Search & Browse Methods in Odoo 17Celine George
 
18-04-UA_REPORT_MEDIALITERAСY_INDEX-DM_23-1-final-eng.pdf
18-04-UA_REPORT_MEDIALITERAСY_INDEX-DM_23-1-final-eng.pdf18-04-UA_REPORT_MEDIALITERAСY_INDEX-DM_23-1-final-eng.pdf
18-04-UA_REPORT_MEDIALITERAСY_INDEX-DM_23-1-final-eng.pdfssuser54595a
 
Alper Gobel In Media Res Media Component
Alper Gobel In Media Res Media ComponentAlper Gobel In Media Res Media Component
Alper Gobel In Media Res Media ComponentInMediaRes1
 
MICROBIOLOGY biochemical test detailed.pptx
MICROBIOLOGY biochemical test detailed.pptxMICROBIOLOGY biochemical test detailed.pptx
MICROBIOLOGY biochemical test detailed.pptxabhijeetpadhi001
 
Capitol Tech U Doctoral Presentation - April 2024.pptx
Capitol Tech U Doctoral Presentation - April 2024.pptxCapitol Tech U Doctoral Presentation - April 2024.pptx
Capitol Tech U Doctoral Presentation - April 2024.pptxCapitolTechU
 
Framing an Appropriate Research Question 6b9b26d93da94caf993c038d9efcdedb.pdf
Framing an Appropriate Research Question 6b9b26d93da94caf993c038d9efcdedb.pdfFraming an Appropriate Research Question 6b9b26d93da94caf993c038d9efcdedb.pdf
Framing an Appropriate Research Question 6b9b26d93da94caf993c038d9efcdedb.pdfUjwalaBharambe
 
Pharmacognosy Flower 3. Compositae 2023.pdf
Pharmacognosy Flower 3. Compositae 2023.pdfPharmacognosy Flower 3. Compositae 2023.pdf
Pharmacognosy Flower 3. Compositae 2023.pdfMahmoud M. Sallam
 
ECONOMIC CONTEXT - PAPER 1 Q3: NEWSPAPERS.pptx
ECONOMIC CONTEXT - PAPER 1 Q3: NEWSPAPERS.pptxECONOMIC CONTEXT - PAPER 1 Q3: NEWSPAPERS.pptx
ECONOMIC CONTEXT - PAPER 1 Q3: NEWSPAPERS.pptxiammrhaywood
 
DATA STRUCTURE AND ALGORITHM for beginners
DATA STRUCTURE AND ALGORITHM for beginnersDATA STRUCTURE AND ALGORITHM for beginners
DATA STRUCTURE AND ALGORITHM for beginnersSabitha Banu
 
Crayon Activity Handout For the Crayon A
Crayon Activity Handout For the Crayon ACrayon Activity Handout For the Crayon A
Crayon Activity Handout For the Crayon AUnboundStockton
 
Enzyme, Pharmaceutical Aids, Miscellaneous Last Part of Chapter no 5th.pdf
Enzyme, Pharmaceutical Aids, Miscellaneous Last Part of Chapter no 5th.pdfEnzyme, Pharmaceutical Aids, Miscellaneous Last Part of Chapter no 5th.pdf
Enzyme, Pharmaceutical Aids, Miscellaneous Last Part of Chapter no 5th.pdfSumit Tiwari
 
Full Stack Web Development Course for Beginners
Full Stack Web Development Course  for BeginnersFull Stack Web Development Course  for Beginners
Full Stack Web Development Course for BeginnersSabitha Banu
 
EPANDING THE CONTENT OF AN OUTLINE using notes.pptx
EPANDING THE CONTENT OF AN OUTLINE using notes.pptxEPANDING THE CONTENT OF AN OUTLINE using notes.pptx
EPANDING THE CONTENT OF AN OUTLINE using notes.pptxRaymartEstabillo3
 
Final demo Grade 9 for demo Plan dessert.pptx
Final demo Grade 9 for demo Plan dessert.pptxFinal demo Grade 9 for demo Plan dessert.pptx
Final demo Grade 9 for demo Plan dessert.pptxAvyJaneVismanos
 
Blooming Together_ Growing a Community Garden Worksheet.docx
Blooming Together_ Growing a Community Garden Worksheet.docxBlooming Together_ Growing a Community Garden Worksheet.docx
Blooming Together_ Growing a Community Garden Worksheet.docxUnboundStockton
 
call girls in Kamla Market (DELHI) 🔝 >༒9953330565🔝 genuine Escort Service 🔝✔️✔️
call girls in Kamla Market (DELHI) 🔝 >༒9953330565🔝 genuine Escort Service 🔝✔️✔️call girls in Kamla Market (DELHI) 🔝 >༒9953330565🔝 genuine Escort Service 🔝✔️✔️
call girls in Kamla Market (DELHI) 🔝 >༒9953330565🔝 genuine Escort Service 🔝✔️✔️9953056974 Low Rate Call Girls In Saket, Delhi NCR
 
How to Configure Email Server in Odoo 17
How to Configure Email Server in Odoo 17How to Configure Email Server in Odoo 17
How to Configure Email Server in Odoo 17Celine George
 

Recently uploaded (20)

What is Model Inheritance in Odoo 17 ERP
What is Model Inheritance in Odoo 17 ERPWhat is Model Inheritance in Odoo 17 ERP
What is Model Inheritance in Odoo 17 ERP
 
Introduction to AI in Higher Education_draft.pptx
Introduction to AI in Higher Education_draft.pptxIntroduction to AI in Higher Education_draft.pptx
Introduction to AI in Higher Education_draft.pptx
 
Difference Between Search & Browse Methods in Odoo 17
Difference Between Search & Browse Methods in Odoo 17Difference Between Search & Browse Methods in Odoo 17
Difference Between Search & Browse Methods in Odoo 17
 
18-04-UA_REPORT_MEDIALITERAСY_INDEX-DM_23-1-final-eng.pdf
18-04-UA_REPORT_MEDIALITERAСY_INDEX-DM_23-1-final-eng.pdf18-04-UA_REPORT_MEDIALITERAСY_INDEX-DM_23-1-final-eng.pdf
18-04-UA_REPORT_MEDIALITERAСY_INDEX-DM_23-1-final-eng.pdf
 
Alper Gobel In Media Res Media Component
Alper Gobel In Media Res Media ComponentAlper Gobel In Media Res Media Component
Alper Gobel In Media Res Media Component
 
MICROBIOLOGY biochemical test detailed.pptx
MICROBIOLOGY biochemical test detailed.pptxMICROBIOLOGY biochemical test detailed.pptx
MICROBIOLOGY biochemical test detailed.pptx
 
Capitol Tech U Doctoral Presentation - April 2024.pptx
Capitol Tech U Doctoral Presentation - April 2024.pptxCapitol Tech U Doctoral Presentation - April 2024.pptx
Capitol Tech U Doctoral Presentation - April 2024.pptx
 
Framing an Appropriate Research Question 6b9b26d93da94caf993c038d9efcdedb.pdf
Framing an Appropriate Research Question 6b9b26d93da94caf993c038d9efcdedb.pdfFraming an Appropriate Research Question 6b9b26d93da94caf993c038d9efcdedb.pdf
Framing an Appropriate Research Question 6b9b26d93da94caf993c038d9efcdedb.pdf
 
Pharmacognosy Flower 3. Compositae 2023.pdf
Pharmacognosy Flower 3. Compositae 2023.pdfPharmacognosy Flower 3. Compositae 2023.pdf
Pharmacognosy Flower 3. Compositae 2023.pdf
 
ECONOMIC CONTEXT - PAPER 1 Q3: NEWSPAPERS.pptx
ECONOMIC CONTEXT - PAPER 1 Q3: NEWSPAPERS.pptxECONOMIC CONTEXT - PAPER 1 Q3: NEWSPAPERS.pptx
ECONOMIC CONTEXT - PAPER 1 Q3: NEWSPAPERS.pptx
 
ESSENTIAL of (CS/IT/IS) class 06 (database)
ESSENTIAL of (CS/IT/IS) class 06 (database)ESSENTIAL of (CS/IT/IS) class 06 (database)
ESSENTIAL of (CS/IT/IS) class 06 (database)
 
DATA STRUCTURE AND ALGORITHM for beginners
DATA STRUCTURE AND ALGORITHM for beginnersDATA STRUCTURE AND ALGORITHM for beginners
DATA STRUCTURE AND ALGORITHM for beginners
 
Crayon Activity Handout For the Crayon A
Crayon Activity Handout For the Crayon ACrayon Activity Handout For the Crayon A
Crayon Activity Handout For the Crayon A
 
Enzyme, Pharmaceutical Aids, Miscellaneous Last Part of Chapter no 5th.pdf
Enzyme, Pharmaceutical Aids, Miscellaneous Last Part of Chapter no 5th.pdfEnzyme, Pharmaceutical Aids, Miscellaneous Last Part of Chapter no 5th.pdf
Enzyme, Pharmaceutical Aids, Miscellaneous Last Part of Chapter no 5th.pdf
 
Full Stack Web Development Course for Beginners
Full Stack Web Development Course  for BeginnersFull Stack Web Development Course  for Beginners
Full Stack Web Development Course for Beginners
 
EPANDING THE CONTENT OF AN OUTLINE using notes.pptx
EPANDING THE CONTENT OF AN OUTLINE using notes.pptxEPANDING THE CONTENT OF AN OUTLINE using notes.pptx
EPANDING THE CONTENT OF AN OUTLINE using notes.pptx
 
Final demo Grade 9 for demo Plan dessert.pptx
Final demo Grade 9 for demo Plan dessert.pptxFinal demo Grade 9 for demo Plan dessert.pptx
Final demo Grade 9 for demo Plan dessert.pptx
 
Blooming Together_ Growing a Community Garden Worksheet.docx
Blooming Together_ Growing a Community Garden Worksheet.docxBlooming Together_ Growing a Community Garden Worksheet.docx
Blooming Together_ Growing a Community Garden Worksheet.docx
 
call girls in Kamla Market (DELHI) 🔝 >༒9953330565🔝 genuine Escort Service 🔝✔️✔️
call girls in Kamla Market (DELHI) 🔝 >༒9953330565🔝 genuine Escort Service 🔝✔️✔️call girls in Kamla Market (DELHI) 🔝 >༒9953330565🔝 genuine Escort Service 🔝✔️✔️
call girls in Kamla Market (DELHI) 🔝 >༒9953330565🔝 genuine Escort Service 🔝✔️✔️
 
How to Configure Email Server in Odoo 17
How to Configure Email Server in Odoo 17How to Configure Email Server in Odoo 17
How to Configure Email Server in Odoo 17
 

Descriptive statistics

  • 1. Descriptive Statistics The farthest most people ever get Data science and AI Certification Course Visit: Learnbay.co
  • 2. Descriptive Statistics l Descriptive Statistics are Used by Researchers to Report on Populations and Samples l In Sociology: Summary descriptions of measurements (variables) taken about a group of people l By Summarizing Information, Descriptive Statistics Speed Up and Simplify Comprehension of a Group’s Characteristics Visit: Learnbay.co
  • 3. Sample vs. Population Population Sample Visit: Learnbay.co
  • 7. Descriptive Statistics An Illustration: Which Group is Smarter? Class A--IQs of 13 Students Class B--IQs of 13 Students 102 115 127 162 128 109 131 103 131 89 96 111 98 106 80 109 140 119 93 87 93 97 120 105 110 109 Each individual may be different. If you try to understand a group by remembering the qualities of each member, you become overwhelmed and fail to understand the group. Visit: Learnbay.co
  • 8. Descriptive Statistics Which group is smarter now? Class A--Average IQ 110.54 110.23 Class B--Average IQ They’re roughly the same! With a summary descriptive statistic, it is much easier to answer our question. Visit: Learnbay.co
  • 9. Descriptive Statistics Types of descriptive statistics: l Organize Data l Tables l Graphs l Summarize Data l Central Tendency l Variation Visit: Learnbay.co
  • 10. Descriptive Statistics Types of descriptive statistics: l Organize Data l Tables l Frequency Distributions l Relative Frequency Distributions l Graphs l Bar Chart or Histogram l Stem and Leaf Plot l Frequency Polygon Visit: Learnbay.co
  • 11. Descriptive Statistics Summarizing Data: l Central Tendency (or Groups’ “Middle Values”) l Mean l Median l Mode l Variation (or Summary of Differences Within Groups) l Range l Interquartile Range l Variance l Standard Deviation Visit: Learnbay.co
  • 12. Mean Most commonly called the “average.” Add up the values for each case and divide by the total number of cases. Y-bar = (Y1 + Y2 + . . . + Yn) n Y-bar = Σ Yi n Visit: Learnbay.co
  • 13. Mean What’s up with all those symbols,man? Y-bar = (Y1 + Y2 + . . . + Yn) n Y-bar = ΣYi n Some Symbolic Conventions in this Class: Y = your variable (could be X or Q or J or even “Glitter”)l l “-bar” or line over symbol of your variable = mean of that variable Y1 = first case’s value on variable Y “. . .” = ellipsis = continue sequentially Yn = last case’s value on variable Y n = number of casesin your sample Σ = Greek letter “sigma” = sum or add up what follows l l l l l l i = a typical case or each case in the sample (1 through n) Visit: Learnbay.co
  • 14. Mean Class A--IQs of 13 Students 102 115 128 109 131 89 98 106 140 119 93 97 110 Σ Yi = 1437 Class B--IQs of 13 Students 127 162 131 103 96 111 80 109 93 87 120 105 109 Σ Yi = 1433 Y-barB = ΣYi = 1433 = n 13 Y-barA 110.23 = Σ Yi = 1437 = 110.54 n 13 Visit: Learnbay.co
  • 15. Mean The mean is the “balance point.” Each person’s score is like 1 pound placed at the score’s position on a see-saw. Below, on a 200 cm see-saw, the mean equals 110, the place on the see-saw where a fulcrum finds balance: 17 units below 4 units below 110 cm 21 units above The scale is balanced because… 17 + 4 on the left = 21 on the right 0 units 1 lb at 93 cm 1 lb at 106 cm 1 lb at 131 cm Visit: Learnbay.co
  • 16. Mean 1. Means can be badly affected by outliers (data points with extreme values unlike the rest) 2. Outliers can make the mean a bad measure of central tendency or common experience Income in the U.S. All of Us Bill Gates OutlierMean Visit: Learnbay.co
  • 17. Median The middle value when a variable’s values are ranked in order; the point that divides a distribution into two equal halves. When data are listed in order, the median is the point at which 50% of the cases are above and 50% below it. The 50th percentile. Visit: Learnbay.co
  • 18. Median Median = 109 (six cases above, six below) Class A--IQs of 13 Students 89 93 97 98 102 106 109 110 115 119 128 131 140 Visit: Learnbay.co
  • 19. Median If the first student were to drop out of Class A, there would be a new median: 89 93 97 98 102 106 Median = 109.5 109 + 110 = 219/2 = 109.5 (six cases above, six below) 109 110 115 119 128 131 140 Visit: Learnbay.co
  • 20. Median 1. The median is unaffected by outliers, making it a better measure of central tendency, better describing the “typical person” than the mean when data are skewed. All of Us Bill Gates outlier Visit: Learnbay.co
  • 21. Median 2. If the recorded values for a variable form a symmetric distribution, the median and mean are identical. 3. In skewed data, the mean lies further toward the skew than the median. Mean Median Mean Median Symmetric Skewed Visit: Learnbay.co
  • 22. Median The middle score or measurement in a set of ranked scores or measurements; the point that divides a distribution into two equal halves. Data are listed in order—the median is the point at which 50% of the cases are above and 50% below. The 50th percentile. Visit: Learnbay.co
  • 23. Mode The most common data point is called the mode. The combined IQ scores for Classes A &B: 80 87 89 93 93 96 97 98 102 103 105 106 109 109 109 110 111 115 119 120 127 128 131 131 140 162 A la mode!! BTW, It is possible to have more than one mode! Visit: Learnbay.co
  • 24. Mode It may mot be at the center of a distribution. Data distribution on the right is “bimodal” (even statistics can be open-minded) 8 131.00162.00 82.00 .0014 00.0 1.0 12. 14. 16. 1.8 2.0 Cou nt Visit: Learnbay.co
  • 25. Mode 1. It may give you the most likely experience rather than the “typical” or “central” experience. 2. In symmetric distributions, the mean, median, and mode are the same. 3. In skewed data, the mean and median lie further toward the skew than the mode. Mean Median Median MeanMode Mode Symmetric Skewed Visit: Learnbay.co
  • 26. Descriptive Statistics Summarizing Data: ü Central Tendency (or Groups’ “Middle Values”) ü Mean ü Median ü Mode l Variation (or Summary of Differences Within Groups) l Range l Interquartile Range l Variance l Standard Deviation Visit: Learnbay.co
  • 27. Range The spread, or the distance, between the lowest and highest values of a variable. To get the range for a variable, you subtract its lowest value from its highest value. Class A--IQs of 13 Students 102 115 128 109 131 89 98 106 140 119 93 97 110 Class A Range = 140 - 89 = 51 Class B--IQs of 13 Students 127 162 131 103 96 111 80 109 93 87 120 105 109 Class B Range = 162 - 80 = 82 Visit: Learnbay.co
  • 28. Interquartile Range A quartile is the value that marks one of the divisions that breaks a series of values into four equal parts. The median is a quartile and divides the cases in half. 25th percentile is a quartile that divides the first ¼ of cases from the latter ¾. 75th percentile is a quartile that divides the first ¾ of cases from the latter ¼. The interquartile range is the distance or range between the 25th percentile and the 75th percentile. Below, what is the interquartile range? 0 250 500 750 1000 25% of cases 25% 25% 25% of cases Visit: Learnbay.co
  • 29. Variance A measure of the spread of the recorded values on a variable. A measure of dispersion. The larger the variance, the further the individual cases are from the mean. Mean The smaller the variance, the closer the individual scores are to the mean. Mean Visit: Learnbay.co
  • 30. Variance Variance is a number that at first seems complex to calculate. Calculating variance starts with a “deviation.” A deviation is the distance away from the mean of a case’s score. Yi – Y-bar If the average person’s car costs $20,000, my deviation from the mean is - $14,000! 6K - 20K = -14K Visit: Learnbay.co
  • 31. Variance The deviation of 102 from 110.54 is? Deviation of 115? Class A--IQs of 13 Students 102 115 128 109 131 89 98106 140 119 9397 110 Y-barA = 110.54 Visit: Learnbay.co
  • 32. Variance The deviation of 102 from 110.54 is? Deviation of 115? 102 - 110.54 = -8.54 115 - 110.54 = 4.46 Class A--IQs of 13 Students 102 115 128 109 131 89 98106 140 119 9397 110 Y-barA = 110.54 Visit: Learnbay.co
  • 33. Variance l We want to add these to get total deviations, but if we were to do that, we would get zero every time. Why? l We need a way to eliminate negative signs. Squaring the deviations will eliminate negative signs... A Deviation Squared: (Yi – Y-bar)2 Back to the IQ example, A deviation squared for 102 is: of 115: (102 - 110.54)2 = (-8.54)2 = 72.93 (115 - 110.54)2 = (4.46)2 = 19.89 Visit: Learnbay.co
  • 34. Variance If you were to add all the squared deviations together, you’d get what we call the “Sum of Squares.” Sum of Squares (SS) = Σ (Yi – Y-bar)2 SS = (Y1 – Y-bar)2 + (Y2 – Y-bar)2 + . . . + (Yn – Y-bar)2 Visit: Learnbay.co
  • 35. Variance Class A, sum of squares: (102 – 110.54)2 + (115 – 110.54)2 + (126 – 110.54)2 + (109 – 110.54)2 + (131 – 110.54)2 + (89 – 110.54)2 + (98 – 110.54)2 + (106 – 110.54)2 + (140 – 110.54)2 + (119 – 110.54)2 + (93 – 110.54)2 + (97 – 110.54)2 + (110 – 110.54) = SS = 2825.39 Class A--IQs of 13 Students 102 115 128 109 131 89 98106 140 119 9397 110 Y-bar = 110.54 Visit: Learnbay.co
  • 36. Variance The last step… The approximate average sum of squares is the variance. SS/N = Variance for a population. SS/n-1 = Variance for a sample. Variance = Σ(Yi – Y-bar)2 / n – 1 Visit: Learnbay.co
  • 37. Variance For Class A, Variance = 2825.39 / n - 1 = 2825.39 / 12 = 235.45 How helpful is that??? Visit: Learnbay.co
  • 38. Standard Deviation To convert variance into something of meaning, let’s create standard deviation. The square root of the variance reveals the average deviation of the observations from the mean. s.d. = Σ(Yi – Y-bar)2 n - 1 Visit: Learnbay.co
  • 39. Standard Deviation For Class A, the standard deviation is: 235.45 = 15.34 The average of persons’ deviation from the mean IQ of 110.54 is 15.34 IQ points. Review: 1. Deviation 2. Deviation squared 3. Sum of squares 4. Variance 5. Standard deviation Visit: Learnbay.co
  • 40. Standard Deviation 1. Larger s.d. = greater amounts of variation around the mean. For example: 19 25 31 Y = 25 s.d. = 3 13 25 37 Y = 25 s.d. = 6 2. s.d. = 0 only when all values are the same (only when you have a constant and not a “variable”) If you were to “rescale” a variable, the s.d. would change by the same magnitude—if we changed units above so the mean equaled 250, the s.d. on the left would be 30, and on the right, 60 Like the mean, the s.d. will be inflated by an outlier case value. 3. 4. Visit: Learnbay.co
  • 41. Descriptive Statistics Summarizing Data: ü Central Tendency (or Groups’ “Middle Values”) ü ü ü Mean Median Mode ü Variation (or Summary of Differences Within Groups) ü ü ü ü Range Interquartile Range Variance Standard Deviation l …Wait! There’s more Visit: Learnbay.co
  • 42. Box-Plots A way to graphically portray almost all the descriptive statistics at once is the box-plot. A box-plot shows: Upper and lower quartiles Mean Median Range Outliers (1.5 IQR) Visit: Learnbay.co
  • 44. Descriptive Statistics l Now you are qualified use descriptive statistics! l Questions? Visit: Learnbay.co