SlideShare a Scribd company logo
Introduction
Summary Statistics
Definition
Summary statistics are used to summarize a
set of observations in order to
communicate as much as information
about the data as possible. It is part of
descriptive statistics and are used to
basically summarize or describe a set of
observations.
Rupak Roy
Example
The weight of the population are
45 kg
57kg
72 kg
52 kg
Now what we want here is the summary of
weight of the population , we can say it is the
average weight of the population is 56.5 kg and
now we can describe the population in the
simplest way as possible.
Rupak Roy
Types
Summary statistics
Measures of Central
Tendency
1 . Mean
2 . Median
3 . Mode
5 . Geometric Mean
Measures of
Dispersion
1. Standard
Deviation
2. Variance
3. Interquartile
Range
Others
1. Co efficient
2. Skewness
3. Kurtosis
4. Probability
Distributions.
5. Distribution plot
Rupak Roy
Definition
 Measures of central tendency : is the value that describes
which group of data clusters around a central value. In
simple words , it is a way to describe the center of a data
set. Again what is center of data ? A single number that
summarizes the entire dataset using techniques such as
mean/average or median of the dataset.
 Measures of Dispersion: “dispersion (also
called variability, scatter, or spread) is the extent to which
a distribution of data is stretched or squeezed.”
Here in the graph we can see the
distribution of data (assume population)
is more stretched at the right side
ranging from 50 to 80
Measures of Central Tendency
1. Mean : is the average of observations. Most effective
when data is not heavily skewed.
2. Median: represents the middle value of the dataset.
Useful for skewed data.
We will talk about skewed data in the upcoming
slides.
3. Mode: means max no of times the data has occurred.
4. Geometric mean: nth root of a product of n numbers.
It is used when we want to get the average rate of the
event and the event rate is determined by multiplication.
For example growth of a bank account per year in a
ABC bank is calculated by geometric mean since the
growth event rate is determined by multiplying the
amount of a bank account by the percentage of growth.
then we use geometric mean.
Rupak Roy
 Formula for calculating Geometric Mean
GM =
example: Geometric Mean of 23,56,66 ?
3 23 * 56*66
3 85008 = 43.9696761which means 3times of 43.9696761
is 85008
Note:
if one of the observation in the event is zero , Geometric
Mean becomes Zero and also it doesn’t works with
negative numbers like -1 , -4 , -5 and so on.
Rupak Roy
Calculation of Mode ; <- Delta
For ungrouped data = Max no of items
Example : 23,45,76,33,54,33,76,33 Therefore Mode = 33
For grouped data = = {(L + Delta 1) / Delta 1+Detal2 } * i
Where Delta 1 = f1 +f0
and Delta 2 = f1- f2
Nowadays, we don’t have to worry about the calculation, as in
any statistical software's like R, excel it will automatically calculate
the intense calculation for large amount of data but
for more in-depth information you can visit this website.
https://www.mathsisfun.com/data/frequency-grouped-mean-median-mode.html
Measures of dispersion
Standard Deviation is basically a measure of how near or far the
observations are from the mean.
Variance: the fact or quality of being different , divergent or
inconsistent. A value of zero means that there is no variability , all the
values in the data set are the same.
Interquartile Range: is a measure of variability ,
by dividing a data set into parts that is quartiles .
Say
Q1 is the middle value in the first half of the data set.
Q2 is the median value .
Q3 is the middle value in the second half of the
rank-ordered data set.
There interquartile range = Q3 – Q1
Skewness – refers to the lack of symmetry or imbalance in data
distribution.
In a symmetric distribution the data is
normally distributed where mean,
median, mode is at the same point.
However in real life data is never perfectly
distributed, hence we call it skewed data.
If the Left side has longer tail then the mass
distribution of data is concentrated on the right
side which is known as negatively skewed.
If the Right side has longer tail then the
mass distribution of data is
concentrated on the left side is
known as positive skewed.
Here is the summary of all the skewness as shown in the figure below.
Example (skewed data)
Temp(*c)
10
40
35
33
35
Mean = 153/5 = 30.6, if we apply mean is 30.6
which is incorrect since we can see maximum
number of values are above 35.
So we have to use median For Ungrouped
data ((n+1)/2)th
That will be ((5+1)/2)th = 6/2 = 3
i.e. 3th term ie 35.
For grouped data:
where L, lower class boundary of the group containing the group.
B, Cumulative frequency of the groups
G , Frequency of the median group
W , width/Range of the group
Again, we don’t have to worry about the calculation, as in any statistical software's like R
, excel it will automatically calculate the intense calculation for large amount of data
but for more in-depth information you can visit this website.
https://www.mathsisfun.com/data/frequency-grouped-mean-median-mode.html
Kurtosis : is a measure of whether data are peaked or flat relative to
normal distribution
(+) Leptokurtic
(-) PlatyKurtic
(0) Meskurtic
(+) Leptokurtic
This means the distribution is more clustered near the mean and has a
relativity less standard deviation
(-) PlatyKurtic
Where the distribution is less clustered around the mean and a standard
deviation more then Leptokurtic
(0) Meskurtic is typically measured with respect to the normal
distribution. Meskurtic has tails similar to normal distribution i.e neither
high nor low, rather it is consider to be a baseline for the other two’s.
 Now how to check the data is skewed or not
in Excel:
=skew(select the range of values/numbers)
=skew(10.24,9.48……….-0.42,-0.95)
= - 0.27 means Negatively skewed.
And to check the Kurtosis in Excel
=kurt(select the values/numbers)
=kurt(10.24,9.48……….-0.42,-0.95)
= -1.6 means it is PlatyKurtic
Recap
What we have learned ?
Measures of central tendency,
Measures of dispersion,
Measure of risk,
Next we will see how to compute this theory in
practical and analyze any data using our
everyday simple tools like Excel.
Rupak Roy
To be continued ………

More Related Content

What's hot

Central limit theorem
Central limit theoremCentral limit theorem
Central limit theorem
Nadeem Uddin
 
Frequency distribution 6
Frequency distribution 6Frequency distribution 6
Frequency distribution 6
NadeemShoukat3
 
Basics stat ppt-types of data
Basics stat ppt-types of dataBasics stat ppt-types of data
Basics stat ppt-types of data
Farhana Shaheen
 
Standard Deviation and Variance
Standard Deviation and VarianceStandard Deviation and Variance
Standard Deviation and Variance
Robbie Jule
 
Dr digs central tendency
Dr digs central tendencyDr digs central tendency
Dr digs central tendencydrdig
 
Descriptive statistics
Descriptive statisticsDescriptive statistics
Descriptive statistics
Attaullah Khan
 
Inferential statistics.ppt
Inferential statistics.pptInferential statistics.ppt
Inferential statistics.pptNursing Path
 
Statistics "Descriptive & Inferential"
Statistics "Descriptive & Inferential"Statistics "Descriptive & Inferential"
Statistics "Descriptive & Inferential"
Dalia El-Shafei
 
Measures of dispersion
Measures of dispersionMeasures of dispersion
Measures of dispersion
Nilanjan Bhaumik
 
Measures of Central Tendency
Measures of Central TendencyMeasures of Central Tendency
Measures of Central Tendency
Nida Nafees
 
Measures of Relative Standing and Boxplots
Measures of Relative Standing and BoxplotsMeasures of Relative Standing and Boxplots
Measures of Relative Standing and Boxplots
Long Beach City College
 
Basic statistics for algorithmic trading
Basic statistics for algorithmic tradingBasic statistics for algorithmic trading
Basic statistics for algorithmic trading
QuantInsti
 
Normality
NormalityNormality
Statistics-Measures of dispersions
Statistics-Measures of dispersionsStatistics-Measures of dispersions
Statistics-Measures of dispersionsCapricorn
 
Basic Descriptive statistics
Basic Descriptive statisticsBasic Descriptive statistics
Basic Descriptive statistics
Ajendra Sharma
 
Descriptive statistics
Descriptive statisticsDescriptive statistics
Descriptive statistics
Murugesan Kandan
 
Introduction to Statistics
Introduction to StatisticsIntroduction to Statistics
Introduction to Statistics
Robert Tinaro
 
Meaning and Importance of Statistics
Meaning and Importance of StatisticsMeaning and Importance of Statistics
Meaning and Importance of Statistics
Flipped Channel
 
Basic Statistical Concepts and Methods
Basic Statistical Concepts and MethodsBasic Statistical Concepts and Methods
Basic Statistical Concepts and Methods
Ahmed-Refat Refat
 
Hypothesis testing
Hypothesis testingHypothesis testing
Hypothesis testing
Kaori Kubo Germano, PhD
 

What's hot (20)

Central limit theorem
Central limit theoremCentral limit theorem
Central limit theorem
 
Frequency distribution 6
Frequency distribution 6Frequency distribution 6
Frequency distribution 6
 
Basics stat ppt-types of data
Basics stat ppt-types of dataBasics stat ppt-types of data
Basics stat ppt-types of data
 
Standard Deviation and Variance
Standard Deviation and VarianceStandard Deviation and Variance
Standard Deviation and Variance
 
Dr digs central tendency
Dr digs central tendencyDr digs central tendency
Dr digs central tendency
 
Descriptive statistics
Descriptive statisticsDescriptive statistics
Descriptive statistics
 
Inferential statistics.ppt
Inferential statistics.pptInferential statistics.ppt
Inferential statistics.ppt
 
Statistics "Descriptive & Inferential"
Statistics "Descriptive & Inferential"Statistics "Descriptive & Inferential"
Statistics "Descriptive & Inferential"
 
Measures of dispersion
Measures of dispersionMeasures of dispersion
Measures of dispersion
 
Measures of Central Tendency
Measures of Central TendencyMeasures of Central Tendency
Measures of Central Tendency
 
Measures of Relative Standing and Boxplots
Measures of Relative Standing and BoxplotsMeasures of Relative Standing and Boxplots
Measures of Relative Standing and Boxplots
 
Basic statistics for algorithmic trading
Basic statistics for algorithmic tradingBasic statistics for algorithmic trading
Basic statistics for algorithmic trading
 
Normality
NormalityNormality
Normality
 
Statistics-Measures of dispersions
Statistics-Measures of dispersionsStatistics-Measures of dispersions
Statistics-Measures of dispersions
 
Basic Descriptive statistics
Basic Descriptive statisticsBasic Descriptive statistics
Basic Descriptive statistics
 
Descriptive statistics
Descriptive statisticsDescriptive statistics
Descriptive statistics
 
Introduction to Statistics
Introduction to StatisticsIntroduction to Statistics
Introduction to Statistics
 
Meaning and Importance of Statistics
Meaning and Importance of StatisticsMeaning and Importance of Statistics
Meaning and Importance of Statistics
 
Basic Statistical Concepts and Methods
Basic Statistical Concepts and MethodsBasic Statistical Concepts and Methods
Basic Statistical Concepts and Methods
 
Hypothesis testing
Hypothesis testingHypothesis testing
Hypothesis testing
 

Similar to Summary statistics

Types of Statistics
Types of Statistics Types of Statistics
Types of Statistics
Rupak Roy
 
1 descriptive statistics
1 descriptive statistics1 descriptive statistics
1 descriptive statisticsSanu Kumar
 
Basic statistics
Basic statisticsBasic statistics
Descriptive Statistics.pptx
Descriptive Statistics.pptxDescriptive Statistics.pptx
Descriptive Statistics.pptx
Shashank Mishra
 
Descriptive Statistics: Measures of Central Tendency - Measures of Dispersion...
Descriptive Statistics: Measures of Central Tendency - Measures of Dispersion...Descriptive Statistics: Measures of Central Tendency - Measures of Dispersion...
Descriptive Statistics: Measures of Central Tendency - Measures of Dispersion...
EqraBaig
 
Statistics (GE 4 CLASS).pptx
Statistics (GE 4 CLASS).pptxStatistics (GE 4 CLASS).pptx
Statistics (GE 4 CLASS).pptx
YollyCalamba
 
Measures of central tendency
Measures of central tendencyMeasures of central tendency
Measures of central tendency
Mmedsc Hahm
 
Graphical presentation of data
Graphical presentation of dataGraphical presentation of data
Graphical presentation of datadrasifk
 
Lect 3 background mathematics
Lect 3 background mathematicsLect 3 background mathematics
Lect 3 background mathematics
hktripathy
 
QT1 - 03 - Measures of Central Tendency
QT1 - 03 - Measures of Central TendencyQT1 - 03 - Measures of Central Tendency
QT1 - 03 - Measures of Central Tendency
Prithwis Mukerjee
 
QT1 - 03 - Measures of Central Tendency
QT1 - 03 - Measures of Central TendencyQT1 - 03 - Measures of Central Tendency
QT1 - 03 - Measures of Central Tendency
Prithwis Mukerjee
 
Basic Statistical Concepts in Machine Learning.pptx
Basic Statistical Concepts in Machine Learning.pptxBasic Statistical Concepts in Machine Learning.pptx
Basic Statistical Concepts in Machine Learning.pptx
bajajrishabh96tech
 
Measures of Dispersion.pptx
Measures of Dispersion.pptxMeasures of Dispersion.pptx
Measures of Dispersion.pptx
Vanmala Buchke
 
3. measures of central tendency
3. measures of central tendency3. measures of central tendency
3. measures of central tendency
renz50
 
Lect 3 background mathematics for Data Mining
Lect 3 background mathematics for Data MiningLect 3 background mathematics for Data Mining
Lect 3 background mathematics for Data Mining
hktripathy
 
Central tendancy 4
Central tendancy 4Central tendancy 4
Central tendancy 4
Sundar B N
 
Basic Statistical Descriptions of Data.pptx
Basic Statistical Descriptions of Data.pptxBasic Statistical Descriptions of Data.pptx
Basic Statistical Descriptions of Data.pptx
Anusuya123
 
SUMMARY MEASURES.pdf
SUMMARY MEASURES.pdfSUMMARY MEASURES.pdf
SUMMARY MEASURES.pdf
GillaMarieLeopardas1
 
Basic statisctis -Anandh Shankar
Basic statisctis -Anandh ShankarBasic statisctis -Anandh Shankar
Basic statisctis -Anandh Shankar
Anandh Shankar Sundararajan
 
Chapter 4 MMW.pdf
Chapter 4 MMW.pdfChapter 4 MMW.pdf
Chapter 4 MMW.pdf
RaRaRamirez
 

Similar to Summary statistics (20)

Types of Statistics
Types of Statistics Types of Statistics
Types of Statistics
 
1 descriptive statistics
1 descriptive statistics1 descriptive statistics
1 descriptive statistics
 
Basic statistics
Basic statisticsBasic statistics
Basic statistics
 
Descriptive Statistics.pptx
Descriptive Statistics.pptxDescriptive Statistics.pptx
Descriptive Statistics.pptx
 
Descriptive Statistics: Measures of Central Tendency - Measures of Dispersion...
Descriptive Statistics: Measures of Central Tendency - Measures of Dispersion...Descriptive Statistics: Measures of Central Tendency - Measures of Dispersion...
Descriptive Statistics: Measures of Central Tendency - Measures of Dispersion...
 
Statistics (GE 4 CLASS).pptx
Statistics (GE 4 CLASS).pptxStatistics (GE 4 CLASS).pptx
Statistics (GE 4 CLASS).pptx
 
Measures of central tendency
Measures of central tendencyMeasures of central tendency
Measures of central tendency
 
Graphical presentation of data
Graphical presentation of dataGraphical presentation of data
Graphical presentation of data
 
Lect 3 background mathematics
Lect 3 background mathematicsLect 3 background mathematics
Lect 3 background mathematics
 
QT1 - 03 - Measures of Central Tendency
QT1 - 03 - Measures of Central TendencyQT1 - 03 - Measures of Central Tendency
QT1 - 03 - Measures of Central Tendency
 
QT1 - 03 - Measures of Central Tendency
QT1 - 03 - Measures of Central TendencyQT1 - 03 - Measures of Central Tendency
QT1 - 03 - Measures of Central Tendency
 
Basic Statistical Concepts in Machine Learning.pptx
Basic Statistical Concepts in Machine Learning.pptxBasic Statistical Concepts in Machine Learning.pptx
Basic Statistical Concepts in Machine Learning.pptx
 
Measures of Dispersion.pptx
Measures of Dispersion.pptxMeasures of Dispersion.pptx
Measures of Dispersion.pptx
 
3. measures of central tendency
3. measures of central tendency3. measures of central tendency
3. measures of central tendency
 
Lect 3 background mathematics for Data Mining
Lect 3 background mathematics for Data MiningLect 3 background mathematics for Data Mining
Lect 3 background mathematics for Data Mining
 
Central tendancy 4
Central tendancy 4Central tendancy 4
Central tendancy 4
 
Basic Statistical Descriptions of Data.pptx
Basic Statistical Descriptions of Data.pptxBasic Statistical Descriptions of Data.pptx
Basic Statistical Descriptions of Data.pptx
 
SUMMARY MEASURES.pdf
SUMMARY MEASURES.pdfSUMMARY MEASURES.pdf
SUMMARY MEASURES.pdf
 
Basic statisctis -Anandh Shankar
Basic statisctis -Anandh ShankarBasic statisctis -Anandh Shankar
Basic statisctis -Anandh Shankar
 
Chapter 4 MMW.pdf
Chapter 4 MMW.pdfChapter 4 MMW.pdf
Chapter 4 MMW.pdf
 

More from Rupak Roy

Hierarchical Clustering - Text Mining/NLP
Hierarchical Clustering - Text Mining/NLPHierarchical Clustering - Text Mining/NLP
Hierarchical Clustering - Text Mining/NLP
Rupak Roy
 
Clustering K means and Hierarchical - NLP
Clustering K means and Hierarchical - NLPClustering K means and Hierarchical - NLP
Clustering K means and Hierarchical - NLP
Rupak Roy
 
Network Analysis - NLP
Network Analysis  - NLPNetwork Analysis  - NLP
Network Analysis - NLP
Rupak Roy
 
Topic Modeling - NLP
Topic Modeling - NLPTopic Modeling - NLP
Topic Modeling - NLP
Rupak Roy
 
Sentiment Analysis Practical Steps
Sentiment Analysis Practical StepsSentiment Analysis Practical Steps
Sentiment Analysis Practical Steps
Rupak Roy
 
NLP - Sentiment Analysis
NLP - Sentiment AnalysisNLP - Sentiment Analysis
NLP - Sentiment Analysis
Rupak Roy
 
Text Mining using Regular Expressions
Text Mining using Regular ExpressionsText Mining using Regular Expressions
Text Mining using Regular Expressions
Rupak Roy
 
Introduction to Text Mining
Introduction to Text Mining Introduction to Text Mining
Introduction to Text Mining
Rupak Roy
 
Apache Hbase Architecture
Apache Hbase ArchitectureApache Hbase Architecture
Apache Hbase Architecture
Rupak Roy
 
Introduction to Hbase
Introduction to Hbase Introduction to Hbase
Introduction to Hbase
Rupak Roy
 
Apache Hive Table Partition and HQL
Apache Hive Table Partition and HQLApache Hive Table Partition and HQL
Apache Hive Table Partition and HQL
Rupak Roy
 
Installing Apache Hive, internal and external table, import-export
Installing Apache Hive, internal and external table, import-export Installing Apache Hive, internal and external table, import-export
Installing Apache Hive, internal and external table, import-export
Rupak Roy
 
Introductive to Hive
Introductive to Hive Introductive to Hive
Introductive to Hive
Rupak Roy
 
Scoop Job, import and export to RDBMS
Scoop Job, import and export to RDBMSScoop Job, import and export to RDBMS
Scoop Job, import and export to RDBMS
Rupak Roy
 
Apache Scoop - Import with Append mode and Last Modified mode
Apache Scoop - Import with Append mode and Last Modified mode Apache Scoop - Import with Append mode and Last Modified mode
Apache Scoop - Import with Append mode and Last Modified mode
Rupak Roy
 
Introduction to scoop and its functions
Introduction to scoop and its functionsIntroduction to scoop and its functions
Introduction to scoop and its functions
Rupak Roy
 
Introduction to Flume
Introduction to FlumeIntroduction to Flume
Introduction to Flume
Rupak Roy
 
Apache Pig Relational Operators - II
Apache Pig Relational Operators - II Apache Pig Relational Operators - II
Apache Pig Relational Operators - II
Rupak Roy
 
Passing Parameters using File and Command Line
Passing Parameters using File and Command LinePassing Parameters using File and Command Line
Passing Parameters using File and Command Line
Rupak Roy
 
Apache PIG Relational Operations
Apache PIG Relational Operations Apache PIG Relational Operations
Apache PIG Relational Operations
Rupak Roy
 

More from Rupak Roy (20)

Hierarchical Clustering - Text Mining/NLP
Hierarchical Clustering - Text Mining/NLPHierarchical Clustering - Text Mining/NLP
Hierarchical Clustering - Text Mining/NLP
 
Clustering K means and Hierarchical - NLP
Clustering K means and Hierarchical - NLPClustering K means and Hierarchical - NLP
Clustering K means and Hierarchical - NLP
 
Network Analysis - NLP
Network Analysis  - NLPNetwork Analysis  - NLP
Network Analysis - NLP
 
Topic Modeling - NLP
Topic Modeling - NLPTopic Modeling - NLP
Topic Modeling - NLP
 
Sentiment Analysis Practical Steps
Sentiment Analysis Practical StepsSentiment Analysis Practical Steps
Sentiment Analysis Practical Steps
 
NLP - Sentiment Analysis
NLP - Sentiment AnalysisNLP - Sentiment Analysis
NLP - Sentiment Analysis
 
Text Mining using Regular Expressions
Text Mining using Regular ExpressionsText Mining using Regular Expressions
Text Mining using Regular Expressions
 
Introduction to Text Mining
Introduction to Text Mining Introduction to Text Mining
Introduction to Text Mining
 
Apache Hbase Architecture
Apache Hbase ArchitectureApache Hbase Architecture
Apache Hbase Architecture
 
Introduction to Hbase
Introduction to Hbase Introduction to Hbase
Introduction to Hbase
 
Apache Hive Table Partition and HQL
Apache Hive Table Partition and HQLApache Hive Table Partition and HQL
Apache Hive Table Partition and HQL
 
Installing Apache Hive, internal and external table, import-export
Installing Apache Hive, internal and external table, import-export Installing Apache Hive, internal and external table, import-export
Installing Apache Hive, internal and external table, import-export
 
Introductive to Hive
Introductive to Hive Introductive to Hive
Introductive to Hive
 
Scoop Job, import and export to RDBMS
Scoop Job, import and export to RDBMSScoop Job, import and export to RDBMS
Scoop Job, import and export to RDBMS
 
Apache Scoop - Import with Append mode and Last Modified mode
Apache Scoop - Import with Append mode and Last Modified mode Apache Scoop - Import with Append mode and Last Modified mode
Apache Scoop - Import with Append mode and Last Modified mode
 
Introduction to scoop and its functions
Introduction to scoop and its functionsIntroduction to scoop and its functions
Introduction to scoop and its functions
 
Introduction to Flume
Introduction to FlumeIntroduction to Flume
Introduction to Flume
 
Apache Pig Relational Operators - II
Apache Pig Relational Operators - II Apache Pig Relational Operators - II
Apache Pig Relational Operators - II
 
Passing Parameters using File and Command Line
Passing Parameters using File and Command LinePassing Parameters using File and Command Line
Passing Parameters using File and Command Line
 
Apache PIG Relational Operations
Apache PIG Relational Operations Apache PIG Relational Operations
Apache PIG Relational Operations
 

Recently uploaded

Chapter -12, Antibiotics (One Page Notes).pdf
Chapter -12, Antibiotics (One Page Notes).pdfChapter -12, Antibiotics (One Page Notes).pdf
Chapter -12, Antibiotics (One Page Notes).pdf
Kartik Tiwari
 
June 3, 2024 Anti-Semitism Letter Sent to MIT President Kornbluth and MIT Cor...
June 3, 2024 Anti-Semitism Letter Sent to MIT President Kornbluth and MIT Cor...June 3, 2024 Anti-Semitism Letter Sent to MIT President Kornbluth and MIT Cor...
June 3, 2024 Anti-Semitism Letter Sent to MIT President Kornbluth and MIT Cor...
Levi Shapiro
 
Unit 8 - Information and Communication Technology (Paper I).pdf
Unit 8 - Information and Communication Technology (Paper I).pdfUnit 8 - Information and Communication Technology (Paper I).pdf
Unit 8 - Information and Communication Technology (Paper I).pdf
Thiyagu K
 
Unit 2- Research Aptitude (UGC NET Paper I).pdf
Unit 2- Research Aptitude (UGC NET Paper I).pdfUnit 2- Research Aptitude (UGC NET Paper I).pdf
Unit 2- Research Aptitude (UGC NET Paper I).pdf
Thiyagu K
 
Exploiting Artificial Intelligence for Empowering Researchers and Faculty, In...
Exploiting Artificial Intelligence for Empowering Researchers and Faculty, In...Exploiting Artificial Intelligence for Empowering Researchers and Faculty, In...
Exploiting Artificial Intelligence for Empowering Researchers and Faculty, In...
Dr. Vinod Kumar Kanvaria
 
Normal Labour/ Stages of Labour/ Mechanism of Labour
Normal Labour/ Stages of Labour/ Mechanism of LabourNormal Labour/ Stages of Labour/ Mechanism of Labour
Normal Labour/ Stages of Labour/ Mechanism of Labour
Wasim Ak
 
BÀI TẬP BỔ TRỢ TIẾNG ANH GLOBAL SUCCESS LỚP 3 - CẢ NĂM (CÓ FILE NGHE VÀ ĐÁP Á...
BÀI TẬP BỔ TRỢ TIẾNG ANH GLOBAL SUCCESS LỚP 3 - CẢ NĂM (CÓ FILE NGHE VÀ ĐÁP Á...BÀI TẬP BỔ TRỢ TIẾNG ANH GLOBAL SUCCESS LỚP 3 - CẢ NĂM (CÓ FILE NGHE VÀ ĐÁP Á...
BÀI TẬP BỔ TRỢ TIẾNG ANH GLOBAL SUCCESS LỚP 3 - CẢ NĂM (CÓ FILE NGHE VÀ ĐÁP Á...
Nguyen Thanh Tu Collection
 
The approach at University of Liverpool.pptx
The approach at University of Liverpool.pptxThe approach at University of Liverpool.pptx
The approach at University of Liverpool.pptx
Jisc
 
Advantages and Disadvantages of CMS from an SEO Perspective
Advantages and Disadvantages of CMS from an SEO PerspectiveAdvantages and Disadvantages of CMS from an SEO Perspective
Advantages and Disadvantages of CMS from an SEO Perspective
Krisztián Száraz
 
The Challenger.pdf DNHS Official Publication
The Challenger.pdf DNHS Official PublicationThe Challenger.pdf DNHS Official Publication
The Challenger.pdf DNHS Official Publication
Delapenabediema
 
"Protectable subject matters, Protection in biotechnology, Protection of othe...
"Protectable subject matters, Protection in biotechnology, Protection of othe..."Protectable subject matters, Protection in biotechnology, Protection of othe...
"Protectable subject matters, Protection in biotechnology, Protection of othe...
SACHIN R KONDAGURI
 
How to Make a Field invisible in Odoo 17
How to Make a Field invisible in Odoo 17How to Make a Field invisible in Odoo 17
How to Make a Field invisible in Odoo 17
Celine George
 
How libraries can support authors with open access requirements for UKRI fund...
How libraries can support authors with open access requirements for UKRI fund...How libraries can support authors with open access requirements for UKRI fund...
How libraries can support authors with open access requirements for UKRI fund...
Jisc
 
Model Attribute Check Company Auto Property
Model Attribute  Check Company Auto PropertyModel Attribute  Check Company Auto Property
Model Attribute Check Company Auto Property
Celine George
 
Supporting (UKRI) OA monographs at Salford.pptx
Supporting (UKRI) OA monographs at Salford.pptxSupporting (UKRI) OA monographs at Salford.pptx
Supporting (UKRI) OA monographs at Salford.pptx
Jisc
 
Pride Month Slides 2024 David Douglas School District
Pride Month Slides 2024 David Douglas School DistrictPride Month Slides 2024 David Douglas School District
Pride Month Slides 2024 David Douglas School District
David Douglas School District
 
Overview on Edible Vaccine: Pros & Cons with Mechanism
Overview on Edible Vaccine: Pros & Cons with MechanismOverview on Edible Vaccine: Pros & Cons with Mechanism
Overview on Edible Vaccine: Pros & Cons with Mechanism
DeeptiGupta154
 
2024.06.01 Introducing a competency framework for languag learning materials ...
2024.06.01 Introducing a competency framework for languag learning materials ...2024.06.01 Introducing a competency framework for languag learning materials ...
2024.06.01 Introducing a competency framework for languag learning materials ...
Sandy Millin
 
Biological Screening of Herbal Drugs in detailed.
Biological Screening of Herbal Drugs in detailed.Biological Screening of Herbal Drugs in detailed.
Biological Screening of Herbal Drugs in detailed.
Ashokrao Mane college of Pharmacy Peth-Vadgaon
 
Acetabularia Information For Class 9 .docx
Acetabularia Information For Class 9  .docxAcetabularia Information For Class 9  .docx
Acetabularia Information For Class 9 .docx
vaibhavrinwa19
 

Recently uploaded (20)

Chapter -12, Antibiotics (One Page Notes).pdf
Chapter -12, Antibiotics (One Page Notes).pdfChapter -12, Antibiotics (One Page Notes).pdf
Chapter -12, Antibiotics (One Page Notes).pdf
 
June 3, 2024 Anti-Semitism Letter Sent to MIT President Kornbluth and MIT Cor...
June 3, 2024 Anti-Semitism Letter Sent to MIT President Kornbluth and MIT Cor...June 3, 2024 Anti-Semitism Letter Sent to MIT President Kornbluth and MIT Cor...
June 3, 2024 Anti-Semitism Letter Sent to MIT President Kornbluth and MIT Cor...
 
Unit 8 - Information and Communication Technology (Paper I).pdf
Unit 8 - Information and Communication Technology (Paper I).pdfUnit 8 - Information and Communication Technology (Paper I).pdf
Unit 8 - Information and Communication Technology (Paper I).pdf
 
Unit 2- Research Aptitude (UGC NET Paper I).pdf
Unit 2- Research Aptitude (UGC NET Paper I).pdfUnit 2- Research Aptitude (UGC NET Paper I).pdf
Unit 2- Research Aptitude (UGC NET Paper I).pdf
 
Exploiting Artificial Intelligence for Empowering Researchers and Faculty, In...
Exploiting Artificial Intelligence for Empowering Researchers and Faculty, In...Exploiting Artificial Intelligence for Empowering Researchers and Faculty, In...
Exploiting Artificial Intelligence for Empowering Researchers and Faculty, In...
 
Normal Labour/ Stages of Labour/ Mechanism of Labour
Normal Labour/ Stages of Labour/ Mechanism of LabourNormal Labour/ Stages of Labour/ Mechanism of Labour
Normal Labour/ Stages of Labour/ Mechanism of Labour
 
BÀI TẬP BỔ TRỢ TIẾNG ANH GLOBAL SUCCESS LỚP 3 - CẢ NĂM (CÓ FILE NGHE VÀ ĐÁP Á...
BÀI TẬP BỔ TRỢ TIẾNG ANH GLOBAL SUCCESS LỚP 3 - CẢ NĂM (CÓ FILE NGHE VÀ ĐÁP Á...BÀI TẬP BỔ TRỢ TIẾNG ANH GLOBAL SUCCESS LỚP 3 - CẢ NĂM (CÓ FILE NGHE VÀ ĐÁP Á...
BÀI TẬP BỔ TRỢ TIẾNG ANH GLOBAL SUCCESS LỚP 3 - CẢ NĂM (CÓ FILE NGHE VÀ ĐÁP Á...
 
The approach at University of Liverpool.pptx
The approach at University of Liverpool.pptxThe approach at University of Liverpool.pptx
The approach at University of Liverpool.pptx
 
Advantages and Disadvantages of CMS from an SEO Perspective
Advantages and Disadvantages of CMS from an SEO PerspectiveAdvantages and Disadvantages of CMS from an SEO Perspective
Advantages and Disadvantages of CMS from an SEO Perspective
 
The Challenger.pdf DNHS Official Publication
The Challenger.pdf DNHS Official PublicationThe Challenger.pdf DNHS Official Publication
The Challenger.pdf DNHS Official Publication
 
"Protectable subject matters, Protection in biotechnology, Protection of othe...
"Protectable subject matters, Protection in biotechnology, Protection of othe..."Protectable subject matters, Protection in biotechnology, Protection of othe...
"Protectable subject matters, Protection in biotechnology, Protection of othe...
 
How to Make a Field invisible in Odoo 17
How to Make a Field invisible in Odoo 17How to Make a Field invisible in Odoo 17
How to Make a Field invisible in Odoo 17
 
How libraries can support authors with open access requirements for UKRI fund...
How libraries can support authors with open access requirements for UKRI fund...How libraries can support authors with open access requirements for UKRI fund...
How libraries can support authors with open access requirements for UKRI fund...
 
Model Attribute Check Company Auto Property
Model Attribute  Check Company Auto PropertyModel Attribute  Check Company Auto Property
Model Attribute Check Company Auto Property
 
Supporting (UKRI) OA monographs at Salford.pptx
Supporting (UKRI) OA monographs at Salford.pptxSupporting (UKRI) OA monographs at Salford.pptx
Supporting (UKRI) OA monographs at Salford.pptx
 
Pride Month Slides 2024 David Douglas School District
Pride Month Slides 2024 David Douglas School DistrictPride Month Slides 2024 David Douglas School District
Pride Month Slides 2024 David Douglas School District
 
Overview on Edible Vaccine: Pros & Cons with Mechanism
Overview on Edible Vaccine: Pros & Cons with MechanismOverview on Edible Vaccine: Pros & Cons with Mechanism
Overview on Edible Vaccine: Pros & Cons with Mechanism
 
2024.06.01 Introducing a competency framework for languag learning materials ...
2024.06.01 Introducing a competency framework for languag learning materials ...2024.06.01 Introducing a competency framework for languag learning materials ...
2024.06.01 Introducing a competency framework for languag learning materials ...
 
Biological Screening of Herbal Drugs in detailed.
Biological Screening of Herbal Drugs in detailed.Biological Screening of Herbal Drugs in detailed.
Biological Screening of Herbal Drugs in detailed.
 
Acetabularia Information For Class 9 .docx
Acetabularia Information For Class 9  .docxAcetabularia Information For Class 9  .docx
Acetabularia Information For Class 9 .docx
 

Summary statistics

  • 2. Definition Summary statistics are used to summarize a set of observations in order to communicate as much as information about the data as possible. It is part of descriptive statistics and are used to basically summarize or describe a set of observations. Rupak Roy
  • 3. Example The weight of the population are 45 kg 57kg 72 kg 52 kg Now what we want here is the summary of weight of the population , we can say it is the average weight of the population is 56.5 kg and now we can describe the population in the simplest way as possible. Rupak Roy
  • 4. Types Summary statistics Measures of Central Tendency 1 . Mean 2 . Median 3 . Mode 5 . Geometric Mean Measures of Dispersion 1. Standard Deviation 2. Variance 3. Interquartile Range Others 1. Co efficient 2. Skewness 3. Kurtosis 4. Probability Distributions. 5. Distribution plot Rupak Roy
  • 5. Definition  Measures of central tendency : is the value that describes which group of data clusters around a central value. In simple words , it is a way to describe the center of a data set. Again what is center of data ? A single number that summarizes the entire dataset using techniques such as mean/average or median of the dataset.  Measures of Dispersion: “dispersion (also called variability, scatter, or spread) is the extent to which a distribution of data is stretched or squeezed.” Here in the graph we can see the distribution of data (assume population) is more stretched at the right side ranging from 50 to 80
  • 6. Measures of Central Tendency 1. Mean : is the average of observations. Most effective when data is not heavily skewed. 2. Median: represents the middle value of the dataset. Useful for skewed data. We will talk about skewed data in the upcoming slides. 3. Mode: means max no of times the data has occurred. 4. Geometric mean: nth root of a product of n numbers. It is used when we want to get the average rate of the event and the event rate is determined by multiplication. For example growth of a bank account per year in a ABC bank is calculated by geometric mean since the growth event rate is determined by multiplying the amount of a bank account by the percentage of growth. then we use geometric mean. Rupak Roy
  • 7.  Formula for calculating Geometric Mean GM = example: Geometric Mean of 23,56,66 ? 3 23 * 56*66 3 85008 = 43.9696761which means 3times of 43.9696761 is 85008 Note: if one of the observation in the event is zero , Geometric Mean becomes Zero and also it doesn’t works with negative numbers like -1 , -4 , -5 and so on. Rupak Roy
  • 8. Calculation of Mode ; <- Delta For ungrouped data = Max no of items Example : 23,45,76,33,54,33,76,33 Therefore Mode = 33 For grouped data = = {(L + Delta 1) / Delta 1+Detal2 } * i Where Delta 1 = f1 +f0 and Delta 2 = f1- f2 Nowadays, we don’t have to worry about the calculation, as in any statistical software's like R, excel it will automatically calculate the intense calculation for large amount of data but for more in-depth information you can visit this website. https://www.mathsisfun.com/data/frequency-grouped-mean-median-mode.html
  • 9. Measures of dispersion Standard Deviation is basically a measure of how near or far the observations are from the mean. Variance: the fact or quality of being different , divergent or inconsistent. A value of zero means that there is no variability , all the values in the data set are the same. Interquartile Range: is a measure of variability , by dividing a data set into parts that is quartiles . Say Q1 is the middle value in the first half of the data set. Q2 is the median value . Q3 is the middle value in the second half of the rank-ordered data set. There interquartile range = Q3 – Q1
  • 10. Skewness – refers to the lack of symmetry or imbalance in data distribution. In a symmetric distribution the data is normally distributed where mean, median, mode is at the same point. However in real life data is never perfectly distributed, hence we call it skewed data. If the Left side has longer tail then the mass distribution of data is concentrated on the right side which is known as negatively skewed.
  • 11. If the Right side has longer tail then the mass distribution of data is concentrated on the left side is known as positive skewed. Here is the summary of all the skewness as shown in the figure below.
  • 12. Example (skewed data) Temp(*c) 10 40 35 33 35 Mean = 153/5 = 30.6, if we apply mean is 30.6 which is incorrect since we can see maximum number of values are above 35. So we have to use median For Ungrouped data ((n+1)/2)th That will be ((5+1)/2)th = 6/2 = 3 i.e. 3th term ie 35. For grouped data: where L, lower class boundary of the group containing the group. B, Cumulative frequency of the groups G , Frequency of the median group W , width/Range of the group Again, we don’t have to worry about the calculation, as in any statistical software's like R , excel it will automatically calculate the intense calculation for large amount of data but for more in-depth information you can visit this website. https://www.mathsisfun.com/data/frequency-grouped-mean-median-mode.html
  • 13. Kurtosis : is a measure of whether data are peaked or flat relative to normal distribution (+) Leptokurtic (-) PlatyKurtic (0) Meskurtic (+) Leptokurtic This means the distribution is more clustered near the mean and has a relativity less standard deviation (-) PlatyKurtic Where the distribution is less clustered around the mean and a standard deviation more then Leptokurtic (0) Meskurtic is typically measured with respect to the normal distribution. Meskurtic has tails similar to normal distribution i.e neither high nor low, rather it is consider to be a baseline for the other two’s.
  • 14.  Now how to check the data is skewed or not in Excel: =skew(select the range of values/numbers) =skew(10.24,9.48……….-0.42,-0.95) = - 0.27 means Negatively skewed. And to check the Kurtosis in Excel =kurt(select the values/numbers) =kurt(10.24,9.48……….-0.42,-0.95) = -1.6 means it is PlatyKurtic
  • 15. Recap What we have learned ? Measures of central tendency, Measures of dispersion, Measure of risk, Next we will see how to compute this theory in practical and analyze any data using our everyday simple tools like Excel. Rupak Roy
  • 16. To be continued ………