SlideShare a Scribd company logo
Research needs good understanding of data analysis
Vikash Raj Satyal
(Vikash@kusom.edu.np)
Summarize your Data:
What to look in the dataset?
If our study have a large data set, we
(researcher) are interested to know :-
• What the central value is,
• What is the spread from center,
• What is the shape & size of data
distribution
Major economic dataset
Questions
• What is percapita GDP?
• Whose percapita GDP is this?
• Did you earn $1191 in this FY
142920(Rs.126,018)? (Rs11,910monthly)
• Nepali people earn about
55 times low percapita GDP
than USA, and
165 times lower than
Monaco people
Nepali Database
Research Paradigm
3. Survey(Collect data)
4. Statistical analysis
5. There is not enough evidence to
support research(alternative)
hypothesis(HA)
6. Res. Hypo accepted
HA is true
= Failure of research hypo.
7. Report writing
1. Setup research
hypo/Refine(Lit Review)
2. Develop instruments
5a. Report writing
7
Why Dolpa &
Mugu also have
highest annual
growth rate?
Why Achham,
Palpa has one of
the lowest
growth rate?
Mugu
Dolpa
• What is the general IQ of US university students?
• In the US the mean IQ for persons completing no more than a…..
• Bachelor’s degree 113 (80th centile)
• Master’s degree 117 (87th centile)
• PhD, LLD, MD 124 (95th centile)
Central Tendency in large sample data
In any large data set, data are
clustered around center. So
researchers focus to find out
that central value.
Depending on the shape of the
data distribution center is
calculated differently using
different statistical formula
When….
Statistical way of measuring
the center of a data set
•Mean(AM, GM, HM, Weighted mean)
•Median
•Mode
•Partition values
Median not mean, for:
(i) Open End Classes.
(ii) unequal class interval data table.
(ii) When data has several extreme values(outliers).
(iii) qualitative data( in frequency).
(IV) When data strongly lack normality
Quantiles
or Partition values
•Quartiles (3)
•Deciles (9)
•Percentiles (99)
• Quintile(5) (Not Quantile)
Mode is most frequently occurring value
• Less used
• Popular in business and industry
• Only way to locate central value when data is nominal
(How many type A sold? most preferred flavor of ice cream)
Mode & symmetry
Which Average is better?
AM is best for interval data, however it should not be used :
• For highly skewed data
• in open end classes.
• When there are very large and very small items(outliers).
• In case of average ratio and rate of change.
Median is the best average for:
• open end classes
• Skewed data or in presence of outliers
• For ordinal qualitative data eg.: less honest, honest, very honest
Mode is used for qualitative nominal data frequently used in
business and industry
Does Shape and Size of the data matters?
• Elongation of left or right tail is Skewness
• skewness described dataset’s symmetry – or lack of
symmetry.
• A perfectly symmetrical data set will have a skewness of
0.
Skewness • Negative (left) skewness indicates more small values(on left tail)
• Positive (right) skewness indicates more large values(on right tail)
• kurtosis measures extreme values in either tail.
• Normal curve has no Kurtosis
• Kurtosis is measured comparing
the Normal curve
Calculation in EXCEL:
Statistics for lexp(life expectancy) using NHDR2014.xlsx
Calculation in EXCEL:
Statistics for price using auto.xlsx
Use data, nhdr2014 to calculate the following
1. Average life expectancy (‘life’)
2. Average gdp percapita (‘income’)
3. Average life expectancy (‘life’) of 3 ecologies (eg, average life(mountain)= …. )
4. Calculate Q1, Q2, Q3 of ‘income’
5. Using 3 quartiles of ‘income’ we can divide any other data in 4 equal parts.
Make a new variable, call it ‘groups’, that will have 4 value-labels according to
below criteria:
‘poor’ if below Q1
‘below average’, if between Q1 to Q2,
‘above average’, if between Q2 to Q3
‘rich’ if above Q3
6. Find the average of ‘life’ & ‘hdi’ for this newly created variable with 4 groups
7. How many ‘districts’ falls in each of these ‘groups’? And which district has the
highest & lowest ‘life’ value that falls in each of these 4 ‘groups’?
8. Save this data for your future use
Dispersion in data is meaningful
Central value alone can disguise the picture
Variability is beauty of the wild nature
•Geographical variation generates
variety in species of flora and fauna
•Ethnography –cultural diversity
•Epidemiology treats variation in
disease
How to measure data dispersion?
Range
Standard Deviation
Quartile Deviation
Coefficient of variation
1. Range
Range= Largest value – Smallest value
•High Range in temperature acts for desertification
•Range of mobile sets
•Range of social disparity
2. Quartile deviation (semi inter quartile range)
• Inter-quartile Range = Q3 – Q1
• Quartile deviation (Q.D.) :
𝑸. 𝑫. =
𝟑𝟑. 𝟐𝟓 − 𝟏𝟑. 𝟖𝟔
𝟐
= 𝟗. 𝟔𝟗
3. Variance & Standard Deviation
•Most popular measure of variation
•It uses all observations
•Std(standard deviation) is the square root of variance
•Std = 𝑣𝑎𝑟𝑖𝑎𝑛𝑐𝑒
Sample VS population VARIANCE
For Papulation
s2 =
(𝑋−𝑋 )²
𝑛
=
𝑋²
𝑛
−
𝑋
𝑛
2
(individual data)
s2 =
𝑓(𝑋−𝑋 )²
𝑁
=
𝑓𝑋²
𝑁
−
𝑓𝑋
𝑁
2
Grouped data
For sample
S2 =
(𝑋−𝑋 )²
𝑛−1
Also, S2 =
𝑛
𝑛−1
s2
S2 =
𝑛
𝑛−1
s2
=
𝑛
𝑛 − 1
𝑓𝑋2
𝑁
−
𝑓𝑋
𝑁
2
When n  ∞ , sample mean  population mean
Example: Variance and std of the life
of electric bulbs(in hours)
Length of life No. of bulbs
500–700 5
700–900 11
900–1100 26
1100–1300 10
1300–1500 8
Length of
life
No. of
bulbs
mid-
value
f X fx fx2
500–700 5 600 3000 1800000
700–900 11 800 8800 7040000
900–1100 26 1000 26000 26000000
1100–1300 10 1200 12000 14400000
1300–1500 8 1400 11200 15680000
SUM 60 61000 64920000
Mean = 1016.67
Variance = 48388.89
Std = 219.9747
4. Coefficient of Variation(C.V.)
The co-efficient of variation is the relative measure based on the
standard deviation and is defined as the ratio of the standard
deviation to the mean expressed in percent.
C.V. =
𝜎
μ
x100%
It is used to compare the compactness of two or more data
Smaller C.V. indicates consistent or less variable data
C.V. is unit-less so data in same or different units can be compared
by it. eg. Weights in KG and in Pounds
Which type of electric bulbs has better consistency in life span?
Length of life
No. of
bulbs(alpha, a)
No. of
bulbs(beta, b)
fa fb
500–700 5 4
700–900 11 30
900–1100 26 12
1100–1300 10 8
1300–1500 8 6
Length of life
# bulbs
(alpha, a)
# bulbs
(beta, b)
Mid-value
fa fb X Xfa Xfb X2fa X2fb
500–700 5 4 600 3000 2400 1800000 1440000
700–900 11 30 800 8800 24000 7040000 19200000
900–1100 26 12 1000 26000 12000 26000000 12000000
1100–1300 10 8 1200 12000 9600 14400000 11520000
1300–1500 8 6 1400 11200 8400 15680000 11760000
SUM 60 60 61000 56400 64920000 55920000
mean(a) 1016.7 mean(b) 940.0
std(a)= 220.0 std(b)= 220.0
CV(a) 21.64% CV(b) 23.4%
Hans Rosling
(27 July 1948 – 7 February 2017)
most admired TED shows
Swedish epidemiologist with high data exploratory power
Gapminder foundation
2014 second time in Nepal from UNESCO
How not to be ignorant /The Joy of Statistics
( first 5 minutes of the total 1 hours Video)
http://www.gapminder.org/videos/the-joy-of-stats/
Thanks

More Related Content

Similar to Slide 1zjzckkasjasfjsajkfakjlasfasajfdfjaksdffj.pdf

Lec 3.pptx
Lec 3.pptxLec 3.pptx
Lec 3.pptx
AliAkbar99386
 
Intro to Statistics.pptx
Intro to Statistics.pptxIntro to Statistics.pptx
Intro to Statistics.pptx
Elyada Wigati Pramaresti
 
Working with Numerical Data
Working with  Numerical DataWorking with  Numerical Data
Working with Numerical Data
Global Polis
 
Statistics
StatisticsStatistics
Statistics
Eran Earland
 
Descriptive Statistics and Data Visualization
Descriptive Statistics and Data VisualizationDescriptive Statistics and Data Visualization
Descriptive Statistics and Data Visualization
Douglas Joubert
 
Making sense of numbers - a half-day workshop
Making sense of numbers - a half-day workshopMaking sense of numbers - a half-day workshop
Making sense of numbers - a half-day workshop
Improvement Skills Consulting Ltd.
 
2. basic statistics
2. basic statistics2. basic statistics
2. basic statistics
Steve Saffhill
 
Descriptive statistics
Descriptive statisticsDescriptive statistics
Descriptive statistics
Anand Thokal
 
Sampling Size
Sampling SizeSampling Size
Sampling Size
Daniel Moore
 
Statistics final seminar
Statistics final seminarStatistics final seminar
Statistics final seminar
Tejas Jagtap
 
Quantitative Methods
Quantitative MethodsQuantitative Methods
Quantitative Methods
atrantham
 
Standard deviation
Standard deviationStandard deviation
Standard deviation
M K
 
data analysis in Statistics-2023 guide 2023
data analysis in Statistics-2023 guide 2023data analysis in Statistics-2023 guide 2023
data analysis in Statistics-2023 guide 2023
ayesha455941
 
Statistics (GE 4 CLASS).pptx
Statistics (GE 4 CLASS).pptxStatistics (GE 4 CLASS).pptx
Statistics (GE 4 CLASS).pptx
YollyCalamba
 
Torturing numbers - Descriptive Statistics for Growers (2013)
Torturing numbers - Descriptive Statistics for Growers (2013)Torturing numbers - Descriptive Statistics for Growers (2013)
Torturing numbers - Descriptive Statistics for Growers (2013)
jasondeveau
 
EDUCATIONAL STATISTICS_Unit_I.ppt
EDUCATIONAL STATISTICS_Unit_I.pptEDUCATIONAL STATISTICS_Unit_I.ppt
EDUCATIONAL STATISTICS_Unit_I.ppt
Sasi Kumar
 
Spss basic Dr Marwa Zalat
Spss basic Dr Marwa ZalatSpss basic Dr Marwa Zalat
Spss basic Dr Marwa Zalat
Marwa Zalat
 
Intro to Biostat. ppt
Intro to Biostat. pptIntro to Biostat. ppt
Intro to Biostat. ppt
AhmadYarSukhera
 
Confidence Intervals in the Life Sciences PresentationNamesS.docx
Confidence Intervals in the Life Sciences PresentationNamesS.docxConfidence Intervals in the Life Sciences PresentationNamesS.docx
Confidence Intervals in the Life Sciences PresentationNamesS.docx
maxinesmith73660
 
Statistics and Public Health. Curso de Inglés Técnico para profesionales de S...
Statistics and Public Health. Curso de Inglés Técnico para profesionales de S...Statistics and Public Health. Curso de Inglés Técnico para profesionales de S...
Statistics and Public Health. Curso de Inglés Técnico para profesionales de S...
Universidad Particular de Loja
 

Similar to Slide 1zjzckkasjasfjsajkfakjlasfasajfdfjaksdffj.pdf (20)

Lec 3.pptx
Lec 3.pptxLec 3.pptx
Lec 3.pptx
 
Intro to Statistics.pptx
Intro to Statistics.pptxIntro to Statistics.pptx
Intro to Statistics.pptx
 
Working with Numerical Data
Working with  Numerical DataWorking with  Numerical Data
Working with Numerical Data
 
Statistics
StatisticsStatistics
Statistics
 
Descriptive Statistics and Data Visualization
Descriptive Statistics and Data VisualizationDescriptive Statistics and Data Visualization
Descriptive Statistics and Data Visualization
 
Making sense of numbers - a half-day workshop
Making sense of numbers - a half-day workshopMaking sense of numbers - a half-day workshop
Making sense of numbers - a half-day workshop
 
2. basic statistics
2. basic statistics2. basic statistics
2. basic statistics
 
Descriptive statistics
Descriptive statisticsDescriptive statistics
Descriptive statistics
 
Sampling Size
Sampling SizeSampling Size
Sampling Size
 
Statistics final seminar
Statistics final seminarStatistics final seminar
Statistics final seminar
 
Quantitative Methods
Quantitative MethodsQuantitative Methods
Quantitative Methods
 
Standard deviation
Standard deviationStandard deviation
Standard deviation
 
data analysis in Statistics-2023 guide 2023
data analysis in Statistics-2023 guide 2023data analysis in Statistics-2023 guide 2023
data analysis in Statistics-2023 guide 2023
 
Statistics (GE 4 CLASS).pptx
Statistics (GE 4 CLASS).pptxStatistics (GE 4 CLASS).pptx
Statistics (GE 4 CLASS).pptx
 
Torturing numbers - Descriptive Statistics for Growers (2013)
Torturing numbers - Descriptive Statistics for Growers (2013)Torturing numbers - Descriptive Statistics for Growers (2013)
Torturing numbers - Descriptive Statistics for Growers (2013)
 
EDUCATIONAL STATISTICS_Unit_I.ppt
EDUCATIONAL STATISTICS_Unit_I.pptEDUCATIONAL STATISTICS_Unit_I.ppt
EDUCATIONAL STATISTICS_Unit_I.ppt
 
Spss basic Dr Marwa Zalat
Spss basic Dr Marwa ZalatSpss basic Dr Marwa Zalat
Spss basic Dr Marwa Zalat
 
Intro to Biostat. ppt
Intro to Biostat. pptIntro to Biostat. ppt
Intro to Biostat. ppt
 
Confidence Intervals in the Life Sciences PresentationNamesS.docx
Confidence Intervals in the Life Sciences PresentationNamesS.docxConfidence Intervals in the Life Sciences PresentationNamesS.docx
Confidence Intervals in the Life Sciences PresentationNamesS.docx
 
Statistics and Public Health. Curso de Inglés Técnico para profesionales de S...
Statistics and Public Health. Curso de Inglés Técnico para profesionales de S...Statistics and Public Health. Curso de Inglés Técnico para profesionales de S...
Statistics and Public Health. Curso de Inglés Técnico para profesionales de S...
 

More from BirBetalMatketing

01 Chapterrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrr.ppt
01 Chapterrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrr.ppt01 Chapterrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrr.ppt
01 Chapterrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrr.ppt
BirBetalMatketing
 
Chapter 11 Case Solutionnnnnnnnnnnn.pptx
Chapter 11 Case Solutionnnnnnnnnnnn.pptxChapter 11 Case Solutionnnnnnnnnnnn.pptx
Chapter 11 Case Solutionnnnnnnnnnnn.pptx
BirBetalMatketing
 
Chapter 2 Literature reviewwwwwww OK.pptx
Chapter 2 Literature reviewwwwwww OK.pptxChapter 2 Literature reviewwwwwww OK.pptx
Chapter 2 Literature reviewwwwwww OK.pptx
BirBetalMatketing
 
Chapter 1 The Nature and Context of Social Research.pptx
Chapter 1 The Nature and Context of Social Research.pptxChapter 1 The Nature and Context of Social Research.pptx
Chapter 1 The Nature and Context of Social Research.pptx
BirBetalMatketing
 
fina.state.Numericallllllllllllllllll.ppt
fina.state.Numericallllllllllllllllll.pptfina.state.Numericallllllllllllllllll.ppt
fina.state.Numericallllllllllllllllll.ppt
BirBetalMatketing
 
2080-12-255555555555555555555555555555.pptx
2080-12-255555555555555555555555555555.pptx2080-12-255555555555555555555555555555.pptx
2080-12-255555555555555555555555555555.pptx
BirBetalMatketing
 
discret_distributionssssssssssssssssss.pdf
discret_distributionssssssssssssssssss.pdfdiscret_distributionssssssssssssssssss.pdf
discret_distributionssssssssssssssssss.pdf
BirBetalMatketing
 
conti_distributionsssssssssssssssssssssssssssssssssss.pdf
conti_distributionsssssssssssssssssssssssssssssssssss.pdfconti_distributionsssssssssssssssssssssssssssssssssss.pdf
conti_distributionsssssssssssssssssssssssssssssssssss.pdf
BirBetalMatketing
 
Slide 2 (Basic probabilityjhhhhhhhhhhh).pdf
Slide 2 (Basic probabilityjhhhhhhhhhhh).pdfSlide 2 (Basic probabilityjhhhhhhhhhhh).pdf
Slide 2 (Basic probabilityjhhhhhhhhhhh).pdf
BirBetalMatketing
 
Research Thinking research methodology.ppt
Research Thinking research methodology.pptResearch Thinking research methodology.ppt
Research Thinking research methodology.ppt
BirBetalMatketing
 
Academic Writing.pptxjiasdaidiAIIAIUHASIDUFASOIUIFAOSIUFOIASUFIASDF
Academic Writing.pptxjiasdaidiAIIAIUHASIDUFASOIUIFAOSIUFOIASUFIASDFAcademic Writing.pptxjiasdaidiAIIAIUHASIDUFASOIUIFAOSIUFOIASUFIASDF
Academic Writing.pptxjiasdaidiAIIAIUHASIDUFASOIUIFAOSIUFOIASUFIASDF
BirBetalMatketing
 

More from BirBetalMatketing (11)

01 Chapterrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrr.ppt
01 Chapterrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrr.ppt01 Chapterrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrr.ppt
01 Chapterrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrr.ppt
 
Chapter 11 Case Solutionnnnnnnnnnnn.pptx
Chapter 11 Case Solutionnnnnnnnnnnn.pptxChapter 11 Case Solutionnnnnnnnnnnn.pptx
Chapter 11 Case Solutionnnnnnnnnnnn.pptx
 
Chapter 2 Literature reviewwwwwww OK.pptx
Chapter 2 Literature reviewwwwwww OK.pptxChapter 2 Literature reviewwwwwww OK.pptx
Chapter 2 Literature reviewwwwwww OK.pptx
 
Chapter 1 The Nature and Context of Social Research.pptx
Chapter 1 The Nature and Context of Social Research.pptxChapter 1 The Nature and Context of Social Research.pptx
Chapter 1 The Nature and Context of Social Research.pptx
 
fina.state.Numericallllllllllllllllll.ppt
fina.state.Numericallllllllllllllllll.pptfina.state.Numericallllllllllllllllll.ppt
fina.state.Numericallllllllllllllllll.ppt
 
2080-12-255555555555555555555555555555.pptx
2080-12-255555555555555555555555555555.pptx2080-12-255555555555555555555555555555.pptx
2080-12-255555555555555555555555555555.pptx
 
discret_distributionssssssssssssssssss.pdf
discret_distributionssssssssssssssssss.pdfdiscret_distributionssssssssssssssssss.pdf
discret_distributionssssssssssssssssss.pdf
 
conti_distributionsssssssssssssssssssssssssssssssssss.pdf
conti_distributionsssssssssssssssssssssssssssssssssss.pdfconti_distributionsssssssssssssssssssssssssssssssssss.pdf
conti_distributionsssssssssssssssssssssssssssssssssss.pdf
 
Slide 2 (Basic probabilityjhhhhhhhhhhh).pdf
Slide 2 (Basic probabilityjhhhhhhhhhhh).pdfSlide 2 (Basic probabilityjhhhhhhhhhhh).pdf
Slide 2 (Basic probabilityjhhhhhhhhhhh).pdf
 
Research Thinking research methodology.ppt
Research Thinking research methodology.pptResearch Thinking research methodology.ppt
Research Thinking research methodology.ppt
 
Academic Writing.pptxjiasdaidiAIIAIUHASIDUFASOIUIFAOSIUFOIASUFIASDF
Academic Writing.pptxjiasdaidiAIIAIUHASIDUFASOIUIFAOSIUFOIASUFIASDFAcademic Writing.pptxjiasdaidiAIIAIUHASIDUFASOIUIFAOSIUFOIASUFIASDF
Academic Writing.pptxjiasdaidiAIIAIUHASIDUFASOIUIFAOSIUFOIASUFIASDF
 

Recently uploaded

Palestine last event orientationfvgnh .pptx
Palestine last event orientationfvgnh .pptxPalestine last event orientationfvgnh .pptx
Palestine last event orientationfvgnh .pptx
RaedMohamed3
 
Thesis Statement for students diagnonsed withADHD.ppt
Thesis Statement for students diagnonsed withADHD.pptThesis Statement for students diagnonsed withADHD.ppt
Thesis Statement for students diagnonsed withADHD.ppt
EverAndrsGuerraGuerr
 
Synthetic Fiber Construction in lab .pptx
Synthetic Fiber Construction in lab .pptxSynthetic Fiber Construction in lab .pptx
Synthetic Fiber Construction in lab .pptx
Pavel ( NSTU)
 
BÀI TẬP BỔ TRỢ TIẾNG ANH GLOBAL SUCCESS LỚP 3 - CẢ NĂM (CÓ FILE NGHE VÀ ĐÁP Á...
BÀI TẬP BỔ TRỢ TIẾNG ANH GLOBAL SUCCESS LỚP 3 - CẢ NĂM (CÓ FILE NGHE VÀ ĐÁP Á...BÀI TẬP BỔ TRỢ TIẾNG ANH GLOBAL SUCCESS LỚP 3 - CẢ NĂM (CÓ FILE NGHE VÀ ĐÁP Á...
BÀI TẬP BỔ TRỢ TIẾNG ANH GLOBAL SUCCESS LỚP 3 - CẢ NĂM (CÓ FILE NGHE VÀ ĐÁP Á...
Nguyen Thanh Tu Collection
 
Lapbook sobre os Regimes Totalitários.pdf
Lapbook sobre os Regimes Totalitários.pdfLapbook sobre os Regimes Totalitários.pdf
Lapbook sobre os Regimes Totalitários.pdf
Jean Carlos Nunes Paixão
 
CACJapan - GROUP Presentation 1- Wk 4.pdf
CACJapan - GROUP Presentation 1- Wk 4.pdfCACJapan - GROUP Presentation 1- Wk 4.pdf
CACJapan - GROUP Presentation 1- Wk 4.pdf
camakaiclarkmusic
 
Acetabularia Information For Class 9 .docx
Acetabularia Information For Class 9  .docxAcetabularia Information For Class 9  .docx
Acetabularia Information For Class 9 .docx
vaibhavrinwa19
 
A Strategic Approach: GenAI in Education
A Strategic Approach: GenAI in EducationA Strategic Approach: GenAI in Education
A Strategic Approach: GenAI in Education
Peter Windle
 
1.4 modern child centered education - mahatma gandhi-2.pptx
1.4 modern child centered education - mahatma gandhi-2.pptx1.4 modern child centered education - mahatma gandhi-2.pptx
1.4 modern child centered education - mahatma gandhi-2.pptx
JosvitaDsouza2
 
Embracing GenAI - A Strategic Imperative
Embracing GenAI - A Strategic ImperativeEmbracing GenAI - A Strategic Imperative
Embracing GenAI - A Strategic Imperative
Peter Windle
 
678020731-Sumas-y-Restas-Para-Colorear.pdf
678020731-Sumas-y-Restas-Para-Colorear.pdf678020731-Sumas-y-Restas-Para-Colorear.pdf
678020731-Sumas-y-Restas-Para-Colorear.pdf
CarlosHernanMontoyab2
 
Francesca Gottschalk - How can education support child empowerment.pptx
Francesca Gottschalk - How can education support child empowerment.pptxFrancesca Gottschalk - How can education support child empowerment.pptx
Francesca Gottschalk - How can education support child empowerment.pptx
EduSkills OECD
 
Unit 2- Research Aptitude (UGC NET Paper I).pdf
Unit 2- Research Aptitude (UGC NET Paper I).pdfUnit 2- Research Aptitude (UGC NET Paper I).pdf
Unit 2- Research Aptitude (UGC NET Paper I).pdf
Thiyagu K
 
The Accursed House by Émile Gaboriau.pptx
The Accursed House by Émile Gaboriau.pptxThe Accursed House by Émile Gaboriau.pptx
The Accursed House by Émile Gaboriau.pptx
DhatriParmar
 
Overview on Edible Vaccine: Pros & Cons with Mechanism
Overview on Edible Vaccine: Pros & Cons with MechanismOverview on Edible Vaccine: Pros & Cons with Mechanism
Overview on Edible Vaccine: Pros & Cons with Mechanism
DeeptiGupta154
 
The basics of sentences session 5pptx.pptx
The basics of sentences session 5pptx.pptxThe basics of sentences session 5pptx.pptx
The basics of sentences session 5pptx.pptx
heathfieldcps1
 
2024.06.01 Introducing a competency framework for languag learning materials ...
2024.06.01 Introducing a competency framework for languag learning materials ...2024.06.01 Introducing a competency framework for languag learning materials ...
2024.06.01 Introducing a competency framework for languag learning materials ...
Sandy Millin
 
How to Make a Field invisible in Odoo 17
How to Make a Field invisible in Odoo 17How to Make a Field invisible in Odoo 17
How to Make a Field invisible in Odoo 17
Celine George
 
TESDA TM1 REVIEWER FOR NATIONAL ASSESSMENT WRITTEN AND ORAL QUESTIONS WITH A...
TESDA TM1 REVIEWER  FOR NATIONAL ASSESSMENT WRITTEN AND ORAL QUESTIONS WITH A...TESDA TM1 REVIEWER  FOR NATIONAL ASSESSMENT WRITTEN AND ORAL QUESTIONS WITH A...
TESDA TM1 REVIEWER FOR NATIONAL ASSESSMENT WRITTEN AND ORAL QUESTIONS WITH A...
EugeneSaldivar
 
Language Across the Curriculm LAC B.Ed.
Language Across the  Curriculm LAC B.Ed.Language Across the  Curriculm LAC B.Ed.
Language Across the Curriculm LAC B.Ed.
Atul Kumar Singh
 

Recently uploaded (20)

Palestine last event orientationfvgnh .pptx
Palestine last event orientationfvgnh .pptxPalestine last event orientationfvgnh .pptx
Palestine last event orientationfvgnh .pptx
 
Thesis Statement for students diagnonsed withADHD.ppt
Thesis Statement for students diagnonsed withADHD.pptThesis Statement for students diagnonsed withADHD.ppt
Thesis Statement for students diagnonsed withADHD.ppt
 
Synthetic Fiber Construction in lab .pptx
Synthetic Fiber Construction in lab .pptxSynthetic Fiber Construction in lab .pptx
Synthetic Fiber Construction in lab .pptx
 
BÀI TẬP BỔ TRỢ TIẾNG ANH GLOBAL SUCCESS LỚP 3 - CẢ NĂM (CÓ FILE NGHE VÀ ĐÁP Á...
BÀI TẬP BỔ TRỢ TIẾNG ANH GLOBAL SUCCESS LỚP 3 - CẢ NĂM (CÓ FILE NGHE VÀ ĐÁP Á...BÀI TẬP BỔ TRỢ TIẾNG ANH GLOBAL SUCCESS LỚP 3 - CẢ NĂM (CÓ FILE NGHE VÀ ĐÁP Á...
BÀI TẬP BỔ TRỢ TIẾNG ANH GLOBAL SUCCESS LỚP 3 - CẢ NĂM (CÓ FILE NGHE VÀ ĐÁP Á...
 
Lapbook sobre os Regimes Totalitários.pdf
Lapbook sobre os Regimes Totalitários.pdfLapbook sobre os Regimes Totalitários.pdf
Lapbook sobre os Regimes Totalitários.pdf
 
CACJapan - GROUP Presentation 1- Wk 4.pdf
CACJapan - GROUP Presentation 1- Wk 4.pdfCACJapan - GROUP Presentation 1- Wk 4.pdf
CACJapan - GROUP Presentation 1- Wk 4.pdf
 
Acetabularia Information For Class 9 .docx
Acetabularia Information For Class 9  .docxAcetabularia Information For Class 9  .docx
Acetabularia Information For Class 9 .docx
 
A Strategic Approach: GenAI in Education
A Strategic Approach: GenAI in EducationA Strategic Approach: GenAI in Education
A Strategic Approach: GenAI in Education
 
1.4 modern child centered education - mahatma gandhi-2.pptx
1.4 modern child centered education - mahatma gandhi-2.pptx1.4 modern child centered education - mahatma gandhi-2.pptx
1.4 modern child centered education - mahatma gandhi-2.pptx
 
Embracing GenAI - A Strategic Imperative
Embracing GenAI - A Strategic ImperativeEmbracing GenAI - A Strategic Imperative
Embracing GenAI - A Strategic Imperative
 
678020731-Sumas-y-Restas-Para-Colorear.pdf
678020731-Sumas-y-Restas-Para-Colorear.pdf678020731-Sumas-y-Restas-Para-Colorear.pdf
678020731-Sumas-y-Restas-Para-Colorear.pdf
 
Francesca Gottschalk - How can education support child empowerment.pptx
Francesca Gottschalk - How can education support child empowerment.pptxFrancesca Gottschalk - How can education support child empowerment.pptx
Francesca Gottschalk - How can education support child empowerment.pptx
 
Unit 2- Research Aptitude (UGC NET Paper I).pdf
Unit 2- Research Aptitude (UGC NET Paper I).pdfUnit 2- Research Aptitude (UGC NET Paper I).pdf
Unit 2- Research Aptitude (UGC NET Paper I).pdf
 
The Accursed House by Émile Gaboriau.pptx
The Accursed House by Émile Gaboriau.pptxThe Accursed House by Émile Gaboriau.pptx
The Accursed House by Émile Gaboriau.pptx
 
Overview on Edible Vaccine: Pros & Cons with Mechanism
Overview on Edible Vaccine: Pros & Cons with MechanismOverview on Edible Vaccine: Pros & Cons with Mechanism
Overview on Edible Vaccine: Pros & Cons with Mechanism
 
The basics of sentences session 5pptx.pptx
The basics of sentences session 5pptx.pptxThe basics of sentences session 5pptx.pptx
The basics of sentences session 5pptx.pptx
 
2024.06.01 Introducing a competency framework for languag learning materials ...
2024.06.01 Introducing a competency framework for languag learning materials ...2024.06.01 Introducing a competency framework for languag learning materials ...
2024.06.01 Introducing a competency framework for languag learning materials ...
 
How to Make a Field invisible in Odoo 17
How to Make a Field invisible in Odoo 17How to Make a Field invisible in Odoo 17
How to Make a Field invisible in Odoo 17
 
TESDA TM1 REVIEWER FOR NATIONAL ASSESSMENT WRITTEN AND ORAL QUESTIONS WITH A...
TESDA TM1 REVIEWER  FOR NATIONAL ASSESSMENT WRITTEN AND ORAL QUESTIONS WITH A...TESDA TM1 REVIEWER  FOR NATIONAL ASSESSMENT WRITTEN AND ORAL QUESTIONS WITH A...
TESDA TM1 REVIEWER FOR NATIONAL ASSESSMENT WRITTEN AND ORAL QUESTIONS WITH A...
 
Language Across the Curriculm LAC B.Ed.
Language Across the  Curriculm LAC B.Ed.Language Across the  Curriculm LAC B.Ed.
Language Across the Curriculm LAC B.Ed.
 

Slide 1zjzckkasjasfjsajkfakjlasfasajfdfjaksdffj.pdf

  • 1. Research needs good understanding of data analysis Vikash Raj Satyal (Vikash@kusom.edu.np) Summarize your Data:
  • 2. What to look in the dataset? If our study have a large data set, we (researcher) are interested to know :- • What the central value is, • What is the spread from center, • What is the shape & size of data distribution
  • 3. Major economic dataset Questions • What is percapita GDP? • Whose percapita GDP is this? • Did you earn $1191 in this FY 142920(Rs.126,018)? (Rs11,910monthly)
  • 4.
  • 5. • Nepali people earn about 55 times low percapita GDP than USA, and 165 times lower than Monaco people
  • 7. Research Paradigm 3. Survey(Collect data) 4. Statistical analysis 5. There is not enough evidence to support research(alternative) hypothesis(HA) 6. Res. Hypo accepted HA is true = Failure of research hypo. 7. Report writing 1. Setup research hypo/Refine(Lit Review) 2. Develop instruments 5a. Report writing 7
  • 8. Why Dolpa & Mugu also have highest annual growth rate? Why Achham, Palpa has one of the lowest growth rate? Mugu Dolpa
  • 9. • What is the general IQ of US university students? • In the US the mean IQ for persons completing no more than a….. • Bachelor’s degree 113 (80th centile) • Master’s degree 117 (87th centile) • PhD, LLD, MD 124 (95th centile)
  • 10. Central Tendency in large sample data In any large data set, data are clustered around center. So researchers focus to find out that central value. Depending on the shape of the data distribution center is calculated differently using different statistical formula
  • 12. Statistical way of measuring the center of a data set •Mean(AM, GM, HM, Weighted mean) •Median •Mode •Partition values
  • 13. Median not mean, for: (i) Open End Classes. (ii) unequal class interval data table. (ii) When data has several extreme values(outliers). (iii) qualitative data( in frequency). (IV) When data strongly lack normality
  • 14. Quantiles or Partition values •Quartiles (3) •Deciles (9) •Percentiles (99) • Quintile(5) (Not Quantile)
  • 15. Mode is most frequently occurring value • Less used • Popular in business and industry • Only way to locate central value when data is nominal (How many type A sold? most preferred flavor of ice cream)
  • 17. Which Average is better? AM is best for interval data, however it should not be used : • For highly skewed data • in open end classes. • When there are very large and very small items(outliers). • In case of average ratio and rate of change. Median is the best average for: • open end classes • Skewed data or in presence of outliers • For ordinal qualitative data eg.: less honest, honest, very honest Mode is used for qualitative nominal data frequently used in business and industry
  • 18. Does Shape and Size of the data matters? • Elongation of left or right tail is Skewness • skewness described dataset’s symmetry – or lack of symmetry. • A perfectly symmetrical data set will have a skewness of 0.
  • 19. Skewness • Negative (left) skewness indicates more small values(on left tail) • Positive (right) skewness indicates more large values(on right tail)
  • 20. • kurtosis measures extreme values in either tail. • Normal curve has no Kurtosis • Kurtosis is measured comparing the Normal curve
  • 21. Calculation in EXCEL: Statistics for lexp(life expectancy) using NHDR2014.xlsx
  • 22. Calculation in EXCEL: Statistics for price using auto.xlsx
  • 23. Use data, nhdr2014 to calculate the following 1. Average life expectancy (‘life’) 2. Average gdp percapita (‘income’) 3. Average life expectancy (‘life’) of 3 ecologies (eg, average life(mountain)= …. ) 4. Calculate Q1, Q2, Q3 of ‘income’ 5. Using 3 quartiles of ‘income’ we can divide any other data in 4 equal parts. Make a new variable, call it ‘groups’, that will have 4 value-labels according to below criteria: ‘poor’ if below Q1 ‘below average’, if between Q1 to Q2, ‘above average’, if between Q2 to Q3 ‘rich’ if above Q3 6. Find the average of ‘life’ & ‘hdi’ for this newly created variable with 4 groups 7. How many ‘districts’ falls in each of these ‘groups’? And which district has the highest & lowest ‘life’ value that falls in each of these 4 ‘groups’? 8. Save this data for your future use
  • 24. Dispersion in data is meaningful
  • 25. Central value alone can disguise the picture
  • 26. Variability is beauty of the wild nature •Geographical variation generates variety in species of flora and fauna •Ethnography –cultural diversity •Epidemiology treats variation in disease
  • 27. How to measure data dispersion? Range Standard Deviation Quartile Deviation Coefficient of variation
  • 28. 1. Range Range= Largest value – Smallest value •High Range in temperature acts for desertification •Range of mobile sets •Range of social disparity
  • 29. 2. Quartile deviation (semi inter quartile range) • Inter-quartile Range = Q3 – Q1 • Quartile deviation (Q.D.) : 𝑸. 𝑫. = 𝟑𝟑. 𝟐𝟓 − 𝟏𝟑. 𝟖𝟔 𝟐 = 𝟗. 𝟔𝟗
  • 30. 3. Variance & Standard Deviation •Most popular measure of variation •It uses all observations •Std(standard deviation) is the square root of variance •Std = 𝑣𝑎𝑟𝑖𝑎𝑛𝑐𝑒
  • 31. Sample VS population VARIANCE For Papulation s2 = (𝑋−𝑋 )² 𝑛 = 𝑋² 𝑛 − 𝑋 𝑛 2 (individual data) s2 = 𝑓(𝑋−𝑋 )² 𝑁 = 𝑓𝑋² 𝑁 − 𝑓𝑋 𝑁 2 Grouped data For sample S2 = (𝑋−𝑋 )² 𝑛−1 Also, S2 = 𝑛 𝑛−1 s2 S2 = 𝑛 𝑛−1 s2 = 𝑛 𝑛 − 1 𝑓𝑋2 𝑁 − 𝑓𝑋 𝑁 2 When n  ∞ , sample mean  population mean
  • 32. Example: Variance and std of the life of electric bulbs(in hours) Length of life No. of bulbs 500–700 5 700–900 11 900–1100 26 1100–1300 10 1300–1500 8 Length of life No. of bulbs mid- value f X fx fx2 500–700 5 600 3000 1800000 700–900 11 800 8800 7040000 900–1100 26 1000 26000 26000000 1100–1300 10 1200 12000 14400000 1300–1500 8 1400 11200 15680000 SUM 60 61000 64920000 Mean = 1016.67 Variance = 48388.89 Std = 219.9747
  • 33. 4. Coefficient of Variation(C.V.) The co-efficient of variation is the relative measure based on the standard deviation and is defined as the ratio of the standard deviation to the mean expressed in percent. C.V. = 𝜎 μ x100% It is used to compare the compactness of two or more data Smaller C.V. indicates consistent or less variable data C.V. is unit-less so data in same or different units can be compared by it. eg. Weights in KG and in Pounds
  • 34. Which type of electric bulbs has better consistency in life span? Length of life No. of bulbs(alpha, a) No. of bulbs(beta, b) fa fb 500–700 5 4 700–900 11 30 900–1100 26 12 1100–1300 10 8 1300–1500 8 6 Length of life # bulbs (alpha, a) # bulbs (beta, b) Mid-value fa fb X Xfa Xfb X2fa X2fb 500–700 5 4 600 3000 2400 1800000 1440000 700–900 11 30 800 8800 24000 7040000 19200000 900–1100 26 12 1000 26000 12000 26000000 12000000 1100–1300 10 8 1200 12000 9600 14400000 11520000 1300–1500 8 6 1400 11200 8400 15680000 11760000 SUM 60 60 61000 56400 64920000 55920000 mean(a) 1016.7 mean(b) 940.0 std(a)= 220.0 std(b)= 220.0 CV(a) 21.64% CV(b) 23.4%
  • 35. Hans Rosling (27 July 1948 – 7 February 2017) most admired TED shows Swedish epidemiologist with high data exploratory power Gapminder foundation 2014 second time in Nepal from UNESCO How not to be ignorant /The Joy of Statistics ( first 5 minutes of the total 1 hours Video) http://www.gapminder.org/videos/the-joy-of-stats/