SlideShare a Scribd company logo
1 of 38
Data analysis: Explore
GAP Toolkit 5
Training in basic drug abuse data management
and analysis
Training session 9
Objectives
• To define a standard set of descriptive statistics
used to analyse continuous variables
• To examine the Explore facility in SPSS
• To introduce the analysis of a continuous variable
according to values of a categorical variable, an
example of bivariate analysis
• To introduce further SPSS Help options
• To reinforce the use of SPSS syntax
SPSS Descriptive Statistics
• Analyse/Descriptive Statistics/Frequencies
• Analyse/Descriptive Statistics/Explore
• Analyse/Descriptive Statistics/Descriptives
Exercise: continuous variable
• Generate a set of standard summary statistics for the
continuous variable Age
Explore: Age
Explore: Descriptive Statistics
Statistic Std. Error
AGE Mean 31.78 .315
95% Confidence Interval for
Mean
Lower Bound 31.16
Upper Bound 32.40
5% Trimmed Mean 31.31
Median 31.00
Variance 154.614
Std. Deviation 12.434
Minimum 1
Maximum 77
Range 76
Interquartile Range 20.00
Skewness .427 .062
Kurtosis -.503 .124
Descriptives
Exercise: Help
• What’s This?
• Results Coach
• Case Studies
Measures of central tendency
• Most commonly:
– Mode
– Median
– Mean
• 5 per cent trimmed mean
The mode
• The mode is the most frequently occurring value in a
dataset
• Suitable for nominal data and above
• Example:
– The mode of the first most frequently used drug is Alcohol,
with 717 cases, approximately 46 per cent of valid responses
Bimodal
• Describes a distribution
• Two categories have a large number of cases
• Example:
– The distribution of Employment is bimodal, employment and
unemployment having a similar number of cases and more
cases than the other categories
The median
• The middle value when the data are ordered from low to
high is the median
• Half the data values lie below the median and half
above
• The data have to be ordered so the median is not
suitable for nominal data, but is suitable for ordinal
levels of measurement and above
Example: median
• Seizures of opium in Germany, 1994-1998
(Kilograms)
• Source: United Nations (2000). World Drug Report 2000 (United Nations publication,
Sales No. GV.E.00.0.10).
Year 1994 1995 1996 1997 1998
Seizure 36 15 45 42 286
• Sort the seizure data in ascending order
• The middle value is the median; the median annual
seizures of opium for Germany between 1994 and 1998
was 42 kilograms
Year 1995 1994 1997 1996 1998
Seizure 15 36 42 45 286
Ranked: 1 2 3 4 5
The mean
• Add the values in the data set and divide by the number
of values
• The mean is only truly applicable to interval and ratio
data, as it involves adding the variables
• It is sometimes applied to ordinal data or ordinal scales
constructed from a number of Likert scales, but this
requires the assumption that the difference between the
values in the scale is the same, e.g. between 1 and 2 is
the same as between 5 and 6
Example: mean
• Seizures of opium in Germany, 1994-1998
• Sample size = 5
• 36 + 15 + 45 + 42 + 286 = 424
• 424/5 = 84.8
Year 1994 1995 1996 1997 1998
Seizure 36 15 45 42 286
The 5 per cent trimmed mean
• The 5 per cent trimmed mean is the mean calculated on
the data set with the top 5 per cent and bottom 5 per
cent of values removed
• An estimator that is more resistant to outliers than the
mean
95 per cent confidence interval for the mean
• An indication of the expected error (precision) when
estimating the population mean with the sample mean
• In repeated sampling, the equation used to calculate the
confidence interval around the sample mean will contain
the population mean 95 times out of 100
Measures of dispersion
• The range
• The inter-quartile range
• The variance
• The standard deviation
The range
• A measure of the spread of the data
• Range = maximum – minimum
Quartiles
• 1st quartile: 25 per cent of the values lie below the value
of the 1st quartile and 75 per cent above
• 2nd quartile: the median: 50 per cent of values below
and 50 per cent of values above
• 3rd quartile: 75 per cent of values below and 25 per
cent of the values above
Inter-quartile range
• IQR = 3rd Quartile – 1st Quartile
• The inter-quartile range measures the spread or range
of the mid 50 per cent of the data
• Ordinal level of measurement or above
Variance
• The average squared difference from the mean
• Measured in units squared
• Requires interval or ratio levels of measurement
 
1
2



n
X
Xi
Standard deviation
• The square root of the variance
• Returns the units to those of the original variable
 
1
2



n
X
Xi
Example: standard deviation and variance
Seizures of opium in Germany, 1994-1998
Year Seizure Deviations Squared
deviations
1994 36 -48.8 2381.44
1995 15 -69.8 4872.04
1996 45 -39.8 1584.04
1997 42 -42.8 1831.84
1998 286 201.2 40481.44
Total 424 0 51150.8
Count 5 5
Mean 84.8 Variance 10230
Standard
deviation
101
Distribution or shape of the data
• The normal distribution
• Skewness:
– Positive or right-hand skewed
– Negative or left-hand skewed
• Kurtosis:
– Platykurtic
– Mesokurtic
– Leptokurtic
• Symmetrical data: the mean, the median and the mode
coincide
Mean
Median
Mode
f(X)
X
The normal distribution
Right-hand skew (+)
• Right-hand skew: the extreme large values drag the
mean towards them
f(X)
X
Mode Median Mean
Left-hand skew (-)
• Left-hand skew: the extreme small values drag the
mean towards them
Mode
Mean Median X
f(X)
Bivariate analysis
• Continuous Dependent Variable
• Categorical Independent Variable
Explore
Explore: Options button
Explore: Plots button
Explore: Statistics button
Gender Statistic Std. Error
AGE Male Mean 31.43 .340
95% Confidence Interval for
Mean
Lower Bound 30.76
Upper Bound 32.09
5% Trimmed Mean 31.03
Median 30.00
Variance 144.286
Std. Deviation 12.012
Minimum 1
Maximum 70
Range 69
Interquartile Range 19.00
Skewness .370 .069
Kurtosis -.573 .138
Female Mean 33.39 .789
95% Confidence Interval for
Mean
Lower Bound 31.84
Upper Bound 34.94
5% Trimmed Mean 32.77
Median 33.00
Variance 193.593
Std. Deviation 13.914
Minimum 14
Maximum 77
Range 63
Interquartile Range 23.00
Skewness .472 .138
Kurtosis -.602 .376
Descriptives
Male Female
Age
70.0
65.0
60.0
55.0
50.0
45.0
40.0
35.0
30.0
25.0
20.0
15.0
10.0
5.0
0.0
Histogram
Frequency
300
200
100
0
Std. Dev = 12.01
Mean = 31.4
N = 1247.00
Age
75.0
70.0
65.0
60.0
55.0
50.0
45.0
40.0
35.0
30.0
25.0
20.0
15.0
Histogram
Frequency
60
50
40
30
20
10
0
Std. Dev = 13.91
Mean = 33.4
N = 311.00
Boxplot of Age vs Gender
311
1247
N =
Gender
Female
Male
Age
100
80
60
40
20
0
-20
183
Median
Inter-quartile range
Outlier
Syntax: Explore
EXAMINE
VARIABLES=age BY gender /ID=id
/PLOT BOXPLOT HISTOGRAM
/COMPARE GROUP
/STATISTICS DESCRIPTIVES
/CINTERVAL 95
/MISSING LISTWISE
/NOTOTAL.
Summary
• Measures of central
tendency
• Measures of variation
• Quantiles
• Measures of shape
• Bivariate analysis for a
categorical independent
variable and continuous
dependent variable
• Histograms
• Boxplots

More Related Content

Similar to trs-9.ppt

Introduction to statistics RSS6 2014
Introduction to statistics RSS6 2014Introduction to statistics RSS6 2014
Introduction to statistics RSS6 2014
RSS6
 
statisticsintroductionofbusinessstats.ppt
statisticsintroductionofbusinessstats.pptstatisticsintroductionofbusinessstats.ppt
statisticsintroductionofbusinessstats.ppt
voore ajay
 
Research methodology3
Research methodology3Research methodology3
Research methodology3
Tosif Ahmad
 
01 introduction stat
01 introduction stat01 introduction stat
01 introduction stat
antaraar2009
 
Upload 140103034715-phpapp01 (1)
Upload 140103034715-phpapp01 (1)Upload 140103034715-phpapp01 (1)
Upload 140103034715-phpapp01 (1)
captaininfantry
 

Similar to trs-9.ppt (20)

Dscriptive statistics
Dscriptive statisticsDscriptive statistics
Dscriptive statistics
 
LECTURE 3 - inferential statistics bmaths
LECTURE 3 - inferential statistics bmathsLECTURE 3 - inferential statistics bmaths
LECTURE 3 - inferential statistics bmaths
 
Introduction to statistics RSS6 2014
Introduction to statistics RSS6 2014Introduction to statistics RSS6 2014
Introduction to statistics RSS6 2014
 
Bst322week1
Bst322week1Bst322week1
Bst322week1
 
statisticsintroductionofbusinessstats.ppt
statisticsintroductionofbusinessstats.pptstatisticsintroductionofbusinessstats.ppt
statisticsintroductionofbusinessstats.ppt
 
3. Statistical Analysis.pptx
3. Statistical Analysis.pptx3. Statistical Analysis.pptx
3. Statistical Analysis.pptx
 
Research methodology3
Research methodology3Research methodology3
Research methodology3
 
Introduction to Statistics2312.ppt
Introduction to Statistics2312.pptIntroduction to Statistics2312.ppt
Introduction to Statistics2312.ppt
 
Introduction to Statistics23122223.ppt
Introduction to Statistics23122223.pptIntroduction to Statistics23122223.ppt
Introduction to Statistics23122223.ppt
 
Measure of dispersion 10321
Measure of dispersion 10321Measure of dispersion 10321
Measure of dispersion 10321
 
Hm306 week 4
Hm306 week 4Hm306 week 4
Hm306 week 4
 
Hm306 week 4
Hm306 week 4Hm306 week 4
Hm306 week 4
 
Business statistics
Business statisticsBusiness statistics
Business statistics
 
01 introduction stat
01 introduction stat01 introduction stat
01 introduction stat
 
Biostatistics mean median mode unit 1.pptx
Biostatistics mean median mode unit 1.pptxBiostatistics mean median mode unit 1.pptx
Biostatistics mean median mode unit 1.pptx
 
Upload 140103034715-phpapp01 (1)
Upload 140103034715-phpapp01 (1)Upload 140103034715-phpapp01 (1)
Upload 140103034715-phpapp01 (1)
 
Introduction to statistics.pptx
Introduction to statistics.pptxIntroduction to statistics.pptx
Introduction to statistics.pptx
 
descriptive data analysis
 descriptive data analysis descriptive data analysis
descriptive data analysis
 
Week 2 measures of disease occurence
Week 2  measures of disease occurenceWeek 2  measures of disease occurence
Week 2 measures of disease occurence
 
Stats-Review-Maie-St-John-5-20-2009.ppt
Stats-Review-Maie-St-John-5-20-2009.pptStats-Review-Maie-St-John-5-20-2009.ppt
Stats-Review-Maie-St-John-5-20-2009.ppt
 

Recently uploaded

Obat aborsi Jakarta Timur Wa 081225888346 Jual Obat aborsi Cytotec asli Di Ja...
Obat aborsi Jakarta Timur Wa 081225888346 Jual Obat aborsi Cytotec asli Di Ja...Obat aborsi Jakarta Timur Wa 081225888346 Jual Obat aborsi Cytotec asli Di Ja...
Obat aborsi Jakarta Timur Wa 081225888346 Jual Obat aborsi Cytotec asli Di Ja...
icha27638
 
Healthcare Market Overview, May 2024: Funding, Financing and M&A, from Oppenh...
Healthcare Market Overview, May 2024: Funding, Financing and M&A, from Oppenh...Healthcare Market Overview, May 2024: Funding, Financing and M&A, from Oppenh...
Healthcare Market Overview, May 2024: Funding, Financing and M&A, from Oppenh...
Levi Shapiro
 
Tortora PRINCIPLES OF ANATOMY AND PHYSIOLOGY - Tortora - 14th Ed.pdf
Tortora PRINCIPLES OF ANATOMY AND PHYSIOLOGY - Tortora - 14th Ed.pdfTortora PRINCIPLES OF ANATOMY AND PHYSIOLOGY - Tortora - 14th Ed.pdf
Tortora PRINCIPLES OF ANATOMY AND PHYSIOLOGY - Tortora - 14th Ed.pdf
Dr. Afreen Nasir
 

Recently uploaded (20)

Session-17-KANGAROO-MOTHER-CARE_final-blue.pptx
Session-17-KANGAROO-MOTHER-CARE_final-blue.pptxSession-17-KANGAROO-MOTHER-CARE_final-blue.pptx
Session-17-KANGAROO-MOTHER-CARE_final-blue.pptx
 
Session-5-Birthing-Practices-Breastfeeding (1).ppt
Session-5-Birthing-Practices-Breastfeeding (1).pptSession-5-Birthing-Practices-Breastfeeding (1).ppt
Session-5-Birthing-Practices-Breastfeeding (1).ppt
 
Obat aborsi Jakarta Timur Wa 081225888346 Jual Obat aborsi Cytotec asli Di Ja...
Obat aborsi Jakarta Timur Wa 081225888346 Jual Obat aborsi Cytotec asli Di Ja...Obat aborsi Jakarta Timur Wa 081225888346 Jual Obat aborsi Cytotec asli Di Ja...
Obat aborsi Jakarta Timur Wa 081225888346 Jual Obat aborsi Cytotec asli Di Ja...
 
Anthony Edwards We Want Dallas T-shirtsAnthony Edwards We Want Dallas T-shirts
Anthony Edwards We Want Dallas T-shirtsAnthony Edwards We Want Dallas T-shirtsAnthony Edwards We Want Dallas T-shirtsAnthony Edwards We Want Dallas T-shirts
Anthony Edwards We Want Dallas T-shirtsAnthony Edwards We Want Dallas T-shirts
 
Etiology for RRT and Code Blue Workshop.
Etiology for RRT and Code Blue Workshop.Etiology for RRT and Code Blue Workshop.
Etiology for RRT and Code Blue Workshop.
 
Leading large scale change: a life at the interface between theory and practice
Leading large scale change: a life at the interface between theory and practiceLeading large scale change: a life at the interface between theory and practice
Leading large scale change: a life at the interface between theory and practice
 
Session-1-MBFHI-A-part-of-the-Global-Strategy.ppt
Session-1-MBFHI-A-part-of-the-Global-Strategy.pptSession-1-MBFHI-A-part-of-the-Global-Strategy.ppt
Session-1-MBFHI-A-part-of-the-Global-Strategy.ppt
 
Antiepileptic-Drugs-and-Congenital-Anomalies copy.pptx
Antiepileptic-Drugs-and-Congenital-Anomalies copy.pptxAntiepileptic-Drugs-and-Congenital-Anomalies copy.pptx
Antiepileptic-Drugs-and-Congenital-Anomalies copy.pptx
 
Coach Dan Quinn Commanders Feather T Shirts
Coach Dan Quinn Commanders Feather T ShirtsCoach Dan Quinn Commanders Feather T Shirts
Coach Dan Quinn Commanders Feather T Shirts
 
Young & Hot ℂall Girls Goa 8250077686 WhatsApp Number Best Rates of Goa ℂall ...
Young & Hot ℂall Girls Goa 8250077686 WhatsApp Number Best Rates of Goa ℂall ...Young & Hot ℂall Girls Goa 8250077686 WhatsApp Number Best Rates of Goa ℂall ...
Young & Hot ℂall Girls Goa 8250077686 WhatsApp Number Best Rates of Goa ℂall ...
 
VIP ℂall Girls Panaji 8250077686 WhatsApp: Me All Time Serviℂe Available Day ...
VIP ℂall Girls Panaji 8250077686 WhatsApp: Me All Time Serviℂe Available Day ...VIP ℂall Girls Panaji 8250077686 WhatsApp: Me All Time Serviℂe Available Day ...
VIP ℂall Girls Panaji 8250077686 WhatsApp: Me All Time Serviℂe Available Day ...
 
Healthcare Market Overview, May 2024: Funding, Financing and M&A, from Oppenh...
Healthcare Market Overview, May 2024: Funding, Financing and M&A, from Oppenh...Healthcare Market Overview, May 2024: Funding, Financing and M&A, from Oppenh...
Healthcare Market Overview, May 2024: Funding, Financing and M&A, from Oppenh...
 
The Power of Technology and Collaboration in Research - Rheumatology Research...
The Power of Technology and Collaboration in Research - Rheumatology Research...The Power of Technology and Collaboration in Research - Rheumatology Research...
The Power of Technology and Collaboration in Research - Rheumatology Research...
 
clostridiumbotulinum- BY Muzammil Ahmed Siddiqui.pptx
clostridiumbotulinum- BY Muzammil Ahmed Siddiqui.pptxclostridiumbotulinum- BY Muzammil Ahmed Siddiqui.pptx
clostridiumbotulinum- BY Muzammil Ahmed Siddiqui.pptx
 
Tortora PRINCIPLES OF ANATOMY AND PHYSIOLOGY - Tortora - 14th Ed.pdf
Tortora PRINCIPLES OF ANATOMY AND PHYSIOLOGY - Tortora - 14th Ed.pdfTortora PRINCIPLES OF ANATOMY AND PHYSIOLOGY - Tortora - 14th Ed.pdf
Tortora PRINCIPLES OF ANATOMY AND PHYSIOLOGY - Tortora - 14th Ed.pdf
 
GENETICS and KIDNEY DISEASES /
GENETICS and KIDNEY DISEASES            /GENETICS and KIDNEY DISEASES            /
GENETICS and KIDNEY DISEASES /
 
Young & Hot ℂall Girls Mumbai 8250077686 WhatsApp Number Best Rates of Mumbai...
Young & Hot ℂall Girls Mumbai 8250077686 WhatsApp Number Best Rates of Mumbai...Young & Hot ℂall Girls Mumbai 8250077686 WhatsApp Number Best Rates of Mumbai...
Young & Hot ℂall Girls Mumbai 8250077686 WhatsApp Number Best Rates of Mumbai...
 
Making change happen: learning from "positive deviancts"
Making change happen: learning from "positive deviancts"Making change happen: learning from "positive deviancts"
Making change happen: learning from "positive deviancts"
 
End of Response issues - Code and Rapid Response Workshop
End of Response issues - Code and Rapid Response WorkshopEnd of Response issues - Code and Rapid Response Workshop
End of Response issues - Code and Rapid Response Workshop
 
Lactation Mraining Management Session-2-Comm-Building-Conf.ppt
Lactation Mraining Management  Session-2-Comm-Building-Conf.pptLactation Mraining Management  Session-2-Comm-Building-Conf.ppt
Lactation Mraining Management Session-2-Comm-Building-Conf.ppt
 

trs-9.ppt

  • 1. Data analysis: Explore GAP Toolkit 5 Training in basic drug abuse data management and analysis Training session 9
  • 2. Objectives • To define a standard set of descriptive statistics used to analyse continuous variables • To examine the Explore facility in SPSS • To introduce the analysis of a continuous variable according to values of a categorical variable, an example of bivariate analysis • To introduce further SPSS Help options • To reinforce the use of SPSS syntax
  • 3. SPSS Descriptive Statistics • Analyse/Descriptive Statistics/Frequencies • Analyse/Descriptive Statistics/Explore • Analyse/Descriptive Statistics/Descriptives
  • 4. Exercise: continuous variable • Generate a set of standard summary statistics for the continuous variable Age
  • 6. Explore: Descriptive Statistics Statistic Std. Error AGE Mean 31.78 .315 95% Confidence Interval for Mean Lower Bound 31.16 Upper Bound 32.40 5% Trimmed Mean 31.31 Median 31.00 Variance 154.614 Std. Deviation 12.434 Minimum 1 Maximum 77 Range 76 Interquartile Range 20.00 Skewness .427 .062 Kurtosis -.503 .124 Descriptives
  • 7. Exercise: Help • What’s This? • Results Coach • Case Studies
  • 8. Measures of central tendency • Most commonly: – Mode – Median – Mean • 5 per cent trimmed mean
  • 9. The mode • The mode is the most frequently occurring value in a dataset • Suitable for nominal data and above • Example: – The mode of the first most frequently used drug is Alcohol, with 717 cases, approximately 46 per cent of valid responses
  • 10. Bimodal • Describes a distribution • Two categories have a large number of cases • Example: – The distribution of Employment is bimodal, employment and unemployment having a similar number of cases and more cases than the other categories
  • 11. The median • The middle value when the data are ordered from low to high is the median • Half the data values lie below the median and half above • The data have to be ordered so the median is not suitable for nominal data, but is suitable for ordinal levels of measurement and above
  • 12. Example: median • Seizures of opium in Germany, 1994-1998 (Kilograms) • Source: United Nations (2000). World Drug Report 2000 (United Nations publication, Sales No. GV.E.00.0.10). Year 1994 1995 1996 1997 1998 Seizure 36 15 45 42 286
  • 13. • Sort the seizure data in ascending order • The middle value is the median; the median annual seizures of opium for Germany between 1994 and 1998 was 42 kilograms Year 1995 1994 1997 1996 1998 Seizure 15 36 42 45 286 Ranked: 1 2 3 4 5
  • 14. The mean • Add the values in the data set and divide by the number of values • The mean is only truly applicable to interval and ratio data, as it involves adding the variables • It is sometimes applied to ordinal data or ordinal scales constructed from a number of Likert scales, but this requires the assumption that the difference between the values in the scale is the same, e.g. between 1 and 2 is the same as between 5 and 6
  • 15. Example: mean • Seizures of opium in Germany, 1994-1998 • Sample size = 5 • 36 + 15 + 45 + 42 + 286 = 424 • 424/5 = 84.8 Year 1994 1995 1996 1997 1998 Seizure 36 15 45 42 286
  • 16. The 5 per cent trimmed mean • The 5 per cent trimmed mean is the mean calculated on the data set with the top 5 per cent and bottom 5 per cent of values removed • An estimator that is more resistant to outliers than the mean
  • 17. 95 per cent confidence interval for the mean • An indication of the expected error (precision) when estimating the population mean with the sample mean • In repeated sampling, the equation used to calculate the confidence interval around the sample mean will contain the population mean 95 times out of 100
  • 18. Measures of dispersion • The range • The inter-quartile range • The variance • The standard deviation
  • 19. The range • A measure of the spread of the data • Range = maximum – minimum
  • 20. Quartiles • 1st quartile: 25 per cent of the values lie below the value of the 1st quartile and 75 per cent above • 2nd quartile: the median: 50 per cent of values below and 50 per cent of values above • 3rd quartile: 75 per cent of values below and 25 per cent of the values above
  • 21. Inter-quartile range • IQR = 3rd Quartile – 1st Quartile • The inter-quartile range measures the spread or range of the mid 50 per cent of the data • Ordinal level of measurement or above
  • 22. Variance • The average squared difference from the mean • Measured in units squared • Requires interval or ratio levels of measurement   1 2    n X Xi
  • 23. Standard deviation • The square root of the variance • Returns the units to those of the original variable   1 2    n X Xi
  • 24. Example: standard deviation and variance Seizures of opium in Germany, 1994-1998 Year Seizure Deviations Squared deviations 1994 36 -48.8 2381.44 1995 15 -69.8 4872.04 1996 45 -39.8 1584.04 1997 42 -42.8 1831.84 1998 286 201.2 40481.44 Total 424 0 51150.8 Count 5 5 Mean 84.8 Variance 10230 Standard deviation 101
  • 25. Distribution or shape of the data • The normal distribution • Skewness: – Positive or right-hand skewed – Negative or left-hand skewed • Kurtosis: – Platykurtic – Mesokurtic – Leptokurtic
  • 26. • Symmetrical data: the mean, the median and the mode coincide Mean Median Mode f(X) X The normal distribution
  • 27. Right-hand skew (+) • Right-hand skew: the extreme large values drag the mean towards them f(X) X Mode Median Mean
  • 28. Left-hand skew (-) • Left-hand skew: the extreme small values drag the mean towards them Mode Mean Median X f(X)
  • 29. Bivariate analysis • Continuous Dependent Variable • Categorical Independent Variable
  • 34. Gender Statistic Std. Error AGE Male Mean 31.43 .340 95% Confidence Interval for Mean Lower Bound 30.76 Upper Bound 32.09 5% Trimmed Mean 31.03 Median 30.00 Variance 144.286 Std. Deviation 12.012 Minimum 1 Maximum 70 Range 69 Interquartile Range 19.00 Skewness .370 .069 Kurtosis -.573 .138 Female Mean 33.39 .789 95% Confidence Interval for Mean Lower Bound 31.84 Upper Bound 34.94 5% Trimmed Mean 32.77 Median 33.00 Variance 193.593 Std. Deviation 13.914 Minimum 14 Maximum 77 Range 63 Interquartile Range 23.00 Skewness .472 .138 Kurtosis -.602 .376 Descriptives
  • 35. Male Female Age 70.0 65.0 60.0 55.0 50.0 45.0 40.0 35.0 30.0 25.0 20.0 15.0 10.0 5.0 0.0 Histogram Frequency 300 200 100 0 Std. Dev = 12.01 Mean = 31.4 N = 1247.00 Age 75.0 70.0 65.0 60.0 55.0 50.0 45.0 40.0 35.0 30.0 25.0 20.0 15.0 Histogram Frequency 60 50 40 30 20 10 0 Std. Dev = 13.91 Mean = 33.4 N = 311.00
  • 36. Boxplot of Age vs Gender 311 1247 N = Gender Female Male Age 100 80 60 40 20 0 -20 183 Median Inter-quartile range Outlier
  • 37. Syntax: Explore EXAMINE VARIABLES=age BY gender /ID=id /PLOT BOXPLOT HISTOGRAM /COMPARE GROUP /STATISTICS DESCRIPTIVES /CINTERVAL 95 /MISSING LISTWISE /NOTOTAL.
  • 38. Summary • Measures of central tendency • Measures of variation • Quantiles • Measures of shape • Bivariate analysis for a categorical independent variable and continuous dependent variable • Histograms • Boxplots