SlideShare a Scribd company logo
Data analysis: Explore
GAP Toolkit 5
Training in basic drug abuse data management
and analysis
Training session 9
Objectives
• To define a standard set of descriptive statistics
used to analyse continuous variables
• To examine the Explore facility in SPSS
• To introduce the analysis of a continuous variable
according to values of a categorical variable, an
example of bivariate analysis
• To introduce further SPSS Help options
• To reinforce the use of SPSS syntax
SPSS Descriptive Statistics
• Analyse/Descriptive Statistics/Frequencies
• Analyse/Descriptive Statistics/Explore
• Analyse/Descriptive Statistics/Descriptives
Exercise: continuous variable
• Generate a set of standard summary statistics for the
continuous variable Age
Explore: Age
Explore: Descriptive Statistics
Statistic Std. Error
AGE Mean 31.78 .315
95% Confidence Interval for
Mean
Lower Bound 31.16
Upper Bound 32.40
5% Trimmed Mean 31.31
Median 31.00
Variance 154.614
Std. Deviation 12.434
Minimum 1
Maximum 77
Range 76
Interquartile Range 20.00
Skewness .427 .062
Kurtosis -.503 .124
Descriptives
Exercise: Help
• What’s This?
• Results Coach
• Case Studies
Measures of central tendency
• Most commonly:
– Mode
– Median
– Mean
• 5 per cent trimmed mean
The mode
• The mode is the most frequently occurring value in a
dataset
• Suitable for nominal data and above
• Example:
– The mode of the first most frequently used drug is Alcohol,
with 717 cases, approximately 46 per cent of valid responses
Bimodal
• Describes a distribution
• Two categories have a large number of cases
• Example:
– The distribution of Employment is bimodal, employment and
unemployment having a similar number of cases and more
cases than the other categories
The median
• The middle value when the data are ordered from low to
high is the median
• Half the data values lie below the median and half
above
• The data have to be ordered so the median is not
suitable for nominal data, but is suitable for ordinal
levels of measurement and above
Example: median
• Seizures of opium in Germany, 1994-1998
(Kilograms)
• Source: United Nations (2000). World Drug Report 2000 (United Nations publication,
Sales No. GV.E.00.0.10).
Year 1994 1995 1996 1997 1998
Seizure 36 15 45 42 286
• Sort the seizure data in ascending order
• The middle value is the median; the median annual
seizures of opium for Germany between 1994 and 1998
was 42 kilograms
Year 1995 1994 1997 1996 1998
Seizure 15 36 42 45 286
Ranked: 1 2 3 4 5
The mean
• Add the values in the data set and divide by the number
of values
• The mean is only truly applicable to interval and ratio
data, as it involves adding the variables
• It is sometimes applied to ordinal data or ordinal scales
constructed from a number of Likert scales, but this
requires the assumption that the difference between the
values in the scale is the same, e.g. between 1 and 2 is
the same as between 5 and 6
Example: mean
• Seizures of opium in Germany, 1994-1998
• Sample size = 5
• 36 + 15 + 45 + 42 + 286 = 424
• 424/5 = 84.8
Year 1994 1995 1996 1997 1998
Seizure 36 15 45 42 286
The 5 per cent trimmed mean
• The 5 per cent trimmed mean is the mean calculated on
the data set with the top 5 per cent and bottom 5 per
cent of values removed
• An estimator that is more resistant to outliers than the
mean
95 per cent confidence interval for the mean
• An indication of the expected error (precision) when
estimating the population mean with the sample mean
• In repeated sampling, the equation used to calculate the
confidence interval around the sample mean will contain
the population mean 95 times out of 100
Measures of dispersion
• The range
• The inter-quartile range
• The variance
• The standard deviation
The range
• A measure of the spread of the data
• Range = maximum – minimum
Quartiles
• 1st quartile: 25 per cent of the values lie below the value
of the 1st quartile and 75 per cent above
• 2nd quartile: the median: 50 per cent of values below
and 50 per cent of values above
• 3rd quartile: 75 per cent of values below and 25 per
cent of the values above
Inter-quartile range
• IQR = 3rd Quartile – 1st Quartile
• The inter-quartile range measures the spread or range
of the mid 50 per cent of the data
• Ordinal level of measurement or above
Variance
• The average squared difference from the mean
• Measured in units squared
• Requires interval or ratio levels of measurement
 
1
2



n
X
Xi
Standard deviation
• The square root of the variance
• Returns the units to those of the original variable
 
1
2



n
X
Xi
Example: standard deviation and variance
Seizures of opium in Germany, 1994-1998
Year Seizure Deviations Squared
deviations
1994 36 -48.8 2381.44
1995 15 -69.8 4872.04
1996 45 -39.8 1584.04
1997 42 -42.8 1831.84
1998 286 201.2 40481.44
Total 424 0 51150.8
Count 5 5
Mean 84.8 Variance 10230
Standard
deviation
101
Distribution or shape of the data
• The normal distribution
• Skewness:
– Positive or right-hand skewed
– Negative or left-hand skewed
• Kurtosis:
– Platykurtic
– Mesokurtic
– Leptokurtic
• Symmetrical data: the mean, the median and the mode
coincide
Mean
Median
Mode
f(X)
X
The normal distribution
Right-hand skew (+)
• Right-hand skew: the extreme large values drag the
mean towards them
f(X)
X
Mode Median Mean
Left-hand skew (-)
• Left-hand skew: the extreme small values drag the
mean towards them
Mode
Mean Median X
f(X)
Bivariate analysis
• Continuous Dependent Variable
• Categorical Independent Variable
Explore
Explore: Options button
Explore: Plots button
Explore: Statistics button
Gender Statistic Std. Error
AGE Male Mean 31.43 .340
95% Confidence Interval for
Mean
Lower Bound 30.76
Upper Bound 32.09
5% Trimmed Mean 31.03
Median 30.00
Variance 144.286
Std. Deviation 12.012
Minimum 1
Maximum 70
Range 69
Interquartile Range 19.00
Skewness .370 .069
Kurtosis -.573 .138
Female Mean 33.39 .789
95% Confidence Interval for
Mean
Lower Bound 31.84
Upper Bound 34.94
5% Trimmed Mean 32.77
Median 33.00
Variance 193.593
Std. Deviation 13.914
Minimum 14
Maximum 77
Range 63
Interquartile Range 23.00
Skewness .472 .138
Kurtosis -.602 .376
Descriptives
Male Female
Age
70.0
65.0
60.0
55.0
50.0
45.0
40.0
35.0
30.0
25.0
20.0
15.0
10.0
5.0
0.0
Histogram
Frequency
300
200
100
0
Std. Dev = 12.01
Mean = 31.4
N = 1247.00
Age
75.0
70.0
65.0
60.0
55.0
50.0
45.0
40.0
35.0
30.0
25.0
20.0
15.0
Histogram
Frequency
60
50
40
30
20
10
0
Std. Dev = 13.91
Mean = 33.4
N = 311.00
Boxplot of Age vs Gender
311
1247
N =
Gender
Female
Male
Age
100
80
60
40
20
0
-20
183
Median
Inter-quartile range
Outlier
Syntax: Explore
EXAMINE
VARIABLES=age BY gender /ID=id
/PLOT BOXPLOT HISTOGRAM
/COMPARE GROUP
/STATISTICS DESCRIPTIVES
/CINTERVAL 95
/MISSING LISTWISE
/NOTOTAL.
Summary
• Measures of central
tendency
• Measures of variation
• Quantiles
• Measures of shape
• Bivariate analysis for a
categorical independent
variable and continuous
dependent variable
• Histograms
• Boxplots

More Related Content

Similar to trs-9.ppt

Dscriptive statistics
Dscriptive statisticsDscriptive statistics
Dscriptive statistics
Dr. Anees Alyafei
 
LECTURE 3 - inferential statistics bmaths
LECTURE 3 - inferential statistics bmathsLECTURE 3 - inferential statistics bmaths
LECTURE 3 - inferential statistics bmaths
jafari12
 
Introduction to statistics RSS6 2014
Introduction to statistics RSS6 2014Introduction to statistics RSS6 2014
Introduction to statistics RSS6 2014
RSS6
 
Bst322week1
Bst322week1Bst322week1
Bst322week1
Howard Realubit
 
statisticsintroductionofbusinessstats.ppt
statisticsintroductionofbusinessstats.pptstatisticsintroductionofbusinessstats.ppt
statisticsintroductionofbusinessstats.ppt
voore ajay
 
3. Statistical Analysis.pptx
3. Statistical Analysis.pptx3. Statistical Analysis.pptx
3. Statistical Analysis.pptx
jeyanthisivakumar
 
Research methodology3
Research methodology3Research methodology3
Research methodology3
Tosif Ahmad
 
Introduction to Statistics23122223.ppt
Introduction to Statistics23122223.pptIntroduction to Statistics23122223.ppt
Introduction to Statistics23122223.ppt
pathianithanaidu
 
Introduction to Statistics2312.ppt
Introduction to Statistics2312.pptIntroduction to Statistics2312.ppt
Introduction to Statistics2312.ppt
pathianithanaidu
 
Measure of dispersion 10321
Measure of dispersion 10321Measure of dispersion 10321
Measure of dispersion 10321
gopinathannsriramachandraeduin
 
Hm306 week 4
Hm306 week 4Hm306 week 4
Hm306 week 4
BHUOnlineDepartment
 
Hm306 week 4
Hm306 week 4Hm306 week 4
Hm306 week 4
BealCollegeOnline
 
Business statistics
Business statisticsBusiness statistics
Business statistics
Ravi Prakash
 
01 introduction stat
01 introduction stat01 introduction stat
01 introduction stat
antaraar2009
 
Biostatistics mean median mode unit 1.pptx
Biostatistics mean median mode unit 1.pptxBiostatistics mean median mode unit 1.pptx
Biostatistics mean median mode unit 1.pptx
SailajaReddyGunnam
 
Upload 140103034715-phpapp01 (1)
Upload 140103034715-phpapp01 (1)Upload 140103034715-phpapp01 (1)
Upload 140103034715-phpapp01 (1)
captaininfantry
 
Introduction to statistics.pptx
Introduction to statistics.pptxIntroduction to statistics.pptx
Introduction to statistics.pptx
MuddaAbdo1
 
descriptive data analysis
 descriptive data analysis descriptive data analysis
descriptive data analysis
gnanasarita1
 
Week 2 measures of disease occurence
Week 2  measures of disease occurenceWeek 2  measures of disease occurence
Week 2 measures of disease occurence
Hamdi Alhakimi
 
Stats-Review-Maie-St-John-5-20-2009.ppt
Stats-Review-Maie-St-John-5-20-2009.pptStats-Review-Maie-St-John-5-20-2009.ppt
Stats-Review-Maie-St-John-5-20-2009.ppt
DiptoKumerSarker1
 

Similar to trs-9.ppt (20)

Dscriptive statistics
Dscriptive statisticsDscriptive statistics
Dscriptive statistics
 
LECTURE 3 - inferential statistics bmaths
LECTURE 3 - inferential statistics bmathsLECTURE 3 - inferential statistics bmaths
LECTURE 3 - inferential statistics bmaths
 
Introduction to statistics RSS6 2014
Introduction to statistics RSS6 2014Introduction to statistics RSS6 2014
Introduction to statistics RSS6 2014
 
Bst322week1
Bst322week1Bst322week1
Bst322week1
 
statisticsintroductionofbusinessstats.ppt
statisticsintroductionofbusinessstats.pptstatisticsintroductionofbusinessstats.ppt
statisticsintroductionofbusinessstats.ppt
 
3. Statistical Analysis.pptx
3. Statistical Analysis.pptx3. Statistical Analysis.pptx
3. Statistical Analysis.pptx
 
Research methodology3
Research methodology3Research methodology3
Research methodology3
 
Introduction to Statistics23122223.ppt
Introduction to Statistics23122223.pptIntroduction to Statistics23122223.ppt
Introduction to Statistics23122223.ppt
 
Introduction to Statistics2312.ppt
Introduction to Statistics2312.pptIntroduction to Statistics2312.ppt
Introduction to Statistics2312.ppt
 
Measure of dispersion 10321
Measure of dispersion 10321Measure of dispersion 10321
Measure of dispersion 10321
 
Hm306 week 4
Hm306 week 4Hm306 week 4
Hm306 week 4
 
Hm306 week 4
Hm306 week 4Hm306 week 4
Hm306 week 4
 
Business statistics
Business statisticsBusiness statistics
Business statistics
 
01 introduction stat
01 introduction stat01 introduction stat
01 introduction stat
 
Biostatistics mean median mode unit 1.pptx
Biostatistics mean median mode unit 1.pptxBiostatistics mean median mode unit 1.pptx
Biostatistics mean median mode unit 1.pptx
 
Upload 140103034715-phpapp01 (1)
Upload 140103034715-phpapp01 (1)Upload 140103034715-phpapp01 (1)
Upload 140103034715-phpapp01 (1)
 
Introduction to statistics.pptx
Introduction to statistics.pptxIntroduction to statistics.pptx
Introduction to statistics.pptx
 
descriptive data analysis
 descriptive data analysis descriptive data analysis
descriptive data analysis
 
Week 2 measures of disease occurence
Week 2  measures of disease occurenceWeek 2  measures of disease occurence
Week 2 measures of disease occurence
 
Stats-Review-Maie-St-John-5-20-2009.ppt
Stats-Review-Maie-St-John-5-20-2009.pptStats-Review-Maie-St-John-5-20-2009.ppt
Stats-Review-Maie-St-John-5-20-2009.ppt
 

Recently uploaded

Rate Controlled Drug Delivery Systems.pdf
Rate Controlled Drug Delivery Systems.pdfRate Controlled Drug Delivery Systems.pdf
Rate Controlled Drug Delivery Systems.pdf
Rajarambapu College of Pharmacy Kasegaon Dist Sangli
 
Can Allopathy and Homeopathy Be Used Together in India.pdf
Can Allopathy and Homeopathy Be Used Together in India.pdfCan Allopathy and Homeopathy Be Used Together in India.pdf
Can Allopathy and Homeopathy Be Used Together in India.pdf
Dharma Homoeopathy
 
RECENT ADVANCES IN BREAST CANCER RADIOTHERAPY
RECENT ADVANCES IN BREAST CANCER RADIOTHERAPYRECENT ADVANCES IN BREAST CANCER RADIOTHERAPY
RECENT ADVANCES IN BREAST CANCER RADIOTHERAPY
Isha Jaiswal
 
Michigan HealthTech Market Map 2024 with Policy Makers, Academic Innovation C...
Michigan HealthTech Market Map 2024 with Policy Makers, Academic Innovation C...Michigan HealthTech Market Map 2024 with Policy Makers, Academic Innovation C...
Michigan HealthTech Market Map 2024 with Policy Makers, Academic Innovation C...
Levi Shapiro
 
定制(wsu毕业证书)美国华盛顿州立大学毕业证学位证书实拍图原版一模一样
定制(wsu毕业证书)美国华盛顿州立大学毕业证学位证书实拍图原版一模一样定制(wsu毕业证书)美国华盛顿州立大学毕业证学位证书实拍图原版一模一样
定制(wsu毕业证书)美国华盛顿州立大学毕业证学位证书实拍图原版一模一样
khvdq584
 
Dr. David Greene R3 stem cell Breakthroughs: Stem Cell Therapy in Cardiology
Dr. David Greene R3 stem cell Breakthroughs: Stem Cell Therapy in CardiologyDr. David Greene R3 stem cell Breakthroughs: Stem Cell Therapy in Cardiology
Dr. David Greene R3 stem cell Breakthroughs: Stem Cell Therapy in Cardiology
R3 Stem Cell
 
CANSA support - Caring for Cancer Patients' Caregivers
CANSA support - Caring for Cancer Patients' CaregiversCANSA support - Caring for Cancer Patients' Caregivers
CANSA support - Caring for Cancer Patients' Caregivers
CANSA The Cancer Association of South Africa
 
NKTI Annual Report - Annual Report FY 2022
NKTI Annual Report - Annual Report FY 2022NKTI Annual Report - Annual Report FY 2022
NKTI Annual Report - Annual Report FY 2022
nktiacc3
 
Vicarious movements or trick movements_AB.pdf
Vicarious movements or trick movements_AB.pdfVicarious movements or trick movements_AB.pdf
Vicarious movements or trick movements_AB.pdf
Arunima620542
 
CCSN_June_06 2024_jones. Cancer Rehabpptx
CCSN_June_06 2024_jones. Cancer RehabpptxCCSN_June_06 2024_jones. Cancer Rehabpptx
CCSN_June_06 2024_jones. Cancer Rehabpptx
Canadian Cancer Survivor Network
 
DELIRIUM BY DR JAGMOHAN PRAJAPATI.......
DELIRIUM BY DR JAGMOHAN PRAJAPATI.......DELIRIUM BY DR JAGMOHAN PRAJAPATI.......
DELIRIUM BY DR JAGMOHAN PRAJAPATI.......
DR Jag Mohan Prajapati
 
PrudentRx: A Resource for Patient Education and Engagement
PrudentRx: A Resource for Patient Education and EngagementPrudentRx: A Resource for Patient Education and Engagement
PrudentRx: A Resource for Patient Education and Engagement
PrudentRx Program
 
How Effective is Homeopathic Medicine for Anxiety and Stress Relief.pdf
How Effective is Homeopathic Medicine for Anxiety and Stress Relief.pdfHow Effective is Homeopathic Medicine for Anxiety and Stress Relief.pdf
How Effective is Homeopathic Medicine for Anxiety and Stress Relief.pdf
Dharma Homoeopathy
 
Bringing AI into a Mid-Sized Company: A structured Approach
Bringing AI into a Mid-Sized Company: A structured ApproachBringing AI into a Mid-Sized Company: A structured Approach
Bringing AI into a Mid-Sized Company: A structured Approach
Brian Frerichs
 
Top massage center in ajman chandrima Spa
Top massage center in ajman chandrima  SpaTop massage center in ajman chandrima  Spa
Top massage center in ajman chandrima Spa
Chandrima Spa Ajman
 
Professional Secrecy: Forensic Medicine Lecture
Professional Secrecy: Forensic Medicine LectureProfessional Secrecy: Forensic Medicine Lecture
Professional Secrecy: Forensic Medicine Lecture
DIVYANSHU740006
 
NEEDLE STICK INJURY - JOURNAL CLUB PRESENTATION - DR SHAMIN EABENSON
NEEDLE STICK INJURY - JOURNAL CLUB PRESENTATION - DR SHAMIN EABENSONNEEDLE STICK INJURY - JOURNAL CLUB PRESENTATION - DR SHAMIN EABENSON
NEEDLE STICK INJURY - JOURNAL CLUB PRESENTATION - DR SHAMIN EABENSON
SHAMIN EABENSON
 
Can coffee help me lose weight? Yes, 25,422 users in the USA use it for that ...
Can coffee help me lose weight? Yes, 25,422 users in the USA use it for that ...Can coffee help me lose weight? Yes, 25,422 users in the USA use it for that ...
Can coffee help me lose weight? Yes, 25,422 users in the USA use it for that ...
nirahealhty
 
Let's Talk About It: Breast Cancer (What is Mindset and Does it Really Matter?)
Let's Talk About It: Breast Cancer (What is Mindset and Does it Really Matter?)Let's Talk About It: Breast Cancer (What is Mindset and Does it Really Matter?)
Let's Talk About It: Breast Cancer (What is Mindset and Does it Really Matter?)
bkling
 
Pediatric Emergency Care for Children | Apollo Hospital
Pediatric Emergency Care for Children | Apollo HospitalPediatric Emergency Care for Children | Apollo Hospital
Pediatric Emergency Care for Children | Apollo Hospital
Apollo 24/7 Adult & Paediatric Emergency Services
 

Recently uploaded (20)

Rate Controlled Drug Delivery Systems.pdf
Rate Controlled Drug Delivery Systems.pdfRate Controlled Drug Delivery Systems.pdf
Rate Controlled Drug Delivery Systems.pdf
 
Can Allopathy and Homeopathy Be Used Together in India.pdf
Can Allopathy and Homeopathy Be Used Together in India.pdfCan Allopathy and Homeopathy Be Used Together in India.pdf
Can Allopathy and Homeopathy Be Used Together in India.pdf
 
RECENT ADVANCES IN BREAST CANCER RADIOTHERAPY
RECENT ADVANCES IN BREAST CANCER RADIOTHERAPYRECENT ADVANCES IN BREAST CANCER RADIOTHERAPY
RECENT ADVANCES IN BREAST CANCER RADIOTHERAPY
 
Michigan HealthTech Market Map 2024 with Policy Makers, Academic Innovation C...
Michigan HealthTech Market Map 2024 with Policy Makers, Academic Innovation C...Michigan HealthTech Market Map 2024 with Policy Makers, Academic Innovation C...
Michigan HealthTech Market Map 2024 with Policy Makers, Academic Innovation C...
 
定制(wsu毕业证书)美国华盛顿州立大学毕业证学位证书实拍图原版一模一样
定制(wsu毕业证书)美国华盛顿州立大学毕业证学位证书实拍图原版一模一样定制(wsu毕业证书)美国华盛顿州立大学毕业证学位证书实拍图原版一模一样
定制(wsu毕业证书)美国华盛顿州立大学毕业证学位证书实拍图原版一模一样
 
Dr. David Greene R3 stem cell Breakthroughs: Stem Cell Therapy in Cardiology
Dr. David Greene R3 stem cell Breakthroughs: Stem Cell Therapy in CardiologyDr. David Greene R3 stem cell Breakthroughs: Stem Cell Therapy in Cardiology
Dr. David Greene R3 stem cell Breakthroughs: Stem Cell Therapy in Cardiology
 
CANSA support - Caring for Cancer Patients' Caregivers
CANSA support - Caring for Cancer Patients' CaregiversCANSA support - Caring for Cancer Patients' Caregivers
CANSA support - Caring for Cancer Patients' Caregivers
 
NKTI Annual Report - Annual Report FY 2022
NKTI Annual Report - Annual Report FY 2022NKTI Annual Report - Annual Report FY 2022
NKTI Annual Report - Annual Report FY 2022
 
Vicarious movements or trick movements_AB.pdf
Vicarious movements or trick movements_AB.pdfVicarious movements or trick movements_AB.pdf
Vicarious movements or trick movements_AB.pdf
 
CCSN_June_06 2024_jones. Cancer Rehabpptx
CCSN_June_06 2024_jones. Cancer RehabpptxCCSN_June_06 2024_jones. Cancer Rehabpptx
CCSN_June_06 2024_jones. Cancer Rehabpptx
 
DELIRIUM BY DR JAGMOHAN PRAJAPATI.......
DELIRIUM BY DR JAGMOHAN PRAJAPATI.......DELIRIUM BY DR JAGMOHAN PRAJAPATI.......
DELIRIUM BY DR JAGMOHAN PRAJAPATI.......
 
PrudentRx: A Resource for Patient Education and Engagement
PrudentRx: A Resource for Patient Education and EngagementPrudentRx: A Resource for Patient Education and Engagement
PrudentRx: A Resource for Patient Education and Engagement
 
How Effective is Homeopathic Medicine for Anxiety and Stress Relief.pdf
How Effective is Homeopathic Medicine for Anxiety and Stress Relief.pdfHow Effective is Homeopathic Medicine for Anxiety and Stress Relief.pdf
How Effective is Homeopathic Medicine for Anxiety and Stress Relief.pdf
 
Bringing AI into a Mid-Sized Company: A structured Approach
Bringing AI into a Mid-Sized Company: A structured ApproachBringing AI into a Mid-Sized Company: A structured Approach
Bringing AI into a Mid-Sized Company: A structured Approach
 
Top massage center in ajman chandrima Spa
Top massage center in ajman chandrima  SpaTop massage center in ajman chandrima  Spa
Top massage center in ajman chandrima Spa
 
Professional Secrecy: Forensic Medicine Lecture
Professional Secrecy: Forensic Medicine LectureProfessional Secrecy: Forensic Medicine Lecture
Professional Secrecy: Forensic Medicine Lecture
 
NEEDLE STICK INJURY - JOURNAL CLUB PRESENTATION - DR SHAMIN EABENSON
NEEDLE STICK INJURY - JOURNAL CLUB PRESENTATION - DR SHAMIN EABENSONNEEDLE STICK INJURY - JOURNAL CLUB PRESENTATION - DR SHAMIN EABENSON
NEEDLE STICK INJURY - JOURNAL CLUB PRESENTATION - DR SHAMIN EABENSON
 
Can coffee help me lose weight? Yes, 25,422 users in the USA use it for that ...
Can coffee help me lose weight? Yes, 25,422 users in the USA use it for that ...Can coffee help me lose weight? Yes, 25,422 users in the USA use it for that ...
Can coffee help me lose weight? Yes, 25,422 users in the USA use it for that ...
 
Let's Talk About It: Breast Cancer (What is Mindset and Does it Really Matter?)
Let's Talk About It: Breast Cancer (What is Mindset and Does it Really Matter?)Let's Talk About It: Breast Cancer (What is Mindset and Does it Really Matter?)
Let's Talk About It: Breast Cancer (What is Mindset and Does it Really Matter?)
 
Pediatric Emergency Care for Children | Apollo Hospital
Pediatric Emergency Care for Children | Apollo HospitalPediatric Emergency Care for Children | Apollo Hospital
Pediatric Emergency Care for Children | Apollo Hospital
 

trs-9.ppt

  • 1. Data analysis: Explore GAP Toolkit 5 Training in basic drug abuse data management and analysis Training session 9
  • 2. Objectives • To define a standard set of descriptive statistics used to analyse continuous variables • To examine the Explore facility in SPSS • To introduce the analysis of a continuous variable according to values of a categorical variable, an example of bivariate analysis • To introduce further SPSS Help options • To reinforce the use of SPSS syntax
  • 3. SPSS Descriptive Statistics • Analyse/Descriptive Statistics/Frequencies • Analyse/Descriptive Statistics/Explore • Analyse/Descriptive Statistics/Descriptives
  • 4. Exercise: continuous variable • Generate a set of standard summary statistics for the continuous variable Age
  • 6. Explore: Descriptive Statistics Statistic Std. Error AGE Mean 31.78 .315 95% Confidence Interval for Mean Lower Bound 31.16 Upper Bound 32.40 5% Trimmed Mean 31.31 Median 31.00 Variance 154.614 Std. Deviation 12.434 Minimum 1 Maximum 77 Range 76 Interquartile Range 20.00 Skewness .427 .062 Kurtosis -.503 .124 Descriptives
  • 7. Exercise: Help • What’s This? • Results Coach • Case Studies
  • 8. Measures of central tendency • Most commonly: – Mode – Median – Mean • 5 per cent trimmed mean
  • 9. The mode • The mode is the most frequently occurring value in a dataset • Suitable for nominal data and above • Example: – The mode of the first most frequently used drug is Alcohol, with 717 cases, approximately 46 per cent of valid responses
  • 10. Bimodal • Describes a distribution • Two categories have a large number of cases • Example: – The distribution of Employment is bimodal, employment and unemployment having a similar number of cases and more cases than the other categories
  • 11. The median • The middle value when the data are ordered from low to high is the median • Half the data values lie below the median and half above • The data have to be ordered so the median is not suitable for nominal data, but is suitable for ordinal levels of measurement and above
  • 12. Example: median • Seizures of opium in Germany, 1994-1998 (Kilograms) • Source: United Nations (2000). World Drug Report 2000 (United Nations publication, Sales No. GV.E.00.0.10). Year 1994 1995 1996 1997 1998 Seizure 36 15 45 42 286
  • 13. • Sort the seizure data in ascending order • The middle value is the median; the median annual seizures of opium for Germany between 1994 and 1998 was 42 kilograms Year 1995 1994 1997 1996 1998 Seizure 15 36 42 45 286 Ranked: 1 2 3 4 5
  • 14. The mean • Add the values in the data set and divide by the number of values • The mean is only truly applicable to interval and ratio data, as it involves adding the variables • It is sometimes applied to ordinal data or ordinal scales constructed from a number of Likert scales, but this requires the assumption that the difference between the values in the scale is the same, e.g. between 1 and 2 is the same as between 5 and 6
  • 15. Example: mean • Seizures of opium in Germany, 1994-1998 • Sample size = 5 • 36 + 15 + 45 + 42 + 286 = 424 • 424/5 = 84.8 Year 1994 1995 1996 1997 1998 Seizure 36 15 45 42 286
  • 16. The 5 per cent trimmed mean • The 5 per cent trimmed mean is the mean calculated on the data set with the top 5 per cent and bottom 5 per cent of values removed • An estimator that is more resistant to outliers than the mean
  • 17. 95 per cent confidence interval for the mean • An indication of the expected error (precision) when estimating the population mean with the sample mean • In repeated sampling, the equation used to calculate the confidence interval around the sample mean will contain the population mean 95 times out of 100
  • 18. Measures of dispersion • The range • The inter-quartile range • The variance • The standard deviation
  • 19. The range • A measure of the spread of the data • Range = maximum – minimum
  • 20. Quartiles • 1st quartile: 25 per cent of the values lie below the value of the 1st quartile and 75 per cent above • 2nd quartile: the median: 50 per cent of values below and 50 per cent of values above • 3rd quartile: 75 per cent of values below and 25 per cent of the values above
  • 21. Inter-quartile range • IQR = 3rd Quartile – 1st Quartile • The inter-quartile range measures the spread or range of the mid 50 per cent of the data • Ordinal level of measurement or above
  • 22. Variance • The average squared difference from the mean • Measured in units squared • Requires interval or ratio levels of measurement   1 2    n X Xi
  • 23. Standard deviation • The square root of the variance • Returns the units to those of the original variable   1 2    n X Xi
  • 24. Example: standard deviation and variance Seizures of opium in Germany, 1994-1998 Year Seizure Deviations Squared deviations 1994 36 -48.8 2381.44 1995 15 -69.8 4872.04 1996 45 -39.8 1584.04 1997 42 -42.8 1831.84 1998 286 201.2 40481.44 Total 424 0 51150.8 Count 5 5 Mean 84.8 Variance 10230 Standard deviation 101
  • 25. Distribution or shape of the data • The normal distribution • Skewness: – Positive or right-hand skewed – Negative or left-hand skewed • Kurtosis: – Platykurtic – Mesokurtic – Leptokurtic
  • 26. • Symmetrical data: the mean, the median and the mode coincide Mean Median Mode f(X) X The normal distribution
  • 27. Right-hand skew (+) • Right-hand skew: the extreme large values drag the mean towards them f(X) X Mode Median Mean
  • 28. Left-hand skew (-) • Left-hand skew: the extreme small values drag the mean towards them Mode Mean Median X f(X)
  • 29. Bivariate analysis • Continuous Dependent Variable • Categorical Independent Variable
  • 34. Gender Statistic Std. Error AGE Male Mean 31.43 .340 95% Confidence Interval for Mean Lower Bound 30.76 Upper Bound 32.09 5% Trimmed Mean 31.03 Median 30.00 Variance 144.286 Std. Deviation 12.012 Minimum 1 Maximum 70 Range 69 Interquartile Range 19.00 Skewness .370 .069 Kurtosis -.573 .138 Female Mean 33.39 .789 95% Confidence Interval for Mean Lower Bound 31.84 Upper Bound 34.94 5% Trimmed Mean 32.77 Median 33.00 Variance 193.593 Std. Deviation 13.914 Minimum 14 Maximum 77 Range 63 Interquartile Range 23.00 Skewness .472 .138 Kurtosis -.602 .376 Descriptives
  • 35. Male Female Age 70.0 65.0 60.0 55.0 50.0 45.0 40.0 35.0 30.0 25.0 20.0 15.0 10.0 5.0 0.0 Histogram Frequency 300 200 100 0 Std. Dev = 12.01 Mean = 31.4 N = 1247.00 Age 75.0 70.0 65.0 60.0 55.0 50.0 45.0 40.0 35.0 30.0 25.0 20.0 15.0 Histogram Frequency 60 50 40 30 20 10 0 Std. Dev = 13.91 Mean = 33.4 N = 311.00
  • 36. Boxplot of Age vs Gender 311 1247 N = Gender Female Male Age 100 80 60 40 20 0 -20 183 Median Inter-quartile range Outlier
  • 37. Syntax: Explore EXAMINE VARIABLES=age BY gender /ID=id /PLOT BOXPLOT HISTOGRAM /COMPARE GROUP /STATISTICS DESCRIPTIVES /CINTERVAL 95 /MISSING LISTWISE /NOTOTAL.
  • 38. Summary • Measures of central tendency • Measures of variation • Quantiles • Measures of shape • Bivariate analysis for a categorical independent variable and continuous dependent variable • Histograms • Boxplots