This document provides an introduction to applied statistics and statistical methods. It discusses objectives such as going beyond the mean and tests of differences like t-tests and ANOVA. It also covers descriptive statistics such as measures of central tendency and dispersion, inferential statistics like z-scores and confidence intervals, and tests of differences including independent and paired t-tests and nonparametric alternatives.
The presentation slides describes about the analysis of data. The presentation slides deals about scales of measurement, t test, ANOVA, ANCOVA, MANOVA, regression and SPSS help desk.
It is very difficult to distinguish the differences between ANOVA and regression. This is because both terms have more similarities than differences. It can be said that ANOVA and regression are the two sides of the same coin.
UNIVARIATE & BIVARIATE ANALYSIS
UNIVARIATE BIVARIATE & MULTIVARIATE
UNIVARIATE ANALYSIS
-One variable analysed at a time
BIVARIATE ANALYSIS
-Two variable analysed at a time
MULTIVARIATE ANALYSIS
-More than two variables analysed at a time
TYPES OF ANALYSIS
DESCRIPTIVE ANALYSIS
INFERENTIAL ANALYSIS
DESCRIPTIVE ANALYSIS
Transformation of raw data
Facilitate easy understanding and interpretation
Deals with summary measures relating to sample data
Eg-what is the average age of the sample?
INFERENTIAL ANALYSIS
Carried out after descriptive analysis
Inferences drawn on population parameters based on sample results
Generalizes results to the population based on sample results
Eg-is the average age of population different from 35?
DESCRIPTIVE ANALYSIS OF UNIVARIATE DATA
1. Prepare frequency distribution of each variable
Missing Data
Situation where certain questions are left unanswered
Analysis of multiple responses
Measures of central tendency
3 measures of central tendency
1.Mean
2.Median
3.Mode
MEAN
Arithmetic average of a variable
Appropriate for interval and ratio scale data
x
MEDIAN
Calculates the middle value of the data
Computed for ratio, interval or ordinal scale.
Data needs to be arranged in ascending or descending order
MODE
Point of maximum frequency
Should not be computed for ordinal or interval data unless grouped.
Widely used in business
MEASURE OF DISPERSION
Measures of central tendency do not explain distribution of variables
4 measures of dispersion
1.Range
2.Variance and standard deviation
3.Coefficient of variation
4.Relative and absolute frequencies
DESCRIPTIVE ANALYSIS OF BIVARIATE DATA
There are three types of measure used.
1.Cross tabulation
2.Spearmans rank correlation coefficient
3.Pearsons linear correlation coefficient
Cross Tabulation
Responses of two questions are combined
Spearman’s rank order correlation coefficient.
Used in case of ordinal data
Data Processing and Statistical Treatment: Spreads and CorrelationJanet Penilla
A hyperlinked presentation. The objectives of the topic were written. The presentation was started with the variance and then the standard deviation provided with examples. It also answers on when to use the sample standard deviation and the population standard deviation or what type of data should we use when we calculate a standard deviation. The presentation also includes Correlations and other correlation techniques(Pearson-product moment correlation; Spearman - rank order correlation coefficient; t-test for correlation).
THIS IS MY PH.D., VIVA VOCE POWERPOINT. MY THESIS TITLE IS "EFFECTIVENESS OF E-LEARNING MODULES IN TEACHING MATHEMATICS AMONG SECONDARY TEACHER EDUCATION LEVEL"
The presentation slides describes about the analysis of data. The presentation slides deals about scales of measurement, t test, ANOVA, ANCOVA, MANOVA, regression and SPSS help desk.
It is very difficult to distinguish the differences between ANOVA and regression. This is because both terms have more similarities than differences. It can be said that ANOVA and regression are the two sides of the same coin.
UNIVARIATE & BIVARIATE ANALYSIS
UNIVARIATE BIVARIATE & MULTIVARIATE
UNIVARIATE ANALYSIS
-One variable analysed at a time
BIVARIATE ANALYSIS
-Two variable analysed at a time
MULTIVARIATE ANALYSIS
-More than two variables analysed at a time
TYPES OF ANALYSIS
DESCRIPTIVE ANALYSIS
INFERENTIAL ANALYSIS
DESCRIPTIVE ANALYSIS
Transformation of raw data
Facilitate easy understanding and interpretation
Deals with summary measures relating to sample data
Eg-what is the average age of the sample?
INFERENTIAL ANALYSIS
Carried out after descriptive analysis
Inferences drawn on population parameters based on sample results
Generalizes results to the population based on sample results
Eg-is the average age of population different from 35?
DESCRIPTIVE ANALYSIS OF UNIVARIATE DATA
1. Prepare frequency distribution of each variable
Missing Data
Situation where certain questions are left unanswered
Analysis of multiple responses
Measures of central tendency
3 measures of central tendency
1.Mean
2.Median
3.Mode
MEAN
Arithmetic average of a variable
Appropriate for interval and ratio scale data
x
MEDIAN
Calculates the middle value of the data
Computed for ratio, interval or ordinal scale.
Data needs to be arranged in ascending or descending order
MODE
Point of maximum frequency
Should not be computed for ordinal or interval data unless grouped.
Widely used in business
MEASURE OF DISPERSION
Measures of central tendency do not explain distribution of variables
4 measures of dispersion
1.Range
2.Variance and standard deviation
3.Coefficient of variation
4.Relative and absolute frequencies
DESCRIPTIVE ANALYSIS OF BIVARIATE DATA
There are three types of measure used.
1.Cross tabulation
2.Spearmans rank correlation coefficient
3.Pearsons linear correlation coefficient
Cross Tabulation
Responses of two questions are combined
Spearman’s rank order correlation coefficient.
Used in case of ordinal data
Data Processing and Statistical Treatment: Spreads and CorrelationJanet Penilla
A hyperlinked presentation. The objectives of the topic were written. The presentation was started with the variance and then the standard deviation provided with examples. It also answers on when to use the sample standard deviation and the population standard deviation or what type of data should we use when we calculate a standard deviation. The presentation also includes Correlations and other correlation techniques(Pearson-product moment correlation; Spearman - rank order correlation coefficient; t-test for correlation).
THIS IS MY PH.D., VIVA VOCE POWERPOINT. MY THESIS TITLE IS "EFFECTIVENESS OF E-LEARNING MODULES IN TEACHING MATHEMATICS AMONG SECONDARY TEACHER EDUCATION LEVEL"
This powerpoint presentation gives a brief explanation about the biostatic data .this is quite helpful to individuals to understand the basic research methodology terminologys
When you are working on the Inferential Statistics Paper I want yo.docxalanfhall8953
When you are working on the Inferential Statistics Paper I want you to format your paper with the following information
I. Introduction – What are inferential statistics and what is the research problem and hypothesis of the article?
II. Methods – Who are the subjects and variables within the article?
III. Results – What is the statistical analysis used, why were these tests chosen? What were the results of these tests and what do they mean?
IV. Discussion – What were the strengths of this article? What would you have done differently in terms of variables and statistical analysis? Why?
V. Conclusion – Reiterate the introduction and include relevant information that answers the questions regarding the hypothesis.
`
Read: Chapter 3 and 4 of Statistics for the Behavioral and Social Sciences.
Participate in One discussion.
Discussion 1 –Standard Normal Distribution– This allows you to look at any data set into the standard distribution form.
Quiz – Hypothesis testing
Submit your Inferential Statics Article Critique – Read Differential Effects of a Body Image Exposure Session on Smoking Urge Between Physically Active and Sedentary Female Smokers. What is the research question and hypothesis? Identify what variables were present, what inferential statistics were used and why, and if proper research methods were used. See grading rubric for full details.
Discussion Post Expectations:
Your initial post (your answer) is due by Day 3 (Thursday) of this week for Discussion 1.
When grading the Standard Normative Distribution discussion I will be looking for your answer to contain:
Week 2 Discussion 1 Board Rubric
Earned
Weight
Content Criteria
0.5
Student identifies and defines what Standard Normative Distribution (SND) is.
Student explains why it is needed to use a SND to compare two data sets.
0.5
Student identifies the purpose of a z-score in a SND.
0.5
Student identifies the purpose of a percentage in a SND.
0.25
Student explains whether a z-score or a percentage does a better job of identifying proportion of a SND.
0.25
The student responds to at least two classmates’ initial posts by Day 7.
1
Student uses correct spelling, grammar and sentence structure.
2
5
Grading - The discussions are both worth a total of 5 points. The breakdown of the grading for this week’s assignment (per discussion assignment) will be as follows:
Posting your answer by the due date (Day 3, Thursday) is worth 4 points. These five points will be based on the information outlined within the Discussion Assignment Expectations. Content will be worth 2 points and format; spelling and grammar will be worth 2 points.
Responding to two of your classmates (for each assignment) is worth 1 point. The answers must be substantive and go beyond “I agree” or “Good job” to qualify for this point.
Intellectual Elaboration:
In Wee.
This presentation consist of analysis of data in education aspects. This presentation deals about bivariate, multivariate anlaysis and it is also describes the descriptive and inferential statistics.
The presentation slides describes about the analysis of data. The presentation slides deals about scales of measurement, t test, ANOVA, ANCOVA, MANOVA, regression and SPSS help desk.
3. homogeneity of variance test
the Levene’s test
Analyse > Compare Means > One-way ANOVA
p > .05: equality of variance
p <.05 : no equality of variance
Test of Homogeneity of Variances
Levene
Statistic Df1 Df2 Sig.
3.766 1 56 .057
4. descriptive statistic
• measures of central tendency
(mean, median, mode)
• measures of spread or dispersion
(range, variance, standard deviation)
show the characteristics of the sample,
and the population as well
6. descriptive statistic
standard deviation between the sample means:
standard error (SE)
Central limit theorem (Field, 2009): The
sampling distribution (the frequency
distribution of sample means) has a
standard deviation calculated as:
s: standard deviation of the sample
n: the sample size
7. descriptive statistic
• In a normal distribution with a mean of 0 (zero) and
a standard deviation of 1, we can calculate the
probability of a score occurring.
by looking at the probability table
(SPSS will do this for us)
8. z-scores
• We can convert a raw score into a z-score (a
score in a normal distribution with a mean of 0,
SD = 1):
Important:
95% of z-scores lie between -1.96 and 1.96
or 95% of the scores will be within the limit +/- 2
SD from the mean in a normal distribution.
9. z-scores
Convert a raw score into z-score, given that:
raw score: = 7
mean: = 4
std. deviation = 3
z = 2 a person who gave a score of 7 is quite above than the average,
having 2 standard deviations from the mean.
10. the confidence interval
Based on the SE, we can calculate the boundaries
within with the population mean will fall.
• upper bound: mean + 1.96*SE
• lower bound: mean – 1.96*SE
confidence interval: a range of scores the
population mean will fall within
12. the confidence interval by error bars
The two groups differ in their means and confidence
intervals, hence they are likely to come from different
populations.
13. Gender N Mean Variance Std. Deviation
Stress at the
start of the
week
Female 16 14.81 28.16 5.307
Male 16 18.94 63.26 7.954
?
tests of differences
14. independent samples test
SPSS output
Analyse>Compare Means > Independent-Samples Test
t-test for Equality of Means
t df
Sig.
(2-tailed)
Mean
Difference
Std. Error
Difference
95% Confidence
Interval of the
Difference
Lower Upper
-1.726 30 .095 -4.125 2.390 -9.007 .757
-1.726 26.146 .096 -4.125 2.390 -9.037 .787
t = mean difference/Std. Error difference
15. independent samples t-tests
1. Do male and female participants have
different levels of computer use?
2. Does age group 1 have a higher level of
computer use than age group 2?
What do we need to find out?
How many groups involved in the study?
15
16. paired samples t-tests
• Are the mean exam score in September
different to their exam score in November?
• Are the deep learning approach of students
reported at the beginning of the study
significantly different to the deep learning
approach of students reported at the end of
the study?
What do we need to find out?
How many groups involved in the study?
16
17. test of differences
The t-test assesses whether the means of two
groups are statistically different from each other.
Assumptions:
• normal distribution
• homogeneity of variance (equal variances)
• data measured at interval level (scale)
violated? non-parametric equivalent tests
18. nonparametric tests of differences
A group of students were asked to rank the extent to which they fear of
statistics from 1 (scared) to 7 (not scared at all) at 2 different times
time 1 1 3 5 2 7 3 3 4 6
time 2 5 2 5 2 6 1 7 6 2
difference in ranks -4 1 0 0 1 2 -4 -2 4
ties positive negative
19. parametric tests nonparametric tests objectives
the single sample t-test whether the observed
mean is different from a
set value
the independent t-test the Wilcoxon rank-sum
test and the Mann
Whitney test
comparing means from
two independent groups of
individuals
the paired t-test the Wilcoxon signed
rank test
comparing the means of
two sets of observations
from the same individuals
or from pairs of individuals
tests of differences
20. Mean (SD) Mean
difference
t Sig.
score or
measurement at
time point 1
score or
measurement at
time point 2
Paired sample t-test results
reporting the results
23. what tests to use?
A TV company have started a reality TV show where 32 members of the public are
left to fend for themselves on a desert island. They have asked a psychologist to
monitor the psychological well-being of the contestants and he records a number of
indices of mental health. He is initially interested in the amount of stress
experienced by the contestants during their first week on the island and
hypothesises that:
(1)the females will report higher levels of stress than the males at the start as well
as at the end of the week (H1)
(2)the level of stress experienced by all the participants is increased by the end of
the week of the reality TV show (H2)
The data is named TVshow.sav
H1: Analyze > Compare Means > Independent-Samples Test
H2: Analyze > Compare Means > Paired-Samples T-Test
24. what tests to use?
We want to know if people who intend to get a Ph.D. or Psychology Doctor (PhD
holder) in psychology are more likely to rely on a calendar or day-planner to
remember what they are supposed to be doing (i.e., are people who might become
professors more absent minded than other people).
The ordinal variable planner measures the extent to which a person relies on a
calendar/day planner, ranging from 1 (strongly agree) to 5 strongly disagree).
The data file is named planner_use.sav.
Analyse > Nonparametric Tests > Legacy Dialogs >
2 Independent-Samples
25. Practice 1
- use the data file TVshow.sav
• Did females experience a higher level of stress at the
start of the week than males?
• Did females experience a higher level of stress at the
end of the week than males?
Which test should you use?
independent t-test
Practical guidelines page 2
Analyze > Compare Means > Independent-Samples Test
26. Practice 1
Independent Samples Test
Levene's Test for
Equality of
Variances t-test for Equality of Means
F Sig. t df
Sig.
(2-tailed)
Mean
Difference
Std. Error
Difference
95% Confidence
Interval of the
Difference
Lower Upper
Stress at the start of the
week
Equal variances assumed 3.211 .083 -1.726 30 .095 -4.125 2.390 -9.007 .757
Equal variances not
assumed
-1.726 26.146 .096 -4.125 2.390 -9.037 .787
Stress at the end of the
week
Equal variances assumed .038 .847 .522 30 .606 2.188 4.193 -6.376 10.751
Equal variances not
assumed
.522 29.673 .606 2.188 4.193 -6.380 10.755
Analyse>Compare Means > Independent-Samples Test
27. Practice 1
Conclusion?
At the start of the week, the male participants experienced a
higher level of stress (M= 18.94, SE = 1.99) than the females
(M=14.81, SE = 1.32). This difference was significant t(30) = -
1.73, p < .05. Therefore, hypothesis 1 is not supported
because the psychologist assumed that the females
experienced a higher level of stress than males.
(Practical guidelines page 3)
28. Practice 2
- use the data file planner_use.sav
• Do people who intend to do a PhD degree or PhD
holders are more likely to use a calendar or day
planner?
Which test to use?
independent t-test (nonparametric)
Practical guidelines page 6
Analyse > Nonparametric Tests > Legacy Dialogs >
2 Independent-Samples
29. Practice 2
Ranks
Intend To Get PhD or PsyD N Mean Rank Sum of Ranks
I rely on a calendar / day-planner
to remember what I am supposed
to do.
Intend to do a PhD 11 27.32 300.50
PhD holder 35 22.30 780.50
Total 46
Analyse > Nonparametric Tests > Legacy Dialogs >
2 Independent-Samples
30. Practice 2
Analyse > Nonparametric Tests > Legacy Dialogs >
2 Independent-Samples
Test Statisticsb
I rely on a calendar / day-planner to remember what I am
supposed to do.
Mann-Whitney U 150.500
Wilcoxon W 780.500
Z -1.169
Asymp. Sig. (2-tailed) .242
Exact Sig. [2*(1-tailed
Sig.)]
.284a
Exact Sig. (2-tailed) .252
Exact Sig. (1-tailed) .127
Point Probability .006
a. Not corrected for ties.
b. Grouping Variable: Intend To Get PhD or PsyD
31. Practice 2
Conclusion?
People who intend to do a PhD do not differ significantly from
PhD degree holders with regard to the use of day planner to
remember what they are supposed to be doing , U = 150.50, z =
-1.169, p > .05, ns.
(Practical guidelines page 7)