2. SPSS
2
(Statistical package for social sciences)
Presented by:
Kabir Khan
LIMS-1911
Department of Library and Information Management
Superior University , Lahore
3. CONTENT
Introduction
General Capabilities
Data Measurement & Data Analysis
Types of Variables
Level of measurement
SPSS Tabs
Graphical representation of data
Descriptive measures
Measure of Central Tendency/Location
Measure of Dispersion
3
4. INTRODUCTION
It was Introduced in 1968
It was originally developed to facilitate statistical analysis in the social
sciences.
Purchased by IBM in 2009 for more than $1 billion dollars.
4
5. GENERAL CAPABILITIES
Comprehensive software for data management and data analysis
General tabulated reports
Produce charts
Plot distributions and trends
Conduct descriptive statistics
Perform complex statistical analysis
* If you are good in formulation, then you can do most of the data
analysis in MS Excel
5
6. DATA MANAGEMENT
Defining variables
Coding variables
Entering & editing data
Creating new variables
Representation of data into
graphs & diagrams
Descriptive measures e.g central
tendency, variation
Inferential measures e.g
hypothesis testing, regression,
ANOVA etc.
6
Data Analysis
7. TYPES OF VARIABLES
Quantitative Variables: which are further divided into discrete and
continuous variables.
I. Discrete variable can assume only certain values. There are gaps
between the values, e.g no. of children in the families of a certain
locality
II. Continuous variable can assume any value within a specific
range, e.g amount of rainfall.
Qualitative variables, which are also known as categorical or non
numerical variables.
7
8. LEVEL OF MEASUREMENT
Nominal
Ordinal
Scale (ratio & interval)
* It should be noted that data can be entered manually and
data files can also be exported.
8
13. 1. SCATTER PLOT
In data analysis, the first step is finding the relationship
between the two variables. Scatter plot provide great help in
this context.
13
15. 2. BAR DIAGRAM
A simple bar chart consist of horizontal or vertical bars of
equal widths and lengths proportional to the values they
represent.
15
Simple bar diagram showing
the % of patients with
different clinical symptoms.
19. Training Workshop by PASTIC
19
Box-whisker plot is
showing that the
distribution is
symmetric.
20. 5. HISTOGRAM
Histogram consists of a set of adjacent rectangles.
Class boundaries are taken along x-axis.
Frequencies are taken along y-axis.
It should not be confused with HISTORIGRAM which is a graph of time series.
20
23. MEASURE OF CENTRAL TENDENCY/LOCATION
When two or more data sets are to be compared, the visual
representation is not enough.
So, a data set should be summarized in a single value.
Such single value is known as AVERAGE or Measure of
central tendency.
The most common types of averages are arithmetic
mean/mean, median, and mode.
23
24. 24
Continued…
1. Mean
The marks obtained by 9 students are 45, 32, 37, 46, 39, 36, 41, 48, 36. The
mean mark is given by 40 marks. What will happened if the marks of 10th student
90 is included?
25. 25
2. Median
The midpoint of the values after they have been arranged in ascending or descending
order.
Median is not affected by the extreme values.
26. 26
If the number of observations is odd:
If the number of observations is even:
27. 27
Median divides the ordered data into two equal parts. So, we have
Median = Second quartile Q2
Median can be calculated for all levels of data except nominal.
28. MEASURE OF DISPERSION
To summarize a data set, measure of central tendency is not enough.
Dispersion shows the variation in the data.
A small value for a measure of dispersion indicates that the data are
clustered closely around the average.
It means that, used type of average is reliable measure.
We will discuss range, mean deviation, variance, and standard
deviation in this section.
28