3. Data Measurement Scales
Nominal
• Used only for
labeling data
Ordinal
• Numbers to rank
objects or
attributes
• Distance between
objects/attributes
cannot be
measured
Interval
• Distance between
objects/attributes
can be measured
• Have arbitrary
zero point
• Basic arithmetic
operations
possible
Ratio
• Highest level of
measurement
scale
• Have fixed zero
point
• All arithmetic
operations
possible
3
4. Agenda
4
Descriptive Statistics
• Graphical Representation
• Tabular Representation
• Summarization
Inferential Statistics
• Conditional Probability
• Bayes’ Theorem
• Random Variables: Mean
and Variance
• Binominal Distribution
• Poisson Distribution
• Normal Distribution
6. Tabular representation
Freq
distribution
Table that displays the frequency of outcomes in a data
sample.
Each entry in the table contains frequency or count of
occurrences of values within a particular group or
interval.
Cross
tab
Allows comparison between two or more variables
basis multiple parameters.
For example, Pivot table in excel.
6
8. Roadmap for descriptive measures
Type of analysis Numerical data Categorical data
Tabulate, organize,
graphically present values
of a variable
Freq distribution,
histogram
Summary table, bar chart
Graphically represent
relationship between 3
variables
Scatter plot Contingency table
8
9. Summarization
Central tendency
• Mean
• Median
• Mode
Variation
• Range
• Interquartile
range
• Variance
• Standard
deviation
• Coef of variation
Skewness
• Symmetrical
• Left skewed
• Right skewed
9
14. Significance of SD compared to variance
• Why do we take sum of squares for variance and then find
sqrt for SD?
• Why is SD more useful than variance?
14