The document discusses different categories of methods for analyzing data distributions: central tendency, spread, and symmetry. Central tendency describes where data are clustered around the average or center. Spread describes how far data are from the middle. Symmetry describes the overall shape of the distribution. The document provides examples of questions that would require analyzing the central tendency, spread, or symmetry and instructs the reader to identify which one is most relevant. It emphasizes that every distribution has all three characteristics but the analysis may focus on one aspect.
Normal or skewed distributions (descriptive both2)Ken Plummer
The document discusses normal and skewed distributions and how to identify them. It provides examples of normal and skewed distributions based on golf player forearm circumference data and cat and dog IQ data. The golf player data is normally distributed while the dog IQ data is skewed left based on the skewness statistics. Therefore, at least one of the distributions is skewed.
Normal or skewed distributions (inferential)Ken Plummer
- The document discusses determining whether distributions are normal or skewed
- A distribution is considered skewed if the skewness value divided by the standard error of skewness is less than -2 or greater than 2
- For the old car data set in the example, the skewness value of -4.26 divided by the standard error is less than -2, so this distribution is negatively skewed
- The new car data set skewness value of -1.69 divided by the standard error is between -2 and 2, so this distribution is normal
Infernetial vs desctiptive (jejit + indepth)Ken Plummer
The document discusses the differences between descriptive and inferential statistics. Descriptive statistics are used to describe characteristics of a whole population, while inferential statistics are used when the whole population cannot be measured and conclusions are drawn from a sample to make generalizations about the larger population. Examples are provided to illustrate when each type of statistic would be used.
Normal or skewed distributions (descriptive both2) - Copyright updatedKen Plummer
The document discusses normal and skewed distributions and how to identify them. It provides examples of measuring forearm circumference of golf players and IQs of cats and dogs. The forearm circumference data is normally distributed while the dog IQ data is left skewed based on the skewness statistics provided. Therefore, at least one of the distributions (dog IQs) is skewed.
The document discusses different types of data:
- Scaled data provides exact amounts like miles per hour.
- Ordinal or ranked data provides comparative amounts like 1st, 2nd, 3rd place.
- Nominal data names or categorizes values like Republican or Democrat.
- Nominal proportional data are percentages like 45% Republican, 55% Democrat.
The key is determining what type of data is being influenced or is the "outcome" in questions being analyzed.
The document discusses scaled and ordinal data. Scaled data can be measured in exact amounts like distances, temperatures, and speeds. Ordinal data provides comparative amounts by ranking items from first to last place without exact measures between ranks. Examples provided determine whether data about driving speeds, state well-being rankings, giraffe heights, and baby weight percentiles are scaled or ordinal.
The document discusses different types of data, including scaled and ordinal data. Scaled data can be measured in exact amounts with equal intervals between values. Ordinal or ranked data provides comparative amounts but not necessarily equal intervals. Examples are provided of different data types, such as speed which is scaled, states ranked by well-being which is ordinal, and elephant weights which are scaled. The document concludes with practice questions to determine if data is scaled or ordinal.
Nature of the data practice - Copyright updatedKen Plummer
The document discusses different types of data:
- Scaled data provides exact amounts like 12.5 feet or 140 miles per hour.
- Ordinal or ranked data provides comparative amounts like 1st, 2nd, 3rd place.
- Nominal data names or categorizes values like Republican or Democrat.
- Nominal proportional data are simply percentages like Republican 45% or Democrat 55%.
Normal or skewed distributions (descriptive both2)Ken Plummer
The document discusses normal and skewed distributions and how to identify them. It provides examples of normal and skewed distributions based on golf player forearm circumference data and cat and dog IQ data. The golf player data is normally distributed while the dog IQ data is skewed left based on the skewness statistics. Therefore, at least one of the distributions is skewed.
Normal or skewed distributions (inferential)Ken Plummer
- The document discusses determining whether distributions are normal or skewed
- A distribution is considered skewed if the skewness value divided by the standard error of skewness is less than -2 or greater than 2
- For the old car data set in the example, the skewness value of -4.26 divided by the standard error is less than -2, so this distribution is negatively skewed
- The new car data set skewness value of -1.69 divided by the standard error is between -2 and 2, so this distribution is normal
Infernetial vs desctiptive (jejit + indepth)Ken Plummer
The document discusses the differences between descriptive and inferential statistics. Descriptive statistics are used to describe characteristics of a whole population, while inferential statistics are used when the whole population cannot be measured and conclusions are drawn from a sample to make generalizations about the larger population. Examples are provided to illustrate when each type of statistic would be used.
Normal or skewed distributions (descriptive both2) - Copyright updatedKen Plummer
The document discusses normal and skewed distributions and how to identify them. It provides examples of measuring forearm circumference of golf players and IQs of cats and dogs. The forearm circumference data is normally distributed while the dog IQ data is left skewed based on the skewness statistics provided. Therefore, at least one of the distributions (dog IQs) is skewed.
The document discusses different types of data:
- Scaled data provides exact amounts like miles per hour.
- Ordinal or ranked data provides comparative amounts like 1st, 2nd, 3rd place.
- Nominal data names or categorizes values like Republican or Democrat.
- Nominal proportional data are percentages like 45% Republican, 55% Democrat.
The key is determining what type of data is being influenced or is the "outcome" in questions being analyzed.
The document discusses scaled and ordinal data. Scaled data can be measured in exact amounts like distances, temperatures, and speeds. Ordinal data provides comparative amounts by ranking items from first to last place without exact measures between ranks. Examples provided determine whether data about driving speeds, state well-being rankings, giraffe heights, and baby weight percentiles are scaled or ordinal.
The document discusses different types of data, including scaled and ordinal data. Scaled data can be measured in exact amounts with equal intervals between values. Ordinal or ranked data provides comparative amounts but not necessarily equal intervals. Examples are provided of different data types, such as speed which is scaled, states ranked by well-being which is ordinal, and elephant weights which are scaled. The document concludes with practice questions to determine if data is scaled or ordinal.
Nature of the data practice - Copyright updatedKen Plummer
The document discusses different types of data:
- Scaled data provides exact amounts like 12.5 feet or 140 miles per hour.
- Ordinal or ranked data provides comparative amounts like 1st, 2nd, 3rd place.
- Nominal data names or categorizes values like Republican or Democrat.
- Nominal proportional data are simply percentages like Republican 45% or Democrat 55%.
The Interpretation Of Quartiles And Percentiles July 2009Maggie Verster
The document discusses measures of central tendency and dispersion used to summarize data, including the mean, median, mode, range, quartiles, and percentiles. It explains that the mean is best for values representing magnitudes, the median for ranking, and the mode for popularity. Quartiles and percentiles divide a sorted data set into four and one hundred equal parts respectively. A high percentile is desirable for exams but not waiting times, while a low percentile is preferable for race times. Together these statistics provide a concise overview of a data set.
The Normal Distribution is a symmetrical probability distribution where most results are located in the middle and few are spread on both sides. It has the shape of a bell and can entirely be described by its mean and standard deviation.
Standard deviationnormal distributionshowBiologyIB
This tutorial provides information about the normal curve and normal distributions. It discusses key characteristics of the normal curve including that most values fall in the middle and fewer values fall at the extremes. It also discusses how to calculate percentages of values that fall within a certain number of standard deviations from the mean. Additional topics covered include using z-scores to standardize values, types of normal distributions that vary in spread, and how data is not always normally distributed if skewed to one side.
MBA Super Notes: If you are doing MBA or planning to do MBA sometime in the near-future, these are a must-have.
Visit http://SirfBusiness.blogspot.com for more info.
Statistics for machine learning shifa noorulainShifaNoorUlAin1
Introduction to Statistics
Descriptive Statistics
Inferential Statistics
Categories in Statistics
Descriptive Vs Inferential Statistics
Descritive statistics Topics
-Measures of Central Tendency
-Measures of the Spread
-Measures of Asymmetry(Skewness)
Types of analytics & the structures of dataRupak Roy
Get to know more about Prescriptive, Predictive analytics like market basket analytics plus the data structure and variables to apply the analytics.
for more info you can ping me at google #bobrupakroy
This document defines and explains key statistical concepts including the mean, standard deviation, normal distribution, coefficient of variation, error bars, and t-test. The mean is the average value and is calculated by adding all values and dividing by the total number. The standard deviation measures how spread out values are from the mean. A t-test determines if there is a significant difference between the means of two populations by generating a t-score and probability based on degrees of freedom. A probability above 0.05 means the difference is not statistically significant, while below 0.05 means it is significant.
This document provides an overview of key statistical concepts for a regression analysis course, including:
- Distributions of sample data and how to visualize them with histograms.
- The difference between populations and samples, and how samples are used to make inferences about populations.
- Key terms like parameters, statistics, and estimators - where parameters are population values, statistics are calculated from samples, and estimators are used to estimate parameters.
- The importance of random sampling and how properties of estimators like unbiasedness, consistency, and minimum variance relate to how accurately they estimate population parameters.
Here are the modes for the three examples:
1. The mode is 3. This value occurs most frequently among the number of errors committed by the typists.
2. The mode is 82. This value occurs most frequently among the number of fruits yielded by the mango trees.
3. The mode is 12 and 15. These values occur most frequently among the students' quiz scores.
introduction to biostat, standard deviation and varianceamol askar
The document discusses standard deviation and variance in statistics. It defines standard deviation as a measure of how far data points are spread from the mean. A lower standard deviation indicates data points are close to the mean, while a higher standard deviation indicates data points are more spread out. It provides the formula for calculating standard deviation and explains the steps. Variance is defined as the average of the squared deviations from the mean and the formula is given. Grouped data and calculating variance from grouped data is also covered. Applications of standard deviation are listed.
Reporting chi square goodness of fit test of independence in apaKen Plummer
A chi-square goodness of fit test was used to analyze data from a public opinion poll of 1000 voters in Connecticut on their party affiliation. The expected distribution was 40% Republican and 60% Democrat, but the observed results were 32% Republican and 68% Democrat. A sample report in APA style for these results includes the chi-square value, degrees of freedom, and p-value to determine if there is a significant deviation from the expected distribution.
Descriptive statistics is used to describe and summarize key characteristics of a data set. Commonly used measures include central tendency, such as the mean, median, and mode, and measures of dispersion like range, interquartile range, standard deviation, and variance. The mean is the average value calculated by summing all values and dividing by the number of values. The median is the middle value when data is arranged in order. The mode is the most frequently occurring value. Measures of dispersion describe how spread out the data is, such as the difference between highest and lowest values (range) or how close values are to the average (standard deviation).
This document discusses descriptive statistics for one variable. Descriptive statistics summarize and describe data through measures of central tendency (mean, median, mode), variability (variance, standard deviation), and relative standing (percentiles). The mean is the average value, the median is the middle value, and the mode is the most frequent value. Variance and standard deviation describe how spread out the data is. Percentiles indicate what percentage of values are below a given number. Examples are provided to demonstrate calculating and interpreting these common descriptive statistics.
Central tendency spread - symmetry (4.0)Ken Plummer
The document discusses different types of descriptions for distributions including central tendency, spread, and symmetry. It provides examples of questions that would require analyzing the central tendency, spread, or symmetry of a distribution to answer. The document is intended to help the reader identify which type of distribution description is most relevant for answering different types of questions.
The document discusses the three main aspects used to describe data distributions: central tendency, spread, and symmetry. It defines each one as follows:
- Central tendency tells you where the data are clustering or centering, and looks for words like average, mean, median, most common, or center point.
- Spread tells you how spread out the data are from one another, and looks for words like spread, difference, deviation, range, or variation.
- Symmetry tells you the shape of the distribution, and looks for words describing the shape or whether most data are in the center, off to one side, or evenly distributed.
The document presents 7 practice problems about calculating different statistics from data sets. The problems involve comparing average cyberbullying incidents between grade levels, comparing test score ranges between therapy groups, calculating average blood pressure, examining exam score distributions, determining income variation across a school district, and describing comfort level distributions for different faculty groups. Central tendency, spread, and symmetry are the key statistical concepts addressed.
This document discusses statistical concepts for summarizing data, including:
1. Prevalence refers to existing cases of a condition in a population at a given time, while incidence is the number of new cases over a period.
2. Location measures like mode, median, and mean summarize the central tendency of data. The mean uses all data values, while the median is not affected by outliers.
3. Spread measures like range, interquartile range, and standard deviation describe how dispersed data values are. The standard deviation is the most common measure of spread.
4. Choosing the appropriate summary measure depends on the type of variable (nominal, ordinal, or continuous) and whether the data is ske
Descriptive statistics are used to summarize large datasets and communicate findings. There are measures of central tendency like mean, median, and mode to describe typical values. Measures of dispersion like range and standard deviation quantify how spread out the data is. Skewness measures describe the symmetry of distributions. Together these statistics condense complex data into clear high-level insights.
Survey Methodology and Questionnaire Design Theory Part IQualtrics
Do you know what's going on in your respondents' heads as they take your survey? How can you design your questionnaire to collect better data? Understanding the answers to these questions can help you design surveys that collect high quality insights you can depend on.
Dave Vannette, principal research scientist at Qualtrics, shares his best hacks for designing surveys that will help you get quality data. In this presentation, Dave also highlights what your respondents are thinking when they take your surveys, and how your survey design can affect the responses you collect.
The Interpretation Of Quartiles And Percentiles July 2009Maggie Verster
The document discusses measures of central tendency and dispersion used to summarize data, including the mean, median, mode, range, quartiles, and percentiles. It explains that the mean is best for values representing magnitudes, the median for ranking, and the mode for popularity. Quartiles and percentiles divide a sorted data set into four and one hundred equal parts respectively. A high percentile is desirable for exams but not waiting times, while a low percentile is preferable for race times. Together these statistics provide a concise overview of a data set.
The Normal Distribution is a symmetrical probability distribution where most results are located in the middle and few are spread on both sides. It has the shape of a bell and can entirely be described by its mean and standard deviation.
Standard deviationnormal distributionshowBiologyIB
This tutorial provides information about the normal curve and normal distributions. It discusses key characteristics of the normal curve including that most values fall in the middle and fewer values fall at the extremes. It also discusses how to calculate percentages of values that fall within a certain number of standard deviations from the mean. Additional topics covered include using z-scores to standardize values, types of normal distributions that vary in spread, and how data is not always normally distributed if skewed to one side.
MBA Super Notes: If you are doing MBA or planning to do MBA sometime in the near-future, these are a must-have.
Visit http://SirfBusiness.blogspot.com for more info.
Statistics for machine learning shifa noorulainShifaNoorUlAin1
Introduction to Statistics
Descriptive Statistics
Inferential Statistics
Categories in Statistics
Descriptive Vs Inferential Statistics
Descritive statistics Topics
-Measures of Central Tendency
-Measures of the Spread
-Measures of Asymmetry(Skewness)
Types of analytics & the structures of dataRupak Roy
Get to know more about Prescriptive, Predictive analytics like market basket analytics plus the data structure and variables to apply the analytics.
for more info you can ping me at google #bobrupakroy
This document defines and explains key statistical concepts including the mean, standard deviation, normal distribution, coefficient of variation, error bars, and t-test. The mean is the average value and is calculated by adding all values and dividing by the total number. The standard deviation measures how spread out values are from the mean. A t-test determines if there is a significant difference between the means of two populations by generating a t-score and probability based on degrees of freedom. A probability above 0.05 means the difference is not statistically significant, while below 0.05 means it is significant.
This document provides an overview of key statistical concepts for a regression analysis course, including:
- Distributions of sample data and how to visualize them with histograms.
- The difference between populations and samples, and how samples are used to make inferences about populations.
- Key terms like parameters, statistics, and estimators - where parameters are population values, statistics are calculated from samples, and estimators are used to estimate parameters.
- The importance of random sampling and how properties of estimators like unbiasedness, consistency, and minimum variance relate to how accurately they estimate population parameters.
Here are the modes for the three examples:
1. The mode is 3. This value occurs most frequently among the number of errors committed by the typists.
2. The mode is 82. This value occurs most frequently among the number of fruits yielded by the mango trees.
3. The mode is 12 and 15. These values occur most frequently among the students' quiz scores.
introduction to biostat, standard deviation and varianceamol askar
The document discusses standard deviation and variance in statistics. It defines standard deviation as a measure of how far data points are spread from the mean. A lower standard deviation indicates data points are close to the mean, while a higher standard deviation indicates data points are more spread out. It provides the formula for calculating standard deviation and explains the steps. Variance is defined as the average of the squared deviations from the mean and the formula is given. Grouped data and calculating variance from grouped data is also covered. Applications of standard deviation are listed.
Reporting chi square goodness of fit test of independence in apaKen Plummer
A chi-square goodness of fit test was used to analyze data from a public opinion poll of 1000 voters in Connecticut on their party affiliation. The expected distribution was 40% Republican and 60% Democrat, but the observed results were 32% Republican and 68% Democrat. A sample report in APA style for these results includes the chi-square value, degrees of freedom, and p-value to determine if there is a significant deviation from the expected distribution.
Descriptive statistics is used to describe and summarize key characteristics of a data set. Commonly used measures include central tendency, such as the mean, median, and mode, and measures of dispersion like range, interquartile range, standard deviation, and variance. The mean is the average value calculated by summing all values and dividing by the number of values. The median is the middle value when data is arranged in order. The mode is the most frequently occurring value. Measures of dispersion describe how spread out the data is, such as the difference between highest and lowest values (range) or how close values are to the average (standard deviation).
This document discusses descriptive statistics for one variable. Descriptive statistics summarize and describe data through measures of central tendency (mean, median, mode), variability (variance, standard deviation), and relative standing (percentiles). The mean is the average value, the median is the middle value, and the mode is the most frequent value. Variance and standard deviation describe how spread out the data is. Percentiles indicate what percentage of values are below a given number. Examples are provided to demonstrate calculating and interpreting these common descriptive statistics.
Central tendency spread - symmetry (4.0)Ken Plummer
The document discusses different types of descriptions for distributions including central tendency, spread, and symmetry. It provides examples of questions that would require analyzing the central tendency, spread, or symmetry of a distribution to answer. The document is intended to help the reader identify which type of distribution description is most relevant for answering different types of questions.
The document discusses the three main aspects used to describe data distributions: central tendency, spread, and symmetry. It defines each one as follows:
- Central tendency tells you where the data are clustering or centering, and looks for words like average, mean, median, most common, or center point.
- Spread tells you how spread out the data are from one another, and looks for words like spread, difference, deviation, range, or variation.
- Symmetry tells you the shape of the distribution, and looks for words describing the shape or whether most data are in the center, off to one side, or evenly distributed.
The document presents 7 practice problems about calculating different statistics from data sets. The problems involve comparing average cyberbullying incidents between grade levels, comparing test score ranges between therapy groups, calculating average blood pressure, examining exam score distributions, determining income variation across a school district, and describing comfort level distributions for different faculty groups. Central tendency, spread, and symmetry are the key statistical concepts addressed.
This document discusses statistical concepts for summarizing data, including:
1. Prevalence refers to existing cases of a condition in a population at a given time, while incidence is the number of new cases over a period.
2. Location measures like mode, median, and mean summarize the central tendency of data. The mean uses all data values, while the median is not affected by outliers.
3. Spread measures like range, interquartile range, and standard deviation describe how dispersed data values are. The standard deviation is the most common measure of spread.
4. Choosing the appropriate summary measure depends on the type of variable (nominal, ordinal, or continuous) and whether the data is ske
Descriptive statistics are used to summarize large datasets and communicate findings. There are measures of central tendency like mean, median, and mode to describe typical values. Measures of dispersion like range and standard deviation quantify how spread out the data is. Skewness measures describe the symmetry of distributions. Together these statistics condense complex data into clear high-level insights.
Survey Methodology and Questionnaire Design Theory Part IQualtrics
Do you know what's going on in your respondents' heads as they take your survey? How can you design your questionnaire to collect better data? Understanding the answers to these questions can help you design surveys that collect high quality insights you can depend on.
Dave Vannette, principal research scientist at Qualtrics, shares his best hacks for designing surveys that will help you get quality data. In this presentation, Dave also highlights what your respondents are thinking when they take your surveys, and how your survey design can affect the responses you collect.
Quick reminder is this a central tendency - spread - symmetry questionKen Plummer
Central Tendency, Spread, and Distribution Shape are three types of questions related to how data is organized in a distribution. Central Tendency represents the middle point where data scores cluster, Spread refers to the dispersion of scores around the central point, and Distribution Shape describes whether the data is normally distributed, skewed, or another pattern.
The document discusses central tendency and skewness. In Demo #1, it explains that the median is the best measure of central tendency for a positively skewed distribution because it is not influenced by outliers. In Demo #2, it states the mode is best for a multimodal distribution because it indicates the most frequent values. Demo #3 explains that if the mean is lower than the median, the distribution is negatively skewed.
This document provides an introduction to measures of central tendency and dispersion used in descriptive statistics. It defines and explains key terms including mean, median, mode, range, standard deviation, variance, percentiles, and distributions. Examples are given using a fictional dataset on professors' weights to demonstrate how to calculate and interpret these descriptive statistics. Different ways of organizing and visually presenting data through tables, graphs, histograms, pie charts and scatter plots are also outlined.
This document provides an introduction to measures of central tendency and dispersion used in descriptive statistics. It defines and explains key terms including mean, median, mode, range, standard deviation, variance, percentiles, and distributions. Examples are given using a fictional dataset on professors' weights to demonstrate how to calculate and interpret these descriptive statistics. Different ways of organizing and visually presenting data through tables, graphs, histograms, pie charts and scatter plots are also outlined.
QUESTION 1Question 1 Describe the purpose of ecumenical servic.docxmakdul
This document contains a summary of a research article that examines the relationship between patient satisfaction scores and inpatient admission volumes at teaching and non-teaching hospitals. The study found a statistically significant positive correlation between patient satisfaction and admissions at teaching hospitals, but a non-significant negative correlation at non-teaching hospitals. When combined, teaching and non-teaching hospitals showed a statistically significant negative correlation. The findings suggest patient satisfaction may impact admissions more at teaching hospitals. The conclusion provides recommendations for healthcare organizations to strategically focus on patient satisfaction to strengthen performance.
These introductory statistics slides will give you a basic understanding of statistics, types of statistics, variable and its types, the levels of measurements, data collection techniques, and types of sampling.
This document outlines the syllabus for a statistics and probabilities course, which covers topics such as descriptive statistics like measures of central tendency and dispersion, probability distributions, hypothesis testing, regression, and experimental design. It provides definitions and examples of key statistical concepts like populations, samples, variables, measures of central tendency including mean, median and mode, and measures of dispersion like range, mean deviation, variance and standard deviation. The course aims to teach students how to make informed judgments and decisions using statistical methods.
This document provides an introduction to descriptive statistics including measures of central tendency (mean, median, mode) and measures of dispersion (range, standard deviation, variance). It explains how to calculate and interpret these statistics. Examples are provided using data on professors' weights to demonstrate calculating mean, median, mode, standard deviation, and using percentiles. Different types of graphs are introduced for organizing data such as histograms, bar graphs, pie charts, line graphs and scatter plots.
BUS308 – Week 1 Lecture 2 Describing Data Expected Out.docxcurwenmichaela
BUS308 – Week 1 Lecture 2
Describing Data
Expected Outcomes
After reading this lecture, the student should be familiar with:
1. Basic descriptive statistics for data location
2. Basic descriptive statistics for data consistency
3. Basic descriptive statistics for data position
4. Basic approaches for describing likelihood
5. Difference between descriptive and inferential statistics
What this lecture covers
This lecture focuses on describing data and how these descriptions can be used in an
analysis. It also introduces and defines some specific descriptive statistical tools and results.
Even if we never become a data detective or do statistical tests, we will be exposed and
bombarded with statistics and statistical outcomes. We need to understand what they are telling
us and how they help uncover what the data means on the “crime,” AKA research question/issue.
How we obtain these results will be covered in lecture 1-3.
Detecting
In our favorite detective shows, starting out always seems difficult. They have a crime,
but no real clues or suspects, no idea of what happened, no “theory of the crime,” etc. Much as
we are at this point with our question on equal pay for equal work.
The process followed is remarkably similar across the different shows. First, a case or
situation presents itself. The heroes start by understanding the background of the situation and
those involved. They move on to collecting clues and following hints, some of which do not pan
out to be helpful. They then start to build relationships between and among clues and facts,
tossing out ideas that seemed good but lead to dead-ends or non-helpful insights (false leads,
etc.). Finally, a conclusion is reached and the initial question of “who done it” is solved.
Data analysis, and specifically statistical analysis, is done quite the same way as we will
see.
Descriptive Statistics
Week 1 Clues
We are interested in whether or not males and females are paid the same for doing equal
work. So, how do we go about answering this question? The “victim” in this question could be
considered the difference in pay between males and females, specifically when they are doing
equal work. An initial examination (Doc, was it murder or an accident?) involves obtaining
basic information to see if we even have cause to worry.
The first action in any analysis involves collecting the data. This generally involves
conducting a random sample from the population of employees so that we have a manageable
data set to operate from. In this case, our sample, presented in Lecture 1, gave us 25 males and
25 females spread throughout the company. A quick look at the sample by HR provided us with
assurance that the group looked representative of the company workforce we are concerned with
as a whole. Now we can confidently collect clues to see if we should be concerned or not.
As with any detective, the first issue is to understand the.
This document discusses various statistical concepts for summarizing and analyzing quantitative data, including:
- Descriptive statistics like mean, median, mode, range, and standard deviation to summarize central tendency and variability.
- Different measurement scales for data like nominal, ordinal, interval, and ratio scales.
- Graphical representations of data like histograms, bar graphs, and scatterplots.
- Correlational research which investigates relationships between two variables using the Pearson correlation coefficient.
ANALYSIS ANDINTERPRETATION OF DATA Analysis and Interpr.docxcullenrjzsme
ANALYSIS AND
INTERPRETATION
OF DATA
Analysis and Interpretation of Data
https://my.visme.co/render/1454658672/www.erau.edu
Slide 1 Transcript
In a qualitative design, the information gathered and studied often is nominal or narrative in form. Finding trends, patterns, and relationships is discovered inductively and upon
reflection. Some describe this as an intuitive process. In Module 4, qualitative research designs were explained along with the process of how information gained shape the inquiry as it
progresses. For the most part, qualitative designs do not use numerical data, unless a mixed approach is adopted. So, in this module the focus is on how numerical data collected in either
a qualitative mixed design or a quantitative research design are evaluated. In quantitative studies, typically there is a hypothesis or particular research question. Measures used to assess
the value of the hypothesis involve numerical data, usually organized in sets and analyzed using various statistical approaches. Which statistical applications are appropriate for the data of
interest will be the focus for this module.
Data and Statistics
Match the data with an
appropriate statistic
Approaches based on data
characteristics
Collected for single or multiple
groups
Involve continuous or discrete
variables
Data are nominal, ordinal,
interval, or ratio
Normal or non-normal distribution
Statistics serve two
functions
Descriptive: Describe what
data look like
Inferential: Use samples
to estimate population
characteristics
Slide 3 Transcript
There are, of course, far too many statistical concepts to consider than time allows for us here. So, we will limit ourselves to just a few basic ones and a brief overview of the more
common applications in use. It is vitally important to select the proper statistical tool for analysis, otherwise, interpretation of the data is incomplete or inaccurate. Since different
statistics are suitable for different kinds of data, we can begin sorting out which approach to use by considering four characteristics:
1. Have data been collected for a single group or multiple groups
2. Do the data involve continuous or discrete variables
3. Are the data nominal, ordinal, interval, or ratio, and
4. Do the data represent a normal or non-normal distribution.
We will address each of these approaches in the slides that follow. Statistics can serve two main functions – one is to describe what the data look like, which is called descriptive statistics.
The other is known as inferential statistics which typically uses a small sample to estimate characteristics of the larger population. Let’s begin with descriptive statistics and the measures
of central tendency.
Descriptive Statistics and Central Measures
Descriptive statistics
organize and present data
Mode
The number occurring most
frequently; nominal data
Quickest or rough estimate
Most typical value
Measures of central
tendenc.
The document discusses descriptive statistical analysis techniques used in marketing research such as measures of central tendency, variability, frequency distributions, and hypothesis testing. It provides examples of how to calculate the mean, median, mode, and range of a data set and construct a frequency distribution table. The document also demonstrates how to conduct a hypothesis test to determine if a sample provides sufficient evidence to support or reject a hypothesized population parameter value.
Statistical concepts and their applications in various fields:
- Statistics involves collecting and analyzing numerical data to draw valid conclusions. It requires careful research planning and design.
- Descriptive statistics summarize data through measures of central tendency (mean, median, mode) and variability (range, standard deviation).
- Inferential statistics test hypotheses and make estimates about populations based on samples.
- Biostatistics is applied in community medicine, public health, cancer research, pharmacology, and demography to study disease trends, treatment effectiveness, and population attributes. It is also used in advanced biomedical technologies and ecology.
Similar to Central spread - symmetry (jejit + indepth) (20)
Diff rel gof-fit - jejit - practice (5)Ken Plummer
The document discusses the differences between questions of difference, relationship, and goodness of fit. It provides examples to illustrate each type of question. A question of difference compares two or more groups on some outcome, like comparing younger and older drivers' average driving speeds. A question of relationship examines whether a change in one variable causes a change in another, such as the relationship between age and flexibility. A question of goodness of fit assesses how well a claim matches reality, such as whether a salesman's claim of software effectiveness fits the results of user testing.
This document provides examples of questions that ask for the lowest and highest number in a set of data. The questions ask for the difference between the state with the lowest and highest church attendance, the students with the highest and lowest test scores, and the slowest and fastest versions of a vehicle model.
Inferential vs descriptive tutorial of when to use - Copyright UpdatedKen Plummer
The document discusses the differences between descriptive and inferential statistics. Descriptive statistics are used to describe characteristics of a whole population, while inferential statistics are used when the whole population cannot be measured and conclusions are drawn from a sample to generalize to the larger population. Examples are provided to illustrate when each type of statistic would be used. Key differences include descriptive statistics examining entire populations while inferential statistics examine samples that aim to infer conclusions about populations.
Diff rel ind-fit practice - Copyright UpdatedKen Plummer
The document provides explanations and examples for different types of statistical questions:
- Difference questions compare two or more groups on an outcome.
- Relationship questions examine if a change in one variable is associated with a change in another variable.
- Independence questions determine if two variables with multiple levels are independent of each other.
- Goodness of fit questions assess how well a claim matches reality.
Examples are given for each type of question to illustrate key concepts like comparing groups, examining associations between variables, assessing independence, and evaluating how a claim fits observed data.
Normal or skewed distributions (inferential) - Copyright updatedKen Plummer
- The document discusses determining whether distributions are normal or skewed
- A distribution is considered skewed if the skewness value divided by the standard error of skewness is less than -2 or greater than 2
- For the old car data set in the example, the skewness value of -4.26 divided by the standard error is less than -2, so this distribution is negatively skewed
- The new car data set skewness value of -1.69 divided by the standard error is between -2 and 2, so this distribution is normal
Nature of the data (spread) - Copyright updatedKen Plummer
The document discusses scaled and ordinal data. Scaled data can be measured in exact amounts like distances and speeds. Ordinal data provides comparative amounts by ranking items, like the top 3 states in terms of well-being. Examples ask the reader to identify if data is scaled or ordinal, like driving speeds which are scaled, or baby weight percentiles which are ordinal as they compare weights.
The document is a series of questions and examples that explain what it means for a question to ask about the "most frequent response". It provides examples of questions asking about the highest/most number of something based on data in tables or lists. It then asks a series of questions to determine if they are asking about the most frequent/common response based on the data given.
Nature of the data (descriptive) - Copyright updatedKen Plummer
The document discusses two types of data: scaled data and ordinal data. Scaled data can be measured in exact amounts with equal intervals between values. Ordinal or ranked data provides comparative amounts but not necessarily equal intervals. Several examples are provided to illustrate the difference, including driving speed, states ranked by well-being, and elephant weights. Practice questions are also included for the reader to determine if data examples provided are scaled or ordinal.
The document discusses whether variables are dichotomous or scaled when calculating correlations. It provides examples of correlations between ACT scores and whether students attended private or public school. One example has ACT scores as a scaled variable and school type as dichotomous. Another has lower and higher ACT scores as dichotomous and school type as dichotomous. It emphasizes determining if variables are both dichotomous, or if one is dichotomous and one is scaled.
The document discusses the correlation between ACT scores and a measure of school belongingness. It determines that one of the variables, which has a sample size less than 30, is skewed and has many ties. As a result, a non-parametric test should be used to analyze the relationship between the two variables.
The document discusses using parametric versus non-parametric tests based on sample size for skewed distributions. For skewed distributions with a sample size less than 30, a non-parametric test is recommended. For skewed distributions with a sample size greater than or equal to 30, a parametric test is recommended. It provides examples analyzing the correlation between ACT scores and sense of school belongingness using both approaches.
The document discusses whether there are many ties or few/no ties within the variables of the relationship question "What is the correlation between ACT rankings (ordinal) and sense of school belongingness (scaled 1-10)?". It determines that ACT rankings, being ordinal, have many ties, while sense of school belongingness, being on a scale of 1-10, may have many or few ties depending on how scores are distributed.
The document discusses identifying whether variables in statistical analyses are ordinal or nominal. It provides examples of relationships between variables such as ACT rankings and sense of school belongingness, daily social media use and sense of well-being, and private/public school enrollment and sense of well-being. It asks the reader to identify if variables in examples like running speed and shoe/foot size or LSAT scores and test anxiety are ordinal or nominal.
The document discusses covariates and their impact on relationships between variables. It defines a covariate as a variable that is controlled for or eliminated from a study. It explains that if a covariate is related to one of the variables in the relationship being examined, it can impact the strength of that relationship. Examples are provided to demonstrate when a question involves a covariate or not.
This document discusses the nature of variables in relationship questions. It can be determined that the variables are either both scaled, at least one is ordinal, or at least one is nominal. Examples of different relationship questions are provided that fall into each of these categories. The document also provides practice questions for the user to determine which category the variables fall into.
The document discusses the number of variables involved in research questions. It explains that many relationship questions deal with two variables, such as gender predicting driving speed. However, some questions deal with three or more variables, for example gender and age predicting driving speed. The document asks the reader to identify whether example research questions involve two or three or more variables.
The document discusses independent and dependent variables in research questions. It provides examples to illustrate that an independent variable has at least two levels and may have more, such as religious affiliation having two levels (Western religion and Eastern religion) or company type having three levels (Company X, Company Y, Company Z). It then provides a practice example about employee satisfaction rates among morning, afternoon, and evening shifts, identifying shift status as the independent variable with three levels.
The document discusses independent variables and how they relate to research questions. It provides examples of questions with one independent variable, two independent variables, and zero independent variables. An independent variable influences or impacts a dependent variable. Questions are presented about employee satisfaction rates, agent commissions, training proficiency, and cyberbullying incidents to illustrate different numbers of independent variables.
This document discusses dependent and explanatory variables in research questions. It provides examples of questions with one and two dependent variables. A dependent variable is the thing being influenced or measured in a research study. An explanatory variable is what does the influencing. Good research questions will have one or more clearly defined dependent variables and one or more explanatory variables. The document uses examples like shifts' influence on employee satisfaction and training methods' influence on proficiency to illustrate dependent and explanatory variables.
The document discusses predictive versus arbitrary relationships. A predictive relationship is when one variable clearly influences or predicts another variable, like daily temperature predicting sunscreen sales. An arbitrary relationship is when it is unclear which variable causes or predicts the other, like whether wealth impacts education or education impacts wealth. Two examples are then given of problems asking whether relationships between variables are predictive or arbitrary.
Level 3 NCEA - NZ: A Nation In the Making 1872 - 1900 SML.pptHenry Hollis
The History of NZ 1870-1900.
Making of a Nation.
From the NZ Wars to Liberals,
Richard Seddon, George Grey,
Social Laboratory, New Zealand,
Confiscations, Kotahitanga, Kingitanga, Parliament, Suffrage, Repudiation, Economic Change, Agriculture, Gold Mining, Timber, Flax, Sheep, Dairying,
This document provides an overview of wound healing, its functions, stages, mechanisms, factors affecting it, and complications.
A wound is a break in the integrity of the skin or tissues, which may be associated with disruption of the structure and function.
Healing is the body’s response to injury in an attempt to restore normal structure and functions.
Healing can occur in two ways: Regeneration and Repair
There are 4 phases of wound healing: hemostasis, inflammation, proliferation, and remodeling. This document also describes the mechanism of wound healing. Factors that affect healing include infection, uncontrolled diabetes, poor nutrition, age, anemia, the presence of foreign bodies, etc.
Complications of wound healing like infection, hyperpigmentation of scar, contractures, and keloid formation.
A Visual Guide to 1 Samuel | A Tale of Two HeartsSteve Thomason
These slides walk through the story of 1 Samuel. Samuel is the last judge of Israel. The people reject God and want a king. Saul is anointed as the first king, but he is not a good king. David, the shepherd boy is anointed and Saul is envious of him. David shows honor while Saul continues to self destruct.
Gender and Mental Health - Counselling and Family Therapy Applications and In...PsychoTech Services
A proprietary approach developed by bringing together the best of learning theories from Psychology, design principles from the world of visualization, and pedagogical methods from over a decade of training experience, that enables you to: Learn better, faster!
Temple of Asclepius in Thrace. Excavation resultsKrassimira Luka
The temple and the sanctuary around were dedicated to Asklepios Zmidrenus. This name has been known since 1875 when an inscription dedicated to him was discovered in Rome. The inscription is dated in 227 AD and was left by soldiers originating from the city of Philippopolis (modern Plovdiv).
1. What category of method is appropriate for the
research question? Methods dealing with …
Central Tendency? Spread? Symmetry?
2. Central Tendency Tells you about where the data are
clustering or centered in the
distribution (e.g., average, center, middle).
Spread Tells you about how spread out the
data are from the middle. (e.g., spread,
difference, deviation, range, variation, vary
etc.)
Symmetry Tells you about the shape of the
distribution.
3. Note – if before going on you would like to
brush up on what a distribution is – click the link
below:
Link to instruction on distributions
Return to this slide when you are done.
5. Central Tendency
Central Tendency, Spread, or Distribution Shape?
Tells you about where the data are
clustering or centering.
Look for words like, average, mean,
median, most common, center point
etc.
Note – to review what a distribution is go to slide 44 and then return to this slide.
6. Central Tendency
Central Tendency, Spread, or Distribution Shape?
Tells you about where the data are
clustering or centering.
Look for words like, average, mean,
median, most common, center point
etc.
Note – to review what a distribution is go to slide 44 and then return to this slide.
7. Central Tendency
Central Tendency, Spread, or Distribution Shape?
Tells you about where the data are
clustering or centering.
Look for words like, average, mean,
median, most common, center point
etc.
Examples:
What is the average test score?
What is the most frequent score?
What is the median income?
9. Central Tendency, Spread, or Distribution Shape?
Spread Tells you about how spread out the
data are from one another.
Look for words like spread,
difference, deviation, range,
variation, vary, etc.
10. Central Tendency, Spread, or Distribution Shape?
Examples:
How spread out are the test results?
How much does income vary in community X?
What is the difference between the highest and lowest scores?
Spread Tells you about how spread out the
data are from one another.
Look for words like spread,
difference, deviation, range,
variation, vary, etc.
12. Central Tendency, Spread, or Distribution Shape?
Symmetry Tells you about the shape of the
distribution.
Look for words like shape or
expressions like
• Most of the data in the center.
• A few data points to the left or right.
• All data evenly distributed.
• Shape of the distribution is very
peaked.
13. Central Tendency, Spread, or Distribution Shape?
Example:
Are the lab results mostly on the high or low end?
Symmetry Tells you about the shape of the
distribution.
Look for words like shape or
expressions like
• Most of the data in the center.
• A few data points to the left or right.
• All data evenly distributed.
• Shape of the distribution is very
peaked.
14. Important Note
Every data set has central tendency, spread,
and symmetry.
Student Test Score
Anne 35
Bob 31
Carla 43
Daniel 29
Eva 30
15. Important Note
Every data set has central tendency, spread, and
symmetry.
Student Test Score
Anne 35
Bob 31
Carla 43
Daniel 29
Eva 30
In the practices that follow, you
will be directed to focus on one
of these at a time.
17. Central Tendency Tells you about where the data are
clustering or centered in the
distribution (e.g., average, center, middle).
Spread Tells you about how spread out the
data are from the middle. (e.g., spread,
difference, deviation, range, variation, vary
etc.)
Symmetry Tells you about the shape of the
distribution.
19. Problem #1
Question: What is the average number of cyberbullying incidents
among freshmen, sophomores, juniors, and seniors?
20. Problem #1
Question: What is the average number of cyberbullying incidents
among freshmen, sophomores, juniors, and seniors?
Instructions: Which type of description of the distribution
(below) is most relevant to the question (above)?
Central Tendency Spread Symmetry
21. Problem #1
Question: What is the average number of cyberbullying incidents
among freshmen, sophomores, juniors, and seniors?
Central Tendency Spread Symmetry
22. Problem #1
Question: What is the average number of cyberbullying incidents
among freshmen, sophomores, juniors, and seniors?
Central Tendency Spread Symmetry
Tells you about where the data are clustering or centering.
Look for words like, average, mean, median, most common, center point etc.
23. Problem #1
Question: What is the average number of cyberbullying
incidents among freshmen, sophomores, juniors, and seniors?
Central Tendency Spread Symmetry
Tells you about where the data are clustering or centering.
Look for words like, average, mean, median, most common, center point etc.
24. Problem #2
Question: What is the difference between the lowest and
highest score on a survey designed to test the impact of a new
therapy?
25. Problem #2
Question: What is the difference between the lowest and
highest score on a survey designed to test the impact of a new
therapy?
Instructions: Which type of description of the distribution
(below) is most relevant to the question (above)?
Central Tendency Spread Symmetry
26. Problem #2
Question: What is the difference between the lowest and
highest score on a survey designed to test the impact of a new
therapy?
Central Tendency Spread Symmetry
27. Problem #2
Question: What is the difference between the lowest and
highest score on a survey designed to test the impact of a new
therapy?
Central Tendency Spread Symmetry
Tells you about how spread out the data are from one another.
Look for words like spread, difference, deviation, range, variation, etc.
28. Problem #2
Question: What is the difference between the lowest and
highest score on a survey designed to test the impact of a new
therapy?
Central Tendency Spread Symmetry
Tells you about how spread out the data are from one another.
Look for words like spread, difference, deviation, range, variation, etc.
29. Problem #3
Question: The director of a health clinic has asked you to help
her analyze data from the results of patient systolic blood
pressure readings. You decide to compute the mean systolic
pressure to see where the patient results cluster.
30. Problem #3
Question: The director of a health clinic has asked you to help
her analyze data from the results of patient systolic blood
pressure readings. You decide to compute the mean systolic
pressure to see where the patient results cluster.
Instructions: Which type of description of the distribution
(below) is most relevant to the question (above)?
Central Tendency Spread Symmetry
31. Problem #3
Question: The director of a health clinic has asked you to help
her analyze data from the results of patient systolic blood
pressure readings. You decide to compute the mean systolic
pressure to see where the patient results cluster.
Central Tendency Spread Symmetry
32. Problem #3
Question: The director of a health clinic has asked you to help
her analyze data from the results of patient systolic blood
pressure readings. You decide to compute the mean systolic
pressure to see where the patient results cluster.
Central Tendency Spread Symmetry
Tells you about where the data are clustering or centering.
Look for words like, average, mean, median, most common, center point etc.
33. Problem #3
Question: The director of a health clinic has asked you to help
her analyze data from the results of patient systolic blood
pressure readings. You decide to compute the mean systolic
pressure to see where the patient results cluster.
Central Tendency Spread Symmetry
Tells you about where the data are clustering or centering
Look for words like, average, mean, median, most common, center point etc.
34. Problem #4
Question: An entrance exam for mechanical engineers is very
difficult. If it is very difficult you would expect most of the scores
to bunched up on the lower end of the score distribution with a
few high scores. What is the shape of this distribution?
35. Problem #4
Question: An entrance exam for mechanical engineers is very
difficult. If it is very difficult you would expect most of the scores
to bunched up on the lower end of the score distribution with a
few high scores. What is the shape of this distribution?
Instructions: Which type of description of the distribution
(below) is most relevant to the question (above)?
Central Tendency Spread Symmetry
36. Problem #4
Question: An entrance exam for mechanical engineers is very
difficult. If it is very difficult you would expect most of the scores
to bunched up on the lower end of the score distribution with a
few high scores. What is the shape of this distribution?
Central Tendency Spread Symmetry
37. Problem #4
Question: An entrance exam for mechanical engineers is very
difficult. If it is very difficult you would expect most of the scores
to bunched up on the lower end of the score distribution with a
few high scores. What is the shape of this distribution?
Tells you about the shape of the distribution.
Look for words like shape or expressions like
• Most of the data in the center.
• A few data points to the left or right.
• All data evenly distributed.
• Shape of the distribution is very peaked.
Central Tendency Spread Symmetry
38. Problem #4
Question: An entrance exam for mechanical engineers is very
difficult. If it is very difficult you would expect most of the scores
to bunched up on the lower end of the score distribution with a
few high scores. What is the shape of this distribution?
Central Tendency Spread Symmetry
Tells you about the shape of the distribution.
Look for words like shape or expressions like
• Most of the data in the center.
• A few data points to the left or right.
• All data evenly distributed.
• Shape of the distribution is very peaked.
39. What category of method is appropriate for the
research question? Methods dealing with …
Central Tendency? Spread? Symmetry?
Editor's Notes
Symmetry has to do with he shape of a distribution
When the distribution is symmetrical it has most of the values in the middle with equally decreasing values to the left and right of the distribution (as shown to the left).
A distribution is asymmetrical when it does not follow this pattern (see the bottom two images to the left).
You normally will not be asked to assess skew directly but it is an important step in determining the type of spread or central tendency statistics you will run.
Symmetry has to do with he shape of a distribution
When the distribution is symmetrical it has most of the values in the middle with equally decreasing values to the left and right of the distribution (as shown to the left).
A distribution is asymmetrical when it does not follow this pattern (see the bottom two images to the left).
You normally will not be asked to assess skew directly but it is an important step in determining the type of spread or central tendency statistics you will run.
Symmetry has to do with he shape of a distribution
When the distribution is symmetrical it has most of the values in the middle with equally decreasing values to the left and right of the distribution (as shown to the left).
A distribution is asymmetrical when it does not follow this pattern (see the bottom two images to the left).
You normally will not be asked to assess skew directly but it is an important step in determining the type of spread or central tendency statistics you will run.
Symmetry has to do with he shape of a distribution
When the distribution is symmetrical it has most of the values in the middle with equally decreasing values to the left and right of the distribution (as shown to the left).
A distribution is asymmetrical when it does not follow this pattern (see the bottom two images to the left).
You normally will not be asked to assess skew directly but it is an important step in determining the type of spread or central tendency statistics you will run.
Symmetry has to do with he shape of a distribution
When the distribution is symmetrical it has most of the values in the middle with equally decreasing values to the left and right of the distribution (as shown to the left).
A distribution is asymmetrical when it does not follow this pattern (see the bottom two images to the left).
You normally will not be asked to assess skew directly but it is an important step in determining the type of spread or central tendency statistics you will run.
Symmetry has to do with he shape of a distribution
When the distribution is symmetrical it has most of the values in the middle with equally decreasing values to the left and right of the distribution (as shown to the left).
A distribution is asymmetrical when it does not follow this pattern (see the bottom two images to the left).
You normally will not be asked to assess skew directly but it is an important step in determining the type of spread or central tendency statistics you will run.
Symmetry has to do with he shape of a distribution
When the distribution is symmetrical it has most of the values in the middle with equally decreasing values to the left and right of the distribution (as shown to the left).
A distribution is asymmetrical when it does not follow this pattern (see the bottom two images to the left).
You normally will not be asked to assess skew directly but it is an important step in determining the type of spread or central tendency statistics you will run.
Symmetry has to do with he shape of a distribution
When the distribution is symmetrical it has most of the values in the middle with equally decreasing values to the left and right of the distribution (as shown to the left).
A distribution is asymmetrical when it does not follow this pattern (see the bottom two images to the left).
You normally will not be asked to assess skew directly but it is an important step in determining the type of spread or central tendency statistics you will run.
Symmetry has to do with he shape of a distribution
When the distribution is symmetrical it has most of the values in the middle with equally decreasing values to the left and right of the distribution (as shown to the left).
A distribution is asymmetrical when it does not follow this pattern (see the bottom two images to the left).
You normally will not be asked to assess skew directly but it is an important step in determining the type of spread or central tendency statistics you will run.
Symmetry has to do with he shape of a distribution
When the distribution is symmetrical it has most of the values in the middle with equally decreasing values to the left and right of the distribution (as shown to the left).
A distribution is asymmetrical when it does not follow this pattern (see the bottom two images to the left).
You normally will not be asked to assess skew directly but it is an important step in determining the type of spread or central tendency statistics you will run.