At the end of this lecture, the student should be able to:
1. understand structure of research study appropriate for independent-measures t hypothesis test
2. test between two populations or two treatments using independent measures t statistics
3. understand how to evaluate the assumptions underlying this test
The document discusses the paired samples t-test, which is used to compare two sets of measurements made on the same individuals. It notes that this test is appropriate when there are two correlated distributions, such as pre-test and post-test scores from the same people. The null hypothesis is that there is no difference between the pairs. The test calculates the differences between pairs, sums them, and divides this by the standard error of the differences to obtain a t-value, which can be compared to critical values to determine if the null hypothesis can be rejected.
This document discusses inferential statistics, which uses sample data to make inferences about populations. It explains that inferential statistics is based on probability and aims to determine if observed differences between groups are dependable or due to chance. The key purposes of inferential statistics are estimating population parameters from samples and testing hypotheses. It discusses important concepts like sampling distributions, confidence intervals, null hypotheses, levels of significance, type I and type II errors, and choosing appropriate statistical tests.
Statistical inference involves drawing conclusions about a population based on a sample. It has two main areas: estimation and hypothesis testing. Estimation uses sample data to obtain point or interval estimates of unknown population parameters. Hypothesis testing determines whether to accept or reject statements about population parameters. Confidence intervals give a range of values that are likely to contain the true population parameter, with a specified level of confidence such as 90% or 95%.
T test, independant sample, paired sample and anovaQasim Raza
The document discusses various statistical analyses that can be performed in SPSS, including t-tests, ANOVA, and post-hoc tests. It provides details on one-sample t-tests, independent t-tests, paired t-tests, one-way ANOVA tests, and evaluating assumptions like normality. Examples are given on how to conduct these tests in SPSS and how to interpret the output. Guidance is provided on follow-up post-hoc tests that can be used after ANOVA to examine differences between specific groups.
This document discusses the importance of evaluating effect size, not just statistical significance (p-value), when analyzing differences between groups. It provides examples to illustrate effect size and defines some common effect size measures, such as Cohen's d and Hedges' g. The document also discusses how effect size can be used to determine required sample sizes needed to achieve a desired p-value threshold and level of confidence in the results.
This document provides an overview of various statistical analysis techniques used in inferential statistics, including t-tests, ANOVA, ANCOVA, chi-square, regression analysis, and interpreting null hypotheses. It defines key terms like alpha levels, effect sizes, and interpreting graphs. The overall purpose is to explain common statistical methods for analyzing data and determining the probability that results occurred by chance or were statistically significant.
The document provides information about the Chi Square test, including:
- It is one of the most widely used statistical tests in research.
- It compares observed frequencies to expected frequencies to test hypotheses about categorical variables.
- The key steps are defining hypotheses, calculating the test statistic, determining the degrees of freedom, finding the critical value, and making a conclusion by comparing the test statistic to the critical value.
- It can be used for goodness of fit tests, tests of homogeneity of proportions, and tests of independence between categorical variables. Examples of applications in cohort studies, case-control studies, and matched case-control studies are provided.
The document discusses the paired samples t-test, which is used to compare two sets of measurements made on the same individuals. It notes that this test is appropriate when there are two correlated distributions, such as pre-test and post-test scores from the same people. The null hypothesis is that there is no difference between the pairs. The test calculates the differences between pairs, sums them, and divides this by the standard error of the differences to obtain a t-value, which can be compared to critical values to determine if the null hypothesis can be rejected.
This document discusses inferential statistics, which uses sample data to make inferences about populations. It explains that inferential statistics is based on probability and aims to determine if observed differences between groups are dependable or due to chance. The key purposes of inferential statistics are estimating population parameters from samples and testing hypotheses. It discusses important concepts like sampling distributions, confidence intervals, null hypotheses, levels of significance, type I and type II errors, and choosing appropriate statistical tests.
Statistical inference involves drawing conclusions about a population based on a sample. It has two main areas: estimation and hypothesis testing. Estimation uses sample data to obtain point or interval estimates of unknown population parameters. Hypothesis testing determines whether to accept or reject statements about population parameters. Confidence intervals give a range of values that are likely to contain the true population parameter, with a specified level of confidence such as 90% or 95%.
T test, independant sample, paired sample and anovaQasim Raza
The document discusses various statistical analyses that can be performed in SPSS, including t-tests, ANOVA, and post-hoc tests. It provides details on one-sample t-tests, independent t-tests, paired t-tests, one-way ANOVA tests, and evaluating assumptions like normality. Examples are given on how to conduct these tests in SPSS and how to interpret the output. Guidance is provided on follow-up post-hoc tests that can be used after ANOVA to examine differences between specific groups.
This document discusses the importance of evaluating effect size, not just statistical significance (p-value), when analyzing differences between groups. It provides examples to illustrate effect size and defines some common effect size measures, such as Cohen's d and Hedges' g. The document also discusses how effect size can be used to determine required sample sizes needed to achieve a desired p-value threshold and level of confidence in the results.
This document provides an overview of various statistical analysis techniques used in inferential statistics, including t-tests, ANOVA, ANCOVA, chi-square, regression analysis, and interpreting null hypotheses. It defines key terms like alpha levels, effect sizes, and interpreting graphs. The overall purpose is to explain common statistical methods for analyzing data and determining the probability that results occurred by chance or were statistically significant.
The document provides information about the Chi Square test, including:
- It is one of the most widely used statistical tests in research.
- It compares observed frequencies to expected frequencies to test hypotheses about categorical variables.
- The key steps are defining hypotheses, calculating the test statistic, determining the degrees of freedom, finding the critical value, and making a conclusion by comparing the test statistic to the critical value.
- It can be used for goodness of fit tests, tests of homogeneity of proportions, and tests of independence between categorical variables. Examples of applications in cohort studies, case-control studies, and matched case-control studies are provided.
This document provides information on calculating effect sizes when comparing two means. It defines effect size as the extent to which a phenomenon is present in a population or how false the null hypothesis is. It lists several common effect size measures for different statistical tests, including Cohen's d for independent groups t-tests, correlation coefficients for correlational analyses, and eta squared and omega squared for ANOVA. An example is given of computing Cohen's d to compare study habits between public and private school students using t-test results.
This document discusses measures of variability, or dispersion, which describe how spread out values are in a distribution. The five most commonly used measures are the range, interquartile range, variance, standard deviation, and coefficient of variation. The standard deviation is the most widely used measure for interval or ratio data, as it indicates the average deviation of values from the mean and describes whether a sample is heterogeneous or homogeneous. The standard deviation is calculated by computing the sum of squared deviations from the mean, dividing by the sample size minus one, and taking the square root.
The document discusses various measures of central tendency, dispersion, and shape used to describe data numerically. It defines terms like mean, median, mode, variance, standard deviation, coefficient of variation, range, interquartile range, skewness, and quartiles. It provides formulas and examples of how to calculate these measures from data sets. The document also discusses concepts like normal distribution, empirical rule, and how measures of central tendency and dispersion do not provide information about the shape or symmetry of a distribution.
The document discusses validity and reliability in research. It defines validity as the degree to which a study accurately reflects the concept being measured. There are several types of validity discussed, including content validity, construct validity, and criterion-related validity. Reliability refers to the consistency of measurements. Rater reliability and instrument reliability are examined. Methods for establishing reliability include test-retest analysis, equivalence of test forms, and measures of internal consistency such as Cronbach's alpha. Generalizability and sampling methods are also summarized.
The document discusses the chi-square test, which is a non-parametric statistical test used to compare observed data with expected data in one or more categories. It does not assume an underlying distribution and can be applied to contingency tables with multiple classes. The chi-square test statistic follows a chi-square distribution, and the test determines if there is a significant difference between observed and expected frequencies.
The document provides an overview of inferential statistics. It defines inferential statistics as making generalizations about a larger population based on a sample. Key topics covered include hypothesis testing, types of hypotheses, significance tests, critical values, p-values, confidence intervals, z-tests, t-tests, ANOVA, chi-square tests, correlation, and linear regression. The document aims to explain these statistical concepts and techniques at a high level.
This document discusses various statistical techniques used for inferential statistics, including parametric and non-parametric techniques. Parametric techniques make assumptions about the population and can determine relationships, while non-parametric techniques make few assumptions and are useful for nominal and ordinal data. Commonly used parametric tests are t-tests, ANOVA, MANOVA, and correlation analysis. Non-parametric tests mentioned include Chi-square, Wilcoxon, and Friedman tests. Examples are provided to illustrate the appropriate uses of each technique.
This document discusses various quantitative data analysis techniques for research. It covers describing and summarizing data, identifying relationships between variables, comparing variables, and forecasting outcomes. The five most important methods are identified as mean, standard deviation, regression, sample size determination, and hypothesis testing. Parametric and non-parametric techniques are also discussed. Four levels of data measurement are defined: nominal, ordinal, interval, and ratio data. Examples are provided for coding nominal/ordinal data and visualizing data through graphs and charts. Statistical tests like the t-test, ANOVA, and chi-square are also summarized.
This document discusses various statistical methods used to organize and interpret data. It describes descriptive statistics, which summarize and simplify data through measures of central tendency like mean, median, and mode, and measures of variability like range and standard deviation. Frequency distributions are presented through tables, graphs, and other visual displays to organize raw data into meaningful categories.
INFERENTIAL STATISTICS: AN INTRODUCTIONJohn Labrador
For instance, we use inferential statistics to try to infer from the sample data what the population might think. Or, we use inferential statistics to make judgments of the probability that an observed difference between groups is a dependable one or one that might have happened by chance in this study.
This document summarizes key concepts from an introduction to statistics textbook. It covers types of data (quantitative, qualitative, levels of measurement), sampling (population, sample, randomization), experimental design (observational studies, experiments, controlling variables), and potential misuses of statistics (bad samples, misleading graphs, distorted percentages). The goal is to illustrate how common sense is needed to properly interpret data and statistics.
Inferential statistics use samples to make generalizations about populations. It allows researchers to test theories designed to apply to entire populations even though samples are used. The goal is to determine if sample characteristics differ enough from the null hypothesis, which states there is no difference or relationship, to justify rejecting the null in favor of the research hypothesis. All inferential tests examine the size of differences or relationships in a sample compared to variability and sample size to evaluate how deviant the results are from what would be expected by chance alone.
This presentation is for educational purpose only. I do not own the rights to written material or pictures or illustrations used.
This is being uploaded for students who are in search of, or trying to understand how a quasi-experimental research design should look like.
This document discusses frequency distributions and methods for graphically presenting frequency distribution data. It defines a frequency distribution as a tabulation or grouping of data into categories showing the number of observations in each group. The document outlines the parts of a frequency table as class limits, class size, class boundaries, and class marks. It then provides steps for constructing a frequency distribution table from a set of data. Finally, it discusses histograms and frequency polygons as methods for graphically presenting frequency distribution data, and provides examples of how to construct these graphs in Excel.
This document discusses various measures of dispersion used to describe how varied or spread out a set of data values are from the average. It describes range, interquartile range, mean deviation, standard deviation, and the Lorenz curve. Standard deviation is highlighted as the most important measure, being easy to calculate, taking all data points into account equally, and indicating how far values typically are from the average in a normal distribution. The document provides formulas and explains properties and limitations of each measure.
Topic: Population And Sample
Student Name: Sidera Saleem
Class: B.Ed. 2.5
Project Name: “Young Teachers' Professional Development (TPD)"
"Project Founder: Prof. Dr. Amjad Ali Arain
Faculty of Education, University of Sindh, Pakistan
This document discusses descriptive statistics used in research. It defines descriptive statistics as procedures used to organize, interpret, and communicate numeric data. Key aspects covered include frequency distributions, measures of central tendency (mode, median, mean), measures of variability, bivariate descriptive statistics using contingency tables and correlation, and describing risk to facilitate evidence-based decision making. The overall purpose of descriptive statistics is to synthesize and summarize quantitative data for analysis in research.
This document discusses measures of dispersion, or variation, in data sets. It describes absolute and relative measures of dispersion. Absolute measures give the dispersion in the same units as the original data but cannot be used to compare different data sets. Relative measures are dimensionless and can be used to compare dispersion across data sets.
The document then discusses the range as the simplest measure of absolute dispersion. The range is defined as the difference between the maximum and minimum values in a data set. It also defines the coefficient of range, which normalizes the range as a percentage to allow comparison between data sets. Two examples are given to demonstrate calculating the range and coefficient of range.
Experimental Research Design (True, Quasi and Pre Experimental Design)Alam Nuzhathalam
The document discusses research design and experimental research. It defines research design as a broad framework that specifies objectives, methods, timelines and responsibilities for conducting a research project. Experimental research involves manipulating independent variables and measuring their effect on dependent variables. The key aspects of experimental research design discussed are random assignment to experimental and control groups, manipulation of the independent variable, use of control groups, pre-tests and post-tests, and measuring the independent variable's effect on the dependent variable. Strengths of experimental research noted are strong control over variables and ability to identify cause-and-effect relationships.
The document describes how to conduct an independent samples t-test. It explains that the t-test is used to compare differences between separate groups. An example is provided where participants are randomly assigned to either a pizza or beer diet for a week, and their weight gain is measured. Calculations are shown to find the variance, mean, and t-value for each group. The results indicate participants on the beer diet gained significantly more weight than those on the pizza diet, t(8) = 4.47, p < .05. Instructions are also provided for conducting this analysis in SPSS.
This document discusses correlation analysis and its use in SPSS. It defines correlation as a measurement of the relationship between two variables. Pearson's r and Spearman's rho correlations are described, along with their assumptions and interpretations. The document provides examples of positive, negative, and no correlation. It also demonstrates how to conduct and interpret correlation analysis in SPSS, including checking assumptions and presenting results. The goal is for students to understand different types of correlation and how to perform correlations in SPSS.
This document provides information on calculating effect sizes when comparing two means. It defines effect size as the extent to which a phenomenon is present in a population or how false the null hypothesis is. It lists several common effect size measures for different statistical tests, including Cohen's d for independent groups t-tests, correlation coefficients for correlational analyses, and eta squared and omega squared for ANOVA. An example is given of computing Cohen's d to compare study habits between public and private school students using t-test results.
This document discusses measures of variability, or dispersion, which describe how spread out values are in a distribution. The five most commonly used measures are the range, interquartile range, variance, standard deviation, and coefficient of variation. The standard deviation is the most widely used measure for interval or ratio data, as it indicates the average deviation of values from the mean and describes whether a sample is heterogeneous or homogeneous. The standard deviation is calculated by computing the sum of squared deviations from the mean, dividing by the sample size minus one, and taking the square root.
The document discusses various measures of central tendency, dispersion, and shape used to describe data numerically. It defines terms like mean, median, mode, variance, standard deviation, coefficient of variation, range, interquartile range, skewness, and quartiles. It provides formulas and examples of how to calculate these measures from data sets. The document also discusses concepts like normal distribution, empirical rule, and how measures of central tendency and dispersion do not provide information about the shape or symmetry of a distribution.
The document discusses validity and reliability in research. It defines validity as the degree to which a study accurately reflects the concept being measured. There are several types of validity discussed, including content validity, construct validity, and criterion-related validity. Reliability refers to the consistency of measurements. Rater reliability and instrument reliability are examined. Methods for establishing reliability include test-retest analysis, equivalence of test forms, and measures of internal consistency such as Cronbach's alpha. Generalizability and sampling methods are also summarized.
The document discusses the chi-square test, which is a non-parametric statistical test used to compare observed data with expected data in one or more categories. It does not assume an underlying distribution and can be applied to contingency tables with multiple classes. The chi-square test statistic follows a chi-square distribution, and the test determines if there is a significant difference between observed and expected frequencies.
The document provides an overview of inferential statistics. It defines inferential statistics as making generalizations about a larger population based on a sample. Key topics covered include hypothesis testing, types of hypotheses, significance tests, critical values, p-values, confidence intervals, z-tests, t-tests, ANOVA, chi-square tests, correlation, and linear regression. The document aims to explain these statistical concepts and techniques at a high level.
This document discusses various statistical techniques used for inferential statistics, including parametric and non-parametric techniques. Parametric techniques make assumptions about the population and can determine relationships, while non-parametric techniques make few assumptions and are useful for nominal and ordinal data. Commonly used parametric tests are t-tests, ANOVA, MANOVA, and correlation analysis. Non-parametric tests mentioned include Chi-square, Wilcoxon, and Friedman tests. Examples are provided to illustrate the appropriate uses of each technique.
This document discusses various quantitative data analysis techniques for research. It covers describing and summarizing data, identifying relationships between variables, comparing variables, and forecasting outcomes. The five most important methods are identified as mean, standard deviation, regression, sample size determination, and hypothesis testing. Parametric and non-parametric techniques are also discussed. Four levels of data measurement are defined: nominal, ordinal, interval, and ratio data. Examples are provided for coding nominal/ordinal data and visualizing data through graphs and charts. Statistical tests like the t-test, ANOVA, and chi-square are also summarized.
This document discusses various statistical methods used to organize and interpret data. It describes descriptive statistics, which summarize and simplify data through measures of central tendency like mean, median, and mode, and measures of variability like range and standard deviation. Frequency distributions are presented through tables, graphs, and other visual displays to organize raw data into meaningful categories.
INFERENTIAL STATISTICS: AN INTRODUCTIONJohn Labrador
For instance, we use inferential statistics to try to infer from the sample data what the population might think. Or, we use inferential statistics to make judgments of the probability that an observed difference between groups is a dependable one or one that might have happened by chance in this study.
This document summarizes key concepts from an introduction to statistics textbook. It covers types of data (quantitative, qualitative, levels of measurement), sampling (population, sample, randomization), experimental design (observational studies, experiments, controlling variables), and potential misuses of statistics (bad samples, misleading graphs, distorted percentages). The goal is to illustrate how common sense is needed to properly interpret data and statistics.
Inferential statistics use samples to make generalizations about populations. It allows researchers to test theories designed to apply to entire populations even though samples are used. The goal is to determine if sample characteristics differ enough from the null hypothesis, which states there is no difference or relationship, to justify rejecting the null in favor of the research hypothesis. All inferential tests examine the size of differences or relationships in a sample compared to variability and sample size to evaluate how deviant the results are from what would be expected by chance alone.
This presentation is for educational purpose only. I do not own the rights to written material or pictures or illustrations used.
This is being uploaded for students who are in search of, or trying to understand how a quasi-experimental research design should look like.
This document discusses frequency distributions and methods for graphically presenting frequency distribution data. It defines a frequency distribution as a tabulation or grouping of data into categories showing the number of observations in each group. The document outlines the parts of a frequency table as class limits, class size, class boundaries, and class marks. It then provides steps for constructing a frequency distribution table from a set of data. Finally, it discusses histograms and frequency polygons as methods for graphically presenting frequency distribution data, and provides examples of how to construct these graphs in Excel.
This document discusses various measures of dispersion used to describe how varied or spread out a set of data values are from the average. It describes range, interquartile range, mean deviation, standard deviation, and the Lorenz curve. Standard deviation is highlighted as the most important measure, being easy to calculate, taking all data points into account equally, and indicating how far values typically are from the average in a normal distribution. The document provides formulas and explains properties and limitations of each measure.
Topic: Population And Sample
Student Name: Sidera Saleem
Class: B.Ed. 2.5
Project Name: “Young Teachers' Professional Development (TPD)"
"Project Founder: Prof. Dr. Amjad Ali Arain
Faculty of Education, University of Sindh, Pakistan
This document discusses descriptive statistics used in research. It defines descriptive statistics as procedures used to organize, interpret, and communicate numeric data. Key aspects covered include frequency distributions, measures of central tendency (mode, median, mean), measures of variability, bivariate descriptive statistics using contingency tables and correlation, and describing risk to facilitate evidence-based decision making. The overall purpose of descriptive statistics is to synthesize and summarize quantitative data for analysis in research.
This document discusses measures of dispersion, or variation, in data sets. It describes absolute and relative measures of dispersion. Absolute measures give the dispersion in the same units as the original data but cannot be used to compare different data sets. Relative measures are dimensionless and can be used to compare dispersion across data sets.
The document then discusses the range as the simplest measure of absolute dispersion. The range is defined as the difference between the maximum and minimum values in a data set. It also defines the coefficient of range, which normalizes the range as a percentage to allow comparison between data sets. Two examples are given to demonstrate calculating the range and coefficient of range.
Experimental Research Design (True, Quasi and Pre Experimental Design)Alam Nuzhathalam
The document discusses research design and experimental research. It defines research design as a broad framework that specifies objectives, methods, timelines and responsibilities for conducting a research project. Experimental research involves manipulating independent variables and measuring their effect on dependent variables. The key aspects of experimental research design discussed are random assignment to experimental and control groups, manipulation of the independent variable, use of control groups, pre-tests and post-tests, and measuring the independent variable's effect on the dependent variable. Strengths of experimental research noted are strong control over variables and ability to identify cause-and-effect relationships.
The document describes how to conduct an independent samples t-test. It explains that the t-test is used to compare differences between separate groups. An example is provided where participants are randomly assigned to either a pizza or beer diet for a week, and their weight gain is measured. Calculations are shown to find the variance, mean, and t-value for each group. The results indicate participants on the beer diet gained significantly more weight than those on the pizza diet, t(8) = 4.47, p < .05. Instructions are also provided for conducting this analysis in SPSS.
This document discusses correlation analysis and its use in SPSS. It defines correlation as a measurement of the relationship between two variables. Pearson's r and Spearman's rho correlations are described, along with their assumptions and interpretations. The document provides examples of positive, negative, and no correlation. It also demonstrates how to conduct and interpret correlation analysis in SPSS, including checking assumptions and presenting results. The goal is for students to understand different types of correlation and how to perform correlations in SPSS.
An independent t-test is used to compare the means of two independent groups on a continuous dependent variable. It tests if there is a statistically significant difference between the population means of the two groups. The test assumes the groups are independent, the dependent variable is normally distributed for each group, and the groups have equal variances. To perform the test, the researcher states the hypotheses, sets an alpha level, calculates the t-statistic and degrees of freedom, and determines whether to reject or fail to reject the null hypothesis by comparing the t-statistic to the critical value.
This document outlines a lecture on non-parametric statistics. It begins by defining parametric and non-parametric tests, noting that non-parametric tests do not require assumptions of normality and can be used when data is not of sufficient quality for parametric tests. It then reviews the assumptions of common parametric t-tests and how to determine if the data violates assumptions. The document introduces several non-parametric tests: the Mann-Whitney U test for comparing two independent groups, the Wilcoxon test for comparing two related groups, and the Kruskal-Wallis H test for comparing three or more independent groups. It provides examples and outlines how to perform each test in SPSS.
At the end of this lecture, the students should be able to
1.Understand structure of research study appropriate for ANOVA test
2.Understand how to evaluate the assumptions underlying this test
3. interpret SPSS outputs and report the results
An independent samples t-test evaluates whether the means of two independent groups on a single dependent variable are significantly different. It requires interval/ratio data, a normal distribution, one independent variable with two levels, and a single dependent variable. The null hypothesis states there is no significant difference between the two groups on the dependent variable.
The document discusses different types of t-tests used to determine if the means of two samples are statistically significantly different from each other. It describes paired sample t-tests used to compare means when the same subjects are measured before and after a treatment. It also describes two-sample t-tests used to compare independent samples that may have equal or unequal variances, and whether the tests are one-tailed or two-tailed. Examples are provided of interpreting t-test output and determining if differences are statistically significant based on the t-statistic and p-values. Non-parametric alternatives like the Mann-Whitney U test are also briefly mentioned.
The t-test is used to compare the means of two groups and has three main applications:
1) Compare a sample mean to a population mean.
2) Compare the means of two independent samples.
3) Compare the values of one sample at two different time points.
There are two main types: the independent-measures t-test for samples not matched, and the matched-pair t-test for samples in pairs. The t-test assumes normal distributions and equal variances between groups. Examples are provided to demonstrate hypothesis testing for each application.
Hypothesis is usually considered as the principal instrument in research and quality control. Its main function is to suggest new experiments and observations. In fact, many experiments are carried out with the deliberate object of testing hypothesis. Decision makers often face situations wherein they are interested in testing hypothesis on the basis of available information and then take decisions on the basis of such testing. In Six –Sigma methodology, hypothesis testing is a tool of substance and used in analysis phase of the six sigma project so that improvement can be done in right direction
Quantitative method compare means test (independent and paired)Keiko Ono
The document describes the paired samples t-test, which is used to compare means when observations are paired or related. It explains that the paired t-test accounts for natural variability between observational units by looking at the differences between each pair of observations, rather than comparing independent groups. An example compares the mileage from two types of gasoline in the same taxis to control for differences between vehicles. The test determines if the average difference between the paired observations is significantly different from zero.
The document discusses comparing measurements from two physicians who measured tumor volumes. A paired t-test was used to test if the measurements significantly differed. The t-test showed no significant difference between physicians. Additionally, two sample t-tests were used to compare tumor volumes between different cancer types, finding significant differences between brain and breast cancers and brain and liver cancers.
Null hypothesis for an independent-sample t-testKen Plummer
The document discusses the null hypothesis for an independent-samples t-test. The null hypothesis states that there is no effect or relationship between the independent and dependent variables. It provides a template for writing the null hypothesis: "There is no significant difference in [dependent variable] between [level 1 of independent variable] and [level 2 of independent variable]." Two examples applying this template are given, comparing ACT scores between students who eat different foods and comparing IQ scores between teenagers listening to different music types.
Student's T-test, Paired T-Test, ANOVA & Proportionate TestAzmi Mohd Tamil
This document discusses various statistical tests including the T-test, ANOVA, and proportionate tests. It provides details on the independent T-test, paired T-test, ANOVA, and examples of using each test. Key concepts covered include the Student's T-test, its assumptions, and how to perform manual calculations and analyze data using SPSS.
The document provides a template for reporting the results of an independent samples t-test in APA format. It demonstrates how to write a sentence summarizing that there was a significant/non-significant difference between two groups by including the group means, standard deviations, t-statistic, and p-value filled in from a sample SPSS output.
The document describes how to perform a student's t-test to compare two samples. It provides steps for both a matched pairs t-test and an independent samples t-test. For a matched pairs t-test, the steps are: 1) state the null and alternative hypotheses, 2) calculate the differences between pairs, 3) calculate the mean difference, 4) calculate the standard deviation of the differences, 5) calculate the standard error, 6) calculate the t value, 7) determine the degrees of freedom, 8) find the critical t value, and 9) determine if there is a statistically significant difference. For an independent samples t-test, similar steps are followed to calculate means, standard deviations, the difference between
The document discusses small sample tests of hypotheses. It explains that for small sample sizes (n<30), a t-distribution is used instead of the normal distribution to account for the small sample size. There are three cases discussed for small sample tests: testing a population mean, comparing the means of two independent samples, and comparing the means of two paired samples. For each case, the assumptions, test statistic (involving a t-distribution), and an example are provided.
This document provides an overview of multivariate statistics techniques, including factor analysis, multidimensional scaling, and cluster analysis. [1] Factor analysis aims to reduce the number of variables through principal component analysis or explanatory factor analysis. [2] Multidimensional scaling maps objects into a k-dimensional space to approximate given distance matrices between objects. [3] Cluster analysis groups similar objects into clusters using partitioning or hierarchical methods.
Research method ch08 statistical methods 2 anovanaranbatn
1) The document discusses various statistical methods including one-way ANOVA, repeated measures ANOVA, and ANCOVA.
2) One-way ANOVA is used to compare the means of three or more independent groups when you have one independent variable with three or more categories and one continuous dependent variable.
3) Repeated measures ANOVA is used when the same subjects are measured under different conditions to assess for main effects and interactions while accounting for the dependency of measurements within subjects.
The document describes a study that examined the effects of four different diets on weight gain in rats. Twenty-four rats were given one of four diets that varied in vitamin and protein content over two weeks. The weights were recorded and analyzed using one-way ANOVA and post hoc tests. The analysis found that diet 1 (0.1% vitamin, 10% protein) resulted in significantly greater weight gain than diets 2-4. Diets 2, 3 and 4 were not significantly different from each other in terms of weight gain. Therefore, diet 1 was determined to be the optimal diet for promoting weight gain in rats based on the statistical analysis.
OBJECTIVES:
Recognize the differences between categorical data and continuous data
Discuss assumptions of chi square distribution
Correctly interpret and use the terms:
chi-square test of independence,
contingency table
degrees of freedom,
“2x2” and “r x c” table.
Calculate expected numbers of the cells of a contingency table .
Calculate chi-square test statistic and its appropriate degrees of freedom.
Refer the chi-square table to obtain tabulated value.
Categorical variables take on values that are names or labels, such as ethnicity (e.g., Sindhi, Punjabi, Balochi etc.) and methods of teaching (e.g. lecture, discussion, activity based etc.)
Quantitative variables are numerical. They represent a measurable quantity. For example, the number of students taking Biostatistics Supplementary classes .
CHI-SQUARE TEST:
It is used to determine whether there is a significant association between the two categorical variables from a single population.
CHI-SQUARE DISTRIBUTION PROPERTIES:
As the degrees of freedom increases, the chi-square
curve approaches a normal distribution
It has many shapes which are based on its degree of freedom (df)
Distribution is skewed to the right
A chi-square distribution takes positive values only.
Commonly used approaches are:
Test for independence
Test of homogeneity
CHI-SQUARE TEST OF INDEPENDENCE:
A chi-square test of independence is used when we want to see if there is a relationship/association between two categorical variables.
EXAMPLES OF RELATIONSHIPS
BETWEEN QUALITATIVE VARIABLES:
Qualitative variables are either ordinal or nominal.
Examples:
Do the nurses feel differently about a new postoperative procedure than doctors?
Preference (Old/New) Subjects (Nurses/ Doctors)
Is there any relationship between Soya Use & Lung cancer?
Soya Intake (yes/no) Lung cancer (yes/no)
Is there any relationship between parent’s and their children Children’s Education (Illiterate/Up to Intermediate/Graduate)
education?
Parent’s Education (Illiterate/Up to Intermediate/Graduate)
CONTINGENCY TABLE:
The table which classifies categories of the qualitative
variable.
The number of individuals or items assigned to each category is called the frequency.
WHAT INFORMATION DOES CONTINGENCY TABLE REVEAL?
When we consider two categorical variables at a time, then an observation will belong to a particular category of variable one as well as a particular category of variable two. This type of table is referred as contingency table.
The simplest form of contingency table is a 2x2 contingency table with both
variables having exactly two categories.
WHAT OTHER INFORMATION DOES
CONTINGENCY TABLE REVEAL?
In this table Two independent categorical variables that
form a “r x c” contingency table, where “r” is the number of rows (number of categories in first variable e.g. helmet used at the time of accident or not?) and “c” is the number of columns (number of categories in the second variable e.g. got severe brain injury.
This document provides an overview of guidelines for designing a healthy diet, including key principles for diet planning and evaluating nutritional status. It discusses balancing food intake, incorporating variety, choosing nutrient-dense foods, controlling energy intake, and eating foods in moderation. Recommendations for nutritional intake come from scientific research on levels of nutritional status and deficiencies. Dietary guidelines include the USDA's MyPlate, which emphasizes fruits, vegetables, whole grains, proteins and dairy while limiting empty calories. Food labels provide information to help follow nutritional recommendations.
univariate and bivariate analysis in spss Subodh Khanal
this slide will help to perform various tests in spss targeting univariate and bivariate analysis along with the way of entering and analyzing multiple responses.
Data.savQuestion.docxOver the same period, determine wheth.docxtheodorelove43763
This document provides instructions for conducting three chi-square tests of independence using SPSS on data about students' conflict resolution styles and suspensions from school. It describes entering the data, selecting the appropriate tests and variables, and interpreting the output. Students are asked to conduct the chi-square tests following the five steps of hypothesis testing, calculate effect sizes, and explain the results to someone unfamiliar with statistics.
The document provides an overview of key concepts in biostatistics including:
- Common approaches like using common sense, drawing pictures, and being skeptical of small sample sizes and strong conclusions.
- Examples of potential issues like overinterpreting a study with a sample of 10 subjects and reporting 100% improvement.
- Key topics covered include how to think about, visualize, and describe quantitative data through measures of center, spread, and distribution.
This study evaluated the efficacy and safety of lisdexamfetamine dimesylate (LDX) for binge eating disorder (BED) in adults. Participants were randomized to placebo or LDX doses of 30, 50, or 70 mg/day for 11 weeks. The primary outcome was reduction in binge eating days per week. Secondary outcomes included reductions in binge episodes, improvements on the Binge Eating Scale and Three-Factor Eating Questionnaire. LDX at 50 and 70 mg/day significantly reduced binge eating days and episodes compared to placebo. LDX also improved binge eating behaviors and attitudes on secondary measures. The most common side effects with LDX were decreased appetite, dry mouth, headache,
A New Approach to Presenting Health States in Stated Preference Valuation Stu...Office of Health Economics
This study tested presenting health states from the EQ-5D-5L descriptive system in the context of the full system, compared to the standard presentation of individual states. Respondents who saw states in context of the full system had more ordered preferences between levels, fewer logical inconsistencies, and better distinguished between severe and extreme levels. Presenting health states in context of the full descriptive system improves valuation data quality.
This document discusses various research designs used in nursing research. It defines research design as the plan or blueprint for conducting a study. Experimental designs aim to identify cause-effect relationships through manipulation of independent variables and use of control groups. True experiments allow the highest level of control but quasi-experiments and pre-experimental designs are also used when true experiments are not possible. Non-experimental designs observe variables without manipulation and are used when variables cannot be manipulated or experiments would be unethical.
This document provides an overview of basic statistics, including descriptive and inferential statistics. Descriptive statistics such as frequencies, percentages, means and standard deviations are used to summarize single variables. Inferential statistics such as correlation, t-tests, chi-square, and logistic regression are used to test hypotheses, determine associations between variables, and make predictions. The document explains when each statistical test is appropriate and how to interpret and report the results.
Nutritionists work in healthcare facilities like hospitals and nursing homes to assess patients' nutritional needs, develop nutrition programs, and treat diseases through diet. They can expect to earn between $47,522 to $69,957 annually. To become a nutritionist requires obtaining at least a bachelor's degree by taking courses in subjects like nutrition, foods, diet therapy, biology, and chemistry over the course of 4-5 years of full-time study.
Early Patient approach with feedback Improves Clinical Skills of Medical Stud...meducationdotnet
This study evaluated the impact of early patient approach with feedback on the clinical skills of medical students. Students who received early patient approach and feedback scored significantly higher on measures of clinical skills like history taking for fever and physical examination of the chest compared to students who received conventional teaching. However, there was no significant difference in scores on overall long case examinations between the two groups. The study concluded that early patient contact with feedback can improve some specific clinical skills but did not find clear evidence of improved performance on overall comprehensive exams.
This document summarizes a symposium on evidence-based medicine (EBM) focusing on diabetes care, screening, and prevention. It provides an overview of key EBM principles and resources, including the Cochrane Database of Systematic Reviews, ACP Journal Club, and MEDLINE. Measures of risk reduction like relative risk reduction, absolute risk reduction, and number needed to treat are discussed. The document also presents a hypothetical patient case of a woman newly diagnosed with diabetes and addresses related questions on screening, treatment, and prevention from an EBM perspective.
Acetyl-L-carnitine supplementation was evaluated in two randomized controlled trials involving over 1,200 patients with diabetic neuropathy. The studies found that 500-1000 mg doses taken three times daily resulted in:
1) Increased myelinated nerve fiber numbers and regenerating clusters on nerve biopsy.
2) Improved vibration sensation in fingers and toes.
3) Greater benefits seen in subgroups under age 55, BMI under 30, type 2 diabetes, and HbA1c under 8.5%.
While results suggest acetyl-L-carnitine may provide symptomatic relief for diabetic neuropathy, limitations included short trial duration and lack of data on important outcomes like long-term nerve regeneration. The assistant recommended
The document provides an overview of statistical hypothesis testing and various statistical tests used to analyze quantitative and qualitative data. It discusses types of data, key terms like null hypothesis and p-value. It then outlines the steps in hypothesis testing and describes different tests of significance including standard error of difference between proportions, chi-square test, student's t-test, paired t-test, and ANOVA. Examples are provided to demonstrate how to apply these statistical tests to determine if differences observed in sample data are statistically significant.
Ct lecture 4. descriptive analysis of cont variablesHau Pham
The document summarizes key concepts in descriptive analysis of continuous variables, including:
- Mean, variance, and standard deviation are used to describe central tendency and variation in data.
- The normal distribution and z-scores allow relating individual values to the whole population distribution based on mean and standard deviation.
- Confidence intervals describe the range of values where we are confident the population mean lies, based on the sample mean, standard deviation, and size.
Ct lecture 12. simple linear regression analysisHau Pham
This document summarizes a workshop on simple linear regression analysis. It discusses using linear regression to model the relationship between femoral neck bone density (BMD) and age or weight in women. Examples are analyzed using R to estimate regression parameters, interpret results, and check assumptions. Linear regression can be used to understand risk factor effects, predict outcomes, and its assumptions like linearity and normal errors should always be checked.
This document provides an overview of basic statistical analyses that are commonly used for research projects, including descriptive and inferential statistics. Descriptive statistics like frequencies, percentages, means and standard deviations are used to summarize single variables. Inferential statistics like correlation, t-tests, chi-square, and logistic regression are used to determine relationships between variables and make inferences about populations. The document outlines when each statistical test is appropriate, how to interpret results, and how to report findings for common analyses like correlation, t-tests, chi-square, and logistic regression.
Overview and food in historical perspectivewajihahwafa
This document provides an overview of a university course on Nutritional Sociology and Anthropology. It summarizes key topics that will be covered in the course, including the Agricultural Revolution of the Neolithic Era, the search for spices, the Industrial Revolution, and modern-day adaptations. The document focuses on the Neolithic Revolution in more detail, explaining how the domestication of plants and animals around 12,000 years ago led to major changes such as increased and more stable food production, larger and settled human populations, and the development of civilization.
Overview and food in historical perspectivewajihahwafa
This document provides an overview of food and nutrition from a historical perspective, covering major dietary revolutions such as the Agricultural Revolution, Industrial Revolution, and Scientific Revolution. It discusses how early humans transitioned from foraging to food production around 12,000 years ago, settling into villages and domesticating plants and animals. This allowed populations to grow but led to less diverse diets and increased risks of famine and malnutrition. The document examines archaeological evidence of domestication and the biocultural consequences of the shift to agriculture.
This document provides an overview of nutrition for sport and exercise, focusing on fats. It defines different types of fatty acids and lipids, and explains how the body digests, absorbs, transports, stores, and uses fat for energy. Key points covered include the essential roles of fats, recommendations for athlete fat intake, sources of healthy fats, and how fat metabolism compares to carbohydrate metabolism in providing fuel for exercise. The document also discusses ketosis and the multi-step process the body undergoes to break down and oxidize fat for energy production.
Here is a carbohydrate intake plan for the runner:
- 2-3 hours before the race, consume 1.5g/kg of high-glycemic carbohydrates such as pasta, bread, cereal or sports bars. Drink 500ml water.
- 1 hour before the race, consume 0.5g/kg of high-glycemic carbohydrates and 500ml water.
- During the race:
- Drink 200-300ml of Gatorade at every aid station (every 2-4km). This provides 30-45g carbohydrates per hour.
- Take 1 banana between 40-48km to provide extra carbohydrates and electrolytes to prevent fatigue.
This document discusses nutrition policies in various countries. It covers topics like nutrition labels and claims, food marketing restrictions, taxes and subsidies on unhealthy foods, standards for public institutions and food supplies, and initiatives to increase public awareness of healthy eating. Examples provided include salt warning labels in Finland and the UK, sugary drink taxes in Mexico, trans fat bans in Denmark and Austria, and programs to promote fruit and vegetable consumption in Western Australia.
The document summarizes key points from a presentation about nutrition policy and food security in relation to the Sustainable Development Goals (SDGs). It discusses the SDGs which were adopted in 2015 and include 17 goals to end poverty, hunger, and promote health, education, gender equality, and more. The presentation focuses on SDG 2 to end hunger and promote nutrition, the indicators FAO is responsible for monitoring related to agriculture and food security, and Malaysia's commitment and progress toward implementing the 2030 SDG agenda.
NDD30503 NUTRITION FOR SPORTS AND EXERCISEwajihahwafa
This document provides information on nutrition for resistance athletes. It discusses that resistance training causes muscle breakdown and repair through catabolism and anabolism. It recommends resistance athletes consume moderate carbohydrate intake before, during and after exercise to fuel workouts and speed recovery. It also recommends resistance athletes consume approximately 1.6-1.7 grams of protein per kilogram of body weight daily, with 20-30 grams of high quality protein after workouts to stimulate muscle protein synthesis and aid recovery. It emphasizes the importance of hydration and carbohydrate intake before and after resistance exercise.
NDD30503: NUTRITION FOR SPORTS AND EXERCISEwajihahwafa
This document discusses nutrition for sport and exercise. It defines physical fitness, health, and wellness and their interrelationships. It describes the six components of health-related physical fitness - cardiovascular endurance, muscular strength, muscular endurance, flexibility, body composition, and power. It also describes the five components of skill-related physical fitness - agility, balance, coordination, speed, and reaction time. The document provides examples of tests to measure each component and discusses the importance of self-assessment for maintaining good fitness, health, and wellness.
NDD30503: NUTRITION FOR SPORTS AND EXERCISEwajihahwafa
Nutrition is important for combat athletes to maximize performance and support training. Carbohydrates provide 10-12 g/kg daily and should be consumed before and during exercise to fuel activity. Protein intake of 1.8-2.4 g/kg daily ingested several times helps maintain muscle mass. Hydration is also key, with fluid losses over 2% of body weight to be avoided. Rapid weight cutting can negatively impact health and is not recommended.
NDD30503: NUTRITION FOR SPORTS AND EXERCISEwajihahwafa
1. The document discusses nutritional demands and recommendations for endurance athletes during training and competition periods. Carbohydrate needs vary from 3-12g/kg depending on intensity, and adequate hydration and sodium intake are important.
2. Post-exercise, rehydration with sodium-containing fluids and carbohydrate intake of 1-1.5g/kg in the first few hours aids recovery.
3. Issues like hyponatremia from overdrinking and low energy availability increasing health risks are addressed. Meeting energy needs while avoiding deficiencies is the primary nutritional goal.
NDD30503: NUTRITION FOR SPORTS AND EXERCISEwajihahwafa
This document discusses weight and body composition issues in athletes. It covers the following key points:
1. It defines various components of body composition, including body mass, body weight, body fat percentage, fat-free mass, and lean body mass.
2. It describes several methods for measuring body composition, such as hydrostatic weighing, DEXA scans, skinfold tests, and bioelectric impedance. The results must be interpreted appropriately based on an individual's sport.
3. The optimal body composition can vary significantly between sports, depending on the relative importance of factors like size, strength, power, or endurance. Weight standards must be used carefully to avoid unhealthy weight loss or eating disorders.
NDD30503: NUTRITION FOR SPORTS AND EXERCISEwajihahwafa
This document discusses water and electrolyte balance during exercise. It covers the distribution and roles of body water, processes of water movement, and definitions of hydration states like hypohydration and dehydration. Exercise increases fluid loss through sweating and impacts fluid balance, temperature regulation and performance. Maintaining fluid balance requires strategies for fluid intake before, during and after exercise. Large sweat losses can also result in electrolyte imbalances, especially of sodium.
NDD30503: NUTRITION FOR SPORTS AND EXERCISEwajihahwafa
This document discusses vitamins and their roles for athletes. It begins by classifying vitamins based on their solubility, functionality, and sources from food, fortification, or supplements. Several B vitamins support energy metabolism as coenzymes. Vitamins C, E, and A act as antioxidants. Vitamins B12 and folate are important for red blood cell formation. Vitamins D and A also support growth and development. The document emphasizes obtaining vitamins from a variety of whole foods and only using supplements when needed to treat deficiencies.
NDD30503: NUTRITION FOR SPORTS AND EXERCISEwajihahwafa
This document discusses minerals and their roles for athletes. It covers several key points:
1. Minerals are classified as macrominerals or microminerals based on the amount found in the body. Moderate to rigorous exercise can increase mineral loss through sweat and urine.
2. Both mineral deficiencies and excesses can negatively impact performance and health. Factors like diet, supplementation, and exercise can increase or decrease mineral absorption.
3. Minerals like calcium, phosphorus, and magnesium play important roles in bone formation and density. Achieving peak bone mineral density by age 30-35 is important for long-term health. Exercise and adequate intake of calcium, vitamin D, and protein can promote
NDD30503: NUTRITION FOR SPORTS AND EXERCISEwajihahwafa
This document discusses a lecture on protein for athletes. It covers the structure and functions of proteins, digestion and metabolism of amino acids, protein recommendations for athletes, and translating these recommendations into daily food choices. Some key points include: proteins are made of amino acids which have various roles like enzymes and hormones; timing protein intake after exercise can aid recovery; recommended protein intake ranges from 1.0-2.0 g/kg daily for athletes depending on sport; and both animal and plant sources can provide sufficient protein.
NDD30503: NUTRITION FOR SPORTS AND EXERCISEwajihahwafa
This document discusses nutrition for sports and exercise, specifically focusing on fats. It begins by outlining the learning objectives which are to classify and distinguish different types of fats, describe fat digestion and metabolism, explain how the body uses fat for fuel during exercise, and identify fat recommendations and sources. The document then defines different types of fats like saturated, unsaturated, omega-3/6/9 fatty acids. It explains the digestion, absorption, transportation and storage of fats. Key points are made about fat metabolism and using fat as an energy source during exercise.
NDD30503: NUTRITION FOR SPORTS AND EXERCISEwajihahwafa
This document discusses carbohydrate recommendations for athletes. It covers classifying carbohydrates, how they are digested and metabolized, and their role as an energy source for exercise. The key recommendations are that athletes need 5-10 grams of carbohydrates per kilogram of body weight daily. Proper carbohydrate intake before, during, and after exercise is also important to replenish muscle glycogen stores and maintain blood glucose levels. The document provides guidelines on calculating daily carbohydrate needs and selecting foods to meet those needs.
NDD30503: NUTRITION FOR SPORTS AND EXERCISEwajihahwafa
The document summarizes three major energy systems - creatine phosphate, anaerobic glycolysis, and oxidative phosphorylation - that provide energy for exercise. It describes the specific characteristics of each system, including their speed of action, amount of ATP replenished, and duration of action. The creatine phosphate system is very fast acting but can only replenish a small amount of ATP for a very short duration. The anaerobic glycolysis system is faster than oxidative phosphorylation but produces lactic acid as a byproduct. Oxidative phosphorylation can replenish large amounts of ATP but is the slowest acting of the three systems.
NDD30503: NUTRITION FOR SPORTS AND EXERCISEwajihahwafa
This document discusses nutrition for sports and exercise. It defines key terms and concepts in sports nutrition and exercise physiology. It outlines the basic principles of training and nutrition periodization to support athletic performance and recovery goals. The document also discusses dietary supplement use among athletes, noting concerns regarding purity, safety, and effectiveness of supplements given the lack of regulation in the industry. The overall goals of sports nutrition are to support training, performance, and recovery through an evidence-based nutrition plan.
This document outlines a lecture on child and preadolescent nutrition. It discusses normal growth and development in children, including adiposity rebound. It also covers energy and nutrient needs, common nutrition problems like iron deficiency and dental caries, and childhood obesity including predictors, assessment, and treatment approaches. The goal of obesity treatment is weight maintenance or gradual weight loss until a healthy BMI is achieved.
"Financial Odyssey: Navigating Past Performance Through Diverse Analytical Lens"sameer shah
Embark on a captivating financial journey with 'Financial Odyssey,' our hackathon project. Delve deep into the past performance of two companies as we employ an array of financial statement analysis techniques. From ratio analysis to trend analysis, uncover insights crucial for informed decision-making in the dynamic world of finance."
Codeless Generative AI Pipelines
(GenAI with Milvus)
https://ml.dssconf.pl/user.html#!/lecture/DSSML24-041a/rate
Discover the potential of real-time streaming in the context of GenAI as we delve into the intricacies of Apache NiFi and its capabilities. Learn how this tool can significantly simplify the data engineering workflow for GenAI applications, allowing you to focus on the creative aspects rather than the technical complexities. I will guide you through practical examples and use cases, showing the impact of automation on prompt building. From data ingestion to transformation and delivery, witness how Apache NiFi streamlines the entire pipeline, ensuring a smooth and hassle-free experience.
Timothy Spann
https://www.youtube.com/@FLaNK-Stack
https://medium.com/@tspann
https://www.datainmotion.dev/
milvus, unstructured data, vector database, zilliz, cloud, vectors, python, deep learning, generative ai, genai, nifi, kafka, flink, streaming, iot, edge
Build applications with generative AI on Google CloudMárton Kodok
We will explore Vertex AI - Model Garden powered experiences, we are going to learn more about the integration of these generative AI APIs. We are going to see in action what the Gemini family of generative models are for developers to build and deploy AI-driven applications. Vertex AI includes a suite of foundation models, these are referred to as the PaLM and Gemini family of generative ai models, and they come in different versions. We are going to cover how to use via API to: - execute prompts in text and chat - cover multimodal use cases with image prompts. - finetune and distill to improve knowledge domains - run function calls with foundation models to optimize them for specific tasks. At the end of the session, developers will understand how to innovate with generative AI and develop apps using the generative ai industry trends.
4th Modern Marketing Reckoner by MMA Global India & Group M: 60+ experts on W...Social Samosa
The Modern Marketing Reckoner (MMR) is a comprehensive resource packed with POVs from 60+ industry leaders on how AI is transforming the 4 key pillars of marketing – product, place, price and promotions.
1. HFS3283
INDEPENDENT T-TEST
DR. SHARIFAH WAJIHAH WAFA BTE SST WAFA
School of Nutrition and Dietetics
Faculty of Health Sciences
sharifahwajihah@unisza.edu.my
KNOWLEDGE FOR THE BENEFIT OF HUMANITY
2. SCHOOL OF NUTRITION AND DIETETICS . FACULTY OF HEALTH SCIENCES
Topic Learning Outcomes
At the end of this lecture, the student should be
able to:
1
• Understand structure of research study appropriate
for independent-measures t hypothesis test
2
• Test between two populations or two treatments
using independent-measures t statistics
3
• Understand how to evaluate the assumptions
underlying this test
3. SCHOOL OF NUTRITION AND DIETETICS . FACULTY OF HEALTH SCIENCES
Independent-measures Design
Introduction
• Most research studies compare two (or more)
sets of data
– Data from two completely different, independent
participant groups (an independent-measures or
between-subjects design)
– Data from the same or related participant group(s)
(a within-subjects or repeated-measures design)
4. SCHOOL OF NUTRITION AND DIETETICS . FACULTY OF HEALTH SCIENCES
Independent-Measures Design
Introduction (continued)
• Computational procedures are considerably
different for the two designs
• Each design has different strengths and
weaknesses
• Consequently, only between-subjects designs
are considered in this lecture; repeated-
measures designs will be reserved for
discussion in Next Lecture
5. SCHOOL OF NUTRITION AND DIETETICS . FACULTY OF HEALTH SCIENCES
When to use the independent
samples t-test
• It is used to compare differences between separate
groups.
• This test can be used to explore differences in
naturally occurring groups.
• For example, we may be interested in differences of
IQ between males and females.
6. SCHOOL OF NUTRITION AND DIETETICS . FACULTY OF HEALTH SCIENCES
When to use the independent
samples t-test (cont.)
• Any differences between groups can be
explored with the independent t-test, as long
as the tested members of each group are
reasonably representative of the population.
[1]
[1] There are some technical
requirements as well. Principally,
each variable must come from a
normal (or nearly normal) distribution.
7. SCHOOL OF NUTRITION AND DIETETICS . FACULTY OF HEALTH SCIENCES
Independent-Measures Design t
Statistic
• Null hypothesis for independent-measures
test
• Alternative hypothesis for the independent-
measures test
0: 210 H
0: 211 H
8. SCHOOL OF NUTRITION AND DIETETICS . FACULTY OF HEALTH SCIENCES
Example 1
• Suppose we put people on 2 diets:
the nasi lemak diet and the roti canai diet.
• Participants are randomly assigned to either 1-
week of breakfast eating exclusively nasi (NL)
lemak or 1-week of exclusively roti canai (RC).
9. SCHOOL OF NUTRITION AND DIETETICS . FACULTY OF HEALTH SCIENCES
Example 1-con’t
• At the end of the week, we measure
weight gain by each participant.
• Which diet causes more weight gain?
• In other words, the null hypothesis is:
Ho: wt. gain NL diet =wt. gain RC diet.
10. SCHOOL OF NUTRITION AND DIETETICS . FACULTY OF HEALTH SCIENCES
Example 1-con’t
• Why?
• The null hypothesis is the opposite of what we
hope to find.
• In this case, our research hypothesis is that
there ARE differences between the 2 diets.
• Therefore, our null hypothesis is that there are
NO differences between these 2 diets.
11. SCHOOL OF NUTRITION AND DIETETICS . FACULTY OF HEALTH SCIENCES
Example 1-con’t
Column 3 Column 4
X1 : NL X2 : RC
1 3 1 1
2 4 0 0
2 4 0 0
2 4 0 0
3 5 1 1
2 4
0.4 0.4
2
11 )( 2
22 )(
1
2
n
sx
2
2
)(
12. SCHOOL OF NUTRITION AND DIETETICS . FACULTY OF HEALTH SCIENCES
Example 1-con’t
• The first step in calculating the independent
samples t-test is to calculate the variance and
mean in each condition.
• In the previous example, there are a total of
10 people, with 5 in each condition.
• Since there are different people in each
condition, these “samples” are “independent”
of one another;
giving rise to the name of the test.
13. SCHOOL OF NUTRITION AND DIETETICS . FACULTY OF HEALTH SCIENCES
Example 1-con’t
• The variances and means are calculated
separately for each condition
(NL and RC).
• A streamlined calculation of the variance for
each condition was illustrated previously. (See
Slide 11.)
• In short, we take each observed weight gain
for the NL condition, subtract it from the
mean gain of the NL dieters ( 2) and
square the result (see column 3).
1
14. SCHOOL OF NUTRITION AND DIETETICS . FACULTY OF HEALTH SCIENCES
Example 1-con’t
• Next, add up column 3 and divide by the
number of participants in that condition (n1 =
5) to get the sample variance,
• The same calculations are repeated for the
“RC” condition.
4.02
xs
15. SCHOOL OF NUTRITION AND DIETETICS . FACULTY OF HEALTH SCIENCES
Formula
The formula for
the independent samples t-test is:
, df = (n1-1) + (n2-1)
11 2
2
1
2
21
21
n
S
n
S
t
xx
16. SCHOOL OF NUTRITION AND DIETETICS . FACULTY OF HEALTH SCIENCES
Example 1 (cont.)
• Where
= Mean of first sample
= Mean of second sample
n1 = Sample size (i.e., number of observations) of
first sample
n2 = Sample size (i.e., number of observations) of
second sample
= sample variance of first sample
= sample variance of second sample
sp = Pooled standard deviation
2
1xs
2
2xs
1
2
17. SCHOOL OF NUTRITION AND DIETETICS . FACULTY OF HEALTH SCIENCES
Example 1 (cont.)
• From the calculations previously, we have
everything that is needed to find the “t.”
47.4
4
4.
4
4.
42
t , df = (5-1) + (5-1) = 8
• After calculating the “t” value, we need to
know if it is large enough to reject the null
hypothesis.
18. SCHOOL OF NUTRITION AND DIETETICS . FACULTY OF HEALTH SCIENCES
• The “t” is calculated under the assumption,
called the null hypothesis,
“that there are no differences between
the NL and RC diet”.
• If this were true, when we repeatedly sample
10 people from the population and put them
in our 2 diets, most often we would calculate
a “t” of “0.”
Some theory
19. SCHOOL OF NUTRITION AND DIETETICS . FACULTY OF HEALTH SCIENCES
• Look again at the formula for the “t”.
• Most often the numerator (X1-X2) will be “0,”
because the mean of the two conditions
should be the same under the null hypothesis.
• That is, weight gain is the same under both
the NL and RC diet.
Some theory - Why?
20. SCHOOL OF NUTRITION AND DIETETICS . FACULTY OF HEALTH SCIENCES
• Sometimes the weight gain might be a bit
higher under the NL diet, leading to a positive
“t” value.
• In other samples of 10 people, weight gain
might be a little higher under the RC diet,
leading to a negative “t” value.
• The important point, however, is that under
the null hypothesis we should expect that
most “t” values that we compute are close to
“0.”
Some theory - Why (cont.)
21. SCHOOL OF NUTRITION AND DIETETICS . FACULTY OF HEALTH SCIENCES
• Our computed t-value is not “0,” but it is in fact
negative (t(8) = -4.47).
• Although the t-value is negative, this should not
bother us.
• Remember that the t-value is only - 4.47 because we
named the NL diet X1 and the RC diet X2.
–This is, of course, completely arbitrary.
• If we had reversed our order of calculation, with the
NL diet as X2 and the RC diet as X1, then our
calculated t-value would be positive 4.47.
Some theory - Why (cont.)
22. SCHOOL OF NUTRITION AND DIETETICS . FACULTY OF HEALTH SCIENCES
• The calculated t-value is 4.47 (notice, I’ve
eliminated the unnecessary “-“ sign), and the
degrees of freedom are 8.
• In the research question we did not specify
which diet should cause more weight gain,
therefore this t-test is a so-called “2-tailed t.”
Example 1 (again) Calculations
23. SCHOOL OF NUTRITION AND DIETETICS . FACULTY OF HEALTH SCIENCES
Degrees of freedom
• Degrees of freedom (df) for t statistic is
df for first sample + df for second sample
)1()1( 2121 nndfdfdf
Note: this term is the same as the denominator of the pooled
variance
24. SCHOOL OF NUTRITION AND DIETETICS . FACULTY OF HEALTH SCIENCES
• In the last step, we need to find the critical
value for a 2-tailed “t” with 8 degrees of
freedom.
• This is available from tables that are in the
back of any Statistics textbook.
• Look in the back for “Critical Values of the t-
distribution,” or something similar.
• The value you should find is:
C.V. t(8), 2-tailed = 2.31.
Example 1 (again) Calculations
25. SCHOOL OF NUTRITION AND DIETETICS . FACULTY OF HEALTH SCIENCES
• The calculated t-value of 4.47 is larger in magnitude
than the C.V. of 2.31, therefore we can reject the null
hypothesis.
• Even for a results section of journal article, this
language is a bit too formal and general. It is more
important to state the research result, namely:
Participants on the RC diet (M = 4.00) gained
significantly more weight than those on the NL diet
(M = 2.00), t(8) = 4.47, p < .05 (two-tailed).
Example 1 (again) Calculations
26. SCHOOL OF NUTRITION AND DIETETICS . FACULTY OF HEALTH SCIENCES
Repeat from previous slide:
Participants on the RC diet (M = 4.00) gained
significantly more weight than those on the
NL diet (M = 2.00), t(8) = 4.47, p < .05 (two-
tailed).
• Making this conclusion requires inspection of
the mean scores for each condition (NL and
RC).
Example 1 (concluding comment)
27. SCHOOL OF NUTRITION AND DIETETICS . FACULTY OF HEALTH SCIENCES
Example 1 Using SPSS
• First, the variables must be setup in the SPSS data
editor.
• We need to include both the independent and
dependent variables.
• Although it is not strictly necessary, it is good
practice to give each person a unique code
(e.g., personid):
28. SCHOOL OF NUTRITION AND DIETETICS . FACULTY OF HEALTH SCIENCES
Example 1 Using SPSS (cont.)
• In the previous example:
– Dependent Variable
= wtgain (or weight gain)
– Independent Variable = diet
• Why?
• The independent variable (diet) causes
changes in the dependent variable (weight
gain).
29. SCHOOL OF NUTRITION AND DIETETICS . FACULTY OF HEALTH SCIENCES
Example 1 Using SPSS (cont.)
• Next, we need to provide “codes” that
distinguish between the 2 types of diets.
• By clicking in the grey box of the “values” field
in the row containing the “diet” variable, we
get a pop-up dialog that allows us to code the
diet variable.
• Arbitrarily, the NL diet is coded as diet “1” and
the RC diet is diet “2.”
• Any other 2 codes would work, but these
suffice
See next slide.
30. SCHOOL OF NUTRITION AND DIETETICS . FACULTY OF HEALTH SCIENCES
Example 1 Using SPSS (cont.)
.
31. SCHOOL OF NUTRITION AND DIETETICS . FACULTY OF HEALTH SCIENCES
Example 1 Using SPSS (cont.)
• Moving to the data view
tab of the SPSS editor,
the data is entered.
• Each participant is
entered on a separate
line; a code is entered
for the diet they were
on (1 = NL, 2 = RC); and
the weight gain of each
is entered, as follows
32. SCHOOL OF NUTRITION AND DIETETICS . FACULTY OF HEALTH SCIENCES
Example 1 Using SPSS (cont.)
• Moving to the data view
tab of the SPSS editor,
the data is entered.
• Each participant is
entered on a separate
line; a code is entered
for the diet they were
on (1 = NL, 2 = RC); and
the weight gain of each
is entered, as follows
33. SCHOOL OF NUTRITION AND DIETETICS . FACULTY OF HEALTH SCIENCES
Example 1 Using SPSS (cont.)
• Moving to the data view
tab of the SPSS editor,
the data is entered.
• Each participant is
entered on a separate
line; a code is entered
for the diet they were
on (1 = NL, 2 = RC); and
the weight gain of each
is entered, as follows
34. SCHOOL OF NUTRITION AND DIETETICS . FACULTY OF HEALTH SCIENCES
Example 1 Using SPSS (cont.)
• Click Analyze > Compare Means > Independent-
Samples T Test... on the top menu, as shown below:
35. SCHOOL OF NUTRITION AND DIETETICS . FACULTY OF HEALTH SCIENCES
Example 1 Using SPSS (cont.)
• You will be presented with
the Independent-Samples T
Test dialogue box.
• Transfer the dependent
variable, wtgain, into the Test
Variable(s): box, and transfer
the independent variable, diet,
into the Grouping Variable:
box, by highlighting the
relevant variables and pressing
the SPSS Right Arrow Button
buttons. You will end up with
the following screen in the
next slide
36. SCHOOL OF NUTRITION AND DIETETICS . FACULTY OF HEALTH SCIENCES
Example 1 Using SPSS (cont.)
You then need to define the groups (diet). Click on
the button. You will be presented with the Define
Groups dialogue box, as shown above:
37. SCHOOL OF NUTRITION AND DIETETICS . FACULTY OF HEALTH SCIENCES
Example 1 Using SPSS (cont.)
• Enter 1 into
the Group 1: box and
enter 2 into
the Group 2: box.
Remember that we
labelled the Nasi Lemak
group as 1 and the Roti
Canai group as 2.
• Click the button.
38. SCHOOL OF NUTRITION AND DIETETICS . FACULTY OF HEALTH SCIENCES
Example 1 Using SPSS (cont.)
• Two sections (boxes) appear in the output: Group
Statistics
– provides basic information about the group comparisons,
including the sample size (n), mean, standard deviation,
and standard error for weight gain by group.
39. SCHOOL OF NUTRITION AND DIETETICS . FACULTY OF HEALTH SCIENCES
Example 1 Using SPSS (cont.)
• SPSS gives the means for each of the conditions
(NL Mean = 2 and RC Mean = 4).
• In addition, SPSS provides a t-value of -4.47 with
8 degrees of freedom.
• These are the same figures that were computed
“by hand” previously.
• However, SPSS does not provide a critical value.
• Instead, an exact probability is provided (p =
.002).
40. SCHOOL OF NUTRITION AND DIETETICS . FACULTY OF HEALTH SCIENCES
Example 1 Using SPSS (cont.)
• The second section, Independent Samples Test, displays the
results most relevant to the Independent Samples t Test.
There are two parts that provide different pieces of
information: (A) Levene’s Test for Equality of Variances and (B)
t-test for Equality of Means.
41. SCHOOL OF NUTRITION AND DIETETICS . FACULTY OF HEALTH SCIENCES
Example 1 Using SPSS (cont.)
Levene's Test for Equality of of Variances: This section has the test results for
Levene's Test. From left to right:
• F is the test statistic of Levene's test
• Sig. is the p-value corresponding to this test statistic.
The p-value of Levene's test is printed as “1.000“, so we ACCEPT the null of Levene's
test and conclude that the variance in weight gain of NL diet is identical to the RC
diet .This tells us that we should look at the "Equal variances assumed" row for the
t-test results. (If this test result had been significant -- that is, if we had
observed p < α -- then we would have used the "Equal variances not assumed"
output.)
42. SCHOOL OF NUTRITION AND DIETETICS . FACULTY OF HEALTH SCIENCES
Example 1 Using SPSS (cont.)
t-test for Equality of Means provides the results for the actual Independent
Samples t Test. From left to right:
• t is the computed test statistic
• df is the degrees of freedom
• Sig (2-tailed) is the p-value corresponding to the given test statistic and degrees
of freedom
• Mean Difference is the difference between the sample means; it also
corresponds to the numerator of the test statistic
• Std. Error Difference is the standard error; it also corresponds to the
denominator of the test statistic
43. SCHOOL OF NUTRITION AND DIETETICS . FACULTY OF HEALTH SCIENCES
Example 1 Using SPSS (cont.)
• The mean difference is calculated by subtracting the mean of the second group
from the mean of the first group. In this example, the mean weight gain for RC
diet was subtracted from the mean weight gain for NL diet (2.00- 4.00 = -2.00).
The sign of the mean difference corresponds to the sign of the t value.
• The negative t value in this example indicates that the mean weight gain for the
first group, NL diet, is significantly lower than the mean for the second group, RC
diet.
• The associated p value is printed as ".002".
44. SCHOOL OF NUTRITION AND DIETETICS . FACULTY OF HEALTH SCIENCES
Example 1 Using SPSS (cont.)
• Confidence Interval of the Difference: This part of the t-test output
complements the significance test results. Typically, if the CI for the mean
difference contains 0, the results are not significant at the chosen significance
level. In this example, the CI is [-3.03128, -0.96872], which does not contain
zero; this agrees with the small p-value of the significance test.
45. SCHOOL OF NUTRITION AND DIETETICS . FACULTY OF HEALTH SCIENCES
In the Literature
• Report whether the difference between the
two groups was significant or not
• Report descriptive statistics (M and SD) for
each group
• Report t statistic and df
• Report p-value
• Report CI immediately after t
46. SCHOOL OF NUTRITION AND DIETETICS . FACULTY OF HEALTH SCIENCES
In the Literature
Participants on the RC diet (M = 4.00) gained
significantly more weight than
those on the NL diet (M = 2.00),
t(8) = 4.47, p < .01 (two-tailed).
Variables NL diet
(n=5)
Mean (SD)
RC diet
(n=5)
Mean (SD)
Mean diff (95%
CI)
t statistics
(df)
P-
value
Weight gain 2.00 (0.71) 4.00 (0.71) -2.00(-3.03,-0.97) -4.47(8) <0.01
Table 1: Type of diet associated with weight gain
47. SCHOOL OF NUTRITION AND DIETETICS . FACULTY OF HEALTH SCIENCES
Example 1 Using SPSS (cont.)
Repeat from previous slide:
Participants on the RC diet (M = 4.00) gained
significantly more weight than those on the NL diet
(M = 2.00), t(8) = 4.47, p < .01 (two-tailed).
• In APA style we normally only display significance to
2 significant digits.
• Therefore, the probability is displayed as “p<.01,”
which is the smallest probability within this range of
accuracy.
48. SCHOOL OF NUTRITION AND DIETETICS . FACULTY OF HEALTH SCIENCES
Sample size??
• In each group, sample size is BIG (>30)
• In case, the sample size is less than 30 in one group,
so that we need to check for normality assumption.
• How to check?
49. SCHOOL OF NUTRITION AND DIETETICS . FACULTY OF HEALTH SCIENCES
Normality assumption
50. SCHOOL OF NUTRITION AND DIETETICS . FACULTY OF HEALTH SCIENCES
Normality assumption
51. SCHOOL OF NUTRITION AND DIETETICS . FACULTY OF HEALTH SCIENCES
recap
• FOUR assumptions
1. Random sample
2. Observations are independent
3. In each group, data are normally distributed or
sample size is big (>30)
4. Population variances are not different between
2 groups Levene’s test
52. SCHOOL OF NUTRITION AND DIETETICS . FACULTY OF HEALTH SCIENCES
Any
Questio
ns?
Conce
pts?
Equations?