SlideShare a Scribd company logo
1 of 21
Correlation
Inferential statistics
• Comparison of two (or more)
variables
• Qual. Vs Qual eg. htn vs smoking__
(count/proportions)
• Quant. Vs Qual eg. BP vs sex
• Quant. Vs quant. eg. BP vs weight__
(metric/interval data)
• Drawing inference from the sample
for our population of interest
Scatter plots
• A way of portraying a
relationship between two
quantitative variables
Linear
Non-linear
No relationship
• Correlation and regression
Regression and correlation
• Analyze the association between two quantitative variables
• Assume independent observations
• Assume a linear relationship
• Allow hypothesis testing of relationship– drawing inferences on the
population
• Regression: gives the ‘best-fit’ line to the data
• Correlation: gives a measure of scatter of data points around this line
Regression line: y = bx + a
Least squares method: line is fitted to minimize the sum of the squares of vertical distances of the
observed values from the line
The regression equation
• Gives the ‘best fit’ line to the data
• The regression coefficient ‘b’:
• Measures the relationship
between variables
• ‘amount of change in the
variable y for a unit change in
x’
• Positive for a direct
relationship and negative for
an inverse one
Correlation
• While regression equation
measures the average
relationship between two
variables
• Correlation gives the
strength or goodness of fit
of the relationship
• Correlation coefficient
(Pearson’s) r : lies
between -1 to +1
Coefficient of determination : r2
• Interpreted as the percentage of total variation in the dependent
variable (y) explained by the regression line or just alone by the
variation in the particular independent variable (x)
• r2 of 1 would imply that 100 percent of variation is explained by
variation in x
• Values less than 1 imply that other ‘unknown’ variables exist which
can explain y to a certain extent
Hypothesis testing
• The sample statistics b and r used to make inferences on the
population parameters
• Assumptions for valid inferences:
o Independent data (two scatter points are independent)
o Linear relationship in mean of y vs x
o Distribution of y normal for each x
o Variances the same at each x
• Confidence intervals and p values are obtained based on t distribution
When the assumptions
do not hold
• Residual analysis
• Polynomial regression:
y = a + bx + cx2
• Data transformations
• Rank correlation: if
data transformation
fails
Spearman rank correlation coefficient rs/ρ
Rank data/ ordinal data
Significance test on Spearman’s ρ
• The test statistic is ρ/rs itself
• If the calculated coefficient is within the limits +/- rc (critical value)
given in the table for
• ‘n’ pairs (10)
• two sided significance level α (5 %)
• then the null hypothesis (that there is no actual correlation) can’t be
rejected
For the example the value is +/- 0.6485, so the its concluded that there
is no difference between the ranks assigned by the two assessors
Non-parametric methods
Wilcoxon rank-sum test/ Mann-Whitney U
test
• Used when normality assumption doesn’t hold esp. for small samples
• Hypothesis test for assessing the assumption that one of the sets of
samples have a larger value than others
• Ranks are assigned to the values used for comparison
• Assumptions:
• Sample is randomly drawn
• Observations are independent
Steps
1. Rank all the values irrespective of the particular group
2. Sum the ranks in each group
Original
values
Ranks
W1=52 W2=101
U statistic
Decision is based on the
value of U
• For one tailed: u1 or u2
• For two tailed:
u = min (u1;u2)
Reject the null hypothesis
whenever the test statistic
u/u1/u2 is less than critical
value
Comparing two paired groups: Wilcoxon
signed-rank test
Paired tests are used when the the observations between groups are
dependent in some way:
• Variable is measured before-after an intervention
• Subjects are recruited as matched pairs (such as for age, sex, co-
morbidities)
• ‘twins’ or siblings recruited as pairs
• ‘right-left pairs’– ex different treatment for right and lefty eye
Assumption: each pair chosen is random and independent
Wilcoxon signed-rank test
• Non-parametric test for paired data sets
• Tests the hypothesis that there is no difference between two paired groups
Steps:
• Calculate difference between each matched pair keeping track of the sign
• Rank the absolute value of differences for ‘positive’ and ‘negative’
differences ignoring the sign
• Calculate the sums of two groups ‘positive’ and ‘negative’ differently
• Calculate test statistic and compute the p-value
Kruskal-Wallis test
• Similar to one-way ANOVA and extension of Mann Whitney U test
• Non-parametric test for comparing the medians between more than
two groups of observation for a given variable
• Ranks are given to all the observations f/b
• Sum of the ranks for each group are calculated
• Test statistic: H follows a chi square distribution with df= k-1
Summary: non-parametric tests
• Nonparametric tests are less powerful: ‘some information is discarded
while using ranks’
• Sample size: compute the sample size for parametric test and add
15%
• Nonparametric tests are usually not reported with CIs
• Nonparametric tests are not readily extended to regression models
Variable Parametric test
(paired test)
Non-parametric test
(paired test)
Quantitative variable; 2
groups
Mean or median
Unpaired t test
(paired t test)
Mann Whitney U test
(Wilcoxon signed rank
test)
Quantitative variable; > 2
groups
Mean or median
One Way ANOVA
(repeated measures
ANOVA)
Kruskal Wallis test
(Friedman test)
Categorical variable/
proportions
Chi square test
(Mc Nemar test)

More Related Content

Similar to Non-parametric tests:correlation.pptx

CHAPTER 2 - NORM, CORRELATION AND REGRESSION.ppt
CHAPTER 2  - NORM, CORRELATION AND REGRESSION.pptCHAPTER 2  - NORM, CORRELATION AND REGRESSION.ppt
CHAPTER 2 - NORM, CORRELATION AND REGRESSION.pptkriti137049
 
Correlation _ Regression Analysis statistics.pptx
Correlation _ Regression Analysis statistics.pptxCorrelation _ Regression Analysis statistics.pptx
Correlation _ Regression Analysis statistics.pptxkrunal soni
 
Test of significance in Statistics
Test of significance in StatisticsTest of significance in Statistics
Test of significance in StatisticsVikash Keshri
 
Inferential Statistics- Dr Ryan Thomas Williams
Inferential Statistics- Dr Ryan Thomas WilliamsInferential Statistics- Dr Ryan Thomas Williams
Inferential Statistics- Dr Ryan Thomas WilliamsRyan Williams
 
parameter Estimation and effect size
parameter Estimation and effect size parameter Estimation and effect size
parameter Estimation and effect size hannantahir30
 
Correlation and Regression
Correlation and RegressionCorrelation and Regression
Correlation and Regressionjasondroesch
 
A presentation for Multiple linear regression.ppt
A presentation for Multiple linear regression.pptA presentation for Multiple linear regression.ppt
A presentation for Multiple linear regression.pptvigia41
 
Common Statistical Terms - Biostatistics - Ravinandan A P.pdf
Common Statistical Terms - Biostatistics - Ravinandan A P.pdfCommon Statistical Terms - Biostatistics - Ravinandan A P.pdf
Common Statistical Terms - Biostatistics - Ravinandan A P.pdfRavinandan A P
 
20151120221133 how to analyze survey research data
20151120221133 how to analyze survey research data20151120221133 how to analyze survey research data
20151120221133 how to analyze survey research dataRohaniza Rejab
 
Chapter 13 Data Analysis Inferential Methods and Analysis of Time Series
Chapter 13 Data Analysis Inferential Methods and Analysis of Time SeriesChapter 13 Data Analysis Inferential Methods and Analysis of Time Series
Chapter 13 Data Analysis Inferential Methods and Analysis of Time SeriesInternational advisers
 
Unit-III Correlation and Regression.pptx
Unit-III Correlation and Regression.pptxUnit-III Correlation and Regression.pptx
Unit-III Correlation and Regression.pptxAnusuya123
 
Correlation and Regression Analysis.pptx
Correlation and Regression Analysis.pptxCorrelation and Regression Analysis.pptx
Correlation and Regression Analysis.pptxasemzkgmu
 

Similar to Non-parametric tests:correlation.pptx (20)

CHAPTER 2 - NORM, CORRELATION AND REGRESSION.ppt
CHAPTER 2  - NORM, CORRELATION AND REGRESSION.pptCHAPTER 2  - NORM, CORRELATION AND REGRESSION.ppt
CHAPTER 2 - NORM, CORRELATION AND REGRESSION.ppt
 
ANOVA test and correlation
ANOVA test and correlationANOVA test and correlation
ANOVA test and correlation
 
Nonparametric Statistics
Nonparametric StatisticsNonparametric Statistics
Nonparametric Statistics
 
Correlation _ Regression Analysis statistics.pptx
Correlation _ Regression Analysis statistics.pptxCorrelation _ Regression Analysis statistics.pptx
Correlation _ Regression Analysis statistics.pptx
 
Test of significance in Statistics
Test of significance in StatisticsTest of significance in Statistics
Test of significance in Statistics
 
Non parametric-tests
Non parametric-testsNon parametric-tests
Non parametric-tests
 
statistical test.pptx
statistical test.pptxstatistical test.pptx
statistical test.pptx
 
lecture_5.pptx
lecture_5.pptxlecture_5.pptx
lecture_5.pptx
 
Inferential Statistics- Dr Ryan Thomas Williams
Inferential Statistics- Dr Ryan Thomas WilliamsInferential Statistics- Dr Ryan Thomas Williams
Inferential Statistics- Dr Ryan Thomas Williams
 
parameter Estimation and effect size
parameter Estimation and effect size parameter Estimation and effect size
parameter Estimation and effect size
 
Correlation and Regression
Correlation and RegressionCorrelation and Regression
Correlation and Regression
 
A presentation for Multiple linear regression.ppt
A presentation for Multiple linear regression.pptA presentation for Multiple linear regression.ppt
A presentation for Multiple linear regression.ppt
 
chi_square test.pptx
chi_square test.pptxchi_square test.pptx
chi_square test.pptx
 
Common Statistical Terms - Biostatistics - Ravinandan A P.pdf
Common Statistical Terms - Biostatistics - Ravinandan A P.pdfCommon Statistical Terms - Biostatistics - Ravinandan A P.pdf
Common Statistical Terms - Biostatistics - Ravinandan A P.pdf
 
20151120221133 how to analyze survey research data
20151120221133 how to analyze survey research data20151120221133 how to analyze survey research data
20151120221133 how to analyze survey research data
 
Regression
Regression Regression
Regression
 
Chapter38
Chapter38Chapter38
Chapter38
 
Chapter 13 Data Analysis Inferential Methods and Analysis of Time Series
Chapter 13 Data Analysis Inferential Methods and Analysis of Time SeriesChapter 13 Data Analysis Inferential Methods and Analysis of Time Series
Chapter 13 Data Analysis Inferential Methods and Analysis of Time Series
 
Unit-III Correlation and Regression.pptx
Unit-III Correlation and Regression.pptxUnit-III Correlation and Regression.pptx
Unit-III Correlation and Regression.pptx
 
Correlation and Regression Analysis.pptx
Correlation and Regression Analysis.pptxCorrelation and Regression Analysis.pptx
Correlation and Regression Analysis.pptx
 

Recently uploaded

VIP High Class Call Girls Jamshedpur Anushka 8250192130 Independent Escort Se...
VIP High Class Call Girls Jamshedpur Anushka 8250192130 Independent Escort Se...VIP High Class Call Girls Jamshedpur Anushka 8250192130 Independent Escort Se...
VIP High Class Call Girls Jamshedpur Anushka 8250192130 Independent Escort Se...Suhani Kapoor
 
From idea to production in a day – Leveraging Azure ML and Streamlit to build...
From idea to production in a day – Leveraging Azure ML and Streamlit to build...From idea to production in a day – Leveraging Azure ML and Streamlit to build...
From idea to production in a day – Leveraging Azure ML and Streamlit to build...Florian Roscheck
 
Best VIP Call Girls Noida Sector 39 Call Me: 8448380779
Best VIP Call Girls Noida Sector 39 Call Me: 8448380779Best VIP Call Girls Noida Sector 39 Call Me: 8448380779
Best VIP Call Girls Noida Sector 39 Call Me: 8448380779Delhi Call girls
 
Industrialised data - the key to AI success.pdf
Industrialised data - the key to AI success.pdfIndustrialised data - the key to AI success.pdf
Industrialised data - the key to AI success.pdfLars Albertsson
 
꧁❤ Greater Noida Call Girls Delhi ❤꧂ 9711199171 ☎️ Hard And Sexy Vip Call
꧁❤ Greater Noida Call Girls Delhi ❤꧂ 9711199171 ☎️ Hard And Sexy Vip Call꧁❤ Greater Noida Call Girls Delhi ❤꧂ 9711199171 ☎️ Hard And Sexy Vip Call
꧁❤ Greater Noida Call Girls Delhi ❤꧂ 9711199171 ☎️ Hard And Sexy Vip Callshivangimorya083
 
Invezz.com - Grow your wealth with trading signals
Invezz.com - Grow your wealth with trading signalsInvezz.com - Grow your wealth with trading signals
Invezz.com - Grow your wealth with trading signalsInvezz1
 
High Class Call Girls Noida Sector 39 Aarushi 🔝8264348440🔝 Independent Escort...
High Class Call Girls Noida Sector 39 Aarushi 🔝8264348440🔝 Independent Escort...High Class Call Girls Noida Sector 39 Aarushi 🔝8264348440🔝 Independent Escort...
High Class Call Girls Noida Sector 39 Aarushi 🔝8264348440🔝 Independent Escort...soniya singh
 
Indian Call Girls in Abu Dhabi O5286O24O8 Call Girls in Abu Dhabi By Independ...
Indian Call Girls in Abu Dhabi O5286O24O8 Call Girls in Abu Dhabi By Independ...Indian Call Girls in Abu Dhabi O5286O24O8 Call Girls in Abu Dhabi By Independ...
Indian Call Girls in Abu Dhabi O5286O24O8 Call Girls in Abu Dhabi By Independ...dajasot375
 
Predicting Employee Churn: A Data-Driven Approach Project Presentation
Predicting Employee Churn: A Data-Driven Approach Project PresentationPredicting Employee Churn: A Data-Driven Approach Project Presentation
Predicting Employee Churn: A Data-Driven Approach Project PresentationBoston Institute of Analytics
 
Call Girls In Mahipalpur O9654467111 Escorts Service
Call Girls In Mahipalpur O9654467111  Escorts ServiceCall Girls In Mahipalpur O9654467111  Escorts Service
Call Girls In Mahipalpur O9654467111 Escorts ServiceSapana Sha
 
100-Concepts-of-AI by Anupama Kate .pptx
100-Concepts-of-AI by Anupama Kate .pptx100-Concepts-of-AI by Anupama Kate .pptx
100-Concepts-of-AI by Anupama Kate .pptxAnupama Kate
 
Customer Service Analytics - Make Sense of All Your Data.pptx
Customer Service Analytics - Make Sense of All Your Data.pptxCustomer Service Analytics - Make Sense of All Your Data.pptx
Customer Service Analytics - Make Sense of All Your Data.pptxEmmanuel Dauda
 
Brighton SEO | April 2024 | Data Storytelling
Brighton SEO | April 2024 | Data StorytellingBrighton SEO | April 2024 | Data Storytelling
Brighton SEO | April 2024 | Data StorytellingNeil Barnes
 
Unveiling Insights: The Role of a Data Analyst
Unveiling Insights: The Role of a Data AnalystUnveiling Insights: The Role of a Data Analyst
Unveiling Insights: The Role of a Data AnalystSamantha Rae Coolbeth
 
dokumen.tips_chapter-4-transient-heat-conduction-mehmet-kanoglu.ppt
dokumen.tips_chapter-4-transient-heat-conduction-mehmet-kanoglu.pptdokumen.tips_chapter-4-transient-heat-conduction-mehmet-kanoglu.ppt
dokumen.tips_chapter-4-transient-heat-conduction-mehmet-kanoglu.pptSonatrach
 
RA-11058_IRR-COMPRESS Do 198 series of 1998
RA-11058_IRR-COMPRESS Do 198 series of 1998RA-11058_IRR-COMPRESS Do 198 series of 1998
RA-11058_IRR-COMPRESS Do 198 series of 1998YohFuh
 
Schema on read is obsolete. Welcome metaprogramming..pdf
Schema on read is obsolete. Welcome metaprogramming..pdfSchema on read is obsolete. Welcome metaprogramming..pdf
Schema on read is obsolete. Welcome metaprogramming..pdfLars Albertsson
 
PKS-TGC-1084-630 - Stage 1 Proposal.pptx
PKS-TGC-1084-630 - Stage 1 Proposal.pptxPKS-TGC-1084-630 - Stage 1 Proposal.pptx
PKS-TGC-1084-630 - Stage 1 Proposal.pptxPramod Kumar Srivastava
 
Low Rate Call Girls Bhilai Anika 8250192130 Independent Escort Service Bhilai
Low Rate Call Girls Bhilai Anika 8250192130 Independent Escort Service BhilaiLow Rate Call Girls Bhilai Anika 8250192130 Independent Escort Service Bhilai
Low Rate Call Girls Bhilai Anika 8250192130 Independent Escort Service BhilaiSuhani Kapoor
 
Delhi Call Girls CP 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
Delhi Call Girls CP 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip CallDelhi Call Girls CP 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
Delhi Call Girls CP 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Callshivangimorya083
 

Recently uploaded (20)

VIP High Class Call Girls Jamshedpur Anushka 8250192130 Independent Escort Se...
VIP High Class Call Girls Jamshedpur Anushka 8250192130 Independent Escort Se...VIP High Class Call Girls Jamshedpur Anushka 8250192130 Independent Escort Se...
VIP High Class Call Girls Jamshedpur Anushka 8250192130 Independent Escort Se...
 
From idea to production in a day – Leveraging Azure ML and Streamlit to build...
From idea to production in a day – Leveraging Azure ML and Streamlit to build...From idea to production in a day – Leveraging Azure ML and Streamlit to build...
From idea to production in a day – Leveraging Azure ML and Streamlit to build...
 
Best VIP Call Girls Noida Sector 39 Call Me: 8448380779
Best VIP Call Girls Noida Sector 39 Call Me: 8448380779Best VIP Call Girls Noida Sector 39 Call Me: 8448380779
Best VIP Call Girls Noida Sector 39 Call Me: 8448380779
 
Industrialised data - the key to AI success.pdf
Industrialised data - the key to AI success.pdfIndustrialised data - the key to AI success.pdf
Industrialised data - the key to AI success.pdf
 
꧁❤ Greater Noida Call Girls Delhi ❤꧂ 9711199171 ☎️ Hard And Sexy Vip Call
꧁❤ Greater Noida Call Girls Delhi ❤꧂ 9711199171 ☎️ Hard And Sexy Vip Call꧁❤ Greater Noida Call Girls Delhi ❤꧂ 9711199171 ☎️ Hard And Sexy Vip Call
꧁❤ Greater Noida Call Girls Delhi ❤꧂ 9711199171 ☎️ Hard And Sexy Vip Call
 
Invezz.com - Grow your wealth with trading signals
Invezz.com - Grow your wealth with trading signalsInvezz.com - Grow your wealth with trading signals
Invezz.com - Grow your wealth with trading signals
 
High Class Call Girls Noida Sector 39 Aarushi 🔝8264348440🔝 Independent Escort...
High Class Call Girls Noida Sector 39 Aarushi 🔝8264348440🔝 Independent Escort...High Class Call Girls Noida Sector 39 Aarushi 🔝8264348440🔝 Independent Escort...
High Class Call Girls Noida Sector 39 Aarushi 🔝8264348440🔝 Independent Escort...
 
Indian Call Girls in Abu Dhabi O5286O24O8 Call Girls in Abu Dhabi By Independ...
Indian Call Girls in Abu Dhabi O5286O24O8 Call Girls in Abu Dhabi By Independ...Indian Call Girls in Abu Dhabi O5286O24O8 Call Girls in Abu Dhabi By Independ...
Indian Call Girls in Abu Dhabi O5286O24O8 Call Girls in Abu Dhabi By Independ...
 
Predicting Employee Churn: A Data-Driven Approach Project Presentation
Predicting Employee Churn: A Data-Driven Approach Project PresentationPredicting Employee Churn: A Data-Driven Approach Project Presentation
Predicting Employee Churn: A Data-Driven Approach Project Presentation
 
Call Girls In Mahipalpur O9654467111 Escorts Service
Call Girls In Mahipalpur O9654467111  Escorts ServiceCall Girls In Mahipalpur O9654467111  Escorts Service
Call Girls In Mahipalpur O9654467111 Escorts Service
 
100-Concepts-of-AI by Anupama Kate .pptx
100-Concepts-of-AI by Anupama Kate .pptx100-Concepts-of-AI by Anupama Kate .pptx
100-Concepts-of-AI by Anupama Kate .pptx
 
Customer Service Analytics - Make Sense of All Your Data.pptx
Customer Service Analytics - Make Sense of All Your Data.pptxCustomer Service Analytics - Make Sense of All Your Data.pptx
Customer Service Analytics - Make Sense of All Your Data.pptx
 
Brighton SEO | April 2024 | Data Storytelling
Brighton SEO | April 2024 | Data StorytellingBrighton SEO | April 2024 | Data Storytelling
Brighton SEO | April 2024 | Data Storytelling
 
Unveiling Insights: The Role of a Data Analyst
Unveiling Insights: The Role of a Data AnalystUnveiling Insights: The Role of a Data Analyst
Unveiling Insights: The Role of a Data Analyst
 
dokumen.tips_chapter-4-transient-heat-conduction-mehmet-kanoglu.ppt
dokumen.tips_chapter-4-transient-heat-conduction-mehmet-kanoglu.pptdokumen.tips_chapter-4-transient-heat-conduction-mehmet-kanoglu.ppt
dokumen.tips_chapter-4-transient-heat-conduction-mehmet-kanoglu.ppt
 
RA-11058_IRR-COMPRESS Do 198 series of 1998
RA-11058_IRR-COMPRESS Do 198 series of 1998RA-11058_IRR-COMPRESS Do 198 series of 1998
RA-11058_IRR-COMPRESS Do 198 series of 1998
 
Schema on read is obsolete. Welcome metaprogramming..pdf
Schema on read is obsolete. Welcome metaprogramming..pdfSchema on read is obsolete. Welcome metaprogramming..pdf
Schema on read is obsolete. Welcome metaprogramming..pdf
 
PKS-TGC-1084-630 - Stage 1 Proposal.pptx
PKS-TGC-1084-630 - Stage 1 Proposal.pptxPKS-TGC-1084-630 - Stage 1 Proposal.pptx
PKS-TGC-1084-630 - Stage 1 Proposal.pptx
 
Low Rate Call Girls Bhilai Anika 8250192130 Independent Escort Service Bhilai
Low Rate Call Girls Bhilai Anika 8250192130 Independent Escort Service BhilaiLow Rate Call Girls Bhilai Anika 8250192130 Independent Escort Service Bhilai
Low Rate Call Girls Bhilai Anika 8250192130 Independent Escort Service Bhilai
 
Delhi Call Girls CP 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
Delhi Call Girls CP 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip CallDelhi Call Girls CP 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
Delhi Call Girls CP 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
 

Non-parametric tests:correlation.pptx

  • 2. Inferential statistics • Comparison of two (or more) variables • Qual. Vs Qual eg. htn vs smoking__ (count/proportions) • Quant. Vs Qual eg. BP vs sex • Quant. Vs quant. eg. BP vs weight__ (metric/interval data) • Drawing inference from the sample for our population of interest
  • 3. Scatter plots • A way of portraying a relationship between two quantitative variables Linear Non-linear No relationship • Correlation and regression
  • 4. Regression and correlation • Analyze the association between two quantitative variables • Assume independent observations • Assume a linear relationship • Allow hypothesis testing of relationship– drawing inferences on the population • Regression: gives the ‘best-fit’ line to the data • Correlation: gives a measure of scatter of data points around this line
  • 5. Regression line: y = bx + a Least squares method: line is fitted to minimize the sum of the squares of vertical distances of the observed values from the line
  • 6. The regression equation • Gives the ‘best fit’ line to the data • The regression coefficient ‘b’: • Measures the relationship between variables • ‘amount of change in the variable y for a unit change in x’ • Positive for a direct relationship and negative for an inverse one
  • 7. Correlation • While regression equation measures the average relationship between two variables • Correlation gives the strength or goodness of fit of the relationship • Correlation coefficient (Pearson’s) r : lies between -1 to +1
  • 8. Coefficient of determination : r2 • Interpreted as the percentage of total variation in the dependent variable (y) explained by the regression line or just alone by the variation in the particular independent variable (x) • r2 of 1 would imply that 100 percent of variation is explained by variation in x • Values less than 1 imply that other ‘unknown’ variables exist which can explain y to a certain extent
  • 9. Hypothesis testing • The sample statistics b and r used to make inferences on the population parameters • Assumptions for valid inferences: o Independent data (two scatter points are independent) o Linear relationship in mean of y vs x o Distribution of y normal for each x o Variances the same at each x • Confidence intervals and p values are obtained based on t distribution
  • 10. When the assumptions do not hold • Residual analysis • Polynomial regression: y = a + bx + cx2 • Data transformations • Rank correlation: if data transformation fails
  • 11. Spearman rank correlation coefficient rs/ρ Rank data/ ordinal data
  • 12. Significance test on Spearman’s ρ • The test statistic is ρ/rs itself • If the calculated coefficient is within the limits +/- rc (critical value) given in the table for • ‘n’ pairs (10) • two sided significance level α (5 %) • then the null hypothesis (that there is no actual correlation) can’t be rejected For the example the value is +/- 0.6485, so the its concluded that there is no difference between the ranks assigned by the two assessors
  • 14. Wilcoxon rank-sum test/ Mann-Whitney U test • Used when normality assumption doesn’t hold esp. for small samples • Hypothesis test for assessing the assumption that one of the sets of samples have a larger value than others • Ranks are assigned to the values used for comparison • Assumptions: • Sample is randomly drawn • Observations are independent
  • 15. Steps 1. Rank all the values irrespective of the particular group 2. Sum the ranks in each group Original values Ranks W1=52 W2=101
  • 16. U statistic Decision is based on the value of U • For one tailed: u1 or u2 • For two tailed: u = min (u1;u2) Reject the null hypothesis whenever the test statistic u/u1/u2 is less than critical value
  • 17. Comparing two paired groups: Wilcoxon signed-rank test Paired tests are used when the the observations between groups are dependent in some way: • Variable is measured before-after an intervention • Subjects are recruited as matched pairs (such as for age, sex, co- morbidities) • ‘twins’ or siblings recruited as pairs • ‘right-left pairs’– ex different treatment for right and lefty eye Assumption: each pair chosen is random and independent
  • 18. Wilcoxon signed-rank test • Non-parametric test for paired data sets • Tests the hypothesis that there is no difference between two paired groups Steps: • Calculate difference between each matched pair keeping track of the sign • Rank the absolute value of differences for ‘positive’ and ‘negative’ differences ignoring the sign • Calculate the sums of two groups ‘positive’ and ‘negative’ differently • Calculate test statistic and compute the p-value
  • 19. Kruskal-Wallis test • Similar to one-way ANOVA and extension of Mann Whitney U test • Non-parametric test for comparing the medians between more than two groups of observation for a given variable • Ranks are given to all the observations f/b • Sum of the ranks for each group are calculated • Test statistic: H follows a chi square distribution with df= k-1
  • 20. Summary: non-parametric tests • Nonparametric tests are less powerful: ‘some information is discarded while using ranks’ • Sample size: compute the sample size for parametric test and add 15% • Nonparametric tests are usually not reported with CIs • Nonparametric tests are not readily extended to regression models
  • 21. Variable Parametric test (paired test) Non-parametric test (paired test) Quantitative variable; 2 groups Mean or median Unpaired t test (paired t test) Mann Whitney U test (Wilcoxon signed rank test) Quantitative variable; > 2 groups Mean or median One Way ANOVA (repeated measures ANOVA) Kruskal Wallis test (Friedman test) Categorical variable/ proportions Chi square test (Mc Nemar test)

Editor's Notes

  1. Basically built on t distributions