SlideShare a Scribd company logo
Sampling Bias
Dr.K.Prabhakar
Bias
• Once we collect the data we represent the data by way of a model.
Let us assume a linear model.
• This may be written as y(outcome)= a1x1+a2x2+a3x3+…+anxn+ error
• Therefore we predict that there will be an error as the outcome is
expressed as a set of predictor variables multiplied by a set of
coefficients the parameters the a in the equation and tell us about
the relationship between the predictor and outcome variable.
• The prediction will not be perfect as there will be an error as we are
using sample data to predict the outcome variable.
The contexts for bias
• Things that bias the parameter estimates
• Things that bias standard errors and confidence intervals
• Things that bias test statistics and p-values. These bias are related. If
the test statistics are bias then the confidence intervals will be biased.
A bias in confidence intervals will bias the test statistics.
• If the test statistics is biased then the results will be biased and we
need to identify and eliminate the biases as much as possible.
Assumptions that lead to bias
1. Presence of outliners
2. Additivity and linearity
3. Normality
4. Homoscedasticity or homogeneity of variance
5. Independence
Outliers
• Presence of outliers in data will bias the data.
• For example if the class average marks is 60 and standard deviation is
10 marks then if there is a presence of zero marks or 100 marks by
few students may bias the data.
• The outliers need to be identified and removed or replaced to have a
better representation of the data. It generally affect the mean of the
data as well as some of the squares errors. The sum of the squares is
used to compute the standard deviation, which in turn is used to
estimate the standard error. The standard error is used for confidence
intervals around the parameter estimates. This it will have a domino
effect on the results.
Additivity and Linearity
• The assumption is the outcome variable is linearly related to all
predictors. That means the relationship may be summed up as a
straight line.
• If there are several predictors as we have see the equation
y(outcome)= a1x1+a2x2+a3x3+…+anxn+ error
their combined effect is described by adding their effects together.
The model can described accurately by the equation given here.
Assumption of Normality
• There is a mistaken belief that assumption of normality = the data need to be
from normally distributed. This misconception stems from the fact that if the
data is normally distributed then errors in the model as well as sampling
distribution is also normally distributed.
• The central limit theorem means that there are different situations in which we
can assume normality regardless of the shape of the sample data.
• Normality matters when you construct confidence intervals around parameters of
the model or compute significance tests relating to those parameters then
assumption of normality matters in small samples.
• As long as the sample size is fairly large, outliers are taken into account then
assumption of normality will not be a pressing concern.
• Lumley, T., Diehr, P., Emerson, S., & Chen, L. (2002). The importance of
the normality assumption in large public health data sets. Annual review of
public health, 23(1), 151-169.
Homoscedasticity or homogeneity of variance

More Related Content

What's hot

Multivariate reg analysis
Multivariate reg analysisMultivariate reg analysis
Multivariate reg analysis
Irfan Hussain
 
Introduction to Structural Equation Modeling
Introduction to Structural Equation ModelingIntroduction to Structural Equation Modeling
Introduction to Structural Equation Modeling
Azmi Mohd Tamil
 
M1 regression metrics_middleschool
M1 regression metrics_middleschoolM1 regression metrics_middleschool
M1 regression metrics_middleschool
aiclub_slides
 
Methods of point estimation
Methods of point estimationMethods of point estimation
Methods of point estimation
Suruchi Somwanshi
 
Statistical Methods to Handle Missing Data
Statistical Methods to Handle Missing DataStatistical Methods to Handle Missing Data
Statistical Methods to Handle Missing DataTianfan Song
 
Biostatistics Workshop: Missing Data
Biostatistics Workshop: Missing DataBiostatistics Workshop: Missing Data
Biostatistics Workshop: Missing Data
HopkinsCFAR
 
Lab report walk through
Lab report walk throughLab report walk through
Lab report walk through
serenaasya
 
Estimation Theory
Estimation TheoryEstimation Theory
Estimation Theory
Seung Ho Choi
 
Use of Linear Regression in Machine Learning for Ranking
Use of Linear Regression in Machine Learning for RankingUse of Linear Regression in Machine Learning for Ranking
Use of Linear Regression in Machine Learning for Ranking
ijsrd.com
 
Statistical Methods
Statistical MethodsStatistical Methods
Statistical Methodsguest2137aa
 
R - Multiple Regression
R - Multiple RegressionR - Multiple Regression
R - Multiple Regression
Learnbay Datascience
 
CS550 Presentation - On comparing classifiers by Slazberg
CS550 Presentation - On comparing classifiers by SlazbergCS550 Presentation - On comparing classifiers by Slazberg
CS550 Presentation - On comparing classifiers by Slazberg
mustafa sarac
 
Regression
RegressionRegression
Regression
Rohit Sharma
 
Introduction to principal component analysis (pca)
Introduction to principal component analysis (pca)Introduction to principal component analysis (pca)
Introduction to principal component analysis (pca)
Mohammed Musah
 
Point estimation
Point estimationPoint estimation
Point estimation
Shahab Yaseen
 
Lecture note 2
Lecture note 2Lecture note 2
Lecture note 2
sreenu t
 
Polynomials 12.2 12.4
Polynomials 12.2 12.4Polynomials 12.2 12.4
Polynomials 12.2 12.4
RobinFilter
 
Lesson 10 rm psych stats & graphs 2013
Lesson 10   rm psych stats & graphs 2013Lesson 10   rm psych stats & graphs 2013
Lesson 10 rm psych stats & graphs 2013coburgpsych
 

What's hot (19)

Multivariate reg analysis
Multivariate reg analysisMultivariate reg analysis
Multivariate reg analysis
 
Introduction to Structural Equation Modeling
Introduction to Structural Equation ModelingIntroduction to Structural Equation Modeling
Introduction to Structural Equation Modeling
 
M1 regression metrics_middleschool
M1 regression metrics_middleschoolM1 regression metrics_middleschool
M1 regression metrics_middleschool
 
Methods of point estimation
Methods of point estimationMethods of point estimation
Methods of point estimation
 
Statistical Methods to Handle Missing Data
Statistical Methods to Handle Missing DataStatistical Methods to Handle Missing Data
Statistical Methods to Handle Missing Data
 
Biostatistics Workshop: Missing Data
Biostatistics Workshop: Missing DataBiostatistics Workshop: Missing Data
Biostatistics Workshop: Missing Data
 
Lab report walk through
Lab report walk throughLab report walk through
Lab report walk through
 
Estimation Theory
Estimation TheoryEstimation Theory
Estimation Theory
 
Use of Linear Regression in Machine Learning for Ranking
Use of Linear Regression in Machine Learning for RankingUse of Linear Regression in Machine Learning for Ranking
Use of Linear Regression in Machine Learning for Ranking
 
Statistical Methods
Statistical MethodsStatistical Methods
Statistical Methods
 
R - Multiple Regression
R - Multiple RegressionR - Multiple Regression
R - Multiple Regression
 
CS550 Presentation - On comparing classifiers by Slazberg
CS550 Presentation - On comparing classifiers by SlazbergCS550 Presentation - On comparing classifiers by Slazberg
CS550 Presentation - On comparing classifiers by Slazberg
 
Regression
RegressionRegression
Regression
 
Introduction to principal component analysis (pca)
Introduction to principal component analysis (pca)Introduction to principal component analysis (pca)
Introduction to principal component analysis (pca)
 
Point estimation
Point estimationPoint estimation
Point estimation
 
Lecture note 2
Lecture note 2Lecture note 2
Lecture note 2
 
Polynomials 12.2 12.4
Polynomials 12.2 12.4Polynomials 12.2 12.4
Polynomials 12.2 12.4
 
The Chi Square Test
The Chi Square TestThe Chi Square Test
The Chi Square Test
 
Lesson 10 rm psych stats & graphs 2013
Lesson 10   rm psych stats & graphs 2013Lesson 10   rm psych stats & graphs 2013
Lesson 10 rm psych stats & graphs 2013
 

Similar to Bias in Research Methods

regression.pptx
regression.pptxregression.pptx
regression.pptx
aneeshs28
 
Multiple linear regression
Multiple linear regressionMultiple linear regression
Multiple linear regression
Avjinder (Avi) Kaler
 
Lect w8 w9_correlation_regression
Lect w8 w9_correlation_regressionLect w8 w9_correlation_regression
Lect w8 w9_correlation_regression
Rione Drevale
 
Unit III_Ch 17_Probablistic Methods.pptx
Unit III_Ch 17_Probablistic Methods.pptxUnit III_Ch 17_Probablistic Methods.pptx
Unit III_Ch 17_Probablistic Methods.pptx
smithashetty24
 
03 Data Mining Techniques
03 Data Mining Techniques03 Data Mining Techniques
03 Data Mining Techniques
Valerii Klymchuk
 
Errors2
Errors2Errors2
Errors2
sjsuchaya
 
Descriptive Statistics
Descriptive StatisticsDescriptive Statistics
Descriptive Statistics
Kush Kulshrestha
 
Error in chemical analysis
Error in chemical analysisError in chemical analysis
Error in chemical analysis
Suresh Selvaraj
 
chapter12.ppt
chapter12.pptchapter12.ppt
chapter12.ppt
EndrisHEbrahim
 
Correlation in Statistics
Correlation in StatisticsCorrelation in Statistics
Correlation in Statistics
Avjinder (Avi) Kaler
 
Normal distribtion curve
Normal distribtion curveNormal distribtion curve
Normal distribtion curve
AliRaza1767
 
L1 statistics
L1 statisticsL1 statistics
L1 statisticsdapdai
 
statistical estimation
statistical estimationstatistical estimation
statistical estimation
Amish Akbar
 
Ch3_Statistical Analysis and Random Error Estimation.pdf
Ch3_Statistical Analysis and Random Error Estimation.pdfCh3_Statistical Analysis and Random Error Estimation.pdf
Ch3_Statistical Analysis and Random Error Estimation.pdf
Vamshi962726
 
Machine learning session4(linear regression)
Machine learning   session4(linear regression)Machine learning   session4(linear regression)
Machine learning session4(linear regression)
Abhimanyu Dwivedi
 
template.pptx
template.pptxtemplate.pptx
template.pptx
uzmasulthana3
 
R training4
R training4R training4
R training4
Hellen Gakuruh
 
DSE-2, ANALYTICAL METHODS.pptx
DSE-2, ANALYTICAL METHODS.pptxDSE-2, ANALYTICAL METHODS.pptx
DSE-2, ANALYTICAL METHODS.pptx
Mathabhanga College
 
Physics 1.2b Errors and Uncertainties
Physics 1.2b Errors and UncertaintiesPhysics 1.2b Errors and Uncertainties
Physics 1.2b Errors and Uncertainties
JohnPaul Kennedy
 
Presentation1
Presentation1Presentation1
Presentation1
Nalini Singh
 

Similar to Bias in Research Methods (20)

regression.pptx
regression.pptxregression.pptx
regression.pptx
 
Multiple linear regression
Multiple linear regressionMultiple linear regression
Multiple linear regression
 
Lect w8 w9_correlation_regression
Lect w8 w9_correlation_regressionLect w8 w9_correlation_regression
Lect w8 w9_correlation_regression
 
Unit III_Ch 17_Probablistic Methods.pptx
Unit III_Ch 17_Probablistic Methods.pptxUnit III_Ch 17_Probablistic Methods.pptx
Unit III_Ch 17_Probablistic Methods.pptx
 
03 Data Mining Techniques
03 Data Mining Techniques03 Data Mining Techniques
03 Data Mining Techniques
 
Errors2
Errors2Errors2
Errors2
 
Descriptive Statistics
Descriptive StatisticsDescriptive Statistics
Descriptive Statistics
 
Error in chemical analysis
Error in chemical analysisError in chemical analysis
Error in chemical analysis
 
chapter12.ppt
chapter12.pptchapter12.ppt
chapter12.ppt
 
Correlation in Statistics
Correlation in StatisticsCorrelation in Statistics
Correlation in Statistics
 
Normal distribtion curve
Normal distribtion curveNormal distribtion curve
Normal distribtion curve
 
L1 statistics
L1 statisticsL1 statistics
L1 statistics
 
statistical estimation
statistical estimationstatistical estimation
statistical estimation
 
Ch3_Statistical Analysis and Random Error Estimation.pdf
Ch3_Statistical Analysis and Random Error Estimation.pdfCh3_Statistical Analysis and Random Error Estimation.pdf
Ch3_Statistical Analysis and Random Error Estimation.pdf
 
Machine learning session4(linear regression)
Machine learning   session4(linear regression)Machine learning   session4(linear regression)
Machine learning session4(linear regression)
 
template.pptx
template.pptxtemplate.pptx
template.pptx
 
R training4
R training4R training4
R training4
 
DSE-2, ANALYTICAL METHODS.pptx
DSE-2, ANALYTICAL METHODS.pptxDSE-2, ANALYTICAL METHODS.pptx
DSE-2, ANALYTICAL METHODS.pptx
 
Physics 1.2b Errors and Uncertainties
Physics 1.2b Errors and UncertaintiesPhysics 1.2b Errors and Uncertainties
Physics 1.2b Errors and Uncertainties
 
Presentation1
Presentation1Presentation1
Presentation1
 

More from Centre for Social Initiative and Management

Epistemology and Learning for Researchers and Teachers
Epistemology and Learning for Researchers and TeachersEpistemology and Learning for Researchers and Teachers
Epistemology and Learning for Researchers and Teachers
Centre for Social Initiative and Management
 
The Crooked Timber of New India [Autosaved].pptx
The Crooked Timber of New India [Autosaved].pptxThe Crooked Timber of New India [Autosaved].pptx
The Crooked Timber of New India [Autosaved].pptx
Centre for Social Initiative and Management
 
Qualitative research and use of Nvivo
Qualitative research and use of NvivoQualitative research and use of Nvivo
Qualitative research and use of Nvivo
Centre for Social Initiative and Management
 
Impact of covid pandemic on indian economy future
Impact of covid pandemic on indian economy futureImpact of covid pandemic on indian economy future
Impact of covid pandemic on indian economy future
Centre for Social Initiative and Management
 
Learning
LearningLearning
Introduction to qualitative research and nvivo 12
Introduction to qualitative research and nvivo 12Introduction to qualitative research and nvivo 12
Introduction to qualitative research and nvivo 12
Centre for Social Initiative and Management
 
Examiners Expectations from PhD Thesis
Examiners Expectations from PhD ThesisExaminers Expectations from PhD Thesis
Examiners Expectations from PhD Thesis
Centre for Social Initiative and Management
 
Fundamental of Research
Fundamental of Research Fundamental of Research
Reporting Results of Statistical Analysis
Reporting Results of Statistical Analysis Reporting Results of Statistical Analysis
Reporting Results of Statistical Analysis
Centre for Social Initiative and Management
 
Sample Size Determination
Sample Size DeterminationSample Size Determination
Sampling Concepts
 Sampling Concepts Sampling Concepts
Sampling
 Sampling Sampling
Variables, Theory and Sampling Map
Variables, Theory and Sampling MapVariables, Theory and Sampling Map
Variables, Theory and Sampling Map
Centre for Social Initiative and Management
 
Role of Good Governance Practices
Role of Good Governance Practices Role of Good Governance Practices
Role of Good Governance Practices
Centre for Social Initiative and Management
 
Innovations for next 30 years and business
Innovations for next 30 years and businessInnovations for next 30 years and business
Innovations for next 30 years and business
Centre for Social Initiative and Management
 
Companies Act 2013 and Corporate Social Responsibility
Companies Act 2013 and Corporate Social Responsibility Companies Act 2013 and Corporate Social Responsibility
Companies Act 2013 and Corporate Social Responsibility
Centre for Social Initiative and Management
 

More from Centre for Social Initiative and Management (20)

Epistemology and Learning for Researchers and Teachers
Epistemology and Learning for Researchers and TeachersEpistemology and Learning for Researchers and Teachers
Epistemology and Learning for Researchers and Teachers
 
The Crooked Timber of New India [Autosaved].pptx
The Crooked Timber of New India [Autosaved].pptxThe Crooked Timber of New India [Autosaved].pptx
The Crooked Timber of New India [Autosaved].pptx
 
Qualitative research and use of Nvivo
Qualitative research and use of NvivoQualitative research and use of Nvivo
Qualitative research and use of Nvivo
 
Impact of covid pandemic on indian economy future
Impact of covid pandemic on indian economy futureImpact of covid pandemic on indian economy future
Impact of covid pandemic on indian economy future
 
Learning
LearningLearning
Learning
 
Introduction to qualitative research and nvivo 12
Introduction to qualitative research and nvivo 12Introduction to qualitative research and nvivo 12
Introduction to qualitative research and nvivo 12
 
Examiners Expectations from PhD Thesis
Examiners Expectations from PhD ThesisExaminers Expectations from PhD Thesis
Examiners Expectations from PhD Thesis
 
Fundamental of Research
Fundamental of Research Fundamental of Research
Fundamental of Research
 
Reporting Results of Statistical Analysis
Reporting Results of Statistical Analysis Reporting Results of Statistical Analysis
Reporting Results of Statistical Analysis
 
Sample Size Determination
Sample Size DeterminationSample Size Determination
Sample Size Determination
 
Sampling Concepts
 Sampling Concepts Sampling Concepts
Sampling Concepts
 
Sampling
 Sampling Sampling
Sampling
 
Variables, Theory and Sampling Map
Variables, Theory and Sampling MapVariables, Theory and Sampling Map
Variables, Theory and Sampling Map
 
Role of Good Governance Practices
Role of Good Governance Practices Role of Good Governance Practices
Role of Good Governance Practices
 
Individualization
IndividualizationIndividualization
Individualization
 
The twelve commandments to live better by one of my friend
 The twelve commandments to live better by one of my friend  The twelve commandments to live better by one of my friend
The twelve commandments to live better by one of my friend
 
Innovations for next 30 years and business
Innovations for next 30 years and businessInnovations for next 30 years and business
Innovations for next 30 years and business
 
Companies Act 2013 and Corporate Social Responsibility
Companies Act 2013 and Corporate Social Responsibility Companies Act 2013 and Corporate Social Responsibility
Companies Act 2013 and Corporate Social Responsibility
 
Sight Care Foundation
Sight Care Foundation Sight Care Foundation
Sight Care Foundation
 
Project guidelines for mba
Project guidelines for mbaProject guidelines for mba
Project guidelines for mba
 

Recently uploaded

一比一原版(UofM毕业证)明尼苏达大学毕业证成绩单
一比一原版(UofM毕业证)明尼苏达大学毕业证成绩单一比一原版(UofM毕业证)明尼苏达大学毕业证成绩单
一比一原版(UofM毕业证)明尼苏达大学毕业证成绩单
ewymefz
 
一比一原版(UVic毕业证)维多利亚大学毕业证成绩单
一比一原版(UVic毕业证)维多利亚大学毕业证成绩单一比一原版(UVic毕业证)维多利亚大学毕业证成绩单
一比一原版(UVic毕业证)维多利亚大学毕业证成绩单
ukgaet
 
一比一原版(TWU毕业证)西三一大学毕业证成绩单
一比一原版(TWU毕业证)西三一大学毕业证成绩单一比一原版(TWU毕业证)西三一大学毕业证成绩单
一比一原版(TWU毕业证)西三一大学毕业证成绩单
ocavb
 
Jpolillo Amazon PPC - Bid Optimization Sample
Jpolillo Amazon PPC - Bid Optimization SampleJpolillo Amazon PPC - Bid Optimization Sample
Jpolillo Amazon PPC - Bid Optimization Sample
James Polillo
 
做(mqu毕业证书)麦考瑞大学毕业证硕士文凭证书学费发票原版一模一样
做(mqu毕业证书)麦考瑞大学毕业证硕士文凭证书学费发票原版一模一样做(mqu毕业证书)麦考瑞大学毕业证硕士文凭证书学费发票原版一模一样
做(mqu毕业证书)麦考瑞大学毕业证硕士文凭证书学费发票原版一模一样
axoqas
 
【社内勉強会資料_Octo: An Open-Source Generalist Robot Policy】
【社内勉強会資料_Octo: An Open-Source Generalist Robot Policy】【社内勉強会資料_Octo: An Open-Source Generalist Robot Policy】
【社内勉強会資料_Octo: An Open-Source Generalist Robot Policy】
NABLAS株式会社
 
Predicting Product Ad Campaign Performance: A Data Analysis Project Presentation
Predicting Product Ad Campaign Performance: A Data Analysis Project PresentationPredicting Product Ad Campaign Performance: A Data Analysis Project Presentation
Predicting Product Ad Campaign Performance: A Data Analysis Project Presentation
Boston Institute of Analytics
 
Adjusting primitives for graph : SHORT REPORT / NOTES
Adjusting primitives for graph : SHORT REPORT / NOTESAdjusting primitives for graph : SHORT REPORT / NOTES
Adjusting primitives for graph : SHORT REPORT / NOTES
Subhajit Sahu
 
一比一原版(CBU毕业证)卡普顿大学毕业证成绩单
一比一原版(CBU毕业证)卡普顿大学毕业证成绩单一比一原版(CBU毕业证)卡普顿大学毕业证成绩单
一比一原版(CBU毕业证)卡普顿大学毕业证成绩单
nscud
 
Criminal IP - Threat Hunting Webinar.pdf
Criminal IP - Threat Hunting Webinar.pdfCriminal IP - Threat Hunting Webinar.pdf
Criminal IP - Threat Hunting Webinar.pdf
Criminal IP
 
一比一原版(RUG毕业证)格罗宁根大学毕业证成绩单
一比一原版(RUG毕业证)格罗宁根大学毕业证成绩单一比一原版(RUG毕业证)格罗宁根大学毕业证成绩单
一比一原版(RUG毕业证)格罗宁根大学毕业证成绩单
vcaxypu
 
Empowering Data Analytics Ecosystem.pptx
Empowering Data Analytics Ecosystem.pptxEmpowering Data Analytics Ecosystem.pptx
Empowering Data Analytics Ecosystem.pptx
benishzehra469
 
standardisation of garbhpala offhgfffghh
standardisation of garbhpala offhgfffghhstandardisation of garbhpala offhgfffghh
standardisation of garbhpala offhgfffghh
ArpitMalhotra16
 
Best best suvichar in gujarati english meaning of this sentence as Silk road ...
Best best suvichar in gujarati english meaning of this sentence as Silk road ...Best best suvichar in gujarati english meaning of this sentence as Silk road ...
Best best suvichar in gujarati english meaning of this sentence as Silk road ...
AbhimanyuSinha9
 
Business update Q1 2024 Lar España Real Estate SOCIMI
Business update Q1 2024 Lar España Real Estate SOCIMIBusiness update Q1 2024 Lar España Real Estate SOCIMI
Business update Q1 2024 Lar España Real Estate SOCIMI
AlejandraGmez176757
 
SOCRadar Germany 2024 Threat Landscape Report
SOCRadar Germany 2024 Threat Landscape ReportSOCRadar Germany 2024 Threat Landscape Report
SOCRadar Germany 2024 Threat Landscape Report
SOCRadar
 
The affect of service quality and online reviews on customer loyalty in the E...
The affect of service quality and online reviews on customer loyalty in the E...The affect of service quality and online reviews on customer loyalty in the E...
The affect of service quality and online reviews on customer loyalty in the E...
jerlynmaetalle
 
Q1’2024 Update: MYCI’s Leap Year Rebound
Q1’2024 Update: MYCI’s Leap Year ReboundQ1’2024 Update: MYCI’s Leap Year Rebound
Q1’2024 Update: MYCI’s Leap Year Rebound
Oppotus
 
Sample_Global Non-invasive Prenatal Testing (NIPT) Market, 2019-2030.pdf
Sample_Global Non-invasive Prenatal Testing (NIPT) Market, 2019-2030.pdfSample_Global Non-invasive Prenatal Testing (NIPT) Market, 2019-2030.pdf
Sample_Global Non-invasive Prenatal Testing (NIPT) Market, 2019-2030.pdf
Linda486226
 
一比一原版(CU毕业证)卡尔顿大学毕业证成绩单
一比一原版(CU毕业证)卡尔顿大学毕业证成绩单一比一原版(CU毕业证)卡尔顿大学毕业证成绩单
一比一原版(CU毕业证)卡尔顿大学毕业证成绩单
yhkoc
 

Recently uploaded (20)

一比一原版(UofM毕业证)明尼苏达大学毕业证成绩单
一比一原版(UofM毕业证)明尼苏达大学毕业证成绩单一比一原版(UofM毕业证)明尼苏达大学毕业证成绩单
一比一原版(UofM毕业证)明尼苏达大学毕业证成绩单
 
一比一原版(UVic毕业证)维多利亚大学毕业证成绩单
一比一原版(UVic毕业证)维多利亚大学毕业证成绩单一比一原版(UVic毕业证)维多利亚大学毕业证成绩单
一比一原版(UVic毕业证)维多利亚大学毕业证成绩单
 
一比一原版(TWU毕业证)西三一大学毕业证成绩单
一比一原版(TWU毕业证)西三一大学毕业证成绩单一比一原版(TWU毕业证)西三一大学毕业证成绩单
一比一原版(TWU毕业证)西三一大学毕业证成绩单
 
Jpolillo Amazon PPC - Bid Optimization Sample
Jpolillo Amazon PPC - Bid Optimization SampleJpolillo Amazon PPC - Bid Optimization Sample
Jpolillo Amazon PPC - Bid Optimization Sample
 
做(mqu毕业证书)麦考瑞大学毕业证硕士文凭证书学费发票原版一模一样
做(mqu毕业证书)麦考瑞大学毕业证硕士文凭证书学费发票原版一模一样做(mqu毕业证书)麦考瑞大学毕业证硕士文凭证书学费发票原版一模一样
做(mqu毕业证书)麦考瑞大学毕业证硕士文凭证书学费发票原版一模一样
 
【社内勉強会資料_Octo: An Open-Source Generalist Robot Policy】
【社内勉強会資料_Octo: An Open-Source Generalist Robot Policy】【社内勉強会資料_Octo: An Open-Source Generalist Robot Policy】
【社内勉強会資料_Octo: An Open-Source Generalist Robot Policy】
 
Predicting Product Ad Campaign Performance: A Data Analysis Project Presentation
Predicting Product Ad Campaign Performance: A Data Analysis Project PresentationPredicting Product Ad Campaign Performance: A Data Analysis Project Presentation
Predicting Product Ad Campaign Performance: A Data Analysis Project Presentation
 
Adjusting primitives for graph : SHORT REPORT / NOTES
Adjusting primitives for graph : SHORT REPORT / NOTESAdjusting primitives for graph : SHORT REPORT / NOTES
Adjusting primitives for graph : SHORT REPORT / NOTES
 
一比一原版(CBU毕业证)卡普顿大学毕业证成绩单
一比一原版(CBU毕业证)卡普顿大学毕业证成绩单一比一原版(CBU毕业证)卡普顿大学毕业证成绩单
一比一原版(CBU毕业证)卡普顿大学毕业证成绩单
 
Criminal IP - Threat Hunting Webinar.pdf
Criminal IP - Threat Hunting Webinar.pdfCriminal IP - Threat Hunting Webinar.pdf
Criminal IP - Threat Hunting Webinar.pdf
 
一比一原版(RUG毕业证)格罗宁根大学毕业证成绩单
一比一原版(RUG毕业证)格罗宁根大学毕业证成绩单一比一原版(RUG毕业证)格罗宁根大学毕业证成绩单
一比一原版(RUG毕业证)格罗宁根大学毕业证成绩单
 
Empowering Data Analytics Ecosystem.pptx
Empowering Data Analytics Ecosystem.pptxEmpowering Data Analytics Ecosystem.pptx
Empowering Data Analytics Ecosystem.pptx
 
standardisation of garbhpala offhgfffghh
standardisation of garbhpala offhgfffghhstandardisation of garbhpala offhgfffghh
standardisation of garbhpala offhgfffghh
 
Best best suvichar in gujarati english meaning of this sentence as Silk road ...
Best best suvichar in gujarati english meaning of this sentence as Silk road ...Best best suvichar in gujarati english meaning of this sentence as Silk road ...
Best best suvichar in gujarati english meaning of this sentence as Silk road ...
 
Business update Q1 2024 Lar España Real Estate SOCIMI
Business update Q1 2024 Lar España Real Estate SOCIMIBusiness update Q1 2024 Lar España Real Estate SOCIMI
Business update Q1 2024 Lar España Real Estate SOCIMI
 
SOCRadar Germany 2024 Threat Landscape Report
SOCRadar Germany 2024 Threat Landscape ReportSOCRadar Germany 2024 Threat Landscape Report
SOCRadar Germany 2024 Threat Landscape Report
 
The affect of service quality and online reviews on customer loyalty in the E...
The affect of service quality and online reviews on customer loyalty in the E...The affect of service quality and online reviews on customer loyalty in the E...
The affect of service quality and online reviews on customer loyalty in the E...
 
Q1’2024 Update: MYCI’s Leap Year Rebound
Q1’2024 Update: MYCI’s Leap Year ReboundQ1’2024 Update: MYCI’s Leap Year Rebound
Q1’2024 Update: MYCI’s Leap Year Rebound
 
Sample_Global Non-invasive Prenatal Testing (NIPT) Market, 2019-2030.pdf
Sample_Global Non-invasive Prenatal Testing (NIPT) Market, 2019-2030.pdfSample_Global Non-invasive Prenatal Testing (NIPT) Market, 2019-2030.pdf
Sample_Global Non-invasive Prenatal Testing (NIPT) Market, 2019-2030.pdf
 
一比一原版(CU毕业证)卡尔顿大学毕业证成绩单
一比一原版(CU毕业证)卡尔顿大学毕业证成绩单一比一原版(CU毕业证)卡尔顿大学毕业证成绩单
一比一原版(CU毕业证)卡尔顿大学毕业证成绩单
 

Bias in Research Methods

  • 2. Bias • Once we collect the data we represent the data by way of a model. Let us assume a linear model. • This may be written as y(outcome)= a1x1+a2x2+a3x3+…+anxn+ error • Therefore we predict that there will be an error as the outcome is expressed as a set of predictor variables multiplied by a set of coefficients the parameters the a in the equation and tell us about the relationship between the predictor and outcome variable. • The prediction will not be perfect as there will be an error as we are using sample data to predict the outcome variable.
  • 3. The contexts for bias • Things that bias the parameter estimates • Things that bias standard errors and confidence intervals • Things that bias test statistics and p-values. These bias are related. If the test statistics are bias then the confidence intervals will be biased. A bias in confidence intervals will bias the test statistics. • If the test statistics is biased then the results will be biased and we need to identify and eliminate the biases as much as possible.
  • 4. Assumptions that lead to bias 1. Presence of outliners 2. Additivity and linearity 3. Normality 4. Homoscedasticity or homogeneity of variance 5. Independence
  • 5. Outliers • Presence of outliers in data will bias the data. • For example if the class average marks is 60 and standard deviation is 10 marks then if there is a presence of zero marks or 100 marks by few students may bias the data. • The outliers need to be identified and removed or replaced to have a better representation of the data. It generally affect the mean of the data as well as some of the squares errors. The sum of the squares is used to compute the standard deviation, which in turn is used to estimate the standard error. The standard error is used for confidence intervals around the parameter estimates. This it will have a domino effect on the results.
  • 6. Additivity and Linearity • The assumption is the outcome variable is linearly related to all predictors. That means the relationship may be summed up as a straight line. • If there are several predictors as we have see the equation y(outcome)= a1x1+a2x2+a3x3+…+anxn+ error their combined effect is described by adding their effects together. The model can described accurately by the equation given here.
  • 7. Assumption of Normality • There is a mistaken belief that assumption of normality = the data need to be from normally distributed. This misconception stems from the fact that if the data is normally distributed then errors in the model as well as sampling distribution is also normally distributed. • The central limit theorem means that there are different situations in which we can assume normality regardless of the shape of the sample data. • Normality matters when you construct confidence intervals around parameters of the model or compute significance tests relating to those parameters then assumption of normality matters in small samples. • As long as the sample size is fairly large, outliers are taken into account then assumption of normality will not be a pressing concern. • Lumley, T., Diehr, P., Emerson, S., & Chen, L. (2002). The importance of the normality assumption in large public health data sets. Annual review of public health, 23(1), 151-169.