SlideShare a Scribd company logo
Saudi Board of Preventive Medicine, Riyadh Ministry of Health, KSA
Dr. S. A. Rizwan, M.D.Demystifying statistics series: Meta-analysis course
Analysis of small datasets
Dr. S. A. Rizwan, M.D.,
Public Health Specialist,
Saudi Board of Preventive Medicine,
Riyadh, Kingdom of Saudi Arabia
11/25/19 1
Saudi Board of Preventive Medicine, Riyadh Ministry of Health, KSA
Dr. S. A. Rizwan, M.D.Demystifying statistics series: Meta-analysis course
Outline
• What is small?
• Misconceptions about small datasets
• Where do we see small datasets?
• Problems with small datasets
• Descriptive statistics for small datasets
• Inferential statistics for small datasets
211/25/19
Saudi Board of Preventive Medicine, Riyadh Ministry of Health, KSA
Dr. S. A. Rizwan, M.D.Demystifying statistics series: Meta-analysis course
What is small?
• n<30 rule
• Arbitrary
• Not always correct
• Full multivariate techniques even 100 may be considered small
• When do we call a study sample small?
• Outcome is highly influenced by one or two cases
• Valid estimates of parameters and SE not possible
• Iterative methods do not converge
• Relation between sample size and effect size are not appropriate
• Distributions of data are not consistent
311/25/19
Saudi Board of Preventive Medicine, Riyadh Ministry of Health, KSA
Dr. S. A. Rizwan, M.D.Demystifying statistics series: Meta-analysis course
What is small?
411/25/19
Saudi Board of Preventive Medicine, Riyadh Ministry of Health, KSA
Dr. S. A. Rizwan, M.D.Demystifying statistics series: Meta-analysis course
What is small?
511/25/19
Saudi Board of Preventive Medicine, Riyadh Ministry of Health, KSA
Dr. S. A. Rizwan, M.D.Demystifying statistics series: Meta-analysis course
Misconceptions about small datasets
• Some think can’t use statistics
• Not useful
• It is sometimes likened to making astronomical observations with
binoculars (i.e., only big things like planets, meteors can be seen)
• However, Galileo used low power telescopes in his time to discover
the moons of Jupiter
611/25/19
Saudi Board of Preventive Medicine, Riyadh Ministry of Health, KSA
Dr. S. A. Rizwan, M.D.Demystifying statistics series: Meta-analysis course
Where do we see small datasets?
• Brand new drug trials
• Preclinical studies
• Animal experiments (esp. requiring sacrifice)
• Limited biological samples (like organs)
• Proof of concept studies
• Brand new or expensive technology or test (eg. fMRI)
• Neurosurgery/neuropsychology
711/25/19
Saudi Board of Preventive Medicine, Riyadh Ministry of Health, KSA
Dr. S. A. Rizwan, M.D.Demystifying statistics series: Meta-analysis course
Problems with small datasets
• Non normal distribution (limited statistical procedures)
• Outliers
• Statistical significance less likely
• Practical significance less likely
• Perceived deficiency in generalizability
• Lower power and higher margin of error
• Limited to seeing only big effects (inflated effect size)
• Inflated false discovery rate
• Low reproducibility
• Reduced scope of multiple subgroup analysis
• Because small sample data analyses require compromises, it is difficult to justify
811/25/19
Saudi Board of Preventive Medicine, Riyadh Ministry of Health, KSA
Dr. S. A. Rizwan, M.D.Demystifying statistics series: Meta-analysis course
Problems with
small datasets
• Small sample size also
prevents us from properly
estimating and modeling
the populations we sample
from.
• As a consequence, small n
stops us from answering a
fundamental, yet often
ignored empirical question:
how do distributions
differ?
911/25/19
Saudi Board of Preventive Medicine, Riyadh Ministry of Health, KSA
Dr. S. A. Rizwan, M.D.Demystifying statistics series: Meta-analysis course
Descriptive statistics for small datasets
• Mean sometimes
• Median, IQR, range
• Log or other transformations, Geometric mean
• Outlier examination
• Displaying frequencies instead of percentages
• Publishing the entire dataset as a table
1011/25/19
Saudi Board of Preventive Medicine, Riyadh Ministry of Health, KSA
Dr. S. A. Rizwan, M.D.Demystifying statistics series: Meta-analysis course
Inferential statistics for small datasets
• Nonparametric/exact hypothesis tests
• (N-1) finite population correction for tests
• Power calculation in case of non-significant tests
• Data simulation techniques
• Bayesian inferences
1111/25/19
Saudi Board of Preventive Medicine, Riyadh Ministry of Health, KSA
Dr. S. A. Rizwan, M.D.Demystifying statistics series: Meta-analysis course
Inferential statistics for small datasets
• Confidence intervals for small datasets/non normal distributions
• Based on t distributions
• Log transformed intervals
• Exact method
• Adjusted Wald interval for proportions
• Score method
• Bootstrapping and Monte-Carlo simulations
1211/25/19
Saudi Board of Preventive Medicine, Riyadh Ministry of Health, KSA
Dr. S. A. Rizwan, M.D.Demystifying statistics series: Meta-analysis course
When to use exact tests in SPSS
1311/25/19
Saudi Board of Preventive Medicine, Riyadh Ministry of Health, KSA
Dr. S. A. Rizwan, M.D.Demystifying statistics series: Meta-analysis course
Approaches to analysis of small datasets
• Informative analysis
• Data analysis is informative when it addresses the question that motivated
the research
• Hypothesis testing - sufficiently powered to detect meaningful effects
• As a compromise, conduct descriptive analyses to set the stage
• Finite population correction
• Assumes random sampling without replacement and accounts for a reduction
in sampling error as f=n/N increases toward 1.
1411/25/19
Saudi Board of Preventive Medicine, Riyadh Ministry of Health, KSA
Dr. S. A. Rizwan, M.D.Demystifying statistics series: Meta-analysis course
Approaches to analysis of small datasets
• Design and measurement issues to optimize research
• If the goal is to detect a significant effect, there are two options for increasing
t (A general t-test: the ratio of a parameter estimate to its standard error. ):
• Approaches for increasing the parameter estimate
• Sharpen the focus and increase the dosage in the Rx group
• No hint of the active component in the control group
• Treatment directly focused on causal mechanism
• Approaches for decreasing the SE
• Increase sample size
• Full use of data, even incomplete ones, missing data via imputation
• In multivariate model add more explanatory variables at the cost of df
1511/25/19
Saudi Board of Preventive Medicine, Riyadh Ministry of Health, KSA
Dr. S. A. Rizwan, M.D.Demystifying statistics series: Meta-analysis course
Approaches to analysis of small datasets
• Design and measurement issues to optimize research
• Outcome measure chosen should be reliable to minimize attenuation and
sensitive to maximize the odds of detecting difference
• Focus on proximal rather than distal outcomes which are easier to prove
1611/25/19
Saudi Board of Preventive Medicine, Riyadh Ministry of Health, KSA
Dr. S. A. Rizwan, M.D.Demystifying statistics series: Meta-analysis course
Approaches to analysis of small datasets
• Multivariate Models
• Substantial evidence of people using so-called large sample multivariate
techniques with samples that are clearly small
• In cluster studies, fewer than 30 clusters is small
• Growth models, exploratory factor analysis studies, structural equation
models with fewer than 100 participants are small
• For multilevel modeling, small might be considered fewer than 40 clusters.
• (Approaches include restricted maximum likelihood, restricted maximum likelihood with
the Kenward-Roger correction, wild cluster bootstrap)
• Structural equation modeling with fewer than 200 people is considered small
sample
1711/25/19
Saudi Board of Preventive Medicine, Riyadh Ministry of Health, KSA
Dr. S. A. Rizwan, M.D.Demystifying statistics series: Meta-analysis course
Approaches to analysis of small datasets
• Bayesian methods
• Bayesian statistics incorporate prior knowledge along with a given set of
current observations in order to make statistical inferences
• The prior information could come from observational data
• Particularly useful in cases where there is a lack of current test data but there
is a strong prior understanding about the parameter
• By incorporating prior information about a parameter, a posterior distribution
for a parameter can be produced and an adequate estimate of reliability can
be obtained
• Situations might include poverty in a small area, such as a school district, or a
treatment effect
• Bayesian modeling suggests a middle ground—an estimate that is between
the direct estimate and the regression estimate
1811/25/19
Saudi Board of Preventive Medicine, Riyadh Ministry of Health, KSA
Dr. S. A. Rizwan, M.D.Demystifying statistics series: Meta-analysis course
Take home messages
• Small datasets are not all bad
• They could be useful in very specific situations
• A thorough understanding of statistical methods for small datasets is
required for proper conclusions
• Beware of conclusions that use regular statistics for small datasets
1911/25/19
Saudi Board of Preventive Medicine, Riyadh Ministry of Health, KSA
Dr. S. A. Rizwan, M.D.Demystifying statistics series: Meta-analysis course
Thank you
Kindly email your queries to sarizwan1986@outlook.com
2011/25/19

More Related Content

What's hot

What's hot (20)

Overview of the systematic review process
Overview of the systematic review processOverview of the systematic review process
Overview of the systematic review process
 
Checking for normality (Normal distribution)
Checking for normality (Normal distribution)Checking for normality (Normal distribution)
Checking for normality (Normal distribution)
 
Heterogeneity in meta-analysis
Heterogeneity in meta-analysisHeterogeneity in meta-analysis
Heterogeneity in meta-analysis
 
Fixed-effect and random-effects models in meta-analysis
Fixed-effect and random-effects models in meta-analysisFixed-effect and random-effects models in meta-analysis
Fixed-effect and random-effects models in meta-analysis
 
Introduction & rationale for meta-analysis
Introduction & rationale for meta-analysisIntroduction & rationale for meta-analysis
Introduction & rationale for meta-analysis
 
Critical Appraisal of health literature - an overview
Critical Appraisal of health literature - an overviewCritical Appraisal of health literature - an overview
Critical Appraisal of health literature - an overview
 
Statistical tests for data involving quantitative data
Statistical tests for data involving quantitative dataStatistical tests for data involving quantitative data
Statistical tests for data involving quantitative data
 
Statistical tests for categorical data
Statistical tests for categorical dataStatistical tests for categorical data
Statistical tests for categorical data
 
Critical Appraisal of health literature
Critical Appraisal of health literatureCritical Appraisal of health literature
Critical Appraisal of health literature
 
Use of checklists in critical appraisal of health literature
Use of checklists in critical appraisal of health literatureUse of checklists in critical appraisal of health literature
Use of checklists in critical appraisal of health literature
 
Evidence based medicine or health practice
Evidence based medicine or health practiceEvidence based medicine or health practice
Evidence based medicine or health practice
 
Student's t test and variations
Student's t test and variationsStudent's t test and variations
Student's t test and variations
 
Choosing a statistical test
Choosing a statistical testChoosing a statistical test
Choosing a statistical test
 
Sample size in health sciences - Basics and selected examples
Sample size in health sciences - Basics and selected examplesSample size in health sciences - Basics and selected examples
Sample size in health sciences - Basics and selected examples
 
Critical Appraisal of health literature
Critical Appraisal of health literatureCritical Appraisal of health literature
Critical Appraisal of health literature
 
Tests of hypothesis (Statistical testing)
Tests of hypothesis (Statistical testing)Tests of hypothesis (Statistical testing)
Tests of hypothesis (Statistical testing)
 
Probability, population and sample
Probability, population and sampleProbability, population and sample
Probability, population and sample
 
Types of variables
Types of variablesTypes of variables
Types of variables
 
Adaptation of evidence-based clinical practice guidelines: the 'Adapted ADAPT...
Adaptation of evidence-based clinical practice guidelines: the 'Adapted ADAPT...Adaptation of evidence-based clinical practice guidelines: the 'Adapted ADAPT...
Adaptation of evidence-based clinical practice guidelines: the 'Adapted ADAPT...
 
Use of the NEDOCS overcrowding scale in a pediatric ED.
Use of the NEDOCS overcrowding scale in a pediatric ED. Use of the NEDOCS overcrowding scale in a pediatric ED.
Use of the NEDOCS overcrowding scale in a pediatric ED.
 

Similar to Analysis of small datasets

Pathway 2.0 for RWE and MA 2015 -John Cai
Pathway 2.0 for RWE and MA 2015 -John CaiPathway 2.0 for RWE and MA 2015 -John Cai
Pathway 2.0 for RWE and MA 2015 -John Cai
John Cai
 
4-191125033339.pptx kolmogorov smirnov test
4-191125033339.pptx kolmogorov smirnov test4-191125033339.pptx kolmogorov smirnov test
4-191125033339.pptx kolmogorov smirnov test
JayKeluskar1
 
Knowledge transfer research examples
Knowledge transfer research examplesKnowledge transfer research examples
Knowledge transfer research examples
taem
 
Critical appraisal of randomized clinical trials
Critical appraisal of randomized clinical trialsCritical appraisal of randomized clinical trials
Critical appraisal of randomized clinical trials
Samir Haffar
 
D1S1T3N4_Pratibha Jalui & Reetabrata Bhattacharyya
D1S1T3N4_Pratibha Jalui & Reetabrata BhattacharyyaD1S1T3N4_Pratibha Jalui & Reetabrata Bhattacharyya
D1S1T3N4_Pratibha Jalui & Reetabrata Bhattacharyya
Reetabrata Bhattacharyya
 
ROADMAP at Lausanne III OECD 28Oct2016
ROADMAP at Lausanne III OECD 28Oct2016ROADMAP at Lausanne III OECD 28Oct2016
ROADMAP at Lausanne III OECD 28Oct2016
Martin Pan
 

Similar to Analysis of small datasets (20)

Pathway 2.0 for RWE and MA 2015 -John Cai
Pathway 2.0 for RWE and MA 2015 -John CaiPathway 2.0 for RWE and MA 2015 -John Cai
Pathway 2.0 for RWE and MA 2015 -John Cai
 
4-191125033339.pptx kolmogorov smirnov test
4-191125033339.pptx kolmogorov smirnov test4-191125033339.pptx kolmogorov smirnov test
4-191125033339.pptx kolmogorov smirnov test
 
Knowledge transfer research examples
Knowledge transfer research examplesKnowledge transfer research examples
Knowledge transfer research examples
 
How to establish and evaluate clinical prediction models - Statswork
How to establish and evaluate clinical prediction models - StatsworkHow to establish and evaluate clinical prediction models - Statswork
How to establish and evaluate clinical prediction models - Statswork
 
EPIDEMIOLOGY OF PERIODONTAL DISEASE DR SINDHURA.ppt
EPIDEMIOLOGY OF PERIODONTAL DISEASE DR SINDHURA.pptEPIDEMIOLOGY OF PERIODONTAL DISEASE DR SINDHURA.ppt
EPIDEMIOLOGY OF PERIODONTAL DISEASE DR SINDHURA.ppt
 
How evidence affects clinical practice in egypt
How evidence affects clinical practice in egyptHow evidence affects clinical practice in egypt
How evidence affects clinical practice in egypt
 
Advanced Laboratory Analytics — A Disruptive Solution for Health Systems
Advanced Laboratory Analytics — A Disruptive Solution for Health SystemsAdvanced Laboratory Analytics — A Disruptive Solution for Health Systems
Advanced Laboratory Analytics — A Disruptive Solution for Health Systems
 
Clinical trials: quo vadis in the age of covid?
Clinical trials: quo vadis in the age of covid?Clinical trials: quo vadis in the age of covid?
Clinical trials: quo vadis in the age of covid?
 
Evidence based decision making in periodontics
Evidence based decision making in periodonticsEvidence based decision making in periodontics
Evidence based decision making in periodontics
 
Comparison of a fall risk assessment tool with nurses’ judgment alone
Comparison of a fall risk assessment tool with nurses’ judgment aloneComparison of a fall risk assessment tool with nurses’ judgment alone
Comparison of a fall risk assessment tool with nurses’ judgment alone
 
Numerical summaries-Numerical summaries-Numerical summaries
Numerical summaries-Numerical summaries-Numerical summariesNumerical summaries-Numerical summaries-Numerical summaries
Numerical summaries-Numerical summaries-Numerical summaries
 
BIOSTATISTICS
BIOSTATISTICSBIOSTATISTICS
BIOSTATISTICS
 
PMED: APPM Workshop: From Real World Data to Real World Evidence - Richard Zi...
PMED: APPM Workshop: From Real World Data to Real World Evidence - Richard Zi...PMED: APPM Workshop: From Real World Data to Real World Evidence - Richard Zi...
PMED: APPM Workshop: From Real World Data to Real World Evidence - Richard Zi...
 
Clinical research
Clinical researchClinical research
Clinical research
 
Use of an EMR-based Registry to Support Clinical Research
Use of an EMR-based Registry to Support Clinical ResearchUse of an EMR-based Registry to Support Clinical Research
Use of an EMR-based Registry to Support Clinical Research
 
Critical appraisal of randomized clinical trials
Critical appraisal of randomized clinical trialsCritical appraisal of randomized clinical trials
Critical appraisal of randomized clinical trials
 
Biostatistics
BiostatisticsBiostatistics
Biostatistics
 
D1S1T3N4_Pratibha Jalui & Reetabrata Bhattacharyya
D1S1T3N4_Pratibha Jalui & Reetabrata BhattacharyyaD1S1T3N4_Pratibha Jalui & Reetabrata Bhattacharyya
D1S1T3N4_Pratibha Jalui & Reetabrata Bhattacharyya
 
Psychologists and Quality Improvement 3.pdf
Psychologists and Quality Improvement 3.pdfPsychologists and Quality Improvement 3.pdf
Psychologists and Quality Improvement 3.pdf
 
ROADMAP at Lausanne III OECD 28Oct2016
ROADMAP at Lausanne III OECD 28Oct2016ROADMAP at Lausanne III OECD 28Oct2016
ROADMAP at Lausanne III OECD 28Oct2016
 

More from Rizwan S A

More from Rizwan S A (9)

Introduction to scoping reviews
Introduction to scoping reviewsIntroduction to scoping reviews
Introduction to scoping reviews
 
Sources of demographic data 2019
Sources of demographic data 2019Sources of demographic data 2019
Sources of demographic data 2019
 
Effect sizes in meta-analysis
Effect sizes in meta-analysisEffect sizes in meta-analysis
Effect sizes in meta-analysis
 
Kruskal Wallis test, Friedman test, Spearman Correlation
Kruskal Wallis test, Friedman test, Spearman CorrelationKruskal Wallis test, Friedman test, Spearman Correlation
Kruskal Wallis test, Friedman test, Spearman Correlation
 
Mantel Haenszel methods in epidemiology (Stratification)
Mantel Haenszel methods in epidemiology (Stratification) Mantel Haenszel methods in epidemiology (Stratification)
Mantel Haenszel methods in epidemiology (Stratification)
 
Epidemiology: Standardisation of rates
Epidemiology: Standardisation of ratesEpidemiology: Standardisation of rates
Epidemiology: Standardisation of rates
 
Confidence intervals: Types and calculations
Confidence intervals: Types and calculationsConfidence intervals: Types and calculations
Confidence intervals: Types and calculations
 
Confidence Intervals: Basic concepts and overview
Confidence Intervals: Basic concepts and overviewConfidence Intervals: Basic concepts and overview
Confidence Intervals: Basic concepts and overview
 
Chi square test and its types
Chi square test and its typesChi square test and its types
Chi square test and its types
 

Recently uploaded

Lung Cancer: Artificial Intelligence, Synergetics, Complex System Analysis, S...
Lung Cancer: Artificial Intelligence, Synergetics, Complex System Analysis, S...Lung Cancer: Artificial Intelligence, Synergetics, Complex System Analysis, S...
Lung Cancer: Artificial Intelligence, Synergetics, Complex System Analysis, S...
Oleg Kshivets
 
Alcohol_Dr. Jeenal Mistry MD Pharmacology.pdf
Alcohol_Dr. Jeenal Mistry MD Pharmacology.pdfAlcohol_Dr. Jeenal Mistry MD Pharmacology.pdf
Alcohol_Dr. Jeenal Mistry MD Pharmacology.pdf
Dr Jeenal Mistry
 

Recently uploaded (20)

Presentació "Advancing Emergency Medicine Education through Virtual Reality"
Presentació "Advancing Emergency Medicine Education through Virtual Reality"Presentació "Advancing Emergency Medicine Education through Virtual Reality"
Presentació "Advancing Emergency Medicine Education through Virtual Reality"
 
Final CAPNOCYTOPHAGA INFECTION by Gauri Gawande.pptx
Final CAPNOCYTOPHAGA INFECTION by Gauri Gawande.pptxFinal CAPNOCYTOPHAGA INFECTION by Gauri Gawande.pptx
Final CAPNOCYTOPHAGA INFECTION by Gauri Gawande.pptx
 
5cl adbb 5cladba cheap and fine Telegram: +85297504341
5cl adbb 5cladba cheap and fine Telegram: +852975043415cl adbb 5cladba cheap and fine Telegram: +85297504341
5cl adbb 5cladba cheap and fine Telegram: +85297504341
 
Antiulcer drugs Advance Pharmacology .pptx
Antiulcer drugs Advance Pharmacology .pptxAntiulcer drugs Advance Pharmacology .pptx
Antiulcer drugs Advance Pharmacology .pptx
 
Impact of cancers therapies on the loss in cardiac function, myocardial fffic...
Impact of cancers therapies on the loss in cardiac function, myocardial fffic...Impact of cancers therapies on the loss in cardiac function, myocardial fffic...
Impact of cancers therapies on the loss in cardiac function, myocardial fffic...
 
Temporal, Infratemporal & Pterygopalatine BY Dr.RIG.pptx
Temporal, Infratemporal & Pterygopalatine BY Dr.RIG.pptxTemporal, Infratemporal & Pterygopalatine BY Dr.RIG.pptx
Temporal, Infratemporal & Pterygopalatine BY Dr.RIG.pptx
 
Effects of vaping e-cigarettes on arterial health
Effects of vaping e-cigarettes on arterial healthEffects of vaping e-cigarettes on arterial health
Effects of vaping e-cigarettes on arterial health
 
linearity concept of significance, standard deviation, chi square test, stude...
linearity concept of significance, standard deviation, chi square test, stude...linearity concept of significance, standard deviation, chi square test, stude...
linearity concept of significance, standard deviation, chi square test, stude...
 
Why invest into infodemic management in health emergencies
Why invest into infodemic management in health emergenciesWhy invest into infodemic management in health emergencies
Why invest into infodemic management in health emergencies
 
Is preeclampsia and spontaneous preterm delivery associate with vascular and ...
Is preeclampsia and spontaneous preterm delivery associate with vascular and ...Is preeclampsia and spontaneous preterm delivery associate with vascular and ...
Is preeclampsia and spontaneous preterm delivery associate with vascular and ...
 
Evaluation of antidepressant activity of clitoris ternatea in animals
Evaluation of antidepressant activity of clitoris ternatea in animalsEvaluation of antidepressant activity of clitoris ternatea in animals
Evaluation of antidepressant activity of clitoris ternatea in animals
 
Non-Invasive assessment of arterial stiffness in advanced heart failure patie...
Non-Invasive assessment of arterial stiffness in advanced heart failure patie...Non-Invasive assessment of arterial stiffness in advanced heart failure patie...
Non-Invasive assessment of arterial stiffness in advanced heart failure patie...
 
Lung Cancer: Artificial Intelligence, Synergetics, Complex System Analysis, S...
Lung Cancer: Artificial Intelligence, Synergetics, Complex System Analysis, S...Lung Cancer: Artificial Intelligence, Synergetics, Complex System Analysis, S...
Lung Cancer: Artificial Intelligence, Synergetics, Complex System Analysis, S...
 
Blue Printing in medical education by Dr.Mumtaz Ali.pptx
Blue Printing in medical education by Dr.Mumtaz Ali.pptxBlue Printing in medical education by Dr.Mumtaz Ali.pptx
Blue Printing in medical education by Dr.Mumtaz Ali.pptx
 
Compare home pulse pressure components collected directly from home
Compare home pulse pressure components collected directly from homeCompare home pulse pressure components collected directly from home
Compare home pulse pressure components collected directly from home
 
TEST BANK For Wong’s Essentials of Pediatric Nursing, 11th Edition by Marilyn...
TEST BANK For Wong’s Essentials of Pediatric Nursing, 11th Edition by Marilyn...TEST BANK For Wong’s Essentials of Pediatric Nursing, 11th Edition by Marilyn...
TEST BANK For Wong’s Essentials of Pediatric Nursing, 11th Edition by Marilyn...
 
Alcohol_Dr. Jeenal Mistry MD Pharmacology.pdf
Alcohol_Dr. Jeenal Mistry MD Pharmacology.pdfAlcohol_Dr. Jeenal Mistry MD Pharmacology.pdf
Alcohol_Dr. Jeenal Mistry MD Pharmacology.pdf
 
DECIPHERING COMMON ECG FINDINGS IN ED.pptx
DECIPHERING COMMON ECG FINDINGS IN ED.pptxDECIPHERING COMMON ECG FINDINGS IN ED.pptx
DECIPHERING COMMON ECG FINDINGS IN ED.pptx
 
Anuman- An inference for helpful in diagnosis and treatment
Anuman- An inference for helpful in diagnosis and treatmentAnuman- An inference for helpful in diagnosis and treatment
Anuman- An inference for helpful in diagnosis and treatment
 
Couples presenting to the infertility clinic- Do they really have infertility...
Couples presenting to the infertility clinic- Do they really have infertility...Couples presenting to the infertility clinic- Do they really have infertility...
Couples presenting to the infertility clinic- Do they really have infertility...
 

Analysis of small datasets

  • 1. Saudi Board of Preventive Medicine, Riyadh Ministry of Health, KSA Dr. S. A. Rizwan, M.D.Demystifying statistics series: Meta-analysis course Analysis of small datasets Dr. S. A. Rizwan, M.D., Public Health Specialist, Saudi Board of Preventive Medicine, Riyadh, Kingdom of Saudi Arabia 11/25/19 1
  • 2. Saudi Board of Preventive Medicine, Riyadh Ministry of Health, KSA Dr. S. A. Rizwan, M.D.Demystifying statistics series: Meta-analysis course Outline • What is small? • Misconceptions about small datasets • Where do we see small datasets? • Problems with small datasets • Descriptive statistics for small datasets • Inferential statistics for small datasets 211/25/19
  • 3. Saudi Board of Preventive Medicine, Riyadh Ministry of Health, KSA Dr. S. A. Rizwan, M.D.Demystifying statistics series: Meta-analysis course What is small? • n<30 rule • Arbitrary • Not always correct • Full multivariate techniques even 100 may be considered small • When do we call a study sample small? • Outcome is highly influenced by one or two cases • Valid estimates of parameters and SE not possible • Iterative methods do not converge • Relation between sample size and effect size are not appropriate • Distributions of data are not consistent 311/25/19
  • 4. Saudi Board of Preventive Medicine, Riyadh Ministry of Health, KSA Dr. S. A. Rizwan, M.D.Demystifying statistics series: Meta-analysis course What is small? 411/25/19
  • 5. Saudi Board of Preventive Medicine, Riyadh Ministry of Health, KSA Dr. S. A. Rizwan, M.D.Demystifying statistics series: Meta-analysis course What is small? 511/25/19
  • 6. Saudi Board of Preventive Medicine, Riyadh Ministry of Health, KSA Dr. S. A. Rizwan, M.D.Demystifying statistics series: Meta-analysis course Misconceptions about small datasets • Some think can’t use statistics • Not useful • It is sometimes likened to making astronomical observations with binoculars (i.e., only big things like planets, meteors can be seen) • However, Galileo used low power telescopes in his time to discover the moons of Jupiter 611/25/19
  • 7. Saudi Board of Preventive Medicine, Riyadh Ministry of Health, KSA Dr. S. A. Rizwan, M.D.Demystifying statistics series: Meta-analysis course Where do we see small datasets? • Brand new drug trials • Preclinical studies • Animal experiments (esp. requiring sacrifice) • Limited biological samples (like organs) • Proof of concept studies • Brand new or expensive technology or test (eg. fMRI) • Neurosurgery/neuropsychology 711/25/19
  • 8. Saudi Board of Preventive Medicine, Riyadh Ministry of Health, KSA Dr. S. A. Rizwan, M.D.Demystifying statistics series: Meta-analysis course Problems with small datasets • Non normal distribution (limited statistical procedures) • Outliers • Statistical significance less likely • Practical significance less likely • Perceived deficiency in generalizability • Lower power and higher margin of error • Limited to seeing only big effects (inflated effect size) • Inflated false discovery rate • Low reproducibility • Reduced scope of multiple subgroup analysis • Because small sample data analyses require compromises, it is difficult to justify 811/25/19
  • 9. Saudi Board of Preventive Medicine, Riyadh Ministry of Health, KSA Dr. S. A. Rizwan, M.D.Demystifying statistics series: Meta-analysis course Problems with small datasets • Small sample size also prevents us from properly estimating and modeling the populations we sample from. • As a consequence, small n stops us from answering a fundamental, yet often ignored empirical question: how do distributions differ? 911/25/19
  • 10. Saudi Board of Preventive Medicine, Riyadh Ministry of Health, KSA Dr. S. A. Rizwan, M.D.Demystifying statistics series: Meta-analysis course Descriptive statistics for small datasets • Mean sometimes • Median, IQR, range • Log or other transformations, Geometric mean • Outlier examination • Displaying frequencies instead of percentages • Publishing the entire dataset as a table 1011/25/19
  • 11. Saudi Board of Preventive Medicine, Riyadh Ministry of Health, KSA Dr. S. A. Rizwan, M.D.Demystifying statistics series: Meta-analysis course Inferential statistics for small datasets • Nonparametric/exact hypothesis tests • (N-1) finite population correction for tests • Power calculation in case of non-significant tests • Data simulation techniques • Bayesian inferences 1111/25/19
  • 12. Saudi Board of Preventive Medicine, Riyadh Ministry of Health, KSA Dr. S. A. Rizwan, M.D.Demystifying statistics series: Meta-analysis course Inferential statistics for small datasets • Confidence intervals for small datasets/non normal distributions • Based on t distributions • Log transformed intervals • Exact method • Adjusted Wald interval for proportions • Score method • Bootstrapping and Monte-Carlo simulations 1211/25/19
  • 13. Saudi Board of Preventive Medicine, Riyadh Ministry of Health, KSA Dr. S. A. Rizwan, M.D.Demystifying statistics series: Meta-analysis course When to use exact tests in SPSS 1311/25/19
  • 14. Saudi Board of Preventive Medicine, Riyadh Ministry of Health, KSA Dr. S. A. Rizwan, M.D.Demystifying statistics series: Meta-analysis course Approaches to analysis of small datasets • Informative analysis • Data analysis is informative when it addresses the question that motivated the research • Hypothesis testing - sufficiently powered to detect meaningful effects • As a compromise, conduct descriptive analyses to set the stage • Finite population correction • Assumes random sampling without replacement and accounts for a reduction in sampling error as f=n/N increases toward 1. 1411/25/19
  • 15. Saudi Board of Preventive Medicine, Riyadh Ministry of Health, KSA Dr. S. A. Rizwan, M.D.Demystifying statistics series: Meta-analysis course Approaches to analysis of small datasets • Design and measurement issues to optimize research • If the goal is to detect a significant effect, there are two options for increasing t (A general t-test: the ratio of a parameter estimate to its standard error. ): • Approaches for increasing the parameter estimate • Sharpen the focus and increase the dosage in the Rx group • No hint of the active component in the control group • Treatment directly focused on causal mechanism • Approaches for decreasing the SE • Increase sample size • Full use of data, even incomplete ones, missing data via imputation • In multivariate model add more explanatory variables at the cost of df 1511/25/19
  • 16. Saudi Board of Preventive Medicine, Riyadh Ministry of Health, KSA Dr. S. A. Rizwan, M.D.Demystifying statistics series: Meta-analysis course Approaches to analysis of small datasets • Design and measurement issues to optimize research • Outcome measure chosen should be reliable to minimize attenuation and sensitive to maximize the odds of detecting difference • Focus on proximal rather than distal outcomes which are easier to prove 1611/25/19
  • 17. Saudi Board of Preventive Medicine, Riyadh Ministry of Health, KSA Dr. S. A. Rizwan, M.D.Demystifying statistics series: Meta-analysis course Approaches to analysis of small datasets • Multivariate Models • Substantial evidence of people using so-called large sample multivariate techniques with samples that are clearly small • In cluster studies, fewer than 30 clusters is small • Growth models, exploratory factor analysis studies, structural equation models with fewer than 100 participants are small • For multilevel modeling, small might be considered fewer than 40 clusters. • (Approaches include restricted maximum likelihood, restricted maximum likelihood with the Kenward-Roger correction, wild cluster bootstrap) • Structural equation modeling with fewer than 200 people is considered small sample 1711/25/19
  • 18. Saudi Board of Preventive Medicine, Riyadh Ministry of Health, KSA Dr. S. A. Rizwan, M.D.Demystifying statistics series: Meta-analysis course Approaches to analysis of small datasets • Bayesian methods • Bayesian statistics incorporate prior knowledge along with a given set of current observations in order to make statistical inferences • The prior information could come from observational data • Particularly useful in cases where there is a lack of current test data but there is a strong prior understanding about the parameter • By incorporating prior information about a parameter, a posterior distribution for a parameter can be produced and an adequate estimate of reliability can be obtained • Situations might include poverty in a small area, such as a school district, or a treatment effect • Bayesian modeling suggests a middle ground—an estimate that is between the direct estimate and the regression estimate 1811/25/19
  • 19. Saudi Board of Preventive Medicine, Riyadh Ministry of Health, KSA Dr. S. A. Rizwan, M.D.Demystifying statistics series: Meta-analysis course Take home messages • Small datasets are not all bad • They could be useful in very specific situations • A thorough understanding of statistical methods for small datasets is required for proper conclusions • Beware of conclusions that use regular statistics for small datasets 1911/25/19
  • 20. Saudi Board of Preventive Medicine, Riyadh Ministry of Health, KSA Dr. S. A. Rizwan, M.D.Demystifying statistics series: Meta-analysis course Thank you Kindly email your queries to sarizwan1986@outlook.com 2011/25/19