SlideShare a Scribd company logo
1 of 9
“Strengthening the Research Capacity through
Statistical Interventions”
Dr. Jitendra Kumar Chaudhary
Assistant Professor
Department of Animal Genetics & Breeding,
CVSc. & A.H., Central Agricultural University (CAU),
Selesih, Aizawl, Mizoram-796014
Email-vetjitu@gmail.com
Data Transformation:
If a measurement variable does not fit a normal
distribution or has greatly different standard
deviations in different groups, you should go for data
transformation.
Normality Test:
1)Skewness & Kurtosis
2) Shapiro Wilk test
3) Histogram
Procedure to test normality in SPSS
Go to analyze, descriptive explore, take measurement
in dependent and group in factor box tick on plots, tick on
histogram and normality plots with test, continue and ok.
Results
Divide the skewness and kurtosis by std. error and see if the
values lies between -1.96 to +1.96 then the data are normally
distributed (a minor difference is also normally distributed).
See that the Shapiro wilk test is significant or not, if it is non-
significant means data is normal otherwise data is not normal.
The following are the three transformations,
which are being used most commonly, in biological
research
a) Logarithmic Transformation
b) Square root Transformation
c) Arc sine or Angular Transformation
Logarithmic transformation for whole number counts
with wide range
 This transformation is suitable for the data where the
variance is proportional to the square of the mean or the
coefficient of the variation (S.D./Mean) is constant or
where effects are multiplicative.
 These conditions are generally found in the data that are
whole numbers and cover a wide range of values.
 For example- number of insect per plot, number of egg
mass per plant or per unit area etc.
Natural logarithm of a positive number=LN (Give cell
number for which transformation to be done),
Natural logarithm is based on the constant e
(2.718281828845904)
Logarithm of a positive number at base 10=Log10 (Give
cell number for which transformation to be done), Or Log
(number, 10)
Square root transformation
 The transformation is appropriate for the data sets where
the variance is proportional to the mean.
 Here the data consists of small whole numbers, for
example data obtained in counting rare events, such as
number of infested plants in a plot, the number of insects
caught in traps, number of weeds per plot.
Square root transformation
 These data sets generally follow the Poisson distribution
and square root transformation approximates Poisson to
normal distribution.
 Square root transformation=SQRT (Give cell number for
which transformation to be done)
Arcsine Transformation for Proportions or Percentage
 The transformation is appropriate for the data on
proportion i.e. data obtained from a count and the data
expressed as decimal fraction and percentage.
 The distribution of percentage is binomial and this
transformation makes the distribution normal.
Arcsine Transformation=ASIN (Cell Identification e.g.
(0.05)*(180)*(7/22) result will be in degrees.

More Related Content

What's hot

What's hot (20)

Simple linear regression
Simple linear regressionSimple linear regression
Simple linear regression
 
Design of Experiment
Design of ExperimentDesign of Experiment
Design of Experiment
 
Contingency table
Contingency tableContingency table
Contingency table
 
Multivariate data analysis
Multivariate data analysisMultivariate data analysis
Multivariate data analysis
 
The Binomial, Poisson, and Normal Distributions
The Binomial, Poisson, and Normal DistributionsThe Binomial, Poisson, and Normal Distributions
The Binomial, Poisson, and Normal Distributions
 
Statistical Distributions
Statistical DistributionsStatistical Distributions
Statistical Distributions
 
Missing Data and Causes
Missing Data and CausesMissing Data and Causes
Missing Data and Causes
 
Analysis of covariance
Analysis of covarianceAnalysis of covariance
Analysis of covariance
 
Completely randomized design
Completely randomized designCompletely randomized design
Completely randomized design
 
Regression analysis
Regression analysisRegression analysis
Regression analysis
 
Split-plot Designs
Split-plot DesignsSplit-plot Designs
Split-plot Designs
 
Correlation and Regression
Correlation and RegressionCorrelation and Regression
Correlation and Regression
 
Principal Component Analysis and Clustering
Principal Component Analysis and ClusteringPrincipal Component Analysis and Clustering
Principal Component Analysis and Clustering
 
Presentation On Regression
Presentation On RegressionPresentation On Regression
Presentation On Regression
 
Standard error
Standard error Standard error
Standard error
 
Experimental design
Experimental designExperimental design
Experimental design
 
Regression
RegressionRegression
Regression
 
Principles of experimental design
Principles of experimental designPrinciples of experimental design
Principles of experimental design
 
Sampling distribution
Sampling distributionSampling distribution
Sampling distribution
 
Logistic regression
Logistic regressionLogistic regression
Logistic regression
 

Similar to Data Transformation.ppt

Basic Statistics for application in Medical Assessment
Basic Statistics for application in Medical AssessmentBasic Statistics for application in Medical Assessment
Basic Statistics for application in Medical AssessmentShrushrita Sharma
 
Hcai 5220 lecture notes on campus sessions fall 11(2)
Hcai 5220 lecture notes on campus sessions fall 11(2)Hcai 5220 lecture notes on campus sessions fall 11(2)
Hcai 5220 lecture notes on campus sessions fall 11(2)Twene Peter
 
Sampling_Distribution_stat_of_Mean_New.pptx
Sampling_Distribution_stat_of_Mean_New.pptxSampling_Distribution_stat_of_Mean_New.pptx
Sampling_Distribution_stat_of_Mean_New.pptxRajJirel
 
1. complete stats notes
1. complete stats notes1. complete stats notes
1. complete stats notesBob Smullen
 
Soni_Biostatistics.ppt
Soni_Biostatistics.pptSoni_Biostatistics.ppt
Soni_Biostatistics.pptOgunsina1
 
Statistical analysis of biological data (comaprison of means)
Statistical analysis of biological data (comaprison of means)Statistical analysis of biological data (comaprison of means)
Statistical analysis of biological data (comaprison of means)Mohamed Afifi
 
scope and need of biostatics
scope and need of  biostaticsscope and need of  biostatics
scope and need of biostaticsdr_sharmajyoti01
 
Transformasi Data Penelitian
Transformasi Data PenelitianTransformasi Data Penelitian
Transformasi Data PenelitianTrisnadi Wijaya
 
Anova single factor
Anova single factorAnova single factor
Anova single factorDhruv Patel
 
applied multivariate statistical techniques in agriculture and plant science 2
applied multivariate statistical techniques in agriculture and plant science 2applied multivariate statistical techniques in agriculture and plant science 2
applied multivariate statistical techniques in agriculture and plant science 2amir rahmani
 
Normal Curve in Total Quality Management
Normal Curve in Total Quality ManagementNormal Curve in Total Quality Management
Normal Curve in Total Quality ManagementDr.Raja R
 
Statistics and Public Health. Curso de Inglés Técnico para profesionales de S...
Statistics and Public Health. Curso de Inglés Técnico para profesionales de S...Statistics and Public Health. Curso de Inglés Técnico para profesionales de S...
Statistics and Public Health. Curso de Inglés Técnico para profesionales de S...Universidad Particular de Loja
 
Common Statistical Methods Used In Transgenic Fish Research
Common Statistical Methods Used In Transgenic Fish ResearchCommon Statistical Methods Used In Transgenic Fish Research
Common Statistical Methods Used In Transgenic Fish ResearchMohamed Afifi
 
Exploratory Data Analysis for Biotechnology and Pharmaceutical Sciences
Exploratory Data Analysis for Biotechnology and Pharmaceutical SciencesExploratory Data Analysis for Biotechnology and Pharmaceutical Sciences
Exploratory Data Analysis for Biotechnology and Pharmaceutical SciencesParag Shah
 

Similar to Data Transformation.ppt (20)

Count data analysis
Count data analysisCount data analysis
Count data analysis
 
Basic Statistics for application in Medical Assessment
Basic Statistics for application in Medical AssessmentBasic Statistics for application in Medical Assessment
Basic Statistics for application in Medical Assessment
 
Hcai 5220 lecture notes on campus sessions fall 11(2)
Hcai 5220 lecture notes on campus sessions fall 11(2)Hcai 5220 lecture notes on campus sessions fall 11(2)
Hcai 5220 lecture notes on campus sessions fall 11(2)
 
Data presenatation
Data presenatationData presenatation
Data presenatation
 
Sampling_Distribution_stat_of_Mean_New.pptx
Sampling_Distribution_stat_of_Mean_New.pptxSampling_Distribution_stat_of_Mean_New.pptx
Sampling_Distribution_stat_of_Mean_New.pptx
 
Unit 3 Sampling
Unit 3 SamplingUnit 3 Sampling
Unit 3 Sampling
 
1. complete stats notes
1. complete stats notes1. complete stats notes
1. complete stats notes
 
Soni_Biostatistics.ppt
Soni_Biostatistics.pptSoni_Biostatistics.ppt
Soni_Biostatistics.ppt
 
Statistical analysis of biological data (comaprison of means)
Statistical analysis of biological data (comaprison of means)Statistical analysis of biological data (comaprison of means)
Statistical analysis of biological data (comaprison of means)
 
scope and need of biostatics
scope and need of  biostaticsscope and need of  biostatics
scope and need of biostatics
 
Bgy5901
Bgy5901Bgy5901
Bgy5901
 
Transformasi Data Penelitian
Transformasi Data PenelitianTransformasi Data Penelitian
Transformasi Data Penelitian
 
Anova single factor
Anova single factorAnova single factor
Anova single factor
 
applied multivariate statistical techniques in agriculture and plant science 2
applied multivariate statistical techniques in agriculture and plant science 2applied multivariate statistical techniques in agriculture and plant science 2
applied multivariate statistical techniques in agriculture and plant science 2
 
Normal Curve in Total Quality Management
Normal Curve in Total Quality ManagementNormal Curve in Total Quality Management
Normal Curve in Total Quality Management
 
article.pdf
article.pdfarticle.pdf
article.pdf
 
Statistics78 (2)
Statistics78 (2)Statistics78 (2)
Statistics78 (2)
 
Statistics and Public Health. Curso de Inglés Técnico para profesionales de S...
Statistics and Public Health. Curso de Inglés Técnico para profesionales de S...Statistics and Public Health. Curso de Inglés Técnico para profesionales de S...
Statistics and Public Health. Curso de Inglés Técnico para profesionales de S...
 
Common Statistical Methods Used In Transgenic Fish Research
Common Statistical Methods Used In Transgenic Fish ResearchCommon Statistical Methods Used In Transgenic Fish Research
Common Statistical Methods Used In Transgenic Fish Research
 
Exploratory Data Analysis for Biotechnology and Pharmaceutical Sciences
Exploratory Data Analysis for Biotechnology and Pharmaceutical SciencesExploratory Data Analysis for Biotechnology and Pharmaceutical Sciences
Exploratory Data Analysis for Biotechnology and Pharmaceutical Sciences
 

Recently uploaded

Call Girls in Dwarka Mor Delhi Contact Us 9654467111
Call Girls in Dwarka Mor Delhi Contact Us 9654467111Call Girls in Dwarka Mor Delhi Contact Us 9654467111
Call Girls in Dwarka Mor Delhi Contact Us 9654467111Sapana Sha
 
Q4-W6-Restating Informational Text Grade 3
Q4-W6-Restating Informational Text Grade 3Q4-W6-Restating Informational Text Grade 3
Q4-W6-Restating Informational Text Grade 3JemimahLaneBuaron
 
Arihant handbook biology for class 11 .pdf
Arihant handbook biology for class 11 .pdfArihant handbook biology for class 11 .pdf
Arihant handbook biology for class 11 .pdfchloefrazer622
 
_Math 4-Q4 Week 5.pptx Steps in Collecting Data
_Math 4-Q4 Week 5.pptx Steps in Collecting Data_Math 4-Q4 Week 5.pptx Steps in Collecting Data
_Math 4-Q4 Week 5.pptx Steps in Collecting DataJhengPantaleon
 
Introduction to ArtificiaI Intelligence in Higher Education
Introduction to ArtificiaI Intelligence in Higher EducationIntroduction to ArtificiaI Intelligence in Higher Education
Introduction to ArtificiaI Intelligence in Higher Educationpboyjonauth
 
APM Welcome, APM North West Network Conference, Synergies Across Sectors
APM Welcome, APM North West Network Conference, Synergies Across SectorsAPM Welcome, APM North West Network Conference, Synergies Across Sectors
APM Welcome, APM North West Network Conference, Synergies Across SectorsAssociation for Project Management
 
Paris 2024 Olympic Geographies - an activity
Paris 2024 Olympic Geographies - an activityParis 2024 Olympic Geographies - an activity
Paris 2024 Olympic Geographies - an activityGeoBlogs
 
Incoming and Outgoing Shipments in 1 STEP Using Odoo 17
Incoming and Outgoing Shipments in 1 STEP Using Odoo 17Incoming and Outgoing Shipments in 1 STEP Using Odoo 17
Incoming and Outgoing Shipments in 1 STEP Using Odoo 17Celine George
 
Alper Gobel In Media Res Media Component
Alper Gobel In Media Res Media ComponentAlper Gobel In Media Res Media Component
Alper Gobel In Media Res Media ComponentInMediaRes1
 
Accessible design: Minimum effort, maximum impact
Accessible design: Minimum effort, maximum impactAccessible design: Minimum effort, maximum impact
Accessible design: Minimum effort, maximum impactdawncurless
 
Employee wellbeing at the workplace.pptx
Employee wellbeing at the workplace.pptxEmployee wellbeing at the workplace.pptx
Employee wellbeing at the workplace.pptxNirmalaLoungPoorunde1
 
Introduction to AI in Higher Education_draft.pptx
Introduction to AI in Higher Education_draft.pptxIntroduction to AI in Higher Education_draft.pptx
Introduction to AI in Higher Education_draft.pptxpboyjonauth
 
The Most Excellent Way | 1 Corinthians 13
The Most Excellent Way | 1 Corinthians 13The Most Excellent Way | 1 Corinthians 13
The Most Excellent Way | 1 Corinthians 13Steve Thomason
 
“Oh GOSH! Reflecting on Hackteria's Collaborative Practices in a Global Do-It...
“Oh GOSH! Reflecting on Hackteria's Collaborative Practices in a Global Do-It...“Oh GOSH! Reflecting on Hackteria's Collaborative Practices in a Global Do-It...
“Oh GOSH! Reflecting on Hackteria's Collaborative Practices in a Global Do-It...Marc Dusseiller Dusjagr
 
Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...
Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...
Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...Krashi Coaching
 
Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...
Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...
Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...EduSkills OECD
 
Software Engineering Methodologies (overview)
Software Engineering Methodologies (overview)Software Engineering Methodologies (overview)
Software Engineering Methodologies (overview)eniolaolutunde
 
The basics of sentences session 2pptx copy.pptx
The basics of sentences session 2pptx copy.pptxThe basics of sentences session 2pptx copy.pptx
The basics of sentences session 2pptx copy.pptxheathfieldcps1
 

Recently uploaded (20)

Call Girls in Dwarka Mor Delhi Contact Us 9654467111
Call Girls in Dwarka Mor Delhi Contact Us 9654467111Call Girls in Dwarka Mor Delhi Contact Us 9654467111
Call Girls in Dwarka Mor Delhi Contact Us 9654467111
 
Q4-W6-Restating Informational Text Grade 3
Q4-W6-Restating Informational Text Grade 3Q4-W6-Restating Informational Text Grade 3
Q4-W6-Restating Informational Text Grade 3
 
Arihant handbook biology for class 11 .pdf
Arihant handbook biology for class 11 .pdfArihant handbook biology for class 11 .pdf
Arihant handbook biology for class 11 .pdf
 
_Math 4-Q4 Week 5.pptx Steps in Collecting Data
_Math 4-Q4 Week 5.pptx Steps in Collecting Data_Math 4-Q4 Week 5.pptx Steps in Collecting Data
_Math 4-Q4 Week 5.pptx Steps in Collecting Data
 
Introduction to ArtificiaI Intelligence in Higher Education
Introduction to ArtificiaI Intelligence in Higher EducationIntroduction to ArtificiaI Intelligence in Higher Education
Introduction to ArtificiaI Intelligence in Higher Education
 
APM Welcome, APM North West Network Conference, Synergies Across Sectors
APM Welcome, APM North West Network Conference, Synergies Across SectorsAPM Welcome, APM North West Network Conference, Synergies Across Sectors
APM Welcome, APM North West Network Conference, Synergies Across Sectors
 
Paris 2024 Olympic Geographies - an activity
Paris 2024 Olympic Geographies - an activityParis 2024 Olympic Geographies - an activity
Paris 2024 Olympic Geographies - an activity
 
Incoming and Outgoing Shipments in 1 STEP Using Odoo 17
Incoming and Outgoing Shipments in 1 STEP Using Odoo 17Incoming and Outgoing Shipments in 1 STEP Using Odoo 17
Incoming and Outgoing Shipments in 1 STEP Using Odoo 17
 
Model Call Girl in Bikash Puri Delhi reach out to us at 🔝9953056974🔝
Model Call Girl in Bikash Puri  Delhi reach out to us at 🔝9953056974🔝Model Call Girl in Bikash Puri  Delhi reach out to us at 🔝9953056974🔝
Model Call Girl in Bikash Puri Delhi reach out to us at 🔝9953056974🔝
 
Staff of Color (SOC) Retention Efforts DDSD
Staff of Color (SOC) Retention Efforts DDSDStaff of Color (SOC) Retention Efforts DDSD
Staff of Color (SOC) Retention Efforts DDSD
 
Alper Gobel In Media Res Media Component
Alper Gobel In Media Res Media ComponentAlper Gobel In Media Res Media Component
Alper Gobel In Media Res Media Component
 
Accessible design: Minimum effort, maximum impact
Accessible design: Minimum effort, maximum impactAccessible design: Minimum effort, maximum impact
Accessible design: Minimum effort, maximum impact
 
Employee wellbeing at the workplace.pptx
Employee wellbeing at the workplace.pptxEmployee wellbeing at the workplace.pptx
Employee wellbeing at the workplace.pptx
 
Introduction to AI in Higher Education_draft.pptx
Introduction to AI in Higher Education_draft.pptxIntroduction to AI in Higher Education_draft.pptx
Introduction to AI in Higher Education_draft.pptx
 
The Most Excellent Way | 1 Corinthians 13
The Most Excellent Way | 1 Corinthians 13The Most Excellent Way | 1 Corinthians 13
The Most Excellent Way | 1 Corinthians 13
 
“Oh GOSH! Reflecting on Hackteria's Collaborative Practices in a Global Do-It...
“Oh GOSH! Reflecting on Hackteria's Collaborative Practices in a Global Do-It...“Oh GOSH! Reflecting on Hackteria's Collaborative Practices in a Global Do-It...
“Oh GOSH! Reflecting on Hackteria's Collaborative Practices in a Global Do-It...
 
Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...
Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...
Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...
 
Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...
Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...
Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...
 
Software Engineering Methodologies (overview)
Software Engineering Methodologies (overview)Software Engineering Methodologies (overview)
Software Engineering Methodologies (overview)
 
The basics of sentences session 2pptx copy.pptx
The basics of sentences session 2pptx copy.pptxThe basics of sentences session 2pptx copy.pptx
The basics of sentences session 2pptx copy.pptx
 

Data Transformation.ppt

  • 1. “Strengthening the Research Capacity through Statistical Interventions” Dr. Jitendra Kumar Chaudhary Assistant Professor Department of Animal Genetics & Breeding, CVSc. & A.H., Central Agricultural University (CAU), Selesih, Aizawl, Mizoram-796014 Email-vetjitu@gmail.com
  • 2. Data Transformation: If a measurement variable does not fit a normal distribution or has greatly different standard deviations in different groups, you should go for data transformation. Normality Test: 1)Skewness & Kurtosis 2) Shapiro Wilk test 3) Histogram
  • 3. Procedure to test normality in SPSS Go to analyze, descriptive explore, take measurement in dependent and group in factor box tick on plots, tick on histogram and normality plots with test, continue and ok. Results Divide the skewness and kurtosis by std. error and see if the values lies between -1.96 to +1.96 then the data are normally distributed (a minor difference is also normally distributed). See that the Shapiro wilk test is significant or not, if it is non- significant means data is normal otherwise data is not normal.
  • 4. The following are the three transformations, which are being used most commonly, in biological research a) Logarithmic Transformation b) Square root Transformation c) Arc sine or Angular Transformation
  • 5. Logarithmic transformation for whole number counts with wide range  This transformation is suitable for the data where the variance is proportional to the square of the mean or the coefficient of the variation (S.D./Mean) is constant or where effects are multiplicative.  These conditions are generally found in the data that are whole numbers and cover a wide range of values.  For example- number of insect per plot, number of egg mass per plant or per unit area etc.
  • 6. Natural logarithm of a positive number=LN (Give cell number for which transformation to be done), Natural logarithm is based on the constant e (2.718281828845904) Logarithm of a positive number at base 10=Log10 (Give cell number for which transformation to be done), Or Log (number, 10)
  • 7. Square root transformation  The transformation is appropriate for the data sets where the variance is proportional to the mean.  Here the data consists of small whole numbers, for example data obtained in counting rare events, such as number of infested plants in a plot, the number of insects caught in traps, number of weeds per plot.
  • 8. Square root transformation  These data sets generally follow the Poisson distribution and square root transformation approximates Poisson to normal distribution.  Square root transformation=SQRT (Give cell number for which transformation to be done)
  • 9. Arcsine Transformation for Proportions or Percentage  The transformation is appropriate for the data on proportion i.e. data obtained from a count and the data expressed as decimal fraction and percentage.  The distribution of percentage is binomial and this transformation makes the distribution normal. Arcsine Transformation=ASIN (Cell Identification e.g. (0.05)*(180)*(7/22) result will be in degrees.