SlideShare a Scribd company logo
1 of 20
Data Analysis

 Anindita C. Rao
Data Analysis
D                                         Data Analysis    Results
A                                                          interpretation
T
A



C
O
L   Getting data ready
L   for analysis          Feel for data     Goodness of    Hypothesis
E   •Editing              1.Mean            Data           Testing
    •Handling Blank       2.S.D.            •Reliability   •Appropriate
C
    Reponses              3.Correlation     •Validity      statistical tools
T                         4.Frequency
    •Coding
I   •Categorizing         distribution
O   •Creating data file
N   •Programming
Data Preparation
•   Getting data ready for analysis(SPSS)
•   Editing
•   Handling Blank Reponses
•   Coding
•   Categorizing
•   Creating data file
•   Programming
Coding & Categorization
• Usually a number to each response
• Coding of questionnaires
• If male=1 and if female=2
• Negatively worded questions
Some compromises with ethics helps you in
  practical life Strongly disagree1-2-3-4-5-6-7Strongly agree

• EXCEL we designate specific columns to
  specific questions and responses
Descriptive statistics
• Frequency(tabulation)
• Measure of central tendency(mean,
  median & mode)
• Measure of Dispersion(range, variance,
  standard deviation & interquartile range)
Inferential Statistics
• Correlation( Direction, Strength &
  Significance)     -1 to +1
• Parametric: Pearson’s correlation
• Non Parametric: Spearman’s Correlation
  & Kendall’s rank correlation(ordinal)
Data Analysis
•   Simple tabulation & Cross tabulation
•   ANOVA
•   Correlation & Regression
•   Discriminant Analysis
•   Factor Analysis
•   Conjoint Analysis
•   Multidimensional Scaling
•   Cluster Analysis
Tabulation                                                       Variables



              Male              Female                   Total



              25                35                       60



                                                                                        Frequency
                                         Family Background


Descriptive
                                       Frequen                 Valid     Cumulative
Statistics                               cy        Percent    Percent     Percent

                     Valid   Nuclear
                                              72       60.0       60.0         60.0

                             Joint
                                              48       40.0       40.0        100.0

                             Total
                                           120        100.0      100.0
Cross Tabulation
     Two variable interaction
                          Variable
                                                                                                 1(nominal)
Variable
2(nominal)
                                    Qualification * Centrality Crosstabulation



    Count


                                                                  Centrality


                                     -2          0            1                2        3          4       Total

    Qualification   Doctorate
                                          4            6             0             18       6          0           34

                    Post Graduate
                                          0            4             3             20       14         8           49

                    Graduate
                                          0            0             5             28       4          0           37

    Total
                                          4          10              8             66       24         8       120
Chi square
• To determine the systematic association
  between two variables
• Null hypothesis: no association
• Expected cell frequencies comparison with
  actual cell frequencies
• Greater the discrepancies, greater will be
  chi square statistic
Chi square test

• Two nominal variables
• Cross tabulation
• Non parametric
• SPSS code
1. Analyze from SPSS bar
2.Analyze> Nonparametric test> Chi-square
Exercise
•    In this case study , we are observing association between educational
     background(independent variable) of the PGDBM students and their performance in the
     terms of grade(dependent variable) secured. We want to test at 90% and 95% confidence
     level, what is the level of significance of association.(refer to file in SPSS)
Educational Background Code
B.Com              1
B.E.               2
B.Sc.              3
BBA               4
B.A.               5

Grades as follows
Grade Obtained Grade Code
A                 1
B                 2
C                 3
t test
• Significant mean difference between two
  groups
• Nominal variable on interval or ratio scale
• (smokers & non smokers on extent of well
  being)
• Sample size less than 30
• Df = N-1
• Mann Whitney U test
ANOVA
• Significant mean difference among
  multiple groups



• Multiple regression
Variance caused by independent variable on
  dependent variables simultaneously
Hypothesis Testing
•   Errors
•   Normal population
•   Degrees of freedom
•   One tailed or two tailed
•   Single Population
•   . p value
Univariate Techniques
             Metric Data                                   Nonmetric data
             (interval or ratio)                           (nominal or ordinal scale)

                                                          One sample
                                                                             Two or More Sample
                                                          •Frequency
One sample                         Two or More
                                                          •Chi square
                                   samples
•T test
                                                          •K-S
•Z test
                                                          •Runs
                       Independent           Related
                                                          •Binomial
                       •Two group t          •Paired t
                       test                  test
                                                         Independent          Related
                       •Z test
                                                         •Chi square          •Sign
                       •One way
                       anova                             •Mann whitney        •Wilcoxon

                                                         •Median              •Mc Nemar

                                                         •K-S                 •Chi Square

                                                         •K-W Anova
Multivariate Techniques
                                            Independence
        Dependence
                                            Techniques
        Techniques

                                                           Interobject
One dependent     More than one   Variable
                                                           Similarity
variable          dependent       Interdependence
                  variable                                 •Cluster
•Cross tabs                       •Factor Analysis
                                                           Analysis
                  •Cannonical
•Anova &          correlation                              •MDS
Covariance
                  •Multiple
•Multiple         discriminant
regression
•Discriminant
•Conjoint
Criterion(dependent)
                     One                                      Two or More

                  Nomin    Ordinal        Interval      No     Ordinal      Interval
                  al                                    min
                                                        al
One    Nominal    Chi      •Sign test     Analysis of                       Multiple
                  Squar    •Mann          variance                          discriminant
                  e        whinney                                          analysis
                           •Krushal
                           Wallis Anova

       Ordinal             •Spearman
                           rank
                           Correlation
                           •Kendall’s
                           rank
                           correlation

Two    Interval   Anova                   Regression    Ano                 Multiple
or                                        Analysis      va                  Regression
More                                                                        Analysis
       Nominal             Friedman       Anova                             Anova
                           two way        Factorial
                           analysis       Design
       Ordinal

More Related Content

Similar to Data Analysis Techniques and Methods

T11 types of tests
T11 types of testsT11 types of tests
T11 types of testskompellark
 
T11 types of tests
T11 types of testsT11 types of tests
T11 types of testskompellark
 
Fixed Designs for Psychological Research
Fixed Designs for Psychological ResearchFixed Designs for Psychological Research
Fixed Designs for Psychological ResearchGrant Heller
 
statistical analysis for data analtics for students
statistical analysis for data analtics for studentsstatistical analysis for data analtics for students
statistical analysis for data analtics for studentsviju001
 
Inferential statistics quantitative data - single sample and 2 groups
Inferential statistics   quantitative data - single sample and 2 groupsInferential statistics   quantitative data - single sample and 2 groups
Inferential statistics quantitative data - single sample and 2 groupsDhritiman Chakrabarti
 
Is peer review any good? A quantitative analysis of peer review
Is peer review any good? A quantitative analysis of peer reviewIs peer review any good? A quantitative analysis of peer review
Is peer review any good? A quantitative analysis of peer reviewAliaksandr Birukou
 
AI 바이오 (2_3일차).pdf
AI 바이오 (2_3일차).pdfAI 바이오 (2_3일차).pdf
AI 바이오 (2_3일차).pdfH K Yoon
 
Biostatistics basics-biostatistics4734
Biostatistics basics-biostatistics4734Biostatistics basics-biostatistics4734
Biostatistics basics-biostatistics4734AbhishekDas15
 
2008 JSM - Meta Study Data vs Patient Data
2008 JSM - Meta Study Data vs Patient Data2008 JSM - Meta Study Data vs Patient Data
2008 JSM - Meta Study Data vs Patient DataTerry Liao
 

Similar to Data Analysis Techniques and Methods (20)

Hypothesis testing
Hypothesis testingHypothesis testing
Hypothesis testing
 
Stats7.0
Stats7.0Stats7.0
Stats7.0
 
T11 types of tests
T11 types of testsT11 types of tests
T11 types of tests
 
T11 types of tests
T11 types of testsT11 types of tests
T11 types of tests
 
Week 11 collecting_data
Week 11 collecting_dataWeek 11 collecting_data
Week 11 collecting_data
 
Statistic and orthodontic by almuzian
Statistic and orthodontic by almuzianStatistic and orthodontic by almuzian
Statistic and orthodontic by almuzian
 
Stats - Intro to Quantitative
Stats -  Intro to Quantitative Stats -  Intro to Quantitative
Stats - Intro to Quantitative
 
Fixed Designs for Psychological Research
Fixed Designs for Psychological ResearchFixed Designs for Psychological Research
Fixed Designs for Psychological Research
 
statistical analysis for data analtics for students
statistical analysis for data analtics for studentsstatistical analysis for data analtics for students
statistical analysis for data analtics for students
 
Inferential statistics quantitative data - single sample and 2 groups
Inferential statistics   quantitative data - single sample and 2 groupsInferential statistics   quantitative data - single sample and 2 groups
Inferential statistics quantitative data - single sample and 2 groups
 
0 introduction
0  introduction0  introduction
0 introduction
 
Is peer review any good? A quantitative analysis of peer review
Is peer review any good? A quantitative analysis of peer reviewIs peer review any good? A quantitative analysis of peer review
Is peer review any good? A quantitative analysis of peer review
 
Statistics
StatisticsStatistics
Statistics
 
Statistics
StatisticsStatistics
Statistics
 
AI 바이오 (2_3일차).pdf
AI 바이오 (2_3일차).pdfAI 바이오 (2_3일차).pdf
AI 바이오 (2_3일차).pdf
 
UNIT 5.pptx
UNIT 5.pptxUNIT 5.pptx
UNIT 5.pptx
 
Biostatistics basics-biostatistics4734
Biostatistics basics-biostatistics4734Biostatistics basics-biostatistics4734
Biostatistics basics-biostatistics4734
 
Statistics in orthodontics
Statistics in orthodonticsStatistics in orthodontics
Statistics in orthodontics
 
Statistics in orthodontics
Statistics in orthodonticsStatistics in orthodontics
Statistics in orthodontics
 
2008 JSM - Meta Study Data vs Patient Data
2008 JSM - Meta Study Data vs Patient Data2008 JSM - Meta Study Data vs Patient Data
2008 JSM - Meta Study Data vs Patient Data
 

More from H9460730008

Variables, theoretical framework and hypotheses
Variables, theoretical framework and hypothesesVariables, theoretical framework and hypotheses
Variables, theoretical framework and hypothesesH9460730008
 
Problem formulation
Problem formulationProblem formulation
Problem formulationH9460730008
 
Multidimensional scaling
Multidimensional scalingMultidimensional scaling
Multidimensional scalingH9460730008
 
Measurement of variable& scaling
Measurement of variable& scalingMeasurement of variable& scaling
Measurement of variable& scalingH9460730008
 
Measurement of variable& scaling (2)
Measurement of variable& scaling (2)Measurement of variable& scaling (2)
Measurement of variable& scaling (2)H9460730008
 
Experimental design
Experimental designExperimental design
Experimental designH9460730008
 
Exercise problem formulation
Exercise problem formulationExercise problem formulation
Exercise problem formulationH9460730008
 
Pki enabling applications and mca implementation in tcs
Pki enabling applications and mca implementation in tcsPki enabling applications and mca implementation in tcs
Pki enabling applications and mca implementation in tcsH9460730008
 
Mca circulars rationale and implications
Mca circulars rationale and implicationsMca circulars rationale and implications
Mca circulars rationale and implicationsH9460730008
 
Lesson from movie lakshya
Lesson from movie lakshyaLesson from movie lakshya
Lesson from movie lakshyaH9460730008
 
Lesson from movie lagaan
Lesson from movie lagaanLesson from movie lagaan
Lesson from movie lagaanH9460730008
 

More from H9460730008 (20)

Variables, theoretical framework and hypotheses
Variables, theoretical framework and hypothesesVariables, theoretical framework and hypotheses
Variables, theoretical framework and hypotheses
 
Sampling
SamplingSampling
Sampling
 
Research design
Research designResearch design
Research design
 
Problem formulation
Problem formulationProblem formulation
Problem formulation
 
Multidimensional scaling
Multidimensional scalingMultidimensional scaling
Multidimensional scaling
 
Measurement of variable& scaling
Measurement of variable& scalingMeasurement of variable& scaling
Measurement of variable& scaling
 
Measurement of variable& scaling (2)
Measurement of variable& scaling (2)Measurement of variable& scaling (2)
Measurement of variable& scaling (2)
 
Literature.ppt
Literature.pptLiterature.ppt
Literature.ppt
 
Literature
LiteratureLiterature
Literature
 
Experimental design
Experimental designExperimental design
Experimental design
 
Exercise problem formulation
Exercise problem formulationExercise problem formulation
Exercise problem formulation
 
Data collection
Data collectionData collection
Data collection
 
Cluster
ClusterCluster
Cluster
 
Budgeting
BudgetingBudgeting
Budgeting
 
Bayesian
BayesianBayesian
Bayesian
 
Research intro
Research introResearch intro
Research intro
 
Pki enabling applications and mca implementation in tcs
Pki enabling applications and mca implementation in tcsPki enabling applications and mca implementation in tcs
Pki enabling applications and mca implementation in tcs
 
Mca circulars rationale and implications
Mca circulars rationale and implicationsMca circulars rationale and implications
Mca circulars rationale and implications
 
Lesson from movie lakshya
Lesson from movie lakshyaLesson from movie lakshya
Lesson from movie lakshya
 
Lesson from movie lagaan
Lesson from movie lagaanLesson from movie lagaan
Lesson from movie lagaan
 

Data Analysis Techniques and Methods

  • 2. Data Analysis D Data Analysis Results A interpretation T A C O L Getting data ready L for analysis Feel for data Goodness of Hypothesis E •Editing 1.Mean Data Testing •Handling Blank 2.S.D. •Reliability •Appropriate C Reponses 3.Correlation •Validity statistical tools T 4.Frequency •Coding I •Categorizing distribution O •Creating data file N •Programming
  • 3. Data Preparation • Getting data ready for analysis(SPSS) • Editing • Handling Blank Reponses • Coding • Categorizing • Creating data file • Programming
  • 4. Coding & Categorization • Usually a number to each response • Coding of questionnaires • If male=1 and if female=2 • Negatively worded questions Some compromises with ethics helps you in practical life Strongly disagree1-2-3-4-5-6-7Strongly agree • EXCEL we designate specific columns to specific questions and responses
  • 5. Descriptive statistics • Frequency(tabulation) • Measure of central tendency(mean, median & mode) • Measure of Dispersion(range, variance, standard deviation & interquartile range)
  • 6. Inferential Statistics • Correlation( Direction, Strength & Significance) -1 to +1 • Parametric: Pearson’s correlation • Non Parametric: Spearman’s Correlation & Kendall’s rank correlation(ordinal)
  • 7. Data Analysis • Simple tabulation & Cross tabulation • ANOVA • Correlation & Regression • Discriminant Analysis • Factor Analysis • Conjoint Analysis • Multidimensional Scaling • Cluster Analysis
  • 8. Tabulation Variables Male Female Total 25 35 60 Frequency Family Background Descriptive Frequen Valid Cumulative Statistics cy Percent Percent Percent Valid Nuclear 72 60.0 60.0 60.0 Joint 48 40.0 40.0 100.0 Total 120 100.0 100.0
  • 9. Cross Tabulation Two variable interaction Variable 1(nominal) Variable 2(nominal) Qualification * Centrality Crosstabulation Count Centrality -2 0 1 2 3 4 Total Qualification Doctorate 4 6 0 18 6 0 34 Post Graduate 0 4 3 20 14 8 49 Graduate 0 0 5 28 4 0 37 Total 4 10 8 66 24 8 120
  • 10. Chi square • To determine the systematic association between two variables • Null hypothesis: no association • Expected cell frequencies comparison with actual cell frequencies • Greater the discrepancies, greater will be chi square statistic
  • 11. Chi square test • Two nominal variables • Cross tabulation • Non parametric • SPSS code 1. Analyze from SPSS bar 2.Analyze> Nonparametric test> Chi-square
  • 12.
  • 13. Exercise • In this case study , we are observing association between educational background(independent variable) of the PGDBM students and their performance in the terms of grade(dependent variable) secured. We want to test at 90% and 95% confidence level, what is the level of significance of association.(refer to file in SPSS) Educational Background Code B.Com 1 B.E. 2 B.Sc. 3 BBA 4 B.A. 5 Grades as follows Grade Obtained Grade Code A 1 B 2 C 3
  • 14.
  • 15. t test • Significant mean difference between two groups • Nominal variable on interval or ratio scale • (smokers & non smokers on extent of well being) • Sample size less than 30 • Df = N-1 • Mann Whitney U test
  • 16. ANOVA • Significant mean difference among multiple groups • Multiple regression Variance caused by independent variable on dependent variables simultaneously
  • 17. Hypothesis Testing • Errors • Normal population • Degrees of freedom • One tailed or two tailed • Single Population • . p value
  • 18. Univariate Techniques Metric Data Nonmetric data (interval or ratio) (nominal or ordinal scale) One sample Two or More Sample •Frequency One sample Two or More •Chi square samples •T test •K-S •Z test •Runs Independent Related •Binomial •Two group t •Paired t test test Independent Related •Z test •Chi square •Sign •One way anova •Mann whitney •Wilcoxon •Median •Mc Nemar •K-S •Chi Square •K-W Anova
  • 19. Multivariate Techniques Independence Dependence Techniques Techniques Interobject One dependent More than one Variable Similarity variable dependent Interdependence variable •Cluster •Cross tabs •Factor Analysis Analysis •Cannonical •Anova & correlation •MDS Covariance •Multiple •Multiple discriminant regression •Discriminant •Conjoint
  • 20. Criterion(dependent) One Two or More Nomin Ordinal Interval No Ordinal Interval al min al One Nominal Chi •Sign test Analysis of Multiple Squar •Mann variance discriminant e whinney analysis •Krushal Wallis Anova Ordinal •Spearman rank Correlation •Kendall’s rank correlation Two Interval Anova Regression Ano Multiple or Analysis va Regression More Analysis Nominal Friedman Anova Anova two way Factorial analysis Design Ordinal