SlideShare a Scribd company logo
1 of 2
Download to read offline
04-09-2012




       Understanding Equivalent calculations                                                                                                                   Understanding Equivalent calculations
                                                  n (n + 1)
                    Test Statistic U = n1n2 + 1 1           − R1
                                                      2
                    = # of ( X i , Y j ) pairs where X i < Y j
                                                                                                                                                       The p-value and hence the conclusion would be the same
                                                          n1n2                         n1n2 (n1 + n2 + 1)                                              irrespective of what form of the test statistic is used, i.e.
                    Under H 0 , µU =                           ,          σU =
                                                           2                                  12                                                                                       R1 or R2                         ɶ
                                                                                                                                                                                                                or U or U
                                  U-µ U
                    and                      has N(0,1) distribution                                                                                   But need to be careful on (a) left or right tailed, depending on the
                                   σU
                                                                                n ( n + 1)
                                                                                                                                                       form (b) Mean to be subtracted (S.E would be the same)
         (n1 + n2 )( n1 + n2 + 1)                                   ɶ
                                                                   U = n1n2 + 2 2            − R2
R1 + R2 =                                                                             2
                      2
                       n ( n + n + 1)
                                                                   = # of ( X i , Y j ) pairs where X i > Y j
Under H 0 : E ( R1 ) = 1 1 2
                              2                                         ɶ
                                                                   U + U = n1n2
                                                                      ɶ nn    nn
                                                                   ⇒ U − 1 2 = 1 2 −U
                                                                         2     2




               Numerical illustration: small sample                                                                                                                   14-34 in Aczel -Sounderpandyan
                 Q. Do Model B planes fly faster? (modified ex14.4)
                 Travel time in two models of copter-planes :
                                                                                                                                                           Test if the (average) current ratio for the 3 industries are the same.
                 Model A: 35                    38         40 43                  n1 = 4
                 Model B: 27                    29          36                n2 = 3
                 R1 = 3 + 5 + 6 + 7 = 21                           R2 = 1 + 2 + 4 = 7
                         4×5                                                                                                                                                                                                             mean sd
                U = 12 +     − 21 = 1 Easier to note this directly from pairs!
                           2
                 p-value=P(U ≤ 1) = 0.0571 (Table 9 in P798)
                                                                                                                                                       A              1.38 1.55                 1.9       2     1.22 2.11 1.98 1.61      1.719 0.324
         Note that the distribution of U is symmetric (about…?) under the null hypothesis
                                                                                           Let the rank of X obs. be r1 ,… rn1 . R1 = r1 + … + rn1
                                                                                                                                        n1 ( n1 − 1)
                                                                                                                                                       B              2.33         2.5          2.79 3.01 1.99 2.45                      2.512 0.356
    Can you see why U = No. of (X i , Y j ) pairs with X i < Y j                           U = (n1 + n2 − r1 ) + … (n1 + n2 − rn1 ) −
               n1 (n1 + 1)                                                                                                                    2
    = n1n2 +               − R1                                                                                      n1 ( n1 − 1)
                    2                                                                      = n1 ( n1 + n2 ) − R1 −
                                                                                                                           2                           C              1.06 1.37 1.09 1.65 1.44 1.11                                      1.287 0.238
                                                                                                      n1 (n1 + 1)
                                                                                           = n1n2 +               − R1
                                                                                                           2




                                        Kruskal-Wallis Test                                                                                                       Solving 14-34 using Kruskal-Wallis
                                                                                                                                                           Industry    current ratio     rank

         • For comparing means of more than 2                                                                                                                A
                                                                                                                                                             A
                                                                                                                                                                         1.38
                                                                                                                                                                         1.55
                                                                                                                                                                                          6
                                                                                                                                                                                          8
           populations – alternative to ANOVA                                                                                                                A            1.9            11
                                                                                                                                                             A             2             14                        ranksum sample size R^2/n
         • Use if data is ordinal or the assumptions of                                                                                                      A           1.22             4           A                79       8        780.125
           ANOVA are violated                                                                                                                                A
                                                                                                                                                             A
                                                                                                                                                                         2.11
                                                                                                                                                                         1.98
                                                                                                                                                                                         15
                                                                                                                                                                                         12           B               103         6        1768.167
         • Pull all observations and rank them                                                                                                               A           1.61             9
                                                                                                                                                             B           2.33            16           C               28          6        130.6667
         • Compute total of the ranks of observations                                                                                                        B            2.5            18
                                                                                                                                                             B           2.79            19           total           210        20       2678.958
           coming from 1st, 2nd ,3rd… populations                                                                                                            B           3.01            20
                                                                                                                                                             B           1.99            13
         • Null distribution is Chi-square with k-1 d.f                                                                                                      B           2.45            17                   12 × 2678.96
                                                                                                                                                             C           1.06             1           T.S. is                − 3 × 21 = 13.54
                                                  12        R                 2                                                                              C           1.37             5                      20 × 21
                                  T.S. is
                                                n(n + 1 )
                                                          ∑ n − 3(n + 1 )    i                                                                               C
                                                                                                                                                             C
                                                                                                                                                                         1.09
                                                                                                                                                                         1.65
                                                                                                                                                                                          2
                                                                                                                                                                                         10           p − value = P(χ 2 df > 13.54) = 0.001
                                                                                                                                                                                                                       2
                                                             i
                                                                                                                                                             C           1.44             7
                                                                                                                                                             C           1.11             3




                                                                                                                                                                                                                                                        1
04-09-2012




                                                                                                              Run test:
                                   Problem 3
                                                                                                       A test for randomness
                                                                                        • Which of the following sequences appear to be ‘random’?
                                                                                            – HTHTHTHTHTHTHTHTHTHTHTHTHTHTHTHT
                                                                                            – HHHHHHHHHHHHHHTTTTTTTTTTTTTTTTTT
             A sequence of small glass sculptures was inspected for                         – HHHHHHHHHHHTTTTTTTTTTTHHHHHHHHH
             shipping damage. The sequence of acceptable and damaged                    •   NONE! How to determine objectively or statistically?
             pieces was as follows:                                                     •   Calculate the no. of runs
                                                                                        •   A run is a sequence of identical symbols/events
             D,A,A,A,D,D,D,D,D,A,A,D,D,A,A,A,A,D,A,A,D,D,D,D,D                          •   Too many (or few) runs indicate lack of randomness
                                                                                            – HTHT HTHT HTHT HTHT HTHT HTHT HTHT HT
                                                                                            – HHHHHHHHHHHHHHTTTTTTTTTTTTTTTTTT
             Test for the randomness of the damage to the shipment using                    – HHHHHHHHHHHTTTTTTTTTTTHHHHHHHHH
             the 0.05 significance level.




                          Run test (cont.)                                                           Solution to Problem 3
    • How to determine too many or too few?
    • Acceptable no. of runs depend on n1, n2                                           nA = 11, nD = 14,
                                                                                             2 × 11× 14
  If H 0 (the sequence is 'randomly' mixed) is true,
                                                       r-µr
                                                              has approximately         µr =            + 1 = 13.32,
                                                       σr                                     11 + 14
  N(0,1) distribution, provided either n1or n 2 moderately large ( ≥ 10).                      2 × 11×14(2 ×11× 14 − 25)
  Small sample distributions are avaliable (Table 8 page 796-797).
                                                                                        σr =                                 = 2.41
                                                                                                        25 2 × 24
                                                                                        Atα = 0.05, the C.R. is | Z | > 1.96.
                       2n1n2                2n1n2 (2n1n2 − n1 − n2 )
                                                                                                                                  9 - 13.32
            µr =              +1    σr =                                                The observed r = 9, and value of the T.S. is        = −1.79.
                      n1 + n2               (n1 + n2 )2 (n1 + n2 − 1)                                                                2.41
                                                                                        So at 5% level we conclude that damages occur randomly




Data summarization           Expected Value
 And presentation           in decision making
                              Decision trees Discrete: General, Binomial, Poisson


    Probability             Random variable
                           And its Distribution
                                              Continuous: General, Normal,Exponential
                                                        T, Chi-square, F

    Confidence interval/
     Testing hypothesis                                      Sampling
                                                       Sampling distribution of
                  π                                             X,   p, S 2
  1 or 2
                  µ
 sample
                  σ
                                            Goodness of Fit
            ANOVA                                                                 NP
  Test for indep/homogeneity




                                                                                                                                                            2

More Related Content

What's hot

(Mhs)2 calculator enrichment
(Mhs)2 calculator enrichment(Mhs)2 calculator enrichment
(Mhs)2 calculator enrichmentpwalkermathlead
 
Analysis of monitoring of connection between
Analysis of monitoring of connection betweenAnalysis of monitoring of connection between
Analysis of monitoring of connection betweenprjpublications
 
Algebras for programming languages
Algebras for programming languagesAlgebras for programming languages
Algebras for programming languagesYoshihiro Mizoguchi
 
11.application of matrix algebra to multivariate data using standardize scores
11.application of matrix algebra to multivariate data using standardize scores11.application of matrix algebra to multivariate data using standardize scores
11.application of matrix algebra to multivariate data using standardize scoresAlexander Decker
 
Application of matrix algebra to multivariate data using standardize scores
Application of matrix algebra to multivariate data using standardize scoresApplication of matrix algebra to multivariate data using standardize scores
Application of matrix algebra to multivariate data using standardize scoresAlexander Decker
 
Lesson 16: Inverse Trigonometric Functions
Lesson 16: Inverse Trigonometric FunctionsLesson 16: Inverse Trigonometric Functions
Lesson 16: Inverse Trigonometric FunctionsMatthew Leingang
 
Chapter 2 pertubation
Chapter 2 pertubationChapter 2 pertubation
Chapter 2 pertubationNBER
 
Theory of Relational Calculus and its Formalization
Theory of Relational Calculus and its FormalizationTheory of Relational Calculus and its Formalization
Theory of Relational Calculus and its FormalizationYoshihiro Mizoguchi
 
Bernheim calculusfinal
Bernheim calculusfinalBernheim calculusfinal
Bernheim calculusfinalrahulrasa
 
001 basic concepts
001 basic concepts001 basic concepts
001 basic conceptsphysics101
 
Lecture on solving1
Lecture on solving1Lecture on solving1
Lecture on solving1NBER
 
Lesson22 -optimization_problems_slides
Lesson22  -optimization_problems_slidesLesson22  -optimization_problems_slides
Lesson22 -optimization_problems_slidesMatthew Leingang
 
A Coq Library for the Theory of Relational Calculus
A Coq Library for the Theory of Relational CalculusA Coq Library for the Theory of Relational Calculus
A Coq Library for the Theory of Relational CalculusYoshihiro Mizoguchi
 

What's hot (19)

presentazione
presentazionepresentazione
presentazione
 
Session 6
Session 6Session 6
Session 6
 
(Mhs)2 calculator enrichment
(Mhs)2 calculator enrichment(Mhs)2 calculator enrichment
(Mhs)2 calculator enrichment
 
Analysis of monitoring of connection between
Analysis of monitoring of connection betweenAnalysis of monitoring of connection between
Analysis of monitoring of connection between
 
Algebras for programming languages
Algebras for programming languagesAlgebras for programming languages
Algebras for programming languages
 
19 6
19 619 6
19 6
 
Lesson 1: Functions
Lesson 1: FunctionsLesson 1: Functions
Lesson 1: Functions
 
11.application of matrix algebra to multivariate data using standardize scores
11.application of matrix algebra to multivariate data using standardize scores11.application of matrix algebra to multivariate data using standardize scores
11.application of matrix algebra to multivariate data using standardize scores
 
Application of matrix algebra to multivariate data using standardize scores
Application of matrix algebra to multivariate data using standardize scoresApplication of matrix algebra to multivariate data using standardize scores
Application of matrix algebra to multivariate data using standardize scores
 
Lesson 16: Inverse Trigonometric Functions
Lesson 16: Inverse Trigonometric FunctionsLesson 16: Inverse Trigonometric Functions
Lesson 16: Inverse Trigonometric Functions
 
Analytic function
Analytic functionAnalytic function
Analytic function
 
Chapter 2 pertubation
Chapter 2 pertubationChapter 2 pertubation
Chapter 2 pertubation
 
Theory of Relational Calculus and its Formalization
Theory of Relational Calculus and its FormalizationTheory of Relational Calculus and its Formalization
Theory of Relational Calculus and its Formalization
 
Bernheim calculusfinal
Bernheim calculusfinalBernheim calculusfinal
Bernheim calculusfinal
 
001 basic concepts
001 basic concepts001 basic concepts
001 basic concepts
 
Em10 fl
Em10 flEm10 fl
Em10 fl
 
Lecture on solving1
Lecture on solving1Lecture on solving1
Lecture on solving1
 
Lesson22 -optimization_problems_slides
Lesson22  -optimization_problems_slidesLesson22  -optimization_problems_slides
Lesson22 -optimization_problems_slides
 
A Coq Library for the Theory of Relational Calculus
A Coq Library for the Theory of Relational CalculusA Coq Library for the Theory of Relational Calculus
A Coq Library for the Theory of Relational Calculus
 

Similar to Session 20

Mean variance
Mean varianceMean variance
Mean varianceodkoo
 
2 senarai rumus add maths k2 trial spm sbp 2010
2 senarai rumus add maths k2 trial spm sbp 20102 senarai rumus add maths k2 trial spm sbp 2010
2 senarai rumus add maths k2 trial spm sbp 2010zabidah awang
 
2 senarai rumus add maths k1 trial spm sbp 2010
2 senarai rumus add maths k1 trial spm sbp 20102 senarai rumus add maths k1 trial spm sbp 2010
2 senarai rumus add maths k1 trial spm sbp 2010zabidah awang
 
Mth3101 Advanced Calculus Chapter 2
Mth3101 Advanced Calculus Chapter 2Mth3101 Advanced Calculus Chapter 2
Mth3101 Advanced Calculus Chapter 2saya efan
 
Adiabatic Theorem for Discrete Time Evolution
Adiabatic Theorem for Discrete Time EvolutionAdiabatic Theorem for Discrete Time Evolution
Adiabatic Theorem for Discrete Time Evolutiontanaka-atushi
 
CVPR2010: Advanced ITinCVPR in a Nutshell: part 5: Shape, Matching and Diverg...
CVPR2010: Advanced ITinCVPR in a Nutshell: part 5: Shape, Matching and Diverg...CVPR2010: Advanced ITinCVPR in a Nutshell: part 5: Shape, Matching and Diverg...
CVPR2010: Advanced ITinCVPR in a Nutshell: part 5: Shape, Matching and Diverg...zukun
 
Engr 371 midterm february 1997
Engr 371 midterm february 1997Engr 371 midterm february 1997
Engr 371 midterm february 1997amnesiann
 
Analisis Korespondensi
Analisis KorespondensiAnalisis Korespondensi
Analisis Korespondensidessybudiyanti
 
2 senarai rumus add maths k1 trial spm sbp 2010
2 senarai rumus add maths k1 trial spm sbp 20102 senarai rumus add maths k1 trial spm sbp 2010
2 senarai rumus add maths k1 trial spm sbp 2010zabidah awang
 
2 senarai rumus add maths k2 trial spm sbp 2010
2 senarai rumus add maths k2 trial spm sbp 20102 senarai rumus add maths k2 trial spm sbp 2010
2 senarai rumus add maths k2 trial spm sbp 2010zabidah awang
 
Statistical Tools for the Quality Control Laboratory and Validation Studies
Statistical Tools for the Quality Control Laboratory and Validation StudiesStatistical Tools for the Quality Control Laboratory and Validation Studies
Statistical Tools for the Quality Control Laboratory and Validation StudiesInstitute of Validation Technology
 
Some Dynamical Behaviours of a Two Dimensional Nonlinear Map
Some Dynamical Behaviours of a Two Dimensional Nonlinear MapSome Dynamical Behaviours of a Two Dimensional Nonlinear Map
Some Dynamical Behaviours of a Two Dimensional Nonlinear MapIJMER
 
Lesson 24: Area and Distances (worksheet solutions)
Lesson 24: Area and Distances (worksheet solutions)Lesson 24: Area and Distances (worksheet solutions)
Lesson 24: Area and Distances (worksheet solutions)Matthew Leingang
 

Similar to Session 20 (20)

Mean variance
Mean varianceMean variance
Mean variance
 
2 senarai rumus add maths k2 trial spm sbp 2010
2 senarai rumus add maths k2 trial spm sbp 20102 senarai rumus add maths k2 trial spm sbp 2010
2 senarai rumus add maths k2 trial spm sbp 2010
 
2 senarai rumus add maths k1 trial spm sbp 2010
2 senarai rumus add maths k1 trial spm sbp 20102 senarai rumus add maths k1 trial spm sbp 2010
2 senarai rumus add maths k1 trial spm sbp 2010
 
Math report
Math reportMath report
Math report
 
Number theory
Number theoryNumber theory
Number theory
 
Mth3101 Advanced Calculus Chapter 2
Mth3101 Advanced Calculus Chapter 2Mth3101 Advanced Calculus Chapter 2
Mth3101 Advanced Calculus Chapter 2
 
Legendre
LegendreLegendre
Legendre
 
Semi-Magic Squares From Snake-Shaped Matrices
Semi-Magic Squares From Snake-Shaped MatricesSemi-Magic Squares From Snake-Shaped Matrices
Semi-Magic Squares From Snake-Shaped Matrices
 
Adiabatic Theorem for Discrete Time Evolution
Adiabatic Theorem for Discrete Time EvolutionAdiabatic Theorem for Discrete Time Evolution
Adiabatic Theorem for Discrete Time Evolution
 
CVPR2010: Advanced ITinCVPR in a Nutshell: part 5: Shape, Matching and Diverg...
CVPR2010: Advanced ITinCVPR in a Nutshell: part 5: Shape, Matching and Diverg...CVPR2010: Advanced ITinCVPR in a Nutshell: part 5: Shape, Matching and Diverg...
CVPR2010: Advanced ITinCVPR in a Nutshell: part 5: Shape, Matching and Diverg...
 
Engr 371 midterm february 1997
Engr 371 midterm february 1997Engr 371 midterm february 1997
Engr 371 midterm february 1997
 
Analisis Korespondensi
Analisis KorespondensiAnalisis Korespondensi
Analisis Korespondensi
 
2 senarai rumus add maths k1 trial spm sbp 2010
2 senarai rumus add maths k1 trial spm sbp 20102 senarai rumus add maths k1 trial spm sbp 2010
2 senarai rumus add maths k1 trial spm sbp 2010
 
2 senarai rumus add maths k2 trial spm sbp 2010
2 senarai rumus add maths k2 trial spm sbp 20102 senarai rumus add maths k2 trial spm sbp 2010
2 senarai rumus add maths k2 trial spm sbp 2010
 
one + 1
one + 1one + 1
one + 1
 
Statistical controls for qc
Statistical controls for qcStatistical controls for qc
Statistical controls for qc
 
Statistical Tools for the Quality Control Laboratory and Validation Studies
Statistical Tools for the Quality Control Laboratory and Validation StudiesStatistical Tools for the Quality Control Laboratory and Validation Studies
Statistical Tools for the Quality Control Laboratory and Validation Studies
 
Some Dynamical Behaviours of a Two Dimensional Nonlinear Map
Some Dynamical Behaviours of a Two Dimensional Nonlinear MapSome Dynamical Behaviours of a Two Dimensional Nonlinear Map
Some Dynamical Behaviours of a Two Dimensional Nonlinear Map
 
Lesson 24: Area and Distances (worksheet solutions)
Lesson 24: Area and Distances (worksheet solutions)Lesson 24: Area and Distances (worksheet solutions)
Lesson 24: Area and Distances (worksheet solutions)
 
Add maths 2
Add maths 2Add maths 2
Add maths 2
 

More from vivek_shaw

Lecture 11 market structure- perfect competition
Lecture 11  market structure- perfect competitionLecture 11  market structure- perfect competition
Lecture 11 market structure- perfect competitionvivek_shaw
 
Lecture 9 costs
Lecture 9  costsLecture 9  costs
Lecture 9 costsvivek_shaw
 
Lecture 8 production, optimal inputs
Lecture 8  production, optimal inputsLecture 8  production, optimal inputs
Lecture 8 production, optimal inputsvivek_shaw
 
Lecture 8 production, optimal inputs (1)
Lecture 8  production, optimal inputs (1)Lecture 8  production, optimal inputs (1)
Lecture 8 production, optimal inputs (1)vivek_shaw
 
Lecture 3 dds sand elasticity
Lecture 3  dds sand elasticityLecture 3  dds sand elasticity
Lecture 3 dds sand elasticityvivek_shaw
 
Consumertheory1
Consumertheory1Consumertheory1
Consumertheory1vivek_shaw
 
Consumer theory 2
Consumer theory 2Consumer theory 2
Consumer theory 2vivek_shaw
 
Class2 market, demand and supply
Class2  market, demand and supplyClass2  market, demand and supply
Class2 market, demand and supplyvivek_shaw
 
Policy implications
Policy implicationsPolicy implications
Policy implicationsvivek_shaw
 
Man org session 3-org and technology_5th july 2012
Man org session 3-org and technology_5th july 2012Man org session 3-org and technology_5th july 2012
Man org session 3-org and technology_5th july 2012vivek_shaw
 
Man org session 14_org decision making_16th august 2012
Man org session 14_org decision making_16th august 2012Man org session 14_org decision making_16th august 2012
Man org session 14_org decision making_16th august 2012vivek_shaw
 
Man org session 12_org learning_3rd august 2012
Man org session 12_org learning_3rd august 2012Man org session 12_org learning_3rd august 2012
Man org session 12_org learning_3rd august 2012vivek_shaw
 
Man org session 11 interorganizational relationships_2nd august 2012
Man org session 11 interorganizational relationships_2nd august 2012Man org session 11 interorganizational relationships_2nd august 2012
Man org session 11 interorganizational relationships_2nd august 2012vivek_shaw
 
Man org session 10_org control_27th july 2012
Man org session 10_org control_27th july 2012Man org session 10_org control_27th july 2012
Man org session 10_org control_27th july 2012vivek_shaw
 

More from vivek_shaw (20)

Lecture 11 market structure- perfect competition
Lecture 11  market structure- perfect competitionLecture 11  market structure- perfect competition
Lecture 11 market structure- perfect competition
 
Lecture 9 costs
Lecture 9  costsLecture 9  costs
Lecture 9 costs
 
Lecture 8 production, optimal inputs
Lecture 8  production, optimal inputsLecture 8  production, optimal inputs
Lecture 8 production, optimal inputs
 
Lecture 8 production, optimal inputs (1)
Lecture 8  production, optimal inputs (1)Lecture 8  production, optimal inputs (1)
Lecture 8 production, optimal inputs (1)
 
Lecture 3 dds sand elasticity
Lecture 3  dds sand elasticityLecture 3  dds sand elasticity
Lecture 3 dds sand elasticity
 
Game theory 3
Game theory 3Game theory 3
Game theory 3
 
Game theory 1
Game theory 1Game theory 1
Game theory 1
 
Game theory 2
Game theory 2Game theory 2
Game theory 2
 
Ford motors
Ford motorsFord motors
Ford motors
 
Consumertheory1
Consumertheory1Consumertheory1
Consumertheory1
 
Consumer theory 2
Consumer theory 2Consumer theory 2
Consumer theory 2
 
Class2 market, demand and supply
Class2  market, demand and supplyClass2  market, demand and supply
Class2 market, demand and supply
 
Auctions 1
Auctions 1Auctions 1
Auctions 1
 
Costs2
Costs2Costs2
Costs2
 
Policy implications
Policy implicationsPolicy implications
Policy implications
 
Man org session 3-org and technology_5th july 2012
Man org session 3-org and technology_5th july 2012Man org session 3-org and technology_5th july 2012
Man org session 3-org and technology_5th july 2012
 
Man org session 14_org decision making_16th august 2012
Man org session 14_org decision making_16th august 2012Man org session 14_org decision making_16th august 2012
Man org session 14_org decision making_16th august 2012
 
Man org session 12_org learning_3rd august 2012
Man org session 12_org learning_3rd august 2012Man org session 12_org learning_3rd august 2012
Man org session 12_org learning_3rd august 2012
 
Man org session 11 interorganizational relationships_2nd august 2012
Man org session 11 interorganizational relationships_2nd august 2012Man org session 11 interorganizational relationships_2nd august 2012
Man org session 11 interorganizational relationships_2nd august 2012
 
Man org session 10_org control_27th july 2012
Man org session 10_org control_27th july 2012Man org session 10_org control_27th july 2012
Man org session 10_org control_27th july 2012
 

Session 20

  • 1. 04-09-2012 Understanding Equivalent calculations Understanding Equivalent calculations n (n + 1) Test Statistic U = n1n2 + 1 1 − R1 2 = # of ( X i , Y j ) pairs where X i < Y j The p-value and hence the conclusion would be the same n1n2 n1n2 (n1 + n2 + 1) irrespective of what form of the test statistic is used, i.e. Under H 0 , µU = , σU = 2 12 R1 or R2 ɶ or U or U U-µ U and has N(0,1) distribution But need to be careful on (a) left or right tailed, depending on the σU n ( n + 1) form (b) Mean to be subtracted (S.E would be the same) (n1 + n2 )( n1 + n2 + 1) ɶ U = n1n2 + 2 2 − R2 R1 + R2 = 2 2 n ( n + n + 1) = # of ( X i , Y j ) pairs where X i > Y j Under H 0 : E ( R1 ) = 1 1 2 2 ɶ U + U = n1n2 ɶ nn nn ⇒ U − 1 2 = 1 2 −U 2 2 Numerical illustration: small sample 14-34 in Aczel -Sounderpandyan Q. Do Model B planes fly faster? (modified ex14.4) Travel time in two models of copter-planes : Test if the (average) current ratio for the 3 industries are the same. Model A: 35 38 40 43 n1 = 4 Model B: 27 29 36 n2 = 3 R1 = 3 + 5 + 6 + 7 = 21 R2 = 1 + 2 + 4 = 7 4×5 mean sd U = 12 + − 21 = 1 Easier to note this directly from pairs! 2 p-value=P(U ≤ 1) = 0.0571 (Table 9 in P798) A 1.38 1.55 1.9 2 1.22 2.11 1.98 1.61 1.719 0.324 Note that the distribution of U is symmetric (about…?) under the null hypothesis Let the rank of X obs. be r1 ,… rn1 . R1 = r1 + … + rn1 n1 ( n1 − 1) B 2.33 2.5 2.79 3.01 1.99 2.45 2.512 0.356 Can you see why U = No. of (X i , Y j ) pairs with X i < Y j U = (n1 + n2 − r1 ) + … (n1 + n2 − rn1 ) − n1 (n1 + 1) 2 = n1n2 + − R1 n1 ( n1 − 1) 2 = n1 ( n1 + n2 ) − R1 − 2 C 1.06 1.37 1.09 1.65 1.44 1.11 1.287 0.238 n1 (n1 + 1) = n1n2 + − R1 2 Kruskal-Wallis Test Solving 14-34 using Kruskal-Wallis Industry current ratio rank • For comparing means of more than 2 A A 1.38 1.55 6 8 populations – alternative to ANOVA A 1.9 11 A 2 14 ranksum sample size R^2/n • Use if data is ordinal or the assumptions of A 1.22 4 A 79 8 780.125 ANOVA are violated A A 2.11 1.98 15 12 B 103 6 1768.167 • Pull all observations and rank them A 1.61 9 B 2.33 16 C 28 6 130.6667 • Compute total of the ranks of observations B 2.5 18 B 2.79 19 total 210 20 2678.958 coming from 1st, 2nd ,3rd… populations B 3.01 20 B 1.99 13 • Null distribution is Chi-square with k-1 d.f B 2.45 17 12 × 2678.96 C 1.06 1 T.S. is − 3 × 21 = 13.54 12 R 2 C 1.37 5 20 × 21 T.S. is n(n + 1 ) ∑ n − 3(n + 1 ) i C C 1.09 1.65 2 10 p − value = P(χ 2 df > 13.54) = 0.001 2 i C 1.44 7 C 1.11 3 1
  • 2. 04-09-2012 Run test: Problem 3 A test for randomness • Which of the following sequences appear to be ‘random’? – HTHTHTHTHTHTHTHTHTHTHTHTHTHTHTHT – HHHHHHHHHHHHHHTTTTTTTTTTTTTTTTTT A sequence of small glass sculptures was inspected for – HHHHHHHHHHHTTTTTTTTTTTHHHHHHHHH shipping damage. The sequence of acceptable and damaged • NONE! How to determine objectively or statistically? pieces was as follows: • Calculate the no. of runs • A run is a sequence of identical symbols/events D,A,A,A,D,D,D,D,D,A,A,D,D,A,A,A,A,D,A,A,D,D,D,D,D • Too many (or few) runs indicate lack of randomness – HTHT HTHT HTHT HTHT HTHT HTHT HTHT HT – HHHHHHHHHHHHHHTTTTTTTTTTTTTTTTTT Test for the randomness of the damage to the shipment using – HHHHHHHHHHHTTTTTTTTTTTHHHHHHHHH the 0.05 significance level. Run test (cont.) Solution to Problem 3 • How to determine too many or too few? • Acceptable no. of runs depend on n1, n2 nA = 11, nD = 14, 2 × 11× 14 If H 0 (the sequence is 'randomly' mixed) is true, r-µr has approximately µr = + 1 = 13.32, σr 11 + 14 N(0,1) distribution, provided either n1or n 2 moderately large ( ≥ 10). 2 × 11×14(2 ×11× 14 − 25) Small sample distributions are avaliable (Table 8 page 796-797). σr = = 2.41 25 2 × 24 Atα = 0.05, the C.R. is | Z | > 1.96. 2n1n2 2n1n2 (2n1n2 − n1 − n2 ) 9 - 13.32 µr = +1 σr = The observed r = 9, and value of the T.S. is = −1.79. n1 + n2 (n1 + n2 )2 (n1 + n2 − 1) 2.41 So at 5% level we conclude that damages occur randomly Data summarization Expected Value And presentation in decision making Decision trees Discrete: General, Binomial, Poisson Probability Random variable And its Distribution Continuous: General, Normal,Exponential T, Chi-square, F Confidence interval/ Testing hypothesis Sampling Sampling distribution of π X, p, S 2 1 or 2 µ sample σ Goodness of Fit ANOVA NP Test for indep/homogeneity 2