Stage 2 Call the number of values in the smaller sample n1 and the number of samples in the larger sample n2.
Stage 3 Place all the values together in rank order. (from smallest to largest)
Stage 4 Inspect each B in turn and count the number of A’s which precede it. Add up the total to get a U value.
Stage 4 repeat stage 4, but this time inspect each A in turn and count the number of B’s which precede it. Add up the total to get a second U value.
Taking the smaller of the two U values look up the probability value associated with it in the appropriate table. Multiply this by 100 to get the percentage probability figure that the difference between your two sample sets could have occurred by chance.
Stage 5 You now have two U values. Select the smallest of the two.
Using your values of n1 and n2, look up the critical value of U in table J.
If the critical value U is more than your calculated value of U then there is a 5% or less probability that the difference between the two samples sets could have occurred by chance.
If the critical value of U is less than your calculated value of U, then there is a more than 5% probability that the difference between the two sample sets could have occurred by chance alone.
Look up your critical value using your Z value. Multiply this by 100 to find the percentage probability that the difference between the two sample sets could be due to chance.
The Chi Squared test can only be used on data which has the following characteristics.
The data must be in the form of frequencies counted in each of a number of categories. Data on the interval scale (Data which have a precise numerical meaning - height above sea level, population of a town etc) can be grouped into categories to enable this test.
Total observations need to be more than 20.
The expected frequency should not be less than 5.
The observations should not be such that one influences the other.
Stage 4- Calculate the degrees of freedom. This is simply one less than the total number of categories.
Df= n – 1.
Where Df = degrees of freedom.
N= Number of categories in the test.
Stage 5- Using the worksheet, the calculated value of Xsquared and the degrees of freedom, read off the probability that the data frequencies you are testing could have occurred by chance.
You now wish to know if these results are a true reflection of the nature of each rock type or whether they could be simply the result of chance.
The Chi Squared test can be used because…
The data is in the form of counts.
The total number of streams observed exceeds 20.
The expected frequency in any one fraction exceeds 4. This is the number of streams you would expect if rock type had no influence on stream densities. In this case it is the total number of streams (100) divided by the number of rock types (4).
The observations are independent (the number of streams on one rock type does not influence the number of streams on another).
The hypothesis we are trying to prove is that there is a significant difference between the sample data sets i.e rock type does influence stream density.
21.
The Data. 13 4.0 43.6 1.0 25 20 Sandstone 25 15 Limestone 25 58 Granite 25 7 Chalk (O-E)squared E Expected frequency (e) Observed Frequency O Rock type
From the graph we read off the degrees of freedom (3) on the horizontal axis against the xsqaured value (61.6) on the vertical.
The resulting point is above the line marked 0.1 chance in 100. This means that the probability that the data given above could be due to chance alone is less than 1 in 100.
Be the first to comment