2. STUDY DESIGN
Problems: For gym goers in given areas, is age related to amount of time spent at gym
Approach: A frame of all gym goers would be hard if not impossible to obtain. Therefore,
we will use Systematic Sampling.
Solution: First each student will find the 3 closest gyms near their house (students live in
different areas so a region of gyms will be selected) then randomly pick two of the gyms.
We will look at the time the gym is open and divide it into 2 slots “am/pm”. We will the
go each gym and find out on average how many people attend in a week divide that by 7
to figure out N. n= 50. We will divide N/n and get our k (each students may be different).
For gym 1 (randomly decided) the student will survey when they open. For gym 2,
student will go when they are half way through the day; each time collecting data.
6. STATISTICS FOR FIRST VARIABLE
(AGE)
mean – 37.218
standard deviation – 3.86
five-number summary – 16, 25, 33, 48.5, 71
range – 55
mode – 33
outliers none (less then 10.25 and more then 83.75)
7. HISOGRAM FOR FIRST VARIBLE
(AGE)
The data is skewed
to the right showing
a decrease in gym
goers as age
increases
9. STATISTICS FOR SECOND VARIABLE
(TIME SPENT AT GYM IN MINUTES)
mean – 66.1835
standard deviation – 5.19
five-number summary – 15, 45, 60, 90, 150
range - 135
mode – 60
outliers 15, 15, 20, 20 (less then 22.5 and larger then 157.5)
10. HISOGRAM FOR SECOND VARIABLE
(TIME SPENT AT GYM IN MINUTES)
The data is not
skewed in either
direction nor
symmetrical.
However, the
bin that
represents 47-62
min is double
any other bin.
12. SCATTERPLOT WITH LINE OF
REGRESSION AND LINEAR
COEFFICIENT
160
140
120
100
80
60
40
20
0
0 10 20 30 40 50 60 70 80
13. ANALAYSIS
In analyzing a relation between age and time spent at the gym, we
found no correlation.
Overall, data for gym goers was found to have a correlation
coefficient of (r=.0258683725), much less than the critical value
(.195) at .05 with 100 degrees of freedom (the largest data
available on the table).
Age does not affect the amount of time spent in the gym.
We are 95% confident that a relationship exists because r*p < .05 or
.195*.0001<.05 and because .0000195 is less than .05 we know the
relationship exists.
14. THINGS WE FOUND
INTERESTING
The data returned was in increments of 15 minutes; since we usually give time
in increments of quarter hours.
The age diversity of the gym goers.
Senior gym goers work out just as long or longer then the middle-aged gym
goer.
The most common answer we received was one hour.
There were a lot more people that were not willing to talk to us then expected;
and many did not want to give us their age as well.
We would be interested to see what would happen if we could actually record
beginning and ending times for workouts and see if the workout times would
actually be an hour, or if the time would include travel to the gym, working
out, showering, and other activities in preparation and finishing off of the
workout as well; along with if people over estimate how long they will
workout.
15. PARTICIPATION
STUDY DESIGN (2) SARA LARSEN
DATA SLIDES (3-5) MATT BASSETT
FIRST VARIABLE STATS (6) SARA LARSEN
CHARTS FOR FIRST VARIABLE (7-8) JAIME ELSEY
SECOND VARIABLE STATS (9) SARA LARSEN
CHARTS FOR SECOND VARIABLE (10-11) SARA LARSEN
SCATTERPLOT (12) WILLIAM TEAGUE
ANALYSIS (13) WILLIAM AND MATT
INTERESTING THINGS (14) SARA, MATTHEW, & WILLIAM