10  Repeated Sample t-test

In the last chapter, we assumed the two groups were independent. That is, each bag of chips was independent of the other. However, sometimes we are interested in change over time or measuring the a similar variables within individuals. While many of the steps we took in the last chapter remain the same (if you skipped it, please go back!), there are some important differences.

10.1 Creatine and Muscles

You have a theory that exercise converts adenosine triphosphate (ATP) into adenosine diphosphate (ADP). When we ingest creatine phosphate, we convert ADP back into ATP and can exercise for longer. Imagine you are interested in the impact of a new type of creatine on how much weight someone can lift. You hypothesise that individuals will lift a different amount after ingesting creatine regularly for one month. Formally, we might consider the hypothesis:

\(H0: \mu_{d} = 0\) where d represent the difference in scores within a pair \(H1: \mu_{d} \neq 0\)

You decide to recruit 10 individuals from the local gym, measure how much they can bench press. Then, you give them 5g each day for one month and then re-measure their bench press strength. This pre-post design results in the following data:

kable(creatine) %>% 
  kable_material()
ID Bench1 Bench2
1 137 147
2 180 177
3 169 187
4 155 174
5 172 186
6 170 167
7 154 175
8 144 155
9 172 199
10 160 155

The t-statistic is: \(t = \frac{\Delta{\overline{x}_i}}{se_{diff}}\)

Our next step is to compute the difference scores. Note that higher difference scores reflect lifing more at T2, after ingesting the creatine:

kbl(creatine) %>% 
  kable_material()
ID Bench1 Bench2 Difference
1 137 147 10
2 180 177 -3
3 169 187 18
4 155 174 19
5 172 186 14
6 170 167 -3
7 154 175 21
8 144 155 11
9 172 199 27
10 160 155 -5

Our difference scores are simply one score subtract the other:

\(diff = \Delta{x_i} = x_{i1}-x_{i2}\)

The mean of our differences scores above is \(\overline{x}_{diff} =\) 10.9.

We can calculate the standard error of our differences scores in a similar way as the last chapter:

\(se_d = \frac{\sum(d_i-\overline{x}_{diff})^2\frac{1}{N}}{\sqrt{n}}\)

Calculating our squared difference scores, we get:

creatine %>% 
  select(ID, Difference, square_diff) %>% 
  kable() %>% 
  kable_styling(full_width = F)
ID Difference square_diff
1 10 0.81
2 -3 193.21
3 18 50.41
4 19 65.61
5 14 9.61
6 -3 193.21
7 21 102.01
8 11 0.01
9 27 259.21
10 -5 252.81

and the resulting sum of the squared differences is 1126.9. The sd of the difference scores is 11.1897771. Therefore:

\(se_{diff} = \frac{\sqrt{1126.9\frac{1}{10-1}}}{\sqrt{10}}=3.5385\)

and:

\(t = \frac{10.9}{3.5385}= 3.08\)

We can check our results with a formal analysis in R. We would need to specify the argument paired = TRUE.

t.test(data=creatine_long, Weight~Time, paired=T)

    Paired t-test

data:  Weight by Time
t = -3.0804, df = 9, p-value = 0.01313
alternative hypothesis: true mean difference is not equal to 0
95 percent confidence interval:
 -18.904684  -2.895316
sample estimates:
mean difference 
          -10.9 

Thus, we would conclude that our data regarding changes in bench press weight lifted is unlikely given a true null hypothesis, t = 3.08, p = 013.

10.2 Practice Problem

Practice Problem: You are a researchers for Clearly Contact Lenses. You are asked to determine if their contacts improve the confidence of their users. You recruit 8 glasses wearers and measure their confidence. Then, clearly provide the glasses wearers contact lenses. After one month of wearing contacts, you re-assess the individuals’ confidence rating. The following data are obtained:

ID Glasses Contacts
1 52 52
2 46 52
3 59 60
4 68 64
5 60 59
6 61 63
7 47 42
8 60 61

Paired sample t-tests have more statistical power when compared to independent samples t-test, given the same number of observations.

10.3 Conclusion

Test Used for Hypothesis Formula
independent t-test Testing if two group means are the same \(H0: \mu_1 = \mu_2\) \(t = \frac{\bar{x_1} - \bar{x_2}} {\sqrt{\frac{s^2_p}{n_1}+\frac{s^2_p}{n_2}}}\)
paired t-test Testing changes in mean score \(H0: \mu_{diff}=0\) \(t = \frac{\Delta{x_i}}{se_{diff}}\)