Home
Up

 

Section 3: Two dependent means - matched pairs

First a definition:

 Two samples are independent if the sample selected from one population is not related to the sample selected from the other population. If one sample is related to the other, the samples are dependent. When we get samples from the same subject or one value from each of two samples sharing the same characteristic they are called paired or matched samples

 In many cases these are "before and after" situations. Look at this situation

 Example 1: The "new grapefruit diet" has been advertised to be a remarkable way to lose weight. Ten males subjects are put on the diet and here are the results after one month

Subject

1

2

3

4

5

6

7

8

9

10

Before

168

215

225

192

207

224

246

198

201

195

After

156

200

223

190

203

201

211

176

194

203

 

Before we begin the hypothesis test on this example, we need to define some terms/concepts

 1. Two dependent samples must be selected from two populations in a way that is random

 2. Each of the two populations must be normally distributed

 Notation:

  = mean value of the differences d for the population of paired data

   = mean value of the differences d for the paired sample data

  = standard deviation of the differences for the paired sample data

 n = number of pairs of data

 To do a hypothesis test on paired data, we find the differences between the two sets of data (usually after - before) and then test it against 0. Here is the example given above:

 

Subject

1

2

3

4

5

6

7

8

9

10

Before

168

215

225

192

207

224

246

198

201

195

After

156

200

223

190

203

201

211

176

194

203

Difference

(after-before)

-12

-15

-2

-2

-4

-23

-35

-22

-7

8

 If the diet "works" we would expect the average difference to be significantly less than 0.  Since we are doing after minus before we would want the after weight to be smaller in this situation. So we make this our claim:

 

 Lets use the 0.05 significance level

 Now we need a test statistic - here it is:

  , where n-1 = degrees of freedom - note this is a t test often called the paired t test - use this if you have 30 or fewer pairs, use the normal distribution if you have greater than 30 pairs

 Now lets solve the problem like we would any hypothesis test. First calculate the mean and standard deviation of the differences, in our case

 

 For 9 degrees of freedom the critical value is t = -1.833. Here is the picture

 So we reject the null hypothesis. The sample evidence supports the claim that the diet works

 We can also do a confidence interval - Use the formula:

 

The confidence interval of the mean difference is as follows:

 

 

Where  and degrees of freedom = n-1

 In our case , so the confidence interval is

Notice the confidence interval does not contain 0, so we can say we are 95% confident that the actual mean difference is between –20.4615 lbs and –2.3385 lbs. So it is unlikely that the mean difference is 0 (diet has no affect) or greater than zero.

 Back to top