# Stratification and Interaction

##### Which Summary Measure to Use?

- Weighted averages are usually best
- Mantel-Haenszel is easy to compute and can handle zeros
- MLE measures are difficult and typically require a computer

#### Weighted Average in MH Summaries  


Consider the following table:

<table border="1" id="bkmrk-sample-1-sample-2-n-" style="border-collapse: collapse; width: 100%;"><tbody><tr><td style="width: 33.2934%;">  
</td><td style="width: 33.2934%;">Sample 1  
</td><td style="width: 33.2934%;">Sample 2  
</td></tr><tr><td style="width: 33.2934%;">n  
</td><td style="width: 33.2934%;">30  
</td><td style="width: 33.2934%;">70  
</td></tr><tr><td style="width: 33.2934%;">x\_bar  
</td><td style="width: 33.2934%;">5  
</td><td style="width: 33.2934%;">8</td></tr></tbody></table>

Weighted average of population -&gt; ((30\*5)+(70\*8))/(30+70) = 7.1

The average mean is closer to the cohort with a larger sample size. We can calculate any weighted average with the general form:

[![image-1663597273168.png](https://bookstack.mitchellhenschel.com/uploads/images/gallery/2022-09/scaled-1680-/image-1663597273168.png)](https://bookstack.mitchellhenschel.com/uploads/images/gallery/2022-09/image-1663597273168.png)

Where theta\_hat is an estimator, such as mean or OR.

The MH Odds Ratio and RR can be described as weighted averages:

[![image-1663597467897.png](https://bookstack.mitchellhenschel.com/uploads/images/gallery/2022-09/scaled-1680-/image-1663597467897.png)](https://bookstack.mitchellhenschel.com/uploads/images/gallery/2022-09/image-1663597467897.png)

Where the weights are (b\*c)/n

[![image-1663597533686.png](https://bookstack.mitchellhenschel.com/uploads/images/gallery/2022-09/scaled-1680-/image-1663597533686.png)](https://bookstack.mitchellhenschel.com/uploads/images/gallery/2022-09/image-1663597533686.png)

Where (a/n\_1) / (b/n\_0) is the risk ratio in each stratum, (b\*n\_1 / n) is the weight

#### Assumptions of Mantel-Haenszel Summary Measures

- Observations are independent from each other
- All observations are identically distributed
- **The common effect assumption should hold:**
    - Follow-up cohort study - The stratum-specific risk ratios are all equal across the strata
    - Case-control - The stratum specific odds ratios are all equal across the strata

MH measures are biased if the correctness of the common effect assumptions cannot be justified.

An extreme example: When interaction exists with **protective** and **detrimental** effects across strata; Protective effects negative in numerator in a stratum, and detrimental effects positive in numerator in another stratum.

### Precision-based Summary Estimators

Also called **Woolf's Method**. Precision-based summary estimators are also weighted averages. Weighing each stratum according to its sampling error **gives the most weight to the strata with the smallest variance**. Precision-based are designed to have the greatest precision (smallest standard error). For Ratios we often take the log scale for a more symmetrical distribution. The general approach:

[![image-1663598561474.png](https://bookstack.mitchellhenschel.com/uploads/images/gallery/2022-09/scaled-1680-/image-1663598561474.png)](https://bookstack.mitchellhenschel.com/uploads/images/gallery/2022-09/image-1663598561474.png)

This is the sum of the products of each stratum-specific ratio times its weight, all divided by the sum of weights.

#### Precision-based Summary Odds Ratio

[![image-1663598719717.png](https://bookstack.mitchellhenschel.com/uploads/images/gallery/2022-09/scaled-1680-/image-1663598719717.png)](https://bookstack.mitchellhenschel.com/uploads/images/gallery/2022-09/image-1663598719717.png)

Thus, Var(ln(OR\_hat) ~ 1/a + 1/b + 1/c + 1/d

##### Precision-based Summary Risk Ratio

[![image-1663598910847.png](https://bookstack.mitchellhenschel.com/uploads/images/gallery/2022-09/scaled-1680-/image-1663598910847.png)](https://bookstack.mitchellhenschel.com/uploads/images/gallery/2022-09/image-1663598910847.png)

Thus the Var(ln(RR\_hat)) = ((1-p\_hat1)/(n\_1\*p\_hat1) + (1 - p\_hat2)/(n\_2\*p\_hat2))

### Confidence Intervals of Summary Measures

There are 2 types of CI intervals: Test-based (from a test statistic) and Precision-based (uses standard error). Most of the time both will yield very similar intervals.

##### Test-Based CI

[![image-1663600038654.png](https://bookstack.mitchellhenschel.com/uploads/images/gallery/2022-09/scaled-1680-/image-1663600038654.png)](https://bookstack.mitchellhenschel.com/uploads/images/gallery/2022-09/image-1663600038654.png)

##### Precision-based CI  


[![image-1663598835526.png](https://bookstack.mitchellhenschel.com/uploads/images/gallery/2022-09/scaled-1680-/image-1663598835526.png)](https://bookstack.mitchellhenschel.com/uploads/images/gallery/2022-09/image-1663598835526.png)

[![image-1663598945870.png](https://bookstack.mitchellhenschel.com/uploads/images/gallery/2022-09/scaled-1680-/image-1663598945870.png)](https://bookstack.mitchellhenschel.com/uploads/images/gallery/2022-09/image-1663598945870.png)

Where the standard error is the square root of the variance above.

#### Comparision

Precision-based summary ratios are straightforward, and best when the number of strata is small, and sample size within each strata is large. **Cannot** be calculated when any cell in any stratum is 0 as log(0) is undefined, though one could correct .5 at risk of bias.

MH Method can handle 0 cells. The assumption is that all counts are large enough, if there are small counts in some strata the CI will not be valid.

### Hypothesis Testing of Interaction

Tests for interaction (effect modification):

 H0: OR1 = OR2 = ... = ORg / H0: RR1 = RR2 = ... RRg

Tests of Association from Stratified 2x2 Tables:

 H0: No association and the summary (adjusted) measure = 1

#### Breslow-Day Test

This is default test for interaction in SAS.

Steps: Calculate summary OR, use summary OR to get expected number of exposed cases per strata, if no interaction compare with actual number of exposed cases for each strata

 H0: OR1 = ... ORg (g strata)

 H1: at least two measures are different

Conclusion: We have \[in\]sufficent evidence to \[reject/accept\] the null hypothesis that all the associations between X and Y adjusted by strata are equivalent.

[![image-1663601308161.png](https://bookstack.mitchellhenschel.com/uploads/images/gallery/2022-09/scaled-1680-/image-1663601308161.png)](https://bookstack.mitchellhenschel.com/uploads/images/gallery/2022-09/image-1663601308161.png)

Where a = observed value in g<sup>th</sup> stratum and a<sup>|</sup> = fitted or expected value of under H0 in g<sup>th</sup> stratum

[![image-1663601551874.png](https://bookstack.mitchellhenschel.com/uploads/images/gallery/2022-09/scaled-1680-/image-1663601551874.png)](https://bookstack.mitchellhenschel.com/uploads/images/gallery/2022-09/image-1663601551874.png)

a<sup>|</sup> should be comparable with table margins (determines whether to add or subtract the radical)

Variance under H0 in the g<sup>th</sup> stratum:

[![image-1663601420989.png](https://bookstack.mitchellhenschel.com/uploads/images/gallery/2022-09/scaled-1680-/image-1663601420989.png)](https://bookstack.mitchellhenschel.com/uploads/images/gallery/2022-09/image-1663601420989.png)

Assume a common OR (mOR) and create adjusted:

[![image-1663601505331.png](https://bookstack.mitchellhenschel.com/uploads/images/gallery/2022-09/scaled-1680-/image-1663601505331.png)](https://bookstack.mitchellhenschel.com/uploads/images/gallery/2022-09/image-1663601505331.png)

#### Woolf Test

- Can be used for RR or OR
- Calculate summary OR, compare strata-specific ORs to summary OR
- .5 is added to each cell as a small-sample adjustment (optional)

[![image-1663603113544.png](https://bookstack.mitchellhenschel.com/uploads/images/gallery/2022-09/scaled-1680-/image-1663603113544.png)](https://bookstack.mitchellhenschel.com/uploads/images/gallery/2022-09/image-1663603113544.png)

Most often, Breslow-Day and Woolf's test produce similar test statistics. Woolf's method has a theoretical derivation of the weights based on large counts in each cell. If there are small counts in a strata, the CI is invalid.