# Stratification and Interaction ##### Which Summary Measure to Use? - Weighted averages are usually best - Mantel-Haenszel is easy to compute and can handle zeros - MLE measures are difficult and typically require a computer #### Weighted Average in MH Summaries Consider the following table:
Sample 1 Sample 2
n 30 70
x\_bar 5 8
Weighted average of population -> ((30\*5)+(70\*8))/(30+70) = 7.1 The average mean is closer to the cohort with a larger sample size. We can calculate any weighted average with the general form: [![image-1663597273168.png](https://bookstack.mitchellhenschel.com/uploads/images/gallery/2022-09/scaled-1680-/image-1663597273168.png)](https://bookstack.mitchellhenschel.com/uploads/images/gallery/2022-09/image-1663597273168.png) Where theta\_hat is an estimator, such as mean or OR. The MH Odds Ratio and RR can be described as weighted averages: [![image-1663597467897.png](https://bookstack.mitchellhenschel.com/uploads/images/gallery/2022-09/scaled-1680-/image-1663597467897.png)](https://bookstack.mitchellhenschel.com/uploads/images/gallery/2022-09/image-1663597467897.png) Where the weights are (b\*c)/n [![image-1663597533686.png](https://bookstack.mitchellhenschel.com/uploads/images/gallery/2022-09/scaled-1680-/image-1663597533686.png)](https://bookstack.mitchellhenschel.com/uploads/images/gallery/2022-09/image-1663597533686.png) Where (a/n\_1) / (b/n\_0) is the risk ratio in each stratum, (b\*n\_1 / n) is the weight #### Assumptions of Mantel-Haenszel Summary Measures - Observations are independent from each other - All observations are identically distributed - **The common effect assumption should hold:** - Follow-up cohort study - The stratum-specific risk ratios are all equal across the strata - Case-control - The stratum specific odds ratios are all equal across the strata MH measures are biased if the correctness of the common effect assumptions cannot be justified. An extreme example: When interaction exists with **protective** and **detrimental** effects across strata; Protective effects negative in numerator in a stratum, and detrimental effects positive in numerator in another stratum. ### Precision-based Summary Estimators Also called **Woolf's Method**. Precision-based summary estimators are also weighted averages. Weighing each stratum according to its sampling error **gives the most weight to the strata with the smallest variance**. Precision-based are designed to have the greatest precision (smallest standard error). For Ratios we often take the log scale for a more symmetrical distribution. The general approach: [![image-1663598561474.png](https://bookstack.mitchellhenschel.com/uploads/images/gallery/2022-09/scaled-1680-/image-1663598561474.png)](https://bookstack.mitchellhenschel.com/uploads/images/gallery/2022-09/image-1663598561474.png) This is the sum of the products of each stratum-specific ratio times its weight, all divided by the sum of weights. #### Precision-based Summary Odds Ratio [![image-1663598719717.png](https://bookstack.mitchellhenschel.com/uploads/images/gallery/2022-09/scaled-1680-/image-1663598719717.png)](https://bookstack.mitchellhenschel.com/uploads/images/gallery/2022-09/image-1663598719717.png) Thus, Var(ln(OR\_hat) ~ 1/a + 1/b + 1/c + 1/d ##### Precision-based Summary Risk Ratio [![image-1663598910847.png](https://bookstack.mitchellhenschel.com/uploads/images/gallery/2022-09/scaled-1680-/image-1663598910847.png)](https://bookstack.mitchellhenschel.com/uploads/images/gallery/2022-09/image-1663598910847.png) Thus the Var(ln(RR\_hat)) = ((1-p\_hat1)/(n\_1\*p\_hat1) + (1 - p\_hat2)/(n\_2\*p\_hat2)) ### Confidence Intervals of Summary Measures There are 2 types of CI intervals: Test-based (from a test statistic) and Precision-based (uses standard error). Most of the time both will yield very similar intervals. ##### Test-Based CI [![image-1663600038654.png](https://bookstack.mitchellhenschel.com/uploads/images/gallery/2022-09/scaled-1680-/image-1663600038654.png)](https://bookstack.mitchellhenschel.com/uploads/images/gallery/2022-09/image-1663600038654.png) ##### Precision-based CI [![image-1663598835526.png](https://bookstack.mitchellhenschel.com/uploads/images/gallery/2022-09/scaled-1680-/image-1663598835526.png)](https://bookstack.mitchellhenschel.com/uploads/images/gallery/2022-09/image-1663598835526.png) [![image-1663598945870.png](https://bookstack.mitchellhenschel.com/uploads/images/gallery/2022-09/scaled-1680-/image-1663598945870.png)](https://bookstack.mitchellhenschel.com/uploads/images/gallery/2022-09/image-1663598945870.png) Where the standard error is the square root of the variance above. #### Comparision Precision-based summary ratios are straightforward, and best when the number of strata is small, and sample size within each strata is large. **Cannot** be calculated when any cell in any stratum is 0 as log(0) is undefined, though one could correct .5 at risk of bias. MH Method can handle 0 cells. The assumption is that all counts are large enough, if there are small counts in some strata the CI will not be valid. ### Hypothesis Testing of Interaction Tests for interaction (effect modification): H0: OR1 = OR2 = ... = ORg / H0: RR1 = RR2 = ... RRg Tests of Association from Stratified 2x2 Tables: H0: No association and the summary (adjusted) measure = 1 #### Breslow-Day Test This is default test for interaction in SAS. Steps: Calculate summary OR, use summary OR to get expected number of exposed cases per strata, if no interaction compare with actual number of exposed cases for each strata H0: OR1 = ... ORg (g strata) H1: at least two measures are different Conclusion: We have \[in\]sufficent evidence to \[reject/accept\] the null hypothesis that all the associations between X and Y adjusted by strata are equivalent. [![image-1663601308161.png](https://bookstack.mitchellhenschel.com/uploads/images/gallery/2022-09/scaled-1680-/image-1663601308161.png)](https://bookstack.mitchellhenschel.com/uploads/images/gallery/2022-09/image-1663601308161.png) Where a = observed value in gth stratum and a| = fitted or expected value of under H0 in gth stratum [![image-1663601551874.png](https://bookstack.mitchellhenschel.com/uploads/images/gallery/2022-09/scaled-1680-/image-1663601551874.png)](https://bookstack.mitchellhenschel.com/uploads/images/gallery/2022-09/image-1663601551874.png) a| should be comparable with table margins (determines whether to add or subtract the radical) Variance under H0 in the gth stratum: [![image-1663601420989.png](https://bookstack.mitchellhenschel.com/uploads/images/gallery/2022-09/scaled-1680-/image-1663601420989.png)](https://bookstack.mitchellhenschel.com/uploads/images/gallery/2022-09/image-1663601420989.png) Assume a common OR (mOR) and create adjusted: [![image-1663601505331.png](https://bookstack.mitchellhenschel.com/uploads/images/gallery/2022-09/scaled-1680-/image-1663601505331.png)](https://bookstack.mitchellhenschel.com/uploads/images/gallery/2022-09/image-1663601505331.png) #### Woolf Test - Can be used for RR or OR - Calculate summary OR, compare strata-specific ORs to summary OR - .5 is added to each cell as a small-sample adjustment (optional) [![image-1663603113544.png](https://bookstack.mitchellhenschel.com/uploads/images/gallery/2022-09/scaled-1680-/image-1663603113544.png)](https://bookstack.mitchellhenschel.com/uploads/images/gallery/2022-09/image-1663603113544.png) Most often, Breslow-Day and Woolf's test produce similar test statistics. Woolf's method has a theoretical derivation of the weights based on large counts in each cell. If there are small counts in a strata, the CI is invalid.