Recently Updated Pages
Classification
Classification is often used to describe modeling of a categorical outcome. In binary classific...
Intro to Cluster Analysis
Clutsering refers to a very broad set of techniques for finding subgroups, or clusters, in a data...
Power and Sample Size Calculations for Association Studies
Review of errors and difference in means/proportions:Type II error is represented by beta and typ...
Survival Analysis II
In survival data the dependent variable is always survival time (or time until an event with the ...
Association Testing in Related Individuals
Family data is correlated that could lead to inflation in test statistics if not accounted for. M...
Haplotypes and Imputation
When multiple markers/SNPs are genotyped in a gene or gene region, the SNPs may be in linkage dis...
Principal Component Analysis
The goal of supervised learning methods (regression and classification) is to predict outcome/res...
Surveillance Defined
Surveillance is the ongoing systematic collection, analysis and interpretation of outcome-specifi...
Analysis of 2x2 Tables
Review of Measures of Association Exposed Unexposed Disease a b m1 No Di...
Multiple Comparisons and Evaluating Significance
In 1978 Restricted Fragment Linked Polymorphisms (RFLPSs) were used for linkage analysis. In 1...
Tree Based Methods
Classification and regression trees can be generated from multivariable data sets using recursive...
Midterm Cheat Sheet
Linear Regression Predicting a CI new obs adds a 1 to se(y): 𝛽0 + 𝛽2x...
Logistic Regression in Matched Studies
In case-control studies matching cases and controls on a potential confounder improves the effici...
Association Testing in Unrelated Individuals
In association testing we are interested in the effect of a specific allele in the population. We...
Logistic Regression
Stratified analysis can be used to adjust for confounding, but the results can be difficult to ad...
Variable Selection
Variable selection is intended to select the "best subset" of predictors. Variable selection shou...
Regression Diagnostics
The estimation and inference from the regression model depends on several assumptions. These assu...
Dummy Variables and Analysis of Covariance
So far we have mostly seen quantitative variables in regression models, but many variables of int...
Population Genetics
Genotype and Allele Frequency Estimation is the first step in studying a polymorphism. Used for f...
Matching
The aim of matching is remove confounding by matching subjects to be similar on a potential confo...