Advanced Search
Search Results
165 total results found
Genetics and Inheritance
Gregor Mendel was an Austrian Monk in the late 1800s who experimented with pea plants to find trait inheritance patterns in successive generations. Mendel's Laws Principle of Segregation - Two alleles of a homologous pair of chromosomes separate (segrega...
Mutiple Linear Regression and Estimation
Multiple Linear Regression analysis can be looked upon as an extension of simple linear regression analysis to the situation in which more than one independent variables must be considered. The general model with response Y and regressors X1, X2,... Xp: Su...
Stratification and Interaction
Which Summary Measure to Use? Weighted averages are usually best Mantel-Haenszel is easy to compute and can handle zeros MLE measures are difficult and typically require a computer Weighted Average in MH Summaries Consider the following table: ...
Linkage Analysis
The primary goal of linkage analysis is to determine the location (chromosome and region) of genes influencing a specific trait. We accomplish this by looking for evidence of co-inheritance of the trait with other genes or markers whose locations are known, an...
Model Fitting: Inference
Given several predictors and a response, we need to figure out whether all are needed. Consider a large model, Ξ©, and a smaller model, π, which consist of a subset of predictors in Ξ©. If there is not much difference in fit, we perfer the smaller model. ...
Standardization
The objective of standardization is the compare the rates of a disease (or outcome) between population which differ in underlying characteristics (age, sex, race, etc) that may affect the overall rate of disease; In essence it's another way to account for conf...
Assessing the Genetic Component of a Phenotype
A phenotype is the appearance of an individual, which results from the interaction of the person's genetic makeup and their environment. Phenotypes can be categorical or numerical. If we are interested in the genetic component of a traitΒ there are different m...
Dummy Variables and Analysis of Covariance
So far we have mostly seen quantitative variables in regression models, but many variables of interest are qualitative (sex, status, etc). To add such information to a model, we can set up a indicator/dummy variable. For example, we could set up a variable X...
Broken Stick Regression, Polynomial Regression, Splines
Transformations of the response and predictors can improve the fit and correct violations of model assumptions such as non-constant error variance. This chapter focuses on the transformation of predictors. The idea behind Broken Stick Regression/Segmented Reg...
Matching
The aim of matching is remove confounding by matching subjects to be similar on a potential confounder. Doing so eliminates (or reduces) confounding, as well as reducing variability thereby increasing power. Recall a paired t-test with two independent samples...
Population Genetics
Genotype and Allele Frequency Estimation is the first step in studying a polymorphism. Used for family data and independent individuals in a population. We can use a subset of individuals who are independent and count alleles, or use the maximum likelihood met...
Regression Diagnostics
The estimation and inference from the regression model depends on several assumptions. These assumptions need to be checked using regression diagnostics. We divide the potential problems into three categories: Error: π ~ N(0, π2I); i.e. the errors are: ...
Logistic Regression
Stratified analysis can be used to adjust for confounding, but the results can be difficult to adjust multiple confounders. If we have too many strata, we could end up with very small tables or 0 counts for some cells. We can instead use Logistic Regression wh...
Association Testing in Unrelated Individuals
In association testing we are interested in the effect of a specific allele in the population. We ask the question: "Is allele X1 more common in affected individuals than unaffected individuals?" We do not need family data to answer this, but we can use it if ...
Variable Selection
Variable selection is intended to select the "best subset" of predictors. Variable selection shouldn't be separated from the rest of the model; Outliers and influential points can change the model we select. Transformations of the variables can have an impact ...
Logistic Regression in Matched Studies
In case-control studies matching cases and controls on a potential confounder improves the efficiency of a study and removes/reduces bias. Logistic regression controlling for a stratification variable allows us to examine multiple risk factors and control for...
Multiple Comparisons and Evaluating Significance
In 1978 Restricted Fragment Linked Polymorphisms (RFLPSs) were used for linkage analysis. In 1987 the first human genetic map was created. In 1989 microstellite markers made genome-wide linkage studies possible. 1990-2003 the human genome project was sequ...
Midterm Cheat Sheet
Linear Regression Predicting a CI new obs adds a 1 to se(y): π½0 + π½2x +/- t* Multiple Linear Regression and Estimation π»0 : π½1 = π½2 = π½3 = β― = π½π = 0 Β v.s. Β π»1 : not all π½π = 0, π = 1, β¦ , π rejection ...
Tree Based Methods
Classification and regression trees can be generated from multivariable data sets using recursive partitioning with: Gini index or entropy for a categorical outcome Residual sum of squares for a continuous outcome We can use cross-validation to assess t...
Survival Analysis I
Survival analysis is a measure of time until an event occurs. It doesn't only measure death as an outcome, and can adjust for covariates just as a logistic regression. But while a logistic regression only requires knowledge of whether an outcome occurred, surv...