Skip to main content

Recently Updated Pages

Heterogeneous Graph Learning

Advanced Topics

Knowledge graphs are visualization of information with multi-type relations (edges) among some mu...

Updated 2 months ago by Elkip

Propensity Score Weighting Analysis

Advanced Topics

Unlike randomized clinical trials, observational studies must adjust for differences such as conf...

Updated 2 months ago by Elkip

Survival Analysis I

Statistical Methods in Epidemiology

Survival analysis is a measure of time until an event occurs. It doesn't only measure death as an...

Updated 6 months ago by Elkip

Advanced Machine Learning

Advanced Topics

Recall that in an ordinary multiple linear regression, we have a set of p predictor variables mea...

Updated 7 months ago by Elkip

Forecasting with Geospacial Data

Advanced Topics

Geo-statistics is a subfield of statistics focused on spatial or spatiotemporal datasets, AKA dat...

Updated 7 months ago by Elkip

Sampling

Advanced Topics

In the practical use of statistics, we don't have an infinite amount of data. An enormous amount ...

Updated 10 months ago by Elkip

Scala

Scala + Spark

I'll start with a disclaimer: These are notes written by an experienced Java dev, thus some le...

Updated 1 year ago by Elkip

DataFrames and Advanced Techniques

Scala + Spark

A Spark DataSet is an extension of the RDD object. It has rows, can run queries, and has a schema...

Updated 1 year ago by Elkip

Intro to Spark / RDDs

Scala + Spark

Apache Spark Spark is a fast and general engine for large-scale data processing. The user writes...

Updated 1 year ago by Elkip

File Structure and Linked Views

D3.js

After adding a lot of different event listeners, the JavaScript file can become messy. This secti...

Updated 1 year ago by Elkip

Data Driven Documents

D3.js

Introduction D3 is a JS library which can be used to create charts and visualiztions, but to call...

Updated 1 year ago by Elkip

Layouts and Structured Data

D3.js

Now that I've covered the basics of programming in D3, let's take a look at some of the other coo...

Updated 1 year ago by Elkip

Making Graphs

D3.js

Scales Scales are functions that map from an input domain to an output range Linear Scales Lin...

Updated 1 year ago by Elkip

Dynamic and Interactive Content

D3.js

Thus far we've looked at building static content, but the backbone of D3.js are it's beautiful tr...

Updated 1 year ago by Elkip

The Basics of Design

D3.js

Data visualizations should be easy to interpret and look credible. To do this there are several f...

Updated 2 years ago by Elkip

GLM for Correlated Data

Generalized Linear Models

So far the models we've covered assume independence between observations collected on separate in...

Updated 2 years ago by Elkip

Survival - Time to Failure

Generalized Linear Models

Analysis of survival data is more complex than than other methods we've seen so far; We can't jus...

Updated 2 years ago by Elkip

Time Series Models

Analysis of Correlated Data

While standard regression we must assume observations are independent from one another, but with ...

Updated 2 years ago by Elkip

Gamma Regression

Generalized Linear Models

Consider a continuous dependent variable that is positive-valued, such as a length of a hospital ...

Updated 2 years ago by Elkip

Correlated Data in Clincal Trials

Applied Statistics in Clinical Trials

Note: My BS857 Notebook on Correlated Data goes much further in depth than the below. So far we ...

Updated 2 years ago by Elkip