Skip to main content

Recently Updated Pages

Heterogeneous Graph Learning

Advanced Topics

Knowledge graphs are visualization of information with multi-type relations (edges) among some mu...

Updated 1 month ago by Elkip

Propensity Score Weighting Analysis

Advanced Topics

Unlike randomized clinical trials, observational studies must adjust for differences such as conf...

Updated 1 month ago by Elkip

Survival Analysis I

Statistical Methods in Epidemiology

Survival analysis is a measure of time until an event occurs. It doesn't only measure death as an...

Updated 4 months ago by Elkip

Advanced Machine Learning

Advanced Topics

Recall that in an ordinary multiple linear regression, we have a set of p predictor variables mea...

Updated 5 months ago by Elkip

Forecasting with Geospacial Data

Advanced Topics

Geo-statistics is a subfield of statistics focused on spatial or spatiotemporal datasets, AKA dat...

Updated 5 months ago by Elkip

Sampling

Advanced Topics

In the practical use of statistics, we don't have an infinite amount of data. An enormous amount ...

Updated 8 months ago by Elkip

Scala

Scala + Spark

I'll start with a disclaimer: These are notes written by an experienced Java dev, thus some le...

Updated 1 year ago by Elkip

DataFrames and Advanced Techniques

Scala + Spark

A Spark DataSet is an extension of the RDD object. It has rows, can run queries, and has a schema...

Updated 1 year ago by Elkip

Intro to Spark / RDDs

Scala + Spark

Apache Spark Spark is a fast and general engine for large-scale data processing. The user writes...

Updated 1 year ago by Elkip

File Structure and Linked Views

D3.js

After adding a lot of different event listeners, the JavaScript file can become messy. This secti...

Updated 1 year ago by Elkip

Data Driven Documents

D3.js

Introduction D3 is a JS library which can be used to create charts and visualiztions, but to call...

Updated 1 year ago by Elkip

Layouts and Structured Data

D3.js

Now that I've covered the basics of programming in D3, let's take a look at some of the other coo...

Updated 1 year ago by Elkip

Making Graphs

D3.js

Scales Scales are functions that map from an input domain to an output range Linear Scales Lin...

Updated 1 year ago by Elkip

Dynamic and Interactive Content

D3.js

Thus far we've looked at building static content, but the backbone of D3.js are it's beautiful tr...

Updated 1 year ago by Elkip

The Basics of Design

D3.js

Data visualizations should be easy to interpret and look credible. To do this there are several f...

Updated 1 year ago by Elkip

GLM for Correlated Data

Generalized Linear Models

So far the models we've covered assume independence between observations collected on separate in...

Updated 2 years ago by Elkip

Survival - Time to Failure

Generalized Linear Models

Analysis of survival data is more complex than than other methods we've seen so far; We can't jus...

Updated 2 years ago by Elkip

Time Series Models

Analysis of Correlated Data

While standard regression we must assume observations are independent from one another, but with ...

Updated 2 years ago by Elkip

Gamma Regression

Generalized Linear Models

Consider a continuous dependent variable that is positive-valued, such as a length of a hospital ...

Updated 2 years ago by Elkip

Correlated Data in Clincal Trials

Applied Statistics in Clinical Trials

Note: My BS857 Notebook on Correlated Data goes much further in depth than the below. So far we ...

Updated 2 years ago by Elkip