Eric Bunch's Blog

Cython Examples: Random Sampling and Latent Dirichlet Allocation

Python is great, but sometimes is too slow for my needs. In this post, we will walk through how to get up and running with Cython, and go through some examples including how to perform fast random sampling--even faster than numpy in some cases!--and will show how to implement the collapsed Gibbs sampler for Latent Dirichlet Allocation.

Read More →

Posted on 17 Aug 2019 by Eric Bunch

Spectral Clustering

In this post we will investigate spectral clustering, which uses the eigenvalue decomposition of a data set's Laplacian matrix. We will look into the eigengap heuristic, which give guidelines on how many clusters to choose, as well as an example using breast cancer proteome data.

Read More →

Posted on 01 Jun 2018 by Eric Bunch

Calculating Homology of a Simplicial Complex Using Smith Normal Form

In this post we will define homology and see how to compute it for a simplicial complex using Smith Normal Form.

Read More →

Posted on 24 May 2018 by Eric Bunch

Topological Data Analysis and Persistent Homology

In this post we explore persistent homology and how it is constructed.

Read More →

Posted on 26 Apr 2018 by Eric Bunch

The Simpsons' Best Episode Ever by the Data

Using Kaggle's Simpsons data set, we determine which episode is the definitive Best Episode Ever!

Read More →

Posted on 03 Aug 2017 by Eric Bunch

Forecasting new home sales in the U.S.

We use R to forecast the time series of monthly sales of new one-family houses sold in the USA from 1973 to 1996.

Read More →

Posted on 25 May 2017 by Eric Bunch

Luigi vs. Airflow

Comparison of data pipelining libraries Spotify's Luigi and Airbnb's Airflow.

Read More →

Posted on 21 May 2017 by Eric Bunch

Breakout Detection by Twitter

Describing Twitter's breakout detection package.

Read More →

Posted on 02 Mar 2016 by Eric Bunch

Transfer files and directories to an EC2 instance

I'll show how to tansfer a file or directory from your computer to an existing EC2 instance.

Read More →

Posted on 15 Nov 2015 by Eric Bunch

Setting up an Amazon Ubuntu EC2 instance and configuring with Python2.7.9 and SciPy stack

I go through how to how to set up an Amazon EC2 instance and setting up an environment for scientific computation.

Read More →

Posted on 15 Nov 2015 by Eric Bunch