Automating Auto-SARIMA for Large-Scale Time Series Forecasting

For a recent project as part of my bootcamp at The Flatiron School, I wanted to explore the power of SARIMA forecasting for predicting important climate metrics provided by the NOAA, such as average temperature and precipitation. Given the limited time that I had for this project, I was only able to really focus onContinue reading “Automating Auto-SARIMA for Large-Scale Time Series Forecasting”

What is “Statistically Significant” Anyways?

Anyone who has ever done work in the biological sciences, demography, sociology, psychology, and now, data science, knows what a p-value is. It’s the first thing people usually reference when showing that an observed result is “statistically significant” when performing Null Hypothesis Significance Testing (NHST). The problem, though, is that p-values are widely misunderstood areContinue reading “What is “Statistically Significant” Anyways?”

Getting Started with Scikit-learn’s Toy Datasets

Introduction The first library, seaborn, is what we’ll be using to visualize our data. It provides the capabilities to make a variety of beautiful looking plots in only a few lines of code. The next library, pandas, is what we’ll be using to view our data and get some basic information about the different featuresContinue reading “Getting Started with Scikit-learn’s Toy Datasets”

Design a site like this with WordPress.com
Get started