Data Structure Sketches
I’m very much a visual learning and this little collection of graphical representation of classic data structures really helped to hammer home some of their core concepts. - John Forstmeier
Old Principles, New Approaches: Bayes in Practice
This article is a four-in-one deal! It compiles posts about modern applications of Bayes’ theorem in model training, A/B testing, and classification and ranking algorithms. Some classics are classics for a reason. - Katt Baum
Complexity: the new analytics frontier
Anna dives into five super tough issues analytics engineers are starting to face now that the role is maturing and organizations have more data models than ever. I particularly liked her take on how “death by notifications” makes it harder for ops to be effective. - Blake Burch
Key-Value Databases, Explained
I think this is a good, high level of intro to NoSQL DB’s. Having a NoSQL DB or data model is becoming very popular in the wild, and if you’re a data engineer you are almost certainly going to come across it as well as standard data models.This article provides a gentle and simple explanation for how a NoSQL data model works, as well as popular options out there - Wes Poulsen
How to Correctly Select a Sample From a Huge Dataset in Machine Learning
At my previous company, Homesnap, we used a machine learning model to generate a “Likelihood to List” score for every residential property in the United States. I knew we were pulling a ton of different data points, but I was honestly confused how we were able to make sense of that much data. This article appealed to me because it focused on learning from a smaller sample, or what they even refer to as the minimum amount required. - Joseph McDermott
Fighting Data Silos with Data Literacy and Transparency for All
As someone new to the world of Data, the idea of improving Data literacy company wide is very appealing. The team at Inside Big Data offer good ideas on how to improve Data literacy company wide. - Jon Davidson
3 Simple Ways to Speed Up Your Python Code
I feel like the majority of time that we spend learning about our processes are on how to build new things. A lot like orchestration, the speed and efficiency of your code isn’t thought about until you’ve finished your task and have to move onto something else. - Steven Johnson