All Hands on Data #26
As we near the beginning of the World Cup over the weekend, our team has shared some of their favorite articles from across the world of data.
9 Predictions for Data in 2023
The author lists several of his own predictions for the coming year. Two that stood out to me were the expectation that cloud data warehouses would effectively become backends for applications and the increased usage of large language models in data projects (sounds like something we're working on at Shipyard...). - John Forstmeier
Should I Learn Julia?
As a fan of Julia, I think the strengths that it has to offer will become more and more important as the size of datasets continue to grow. It's an elegant language, with a very low barrier of entry to learn - Wes Poulsen
How Data Visualization is Essential for the Banking and Finance Sector
I'm a sucker for visualization. Does this mean I'd pick a picture book over a novel? No comment. But when it comes to data - I really think it's the easiest way to communicate your point to a broad audience. And as the consumer of said data, the easiest way for you to quickly understand it. I've never been involved in the banking industry, so it's interesting to see what types of data visualizations they may be using. - Joseph McDermott
How to gather requirements for your data project
As a QA Engineer, this article hits pretty close to home. Gathering requirements from stake holders can sometimes be a frustrating ordeal. This article gives a great outline of requirement gathering along with some useful tips. - Jon Davidson
Decision Trees Explained -- Entropy, Information Gain, Gini Index, CCP Pruning
As a non-data scientist, I find some data science concepts a bit mysterious. The decision tree, however, always seemed quite intuitive to me. I had no idea the complexity that goes into choosing how to branch and what to prune to make a tree with the most predictive value. Dash takes a highly complex process and clearly explains the approaches, math, and advantages/disadvantages of decision trees. - Katt Baum
Writing Your First dbt Package
dbt is such a useful tool. It makes data engineering a bit easier to on-board or at least it helped me early in my career. Pedram does a great job outlining how to get started with examples. - Steven Johnson