All Hands on Data #42
They say that you should beware the Ides of March. Thankfully, I've read that a data newsletter is the perfect defense against Brutus.
ChatGPT is a data privacy nightmare. If you’ve ever posted online, you ought to be concerned
While there have been concerns on the data used to train ChatGPT, the article raises a point that the terms of service indicates that any submissions can also be used to improve the large language model. Maybe think twice before hucking your SSN in there. ; ) - John Forstmeier
The Incompatibility of Open Core and Profit Incentives
Can open core products truly be trusted? Are they incentivized to continue updating their open source product? Or is there an inevitable rug pull on the horizon? This article explores the downsides of the open core model for data tooling and covers alternative incentive structures that may better align with your own business goals. - Blake Burch
ChatGPT’s Electricity Consumption
Since ChatGPT's release, I use it on a semi-regular basis to help jump start projects and, I'm sure, there are many more like me. This has made me quite curious about the amount of resources it takes to run ChatGPT given its behemoth popularity. This is the first in a two part series about ChatGPT's electricity consumption. While mostly educated guess work, it does pull from other's research and is fascinating. - Katt Baum
Unit Testing for Data Engineers
This article addresses what I feel to be one of the biggest areas of improvement in data engineering: unit testing. The author addresses the challenges to unit testing (and lack thereof) in the data ecosystem, and makes the case for testable code. - Wes Poulsen
Say Goodbye to Pandas
Competitors to Pandas has always been interesting to me because Pandas was always one of the most baseline libraries that you learned when you starting learning Python. Lately, it seems like Polars is having a great run of interest from the data community. Vivek does a great job here explaining why you might want to try Polars and how to get started. - Steven Johnson