All Hands on Data #39
Are you anti-panda (the library not the bear)? This week's AHoD is just for you! Our team has articles about alternatives to Pandas along with a few other fun topics. Check it out below!
A Personal Data Architecture for Fun and Profit
The author gives a quick rundown of their personal data pipeline for collecting, storing, and analyzing personal information. This is used to identify and encourage areas of improvement (e.g. finances, health) with support from hard data. Focusing on data-driven improvements can be beneficial both to companies and to individuals. - John Forstmeier
The 5 Crucial Principles To Build A Responsible AI Framework
The potential of AI is colossal and terrific (equal parts exhilarating and frightening). The ethical discussions about the building and using of AI have been fascinating. Agrawal provides a decent list five tenets for the creating a responsible AI: human-centeredness, fairness, privacy, transparency, and security. - Katt Baum
Lineman Stationarity - A data-driven metric for offensive linemen
Let's talk sports data all day!! It's always cool to see how some instinctive theories about how a sport is played show up the raw numbers. Although widely thrown under the umbrella term "analytics", this information is the next level for moving data science and sports forward. - Eric Elsken
Questions to Ask When Prioritizing Projects
I've been in a lot of spirited discussions over which projects to prioritize. TJ does a great job of outlining some important questions that should be asked during those discussions. I'll definitely have these suggestions in my tool belt next time a discussion pops up. - Steven Johnson
Pandas or Polars? Who Wins?
It is interesting to see new challengers to the dominant way rise up. While Pandas won't be going away anytime soon, it is clear that Polars already has the edge in several respects. - Wes Poulsen
Why are Data Scientists obsessed with PySpark over Pandas
I've seen a recent trend in articles and discussion of tools to use instead of Pandas. This article continues that trend with PySpark. Tushar does a great job of outlining why you should use PySpark over Pandas. - Steven Johnson