Robert Nakano

Robert Nakano

Using Pandas Profiling for Meal Kit Data Validation

Methods: Exploratory Data Analysis, Pandas Profiling

Pandas Profiling

Summary

One of the many weekly tasks at Pick a Kit is collecting meal kit menus from various services. This data is used in the Pick a Kit list of menus.

We have been looking at better ways to understand and validate our data on a weekly basis: enter Pandas Profiling. With a few lines of code, Pandas Profiling generates reports using data from a pandas dataframe. Check out a sample of Pick a Kit weekly meal kit data above!

Conclusion

It's great to have a quick, reproducible view of data in a variety of dimensions! Like any data report, the true benefit comes with the analysis; we'll be using this report as one entry point to understanding. Additionaly, Hofstadter's Law was in full effect; in order to get Pandas Profiling working, I performed a much needed update to Windows 10, Windows Subsystem for Linux (WSL), and updated Node.js, Python, and their respective packages. A reminder to keep paying down your technical debt!