Comet Office Hours: Seven Simple Steps to Standardizing the ML Experiment (1/26/22)
Welcome to another recap of the Comet ML Office Hours, powered by The Artists of Data Science! This week we’re covering Session 4 of our new series. This session took place Jan. 26th, 2022 and we were joined by Jimmy Whitaker of Pachyderm, Dr. Abe Gong of Great Expectations, and Data Governance Analyst and Heartbeat contributor Matt Blasa.
As a reminder, we’d love to see any and all of you at these fifty minute sessions—so feel free to register for upcoming Office Hours sessions here! As always, there’s a lot more in the full session (which you can find on our YouTube channel), so be sure to check it out, alongside clips from roundtables, webinars, and previous Office Hours.
All About Data
This session was all about data – understanding, validating, versioning, and engineering it. Because without good data, your model won’t perform optimally and could disappoint down the road.
Matt Blasa of Brinks Home Security discussed his role as a Data Governance Analyst and how that plays into the larger ML experiment pipeline.
One thing that did come up in the course of the data discussion was the idea of “Data Curators.” This was a new term for Harpreet and inspired Dr. Abe Gong to bring up Emilie Schario’s suggestion of taking the term “Data Scientist” and exploding it into multiple job titles.
Hear more from Dr. Gong in the clip below.
Cautionary Tales – And Plenty of Them
Throughout the discussion a number of stories came up about “upside down models” and what happens if you don’t properly version your data.
One story that came up from Jimmy Whitaker was about digits in NLP models. Specifically, a transcription model that was outputting different things for dates – sometimes written as digits and sometimes written out as words.
Check out the clip below for the full story:
Make sure to read Matt’s articles on Heartbeat.
As always, there’s more to be discussed and discovered. Check out Comet’s contributor-led publication Heartbeat as well as our YouTube, Twitter, and LinkedIn for more great information.
- Comet Office Hours Bookshelf
- Office Hours YouTube Playlist
- Follow Comet ML on Twitter
- Connect with Comet on LinkedIn
Curious to Learn more? Join Us!
We run these virtual Office Hours every Wednesday at 11am EST (New York, NY). Completely free to attend and participate, and we’d love to see any and all of you there! We’ve got a great series planned and welcome questions for Harpreet or any of our guests via email to email@example.com.
But most importantly: