|
Today we’re joined by Kelley Rivoire, engineering manager working on machine learning infrastructure at Stripe. Kelley and I caught up at a recent Strata Data conference to discuss: • Her talk "Scaling model training: From flexible training APIs to resource management with Kubernetes." • Stripe’s machine learning infrastructure journey, including their start from a production focus. • Internal tools used at Stripe, including Railyard, an API built to manage model training at scale & more! The complete show notes can be found at twimlai.com/talk/272. Visit twimlcon.com to learn more about the TWIMLcon: AI Platforms conference! The first 10 listeners who register get their ticket for 75% off using the discount code TWIMLFIRST! Follow along with the entire AI Platforms Vol 2 series at twimlai.com/aiplatforms2. Thanks to SigOpt for their continued support of the podcast, and their sponsorship of this episode! Check out their machine learning experimentation and optimization suite, and get a free trial at twimlai.com/sigopt. |