Search

Home > Computer Science > An AI stack: from scaling AI workloads to evaluating LLMs
Podcast: Computer Science
Episode:

An AI stack: from scaling AI workloads to evaluating LLMs

Category: Education
Duration: 00:55:58
Publish Date: 2026-02-26 12:35:57
Description: Hilary Term 2026 Strachey Lecture with Professor Ion Stoica, An AI stack: from scaling AI workloads to evaluating LLMs Large language models (LLMs) have taken the world by storm, enabling new applications, intensifying GPU shortages, and raising concerns about the accuracy of their outputs. In this talk, I will present several projects I have worked on to address these challenges. Specifically, I will focus on Ray, a distributed framework for scaling AI workloads, vLLM and SGLang, two high-throughput inference engines for LLMs, and LMArena, a platform for accurate LLM benchmarking. I will conclude with key lessons learned and outline directions for future research.
Total Play: 0

Some more Podcasts by Oxford University

40+ Episodes
Latin Americ .. 200+     20+
20+ Episodes
Greek and Ro .. 80+     10+
4 Episodes
Ancient Egyp .. 60+     20+
10+ Episodes
"British" Wo .. 20+     10+
5 Episodes
Digital Sket .. 10+     1
20+ Episodes
Protecting t .. 20+     5
60+ Episodes
Surgical Gra .. 10+     2
90+ Episodes
Ethics, Law .. 10+     5