Search

Home > SuperDataScience > 706: Large Language Model Leaderboards and Benchmarks
Podcast: SuperDataScience
Episode:

706: Large Language Model Leaderboards and Benchmarks

Category: Business
Duration: 00:33:27
Publish Date: 2023-08-18 11:00:19
Description: In this episode, Caterina Constantinescu dives deep into Large Language Models (LLMs), spotlighting top leaderboards, evaluation benchmarks, and real-world user perceptions. Plus, discover the challenges of dataset contamination and the intricacies of platforms like HELM and Chatbot Arena. Additional materials: www.superdatascience.com/706 Interested in sponsoring a SuperDataScience Podcast episode? Visit JonKrohn.com/podcast for sponsorship information.
Total Play: 0