Search

Home > SuperDataScience > 692: Lossless LLM Weight Compression: Run Huge Models on a Single GPU
Podcast: SuperDataScience
Episode:

692: Lossless LLM Weight Compression: Run Huge Models on a Single GPU

Category: Business
Duration: 00:07:39
Publish Date: 2023-06-30 11:00:21
Description: Join Jon as he navigates listeners through the innovative SpQR approach—a cutting-edge, lossless LLM weight compression technique that harnesses the power of quantization. Tune in as Jon delves into the four steps behind this groundbreaking method in this week's episode. Additional materials: www.superdatascience.com/692 Interested in sponsoring a SuperDataScience Podcast episode? Visit JonKrohn.com/podcast for sponsorship information.
Total Play: 0