|
Exposing AI Vulnerabilities with Mozilla's Gen AI Bug Bounty Manager - Marco Figueroa In this special weekend edition of Hashtag Trending, host Jim Love sits down with Marco Figueroa, the Gen AI Bug Bounty Program Manager for Mozilla's ODIN project. They delve into the challenges and intricacies of bypassing security guardrails in large language models like ChatGPT and Claude. Marco shares insights from his storied career in cybersecurity, his role at Mozilla, and the innovative techniques hackers use to jailbreak AI systems. Learn about prompt engineering, prompt injection, and prompt hacking, and discover how Mozilla's ODIN project aims to set new standards in AI security. 00:00 Introduction and Guest Introduction 00:22 Understanding Large Language Models and Jailbreaking 02:02 Recent Jailbreaking Techniques and Discoveries 04:41 Interview with Marco Figueroa: Career Journey 10:12 Marco's Work at Mozilla and the ODIN Project 16:50 Exploring Prompt Injection and Hacking 23:20 Future of AI Security and Final Thoughts 38:00 Conclusion and Contact Information |