Cohort-based courses
Guided programs to get real results.
AI Evals For Engineers & PMs
4.7
·4 weeks·Sep 7 – Oct 2
Hamel Husain ML Engineer with 20 years of experience
Shreya Shankar ML Systems & Applied AI Evals Researcher
AI Evals and Analytics Playbook
5.0
·3 weeks·May 11 – Jun 1

Stella Liu Head of AI Applied Science
Amy Chen Cofounder, AI Evals & Analytics
Beyond Evals: Designing Improvement Flywheels for AI Products
NEW·3 weeks·Jun 6 – Jun 27
.png&w=256&q=75)
Aishwarya Naresh RegantiAI Founder & Advisor to F500s | Ex-AWS
1-day workshops
Short, focused sessions to build specific skills.
Free Lightning Lessons
Interactive sessions to explore new topics.
Vibe Code Annotation UIs for AI Analytics Evals
·Jun 24·60 minutes629 StudentsLive
Shane ButlerHow to Setup Evals For Agents
·30 minutes1,423 StudentsWatch
Harrison Chase, Hamel Husain, andRaise Your Technical Bar as an AI-Native PM
·30 minutes15,809 StudentsWatch
Jason P. Yoong and Gayathri Keerthana (GK)From Automation to Multi-Agent Architectures
·3 lessons1,345 StudentsWatch
Hamza FarooqShip a Production Cursor Agent System in 30 Minutes
·Jun 24·30 minutes90 StudentsLive
Carmelo IariaDesign Evals Users Will Trust
·45 minutes769 StudentsWatch
Aishwarya Naresh RegantiBuild Your AI Evals & Analytics Playbook
·30 minutes503 StudentsWatch
Stella Liu and Amy ChenAutomating Evals With Claude Code + Phoenix
·60 minutes2,354 StudentsWatch
Mikyo King and Hamel HusainAI Evals for Product Managers
·60 minutes2,004 StudentsWatch
Anshumani RuddraSetting Eval for AI Agents & Scaling with Auto-Evaluation
·30 minutes860 StudentsWatch
Mahesh YadavProduction Grade AI Evals by Braintrust.dev
·30 minutes487 StudentsWatch
Mengying LiModern Information Retrieval Evaluation In The RAG Era
·45 minutes5,279 StudentsWatch
Nandan Thakur, Hamel Husain, and Shreya ShankarPart 3: Building Robust Evaluations for AI Agents
·60 minutes138 StudentsWatch
Hamza Farooq and Gabriela de QueirozRun Eval Loops and Guardrails for Cursor Agents
·May 27·30 minutes78 StudentsLive
Carmelo IariaDebug Cursor Agent Failures Before Production
·Jun 10·30 minutes36 StudentsLive
Carmelo IariaDebug the weird stuff your AI does (in less than 1 hour)
·45 minutes5,150 StudentsWatch.webp&w=1536&q=75)
Marily Nika and Hamel HusainHow OpenAI Customers Use Evals To Build Better AI Products
·30 minutes1,080 StudentsWatch
Jim Blomo and Hamel HusainOptimize Your Dev Setup For Evals w/ Cursor Rules & MCP
·30 minutes686 StudentsWatch
Isaac Flath, Hamel Husain, and Shreya ShankarPractical Evaluation Strategies for AI Agents
·45 minutes467 StudentsWatch
Hamza Farooq and Gabriela de QueirozCalibrate LLM-as-a-judge for Real-world Impact
·45 minutes205 StudentsWatch
Eddie LandesbergEvaluating AI Agents
·45 minutes1,428 StudentsWatch
Amir Feizpour and Samuel Dion-GirardeauHow Evals Made GitHub Copilot Happen
·30 minutes891 StudentsWatch
John Berryman, Shawn Simister, and Hamel HusainLearn Agentic AI: Setting agents metrics and evaluations
·45 minutes852 StudentsWatch
Mahesh YadavOnline Evals and Production Monitoring
·60 minutes831 StudentsWatch
Jason Liu, Ben Hylak, and Sidhant BendreEvaluation Driven Development for Agentic AI Systems
·45 minutes582 StudentsWatch
Aurimas GriciūnasHow to Drive AI Evals Adoption
·30 minutes322 StudentsWatch
Dr Sebastian FoxScale Evals Without the Chaos
·45 minutes248 StudentsWatch
Aishwarya Naresh RegantiEvals for Voice AI: Learnings from Google Evals Team
·30 minutes240 StudentsWatch
Ravin KumarEvals in Action With Arize
·45 minutes200 StudentsWatch
Laurie VossCollaborative AI Evals with Human Feedback
·30 minutes112 StudentsWatch
Rogério ChavesEvaluating AI Agents before Users Break Them
·60 minutes87 StudentsWatch
Aki Wijesundara, PhD, Marc Klingen, and Lotte VerheydenGo Beyond AI Evals: Diagnose and Decide
·45 minutes51 StudentsWatch
Rajiv ShahEvals for Everyone
·3 lessons2,193 StudentsWatch
Aishwarya & KiritiError Analysis: The AI Engineer’s Best ROI
·60 minutes1,514 StudentsWatch
Hamel Husain and Shreya ShankarEvaluating Agentic AI Applications Beyond Vibe Checks
·45 minutes1,249 StudentsWatch
Aishwarya Naresh Reganti, Kiriti Badam, and Claire LongoUnderstanding Embedding Performance through Generative Evals
·60 minutes1,181 StudentsWatch
Jason Liu and Kelly HongOptimize Structured Data Retrieval With Evals
·45 minutes843 StudentsWatch
Daniel Svonava and Hamel HusainAI Systems Under Pressure: Red-Team Before You Ship
·60 minutes802 StudentsWatch
Krystal JacksonEvaluate AI agents with Confidence
·45 minutes800 StudentsWatch
Mahesh YadavImprove reliability of your AI applications
·30 minutes747 StudentsWatch
Shreya RajpalBuild Your Own Eval Tools With Notebooks!
·45 minutes610 StudentsWatch
Vincent D. Warmerdam, Hamel Husain, and Shreya ShankarHow You Catch Production Hallucinations in Real Time
·60 minutes504 StudentsWatch
Jason Liu and Julia NeaguScaling Judge-Time Compute for Robust Auto LLM Evaluation
·60 minutes489 StudentsWatch
Jason Liu and Leonard TangStrategies for building self-improving document processing
·60 minutes428 StudentsWatch
Jason Liu and Eli BadgioMaster Evaluation Techniques for LLM Apps
·30 minutes411 StudentsWatch
Haroon ChouderyUnderstand SHAP (SHapley Additive exPlanations)
·30 minutes310 StudentsWatch
Patrick HallReliable RAG Agents: Intent-Driven Failure Detection
·60 minutes297 StudentsWatch
Jason Liu and Ben HylakCreate MCP Tool Evals Before You Ship
·45 minutes282 StudentsWatch
Emmanuel ParaskakisDon't Tweak Prompts. Engineer Agents.
·30 minutes274 StudentsWatch
Hugo Bowne-Anderson and Skylar PayneMastering LLM Application Testing
·30 minutes239 StudentsWatch
Hugo Bowne-Anderson and Stefan KrawczykSynthetic RAG evaluation
·60 minutes210 StudentsWatch
Alexey Grigorev and Doug Turnbull🛠 Synthetic Data Flywheels: Build Reliable LLM Apps Faster
·30 minutes187 StudentsWatch
Hugo Bowne-Anderson and Stefan KrawczykThe Hidden Signal in Production AI Logs
·60 minutes171 StudentsWatch
Jason Liu and Scott ClarkHow to test and improve your AI agents
·45 minutes166 StudentsWatch
Jacob BankDe-Risking LLM Model Switches w Evals & Prompt Optimization
·45 minutes145 StudentsWatch
Amir Feizpour and Hugo MailhotStay Ahead in AI: Evaluate Any New LLM in 15 Minutes
·30 minutes93 StudentsWatch
Sherveen MashayekhiSetting up your first AI eval with a LLM-as-judge
·45 minutes60 StudentsWatch
Madalina Turlea and Catalina TurleaHow to test AI when you don't have any data yet
·45 minutes23 StudentsWatch
Madalina Turlea and Catalina Turlea


