1 of 18

AMAN KHAN

Beyond Vibe Checks: Mastering Evals

2 of 18

A bit about me

3 of 18

What we’ll cover

  • My 5 principles
  • What are evals and why they matter
  • Evaluate “AI trip planning” data

4 of 18

Who is this session for?

5 of 18

My Five Essential Skills

  1. AI Fundamentals
  2. Customer Obsession with Curiosity
  3. Rapid Prototyping
  4. Learning from Great AI Experiences
  5. Mastering Evals and Observability

6 of 18

My Five Essential Skills

  1. AI Fundamentals
  2. Customer Obsession with Curiosity
  3. Rapid Prototyping
  4. Learning from Great AI Experiences
  5. Mastering Evals and Observability

7 of 18

My Five Essential Skills

  • AI Fundamentals
  • Customer Obsession with Curiosity
  • Rapid Prototyping
  • Learning from Great AI Experiences
  • Mastering Evals and Observability

8 of 18

LLMs Hallucinate

Your job is to make sure they don’t embarrass you, your company or brand

9 of 18

What is an Eval?

10 of 18

What is an Eval?

11 of 18

Evaluating with vibes…

© All Rights Reserved

| We Make AI Work

12 of 18

… to Thrive

Coding

© All Rights Reserved

| We Make AI Work

13 of 18

14 of 18

15 of 18

Looking ahead

What is fundamentally shifting in the PM role?

16 of 18

Q&A !

17 of 18

18 of 18

Thank You!

@_amankhan

/amanberkeley

aiproductplaybook.com