Этот материал находится в платной подписке. Оформи премиум подписку и смотри или слушай AI Evals For Engineers & PMs, а также все другие курсы, прямо сейчас!
Премиум
  • Урок 1. 00:56:41
    1. Lesson 1. Fundamentals & Lifecycle LLM Application Evaluation
  • Урок 2. 01:01:39
    2. Lesson 2. Systematic Error Analysis
  • Урок 3. 00:43:03
    3. Braintrust Tutorial w Wayde Gilliam
  • Урок 4. 01:40:14
    4. Optional. Office Hours
  • Урок 5. 00:59:34
    5. Lesson 3. More Error Analysis & Collaborative Evaluation
  • Урок 6. 01:00:35
    6. Lesson 4. Automated Evaluators
  • Урок 7. 00:44:43
    7. Taming diffusion QR codes with evals and inference-time scaling w Charles Frye
  • Урок 8. 00:28:26
    8. 10x Your RAG Evaluation by Avoiding these Pitfalls w Skylar Payne
  • Урок 9. 01:18:26
    9. Optional. Office Hours
  • Урок 10. 00:47:12
    10. Optional. Office Hours
  • Урок 11. 00:05:13
    11. Lesson 5. More Automated Evaluators
  • Урок 12. 00:59:46
    12. Lesson 6. RAG & Complex Architectures
  • Урок 13. 00:31:09
    13. Scaling Inference-Time Compute for Better LLM Judges w Leonard Tang
  • Урок 14. 00:46:39
    14. Building custom eval tools with coding agents w Isaac Flath
  • Урок 15. 00:30:03
    15. From Vibe Checks to Evals to Feedback Loops - Case Studies in Al System Maturities w David Karam
  • Урок 16. 00:38:26
    16. A Playbook For Building Al Agents You Can Trust w Udi Menkes
  • Урок 17. 00:34:16
    17. Al Evals in Vertical Industries (such as healthcare, finance and law) w Dr Chris Lovejoy
  • Урок 18. 00:49:03
    18. Arize Phoenix tutorial W Mikyo King
  • Урок 19. 00:22:32
    19. Optional. Office Hours
  • Урок 20. 00:24:20
    20. Optional. Office Hours
  • Урок 21. 00:55:49
    21. Optional. Office Hours
  • Урок 22. 00:59:03
    22. Lesson 7. Efficient Continuous Human Review Systems
  • Урок 23. 01:03:11
    23. Lesson 8. Cost Optimization
  • Урок 24. 00:33:38
    24. Techniques for evaluating agents w SallyAnn DeLucia (Arize)
  • Урок 25. 00:48:24
    25. LangSmith Tutorial w Harrison Chase
  • Урок 26. 01:10:21
    26. From Noob to 5 Automated Evals in 4 Weeks (as a PM) w Teresa Torres
  • Урок 27. 01:42:26
    27. Solvelt. The Thinking Developer's Environment w Jeremy Howard & Johno Whitaker
  • Урок 28. 01:00:49
    28. Testing Real Al Products LIVE w Robert Ta
  • Урок 29. 00:45:00
    29. Fireside Chat with DSP Creator w Omar Khattab
  • Урок 30. 01:06:31
    30. Optional. Office Hours
  • Урок 31. 01:05:26
    31. Optional. Office Hours (Bonus)
  • Урок 32. 00:10:50
    HW 1&2 walkthrough with Braintrust (pre-recorded) 1
  • Урок 33. 00:05:13
    HW 1&2 walkthrough with Braintrust (pre-recorded) 2
  • Урок 34. 00:15:04
    HW 1&2 walkthrough with Phoenix (pre-recorded)
  • Урок 35. 00:22:41
    HW 1&2 walkthrough with LangSmith (pre-recorded)
  • Урок 36. 00:21:41
    HW 3 walkthrough with Braintrust (pre-recorded)
  • Урок 37. 00:16:40
    HW 3 walkthrough with Phoenix (pre-recorded)
  • Урок 38. 00:23:11
    HW 4 walkthrough with Braintrust (pre-recorded)
  • Урок 39. 00:16:39
    HW 4 walkthrough with Phoenix (pre-recorded)
  • Урок 40. 00:22:03
    HW 5 walkthrough with Braintrust (pre-recorded)
  • Урок 41. 00:14:58
    HW 5 walkthrough with Phoenix (pre-recorded)