Этот материал находится в платной подписке. Оформи премиум подписку и смотри или слушай AI Evals For Engineers & PMs, а также все другие курсы, прямо сейчас!
Премиум
  1. Урок 1. 00:56:41
    1. Lesson 1. Fundamentals & Lifecycle LLM Application Evaluation
  2. Урок 2. 01:01:39
    2. Lesson 2. Systematic Error Analysis
  3. Урок 3. 00:43:03
    3. Braintrust Tutorial w Wayde Gilliam
  4. Урок 4. 01:40:14
    4. Optional. Office Hours
  5. Урок 5. 00:59:34
    5. Lesson 3. More Error Analysis & Collaborative Evaluation
  6. Урок 6. 01:00:35
    6. Lesson 4. Automated Evaluators
  7. Урок 7. 00:44:43
    7. Taming diffusion QR codes with evals and inference-time scaling w Charles Frye
  8. Урок 8. 00:28:26
    8. 10x Your RAG Evaluation by Avoiding these Pitfalls w Skylar Payne
  9. Урок 9. 01:18:26
    9. Optional. Office Hours
  10. Урок 10. 00:47:12
    10. Optional. Office Hours
  11. Урок 11. 00:05:13
    11. Lesson 5. More Automated Evaluators
  12. Урок 12. 00:59:46
    12. Lesson 6. RAG & Complex Architectures
  13. Урок 13. 00:31:09
    13. Scaling Inference-Time Compute for Better LLM Judges w Leonard Tang
  14. Урок 14. 00:46:39
    14. Building custom eval tools with coding agents w Isaac Flath
  15. Урок 15. 00:30:03
    15. From Vibe Checks to Evals to Feedback Loops - Case Studies in Al System Maturities w David Karam
  16. Урок 16. 00:38:26
    16. A Playbook For Building Al Agents You Can Trust w Udi Menkes
  17. Урок 17. 00:34:16
    17. Al Evals in Vertical Industries (such as healthcare, finance and law) w Dr Chris Lovejoy
  18. Урок 18. 00:49:03
    18. Arize Phoenix tutorial W Mikyo King
  19. Урок 19. 00:22:32
    19. Optional. Office Hours
  20. Урок 20. 00:24:20
    20. Optional. Office Hours
  21. Урок 21. 00:55:49
    21. Optional. Office Hours
  22. Урок 22. 00:59:03
    22. Lesson 7. Efficient Continuous Human Review Systems
  23. Урок 23. 01:03:11
    23. Lesson 8. Cost Optimization
  24. Урок 24. 00:33:38
    24. Techniques for evaluating agents w SallyAnn DeLucia (Arize)
  25. Урок 25. 00:48:24
    25. LangSmith Tutorial w Harrison Chase
  26. Урок 26. 01:10:21
    26. From Noob to 5 Automated Evals in 4 Weeks (as a PM) w Teresa Torres
  27. Урок 27. 01:42:26
    27. Solvelt. The Thinking Developer's Environment w Jeremy Howard & Johno Whitaker
  28. Урок 28. 01:00:49
    28. Testing Real Al Products LIVE w Robert Ta
  29. Урок 29. 00:45:00
    29. Fireside Chat with DSP Creator w Omar Khattab
  30. Урок 30. 01:06:31
    30. Optional. Office Hours
  31. Урок 31. 01:05:26
    31. Optional. Office Hours (Bonus)
  32. Урок 32. 00:10:50
    HW 1&2 walkthrough with Braintrust (pre-recorded) 1
  33. Урок 33. 00:05:13
    HW 1&2 walkthrough with Braintrust (pre-recorded) 2
  34. Урок 34. 00:15:04
    HW 1&2 walkthrough with Phoenix (pre-recorded)
  35. Урок 35. 00:22:41
    HW 1&2 walkthrough with LangSmith (pre-recorded)
  36. Урок 36. 00:21:41
    HW 3 walkthrough with Braintrust (pre-recorded)
  37. Урок 37. 00:16:40
    HW 3 walkthrough with Phoenix (pre-recorded)
  38. Урок 38. 00:23:11
    HW 4 walkthrough with Braintrust (pre-recorded)
  39. Урок 39. 00:16:39
    HW 4 walkthrough with Phoenix (pre-recorded)
  40. Урок 40. 00:22:03
    HW 5 walkthrough with Braintrust (pre-recorded)
  41. Урок 41. 00:14:58
    HW 5 walkthrough with Phoenix (pre-recorded)