Week 4

📄 This Week's Overview

Resources

Notes

Channels

Messages

Channels

Messages

Back

Start Here

📄

This Week's Overview

Lesson 1

This week covers advanced topics, focusing on scaling human review and connecting evaluation results to pipeline improvements. First, we will explore interfaces for human review and error analysis. You will learn why dedicated annotation tools can speed up your analyze and measure processes (moving beyond spreadsheets or off-the-shelf tools).

Second, we will briefly cover the "Improve" phase of the development lifecycle. After identifying failures (Week 1) and measuring them (Weeks 2-3), the final step is fixing them. We will survey some accuracy improvement techniques: e.g., prompt engineering, adding more context, fine-tuning.

Objectives

Readings (link to course reader)

Optional Coding Homework

Code Snippets from Lessons

Slides: Lesson 7 | Lesson 8

CompleteComplete this lesson

0%

[

Home

](/parlance-labs/evals/2025-3/home)[

Community

](/parlance-labs/evals/2025-3)