diff --git a/src/agentlab/analyze/error_analysis.py b/src/agentlab/analyze/error_analysis.py index 07406a43..5a36db58 100644 --- a/src/agentlab/analyze/error_analysis.py +++ b/src/agentlab/analyze/error_analysis.py @@ -133,7 +133,7 @@ FEW-SHOT CLASSIFICATION EXAMPLES -------------------------------------------------------------------------------- -1) EXAMPLE A (Benchmarl Error - Benchmark Design Error) +1) EXAMPLE A (Benchmark Error - Benchmark Design Error) • Context: The agent correctly finds a cheaper product meeting the user's criteria, but the benchmark expects a more expensive product and marks the solution as wrong. • Classification: ["Benchmark Design Error"]