Add ToolErrorRecovery capability by DouweM · Pull Request #171 · pydantic/pydantic-ai-harness

DouweM · 2026-04-10T01:02:05Z

Summary

Adds ToolErrorRecovery capability that catches unhandled tool execution errors and recovers gracefully, preventing agent run crashes
Three configurable strategies: 'inform' (default, returns error message to model), ('retry', N) (retries up to N times then informs), ('fallback', value) (returns static value)
Per-tool strategy configuration via tool_strategies dict, with default_strategy for unconfigured tools
Per-run state isolation via for_run() for retry count tracking
Convenience constructors retry() and fallback() for readable strategy definitions

Test plan

Closes #61

🤖 Generated with Claude Code

…acefully Catches unhandled tool execution errors and applies configurable recovery strategies (inform, retry, fallback) per tool, preventing agent run crashes and enabling the model to self-correct. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>

…r to ToolErrorRecovery - retry() now accepts retry_delay (base seconds for 2^attempt backoff) and retryable_exceptions (tuple of exception types eligible for retry) - ToolErrorRecovery gains max_total_errors: after N total errors across all tools, recovery stops and errors propagate as-is - Per-run state (_total_errors) is reset by for_run() alongside _retry_counts - Full test coverage for all new features including backoff timing verification, exception subclass matching, cross-tool budget exhaustion, and validation Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>

devin-ai-integration

Devin Review found 2 potential issues.

View 2 additional findings in Devin Review.

devin-ai-integration · 2026-04-10T01:06:26Z

+                # If the exception isn't retryable, stop immediately.
+                if not isinstance(exc, retryable_exceptions):
+                    return _format_error(call.tool_name, exc, include_traceback=self.include_traceback)


🟡 Non-retryable exceptions bypass the max_total_errors budget check in wrap_tool_execute

In wrap_tool_execute, the non-retryable exception check at line 305 returns an inform message before the budget exhaustion check at line 309. This means when a tool configured with a retry strategy and a custom retryable_exceptions filter encounters a non-retryable exception, it will always be "recovered" (returned as an inform string) even if max_total_errors budget is already exhausted. This contradicts the documented contract of max_total_errors (src/pydantic_harness/tool_error_recovery.py:222-228): "Once the budget is exhausted, subsequent errors propagate as-is instead of being recovered."

Concrete scenario triggering the bug

With ToolErrorRecovery(tool_strategies={'t': retry(3, retryable_exceptions=(ConnectionError,))}, max_total_errors=0), raising a ValueError will increment _total_errors to 1, then hit the non-retryable check and return an inform message — even though _budget_exhausted() would return True (1 > 0). The budget check on line 309 is never reached.

Suggested change

# If the exception isn't retryable, stop immediately.

if not isinstance(exc, retryable_exceptions):

return _format_error(call.tool_name, exc, include_traceback=self.include_traceback)

# If the error budget is exhausted, let the error propagate.

if self._budget_exhausted():

raise

# If the exception isn't retryable, stop immediately.

if not isinstance(exc, retryable_exceptions):

return _format_error(call.tool_name, exc, include_traceback=self.include_traceback)

Was this helpful? React with 👍 or 👎 to provide feedback.

devin-ai-integration · 2026-04-10T01:06:27Z

+                # If the error budget is exhausted, let the error propagate.
+                if self._budget_exhausted():
+                    raise


🚩 Potential double-counting of _total_errors if framework calls on_tool_execute_error after wrap_tool_execute raises

When the retry strategy's budget is exhausted, wrap_tool_execute re-raises the exception at line 310 (after already incrementing _total_errors at line 301). If the PydanticAI framework then calls on_tool_execute_error for this propagated exception, _total_errors would be incremented again at src/pydantic_harness/tool_error_recovery.py:337. This depends on the framework's hook dispatch behavior — specifically whether on_tool_execute_error fires for exceptions that escape wrap_tool_execute. Without access to the pydantic-ai AbstractCapability source, I can't confirm whether this happens. If it does, the error count would be inflated, though in practice it wouldn't change behavior since the budget is already exhausted at that point.

Was this helpful? React with 👍 or 👎 to provide feedback.

DouweM · 2026-04-10T15:06:11Z

Originally posted by @DouweM in #158 comment (PR closed due to history rewrite)

Audit vs prior art: ToolErrorRecovery

Worth adding now:

Exponential backoff for retries
max_total_errors budget across all tools
retryable_exceptions filter

Follow-up opportunities:

Error categorization, metrics/reporting

DouweM and others added 2 commits April 2, 2026 05:33

DouweM requested review from Kludex, adtyavrdhn, dmontagu, dsfaccini and samuelcolvin as code owners April 10, 2026 01:02

devin-ai-integration Bot reviewed Apr 10, 2026

View reviewed changes

DouweM assigned adtyavrdhn Apr 10, 2026

DouweM removed request for Kludex, adtyavrdhn, dmontagu, dsfaccini and samuelcolvin April 10, 2026 15:11

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add ToolErrorRecovery capability#171

Add ToolErrorRecovery capability#171
DouweM wants to merge 2 commits intomainfrom
capability/tool-error-recovery

DouweM commented Apr 10, 2026 •

edited

Loading

Uh oh!

devin-ai-integration Bot left a comment

Uh oh!

devin-ai-integration Bot Apr 10, 2026

Uh oh!

devin-ai-integration Bot Apr 10, 2026

Uh oh!

DouweM commented Apr 10, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

DouweM commented Apr 10, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary

Test plan

Uh oh!

devin-ai-integration Bot left a comment

Choose a reason for hiding this comment

Uh oh!

devin-ai-integration Bot Apr 10, 2026

Choose a reason for hiding this comment

Uh oh!

devin-ai-integration Bot Apr 10, 2026

Choose a reason for hiding this comment

Uh oh!

DouweM commented Apr 10, 2026

Audit vs prior art: ToolErrorRecovery

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

DouweM commented Apr 10, 2026 •

edited

Loading