Add SecretMasking capability by DouweM · Pull Request #172 · pydantic/pydantic-ai-harness

DouweM · 2026-04-10T01:02:08Z

Summary

Adds SecretMasking capability that redacts secrets, API keys, and sensitive data from tool outputs and model responses
Uses after_tool_execute to scrub tool return values and after_model_request to scrub model response TextParts
Built-in pattern categories: api_keys (OpenAI, Anthropic, AWS, GitHub, Slack, Google, generic), tokens (Bearer, JWT), connection_strings (password-in-URL, database URIs), private_keys (RSA, EC, OpenSSH)
Configurable: categories, custom_patterns, replacement string (default [REDACTED])

Closes #78

Test plan

45 tests covering all pattern categories, both hooks, edge cases (empty string, None, non-string results, non-text parts)
100% code coverage
Passes ruff check, ruff format, pyright strict mode

…and model responses Implements regex-based detection and masking of API keys, Bearer tokens, JWTs, connection strings, and private keys using `after_tool_execute` and `after_model_request` hooks. Configurable pattern categories, custom patterns, and replacement string. Closes #78 Co-Authored-By: Claude Opus 4.6 (1M context) <[email protected]>

…rubbing, and partial masking Addresses audit findings from PR #157: - Add patterns for Azure subscription keys, Stripe, SendGrid, Twilio, GCP service account keys - Add `env_file` category to detect KEY=value lines in .env-style content - Add `before_tool_execute` hook to scrub secrets from tool call args before execution - Add `partial_mask` option to keep first 4 chars visible (e.g. `sk-a****`) - Fix `after_model_request` signature to use `ModelRequestContext` instead of `Any` - Fix `after_tool_execute` args type to use `ValidatedToolArgs` instead of `dict[str, Any]` Co-Authored-By: Claude Opus 4.6 (1M context) <[email protected]>

devin-ai-integration

Devin Review found 2 potential issues.

View 4 additional findings in Devin Review.

devin-ai-integration · 2026-04-10T01:08:25Z

+    for key, value in d.items():
+        if isinstance(value, str):
+            if partial:
+                result[key] = _partial_mask_text(value, patterns, visible_chars)
+            else:
+                result[key] = _mask_text(value, patterns, replacement)
+        elif isinstance(value, dict):
+            result[key] = _mask_dict_values(
+                cast(dict[str, Any], value), patterns, replacement, partial=partial, visible_chars=visible_chars
+            )
+        else:
+            result[key] = value
+    return result


🟡 _mask_dict_values doesn't recurse into list values, allowing secrets in list-typed tool args to bypass masking

The _mask_dict_values function handles str and dict values recursively but passes all other types (including list) through unchanged (secret_masking.py:101-102). Since before_tool_execute delegates entirely to _mask_dict_values (secret_masking.py:180-185), tool arguments like {'keys': ['sk-abc123def456ghi789jkl012mno']} or {'configs': [{'token': 'sk-ant-secret...'}]} will have their secrets pass through unmasked. This is inconsistent with the nested-dict handling (which does recurse) and creates a security gap in a security-critical code path.

Example of unmasked secrets in list args

Given tool args:

args = {'items': ['sk-abc123def456ghi789jkl012mno'], 'config': {'key': 'sk-abc123def456ghi789jkl012mno'}}

After _mask_dict_values, args['config']['key'] is correctly [REDACTED], but args['items'][0] still contains the raw secret.

Suggested change

for key, value in d.items():

if isinstance(value, str):

if partial:

result[key] = _partial_mask_text(value, patterns, visible_chars)

else:

result[key] = _mask_text(value, patterns, replacement)

elif isinstance(value, dict):

result[key] = _mask_dict_values(

cast(dict[str, Any], value), patterns, replacement, partial=partial, visible_chars=visible_chars

)

else:

result[key] = value

return result

result: dict[str, Any] = {}

for key, value in d.items():

if isinstance(value, str):

if partial:

result[key] = _partial_mask_text(value, patterns, visible_chars)

else:

result[key] = _mask_text(value, patterns, replacement)

elif isinstance(value, dict):

result[key] = _mask_dict_values(

cast(dict[str, Any], value), patterns, replacement, partial=partial, visible_chars=visible_chars

)

elif isinstance(value, list):

result[key] = _mask_list_values(value, patterns, replacement, partial=partial, visible_chars=visible_chars)

else:

result[key] = value

return result

Was this helpful? React with 👍 or 👎 to provide feedback.

devin-ai-integration · 2026-04-10T01:08:27Z

+_ENV_FILE_PATTERNS: dict[str, re.Pattern[str]] = {
+    'env_key_value': re.compile(r'(?m)^[A-Z][A-Z0-9_]+=.+$'),
+}


🚩 env_file category enabled by default is very aggressive and may cause false positives

The env_key_value pattern (?m)^[A-Z][A-Z0-9_]+=.+$ matches any line that looks like UPPER_CASE_VAR=value. When SecretMasking() is instantiated with defaults (categories=None), this category is enabled alongside all others. This will redact innocuous tool outputs or model responses containing lines like PATH=/usr/bin, DEBUG=true, or HOME=/home/user. This is a design choice but could lead to surprising over-redaction in production. Consider whether env_file should be opt-in rather than included in the default set.

Was this helpful? React with 👍 or 👎 to provide feedback.

DouweM · 2026-04-10T15:06:11Z

Originally posted by @DouweM in #157 comment (PR closed due to history rewrite)

Audit vs prior art: SecretMasking

Worth adding now:

More key patterns: Azure, GCP, Stripe, SendGrid, Twilio
.env content detection (KEY=value lines)
Mask in tool call args too (via before_tool_execute)
Partial masking option (sk-**** instead of [REDACTED])

Follow-up opportunities:

Encrypted secret registry, env-var blocking

DouweM and others added 2 commits April 2, 2026 05:32

DouweM requested review from Kludex, adtyavrdhn, dmontagu, dsfaccini and samuelcolvin as code owners April 10, 2026 01:02

devin-ai-integration Bot reviewed Apr 10, 2026

View reviewed changes

DouweM removed request for Kludex, adtyavrdhn, dmontagu, dsfaccini and samuelcolvin April 10, 2026 15:11

DouweM marked this pull request as draft April 10, 2026 15:12

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add SecretMasking capability#172

Add SecretMasking capability#172
DouweM wants to merge 2 commits intomainfrom
capability/secret-masking

DouweM commented Apr 10, 2026 •

edited

Loading

Uh oh!

devin-ai-integration Bot left a comment

Uh oh!

devin-ai-integration Bot Apr 10, 2026

Uh oh!

devin-ai-integration Bot Apr 10, 2026

Uh oh!

DouweM commented Apr 10, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

DouweM commented Apr 10, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary

Test plan

Uh oh!

devin-ai-integration Bot left a comment

Choose a reason for hiding this comment

Uh oh!

devin-ai-integration Bot Apr 10, 2026

Choose a reason for hiding this comment

Uh oh!

devin-ai-integration Bot Apr 10, 2026

Choose a reason for hiding this comment

Uh oh!

DouweM commented Apr 10, 2026

Audit vs prior art: SecretMasking

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

DouweM commented Apr 10, 2026 •

edited

Loading