Add Skills capability for progressive tool loading by DouweM · Pull Request #183 · pydantic/pydantic-ai-harness

DouweM · 2026-04-10T01:02:46Z

Summary

Implements a Skills capability (AbstractCapability subclass) that enables progressive tool loading: agents discover skills via search_skills(query) and activate them via load_skill(name), keeping unloaded tools hidden from the model's context window
Skills can be defined in Python (with callable tools or FunctionToolset) or loaded from markdown files with YAML frontmatter (pure knowledge packages)
Per-run state isolation via for_run(), spec-serializable via from_spec(dirs=[...]), and tool visibility controlled through the prepare_tools hook

Closes #22. Partially addresses #40.

Test plan

🤖 Generated with Claude Code

Implements a Skills capability (AbstractCapability subclass) that lets agents discover and load skill packages on demand, preserving context window by hiding unloaded tools. Provides search_skills and load_skill meta-tools, supports both Python-defined and markdown-based skills. Closes #22. Partially addresses #40. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>

…lerance - Add `unload_skill(name)` meta-tool to remove a skill's tools from the loaded set, freeing context window space - Improve `search_skills` with word-boundary matching: split query into words, match each against name/description, rank results by match count - Document that unknown frontmatter keys are silently ignored for agentskills.io compatibility (already worked, now explicit + tested) - Add 9 new tests covering all new behavior Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>

…ranch - Add test for frontmatter lines without colon (line 134) - Add test for get_toolset with FunctionToolset skills (lines 230-231) - Mark tool stub functions as `# pragma: no cover` (never called, only registered) Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>

devin-ai-integration

Devin Review found 2 potential issues.

View 4 additional findings in Devin Review.

devin-ai-integration · 2026-04-10T01:07:41Z

+        raise ValueError(f'Missing YAML frontmatter in {source}')
+
+    # Find closing delimiter
+    end = stripped.find('---', 3)


🟡 Frontmatter closing delimiter search matches --- inside values, not just on its own line

_parse_skill_markdown at src/pydantic_harness/skills.py:117 uses stripped.find('---', 3) to locate the closing frontmatter delimiter. This performs a simple substring search, so it will match --- appearing within a frontmatter value (e.g., description: A---B) rather than requiring --- to be on its own line, which is the standard YAML frontmatter convention. When triggered, the frontmatter is truncated at the embedded ---, the description (or other value) is silently cut short, and the remainder is incorrectly treated as the body/instructions.

Example of incorrect parse

Input:

--- name: my-skill description: Long---description --- Body text

find('---', 3) matches the --- inside Long---description at character 34 instead of the actual closing delimiter. Result: description is parsed as Long, and body becomes description\n---\nBody text.

Suggested change

end = stripped.find('---', 3)

end = stripped.find('\n---', 3)

if end == -1:

raise ValueError(f'Unclosed YAML frontmatter in {source}')

frontmatter_text = stripped[3:end].strip()

body = stripped[end + 4 :].strip() or None

Was this helpful? React with 👍 or 👎 to provide feedback.

devin-ai-integration · 2026-04-10T01:07:43Z

+    async def for_run(self, ctx: RunContext[AgentDepsT]) -> Skills[AgentDepsT]:
+        """Return a fresh copy with empty loaded-skills state."""
+        clone: Skills[AgentDepsT] = Skills(skills=self.skills)
+        return clone


🚩 for_run clone shares skills list reference — meta-tool binding depends on get_ re-extraction*

The for_run method at src/pydantic_harness/skills.py:200 creates a clone via Skills(skills=self.skills), sharing the same skills list by reference. The critical design question is whether get_toolset() is re-called on the clone after for_run. The AbstractCapability.for_run docstring says it is "Called once per run, before get_*() re-extraction", which implies get_toolset() is indeed re-invoked on the clone. If so, the meta-tools (_search_skills, _load_skill, _unload_skill) would be bound methods of the clone, and _loaded_skill_names mutations during a run would correctly affect the same instance that prepare_tools checks. This is correct under the documented lifecycle. However, if any future pydantic-ai version changes the re-extraction behavior, this would silently break — the meta-tools would mutate the original instance's state while prepare_tools reads from the clone's state, making skill loading appear to have no effect.

Was this helpful? React with 👍 or 👎 to provide feedback.

DouweM · 2026-04-10T15:09:28Z

Originally posted by @DouweM in #133 comment (PR was recreated)

Audit vs prior art: Skills

Worth adding now:

agentskills.io tolerance: ignore unknown frontmatter keys instead of failing
unload_skill(name) tool to free context window
Fuzzy/word-boundary search in search_skills

Follow-up opportunities:

Remote skill registries (git-based, like VStorm)
Dependency resolution between skills
Resource/script separation (agentskills.io pattern)

mtessar · 2026-04-12T04:12:44Z

@DouweM i am excited to see skills being added. Thanks for working on it!

…212) * Split skills module into package to match project conventions Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com> * Move skills tests to tests/_skills/ to match project conventions Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com> --------- Co-authored-by: Claude Opus 4.6 (1M context) <noreply@anthropic.com>

dergachoff · 2026-04-17T12:32:09Z

Do I understand correctly that dynamic loading/unloading skills breaks cache?

DouweM · 2026-04-23T18:43:23Z

It's clear that the people want skills support :)

From discussing on Slack (join, thread), it's also clear that people mean different things when they say that: some are you really looking for full filesys/shell user-provided skills support, but many primarily want "programmatic skills" that are defined code-side (and could be loaded from a file or a DB, but not necessarily the user's own/sandbox FS).

Check out the thread Slack thread if you have opinions. We're meeting next week with a couple of champions from the Slack thread to make sure we build the right thing, and not get distracted by agentskills.io if we don't have to.

DouweM and others added 3 commits April 2, 2026 05:27

DouweM requested review from Kludex, adtyavrdhn, dmontagu, dsfaccini and samuelcolvin as code owners April 10, 2026 01:02

devin-ai-integration Bot reviewed Apr 10, 2026

View reviewed changes

DouweM removed request for Kludex, adtyavrdhn, dmontagu, dsfaccini and samuelcolvin April 10, 2026 15:12

DouweM marked this pull request as draft April 10, 2026 15:13

adtyavrdhn self-assigned this Apr 15, 2026

DouweM added this to the 2026-05 milestone Apr 23, 2026

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add Skills capability for progressive tool loading#183

Add Skills capability for progressive tool loading#183
DouweM wants to merge 4 commits intomainfrom
capability/skills

DouweM commented Apr 10, 2026 •

edited

Loading

Uh oh!

devin-ai-integration Bot left a comment

Uh oh!

devin-ai-integration Bot Apr 10, 2026

Uh oh!

devin-ai-integration Bot Apr 10, 2026

Uh oh!

DouweM commented Apr 10, 2026

Uh oh!

mtessar commented Apr 12, 2026

Uh oh!

dergachoff commented Apr 17, 2026

Uh oh!

DouweM commented Apr 23, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

-    end = stripped.find('---', 3)
+    end = stripped.find('\n---', 3)
+    if end == -1:
+        raise ValueError(f'Unclosed YAML frontmatter in {source}')
+    frontmatter_text = stripped[3:end].strip()
+    body = stripped[end + 4 :].strip() or None

Conversation

DouweM commented Apr 10, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary

Test plan

Uh oh!

devin-ai-integration Bot left a comment

Choose a reason for hiding this comment

Uh oh!

devin-ai-integration Bot Apr 10, 2026

Choose a reason for hiding this comment

Uh oh!

devin-ai-integration Bot Apr 10, 2026

Choose a reason for hiding this comment

Uh oh!

DouweM commented Apr 10, 2026

Audit vs prior art: Skills

Uh oh!

mtessar commented Apr 12, 2026

Uh oh!

dergachoff commented Apr 17, 2026

Uh oh!

DouweM commented Apr 23, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

DouweM commented Apr 10, 2026 •

edited

Loading