Commit f162a7a
feat+revert(v0.22): domain enumeration in graph context; strategy text reverted
PHASE 2.1 (kept): build_graph_context now emits a "Domains in this
corpus: krra (90125), x2bee (19843), assort (13909)" line whenever
the corpus has 2+ distinct properties._domain_id values. Direct SQL
DISTINCT (not list_nodes sampling) so the count is accurate even
when nodes are stored in domain-contiguous order.
PHASE 2.2 (reverted strategy text, kept enumeration): tested a
3-step strategy prompt block ("identify domains → fan-out → verify").
Re-bench cross-domain shows:
v0.22 hits 3/12 (same as v0.21), but degraded partial coverage:
xd001: miss → hit (assort=6 unlocked) ✓
xd008: hit → miss (assort=10 → 0) ✗
xd010: 2/3 → 1/3 partial coverage ✗
xd012: krra=83 → found=0 catastrophic ✗
Net zero hits, net negative on partial coverage — same brittle
deterministic-prompt-shift dynamic seen in v0.20. Strategy block
reverted; only the factual enumeration line remains (zero behavioural
push, agent CAN read it but isn't being steered).
Pattern (3 iterations now): adding text to AGENT_SYSTEM at temp=0/
seed=42 is a coin-flip per query. v0.19 +4, v0.20 -5 (reverted as
v0.20.1 +1), v0.22 +0. Conclusion: prompt-tuning won't move
cross-domain coverage. Need tool-level fan-out (Phase 2.3): modify
search/deep_search to split into per-domain sub-searches internally
so the agent makes one call and gets multi-domain results without
behaviour change.
Tests: 983 pass.
Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>1 parent b2f94dc commit f162a7a
2 files changed
Lines changed: 76 additions & 0 deletions
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
6 | 6 | | |
7 | 7 | | |
8 | 8 | | |
| 9 | + | |
| 10 | + | |
| 11 | + | |
| 12 | + | |
| 13 | + | |
| 14 | + | |
| 15 | + | |
| 16 | + | |
| 17 | + | |
| 18 | + | |
| 19 | + | |
| 20 | + | |
| 21 | + | |
| 22 | + | |
| 23 | + | |
| 24 | + | |
| 25 | + | |
| 26 | + | |
| 27 | + | |
| 28 | + | |
| 29 | + | |
| 30 | + | |
| 31 | + | |
| 32 | + | |
| 33 | + | |
| 34 | + | |
| 35 | + | |
| 36 | + | |
| 37 | + | |
| 38 | + | |
| 39 | + | |
| 40 | + | |
| 41 | + | |
| 42 | + | |
| 43 | + | |
| 44 | + | |
| 45 | + | |
| 46 | + | |
9 | 47 | | |
10 | 48 | | |
11 | 49 | | |
| |||
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
236 | 236 | | |
237 | 237 | | |
238 | 238 | | |
| 239 | + | |
| 240 | + | |
| 241 | + | |
| 242 | + | |
| 243 | + | |
| 244 | + | |
| 245 | + | |
| 246 | + | |
| 247 | + | |
| 248 | + | |
| 249 | + | |
| 250 | + | |
| 251 | + | |
| 252 | + | |
| 253 | + | |
| 254 | + | |
| 255 | + | |
| 256 | + | |
| 257 | + | |
| 258 | + | |
| 259 | + | |
| 260 | + | |
| 261 | + | |
| 262 | + | |
| 263 | + | |
| 264 | + | |
| 265 | + | |
| 266 | + | |
239 | 267 | | |
240 | 268 | | |
241 | 269 | | |
| |||
256 | 284 | | |
257 | 285 | | |
258 | 286 | | |
| 287 | + | |
| 288 | + | |
| 289 | + | |
| 290 | + | |
| 291 | + | |
| 292 | + | |
| 293 | + | |
| 294 | + | |
| 295 | + | |
| 296 | + | |
259 | 297 | | |
260 | 298 | | |
261 | 299 | | |
| |||
0 commit comments