Applied autoresearch to model routing policy — ZeroAPI eval loop #508
dorukardahan
started this conversation in
Show and tell
Replies: 0 comments
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Uh oh!
There was an error while loading. Please reload this page.
-
I took the measure→experiment→promote cycle from autoresearch and applied it to AI model routing policy instead of model training.
The problem: I pay for 5 AI subscriptions (OpenAI, Kimi, GLM, MiniMax, Qwen — $109/mo total). Each has different strengths. Manually picking the right model per task is guesswork.
What ZeroAPI does: An OpenClaw gateway plugin that classifies tasks via keyword matching (<1ms, no LLM call) and routes to the benchmark leader for that category. 155 models scored across 15 benchmarks from Artificial Analysis.
The autoresearch connection: All routing constants (category keywords, risk levels, vision detection, TTFT thresholds, fallback ordering) live in a JSON config. Every routing decision is logged. A built-in eval script (
scripts/eval.ts) analyzes the logs and reports:The loop: run eval → change one config constant → restart → wait for traffic → re-eval → keep what improves routing, revert what doesnt.
Same pattern as autoresearch but manual instead of automated — routing policy needs human judgment, not overnight hill-climbing.
Repo: https://github.com/dorukardahan/ZeroAPI
Open source, MIT. Works with any OpenClaw setup.
Beta Was this translation helpful? Give feedback.
All reactions