Agent Comparisons

Benchmark harness for evaluating how reliably different agent architectures follow a required sequential execution order as complexity (number of steps N) increases.

Current focus:

Agent with tools: single orchestrator agent delegating to N sub-agent tools
Agent graph: N-node pydantic-graph where an LLM routes between nodes
Deep agents: planned

Prerequisites

Python 3.12+
An OpenAI API key

Local Setup

From the repo root:

python -m venv .venv
pip install -e .

Then activate the virtual environment in your shell.

Create .env from .env.example and set environment variables:

OPENAI_API_KEY

Optional (for Logfire telemetry):

LOGFIRE_KEY
LOGFIRE_ENVIRONMENT

Name		Name	Last commit message	Last commit date
Latest commit History 1 Commit
agents		agents
.env.example		.env.example
.gitignore		.gitignore
.python-version		.python-version
README.md		README.md
main.py		main.py
pyproject.toml		pyproject.toml
uv.lock		uv.lock

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Agent Comparisons

Prerequisites

Local Setup

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Agent Comparisons

Prerequisites

Local Setup

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages