OpenOutcry

The point-in-time environment for trading agents, and the contract they speak

Whoever defines the agent interface owns the ecosystem. OpenOutcry is the leak-free trading floor every agent runs on, and the language-agnostic contract that makes any agent scorable.

Why · Quickstart · Train an agent · Surfaces · The contract · Architecture · Tech stack

Why

An eval is useless without an environment. A benchmark scores trajectories; something has to produce them. OpenOutcry is that producer: a leak-free, point-in-time market environment wrapped in a dead-simple, language-agnostic agent contract. The harness sends an Observation, the agent returns a Decision, repeat.

Two properties make it trustworthy rather than a toy:

Look-ahead is structurally impossible. The environment owns the time cursor and the data layer has no API to read a future bar, so an agent cannot peek by construction, not by policing.
Trajectories are recompute-from-raw-decisions. A run records only the agent's decisions; a separate verifier replays them against the frozen data to recompute a byte-identical result. A tampered trajectory recomputes differently, so an agent cannot lie about its returns.

The strategic bet is interface ownership: if every trading agent in the open ecosystem conforms to OpenOutcry's Observation/Decision contract, then SharpeBench is the natural scorer and the whole funnel (env, trajectory, score, leaderboard) runs on one standard. This is the OpenAI-Gym moment for trading agents. The interface is the product; the simulator is the credibility behind it.

An agent is just a program that reads an observation and writes a decision, in any language. Conform to the contract, and you are scorable everywhere.

Status: published, active (pre-1.0)

Published to crates.io, npm, and PyPI at v0.6.0, depending on the published sharpebench-sim 0.0.8 engine (not a vendored copy). CI is green across four surfaces: Rust (fmt, clippy -D warnings, tests, a WASM target build), cargo-deny, the npm package, and the Python wheel (maturin + pytest).

Beyond the core reset/step lifecycle, the environment now ships a full reinforcement-learning training surface:

Capability	What it is
Procedural scenarios	A seeded generator (`ScenarioSpec` / `generate_scenario`) using Procgen's integer-seed-interval model, with `Calm` / `Hard` / `Extreme` volatility-and-jump tiers and provably disjoint `train_test_split`, cross-runtime golden-hashed for byte-identical generation.
Generalization gap	`generalization_gap` measures train-vs-held-out deflated Sharpe over disjoint seed bands, turning "did it overfit" into one number scored by the SharpeBench kernel.
`verifiers` training env	A PrimeIntellect `verifiers` `MultiTurnEnv` that steps the market bar-by-bar, over a multi-row scenario `Dataset`, with an `XMLParser` decision protocol and a GRPO-safe bounded reward scored by the real SharpeBench `score_run` (deflated Sharpe, pass^k, process checks).
Pluggable reward schemes	`load_environment(reward_scheme=...)` selects from a registry: an online differential Sharpe (Moody-Saffell, aligns the training signal with the deflated-Sharpe score), Sortino, drawdown-penalized, turnover-penalized, loss-averse. Schemes shape training only; the rank key stays the SharpeBench kernel.
More env tasks	Beyond the position env: a `PortfolioEnv` (simplex allocation, log-return), an `ExecutionEnv` (VWAP/TWAP implementation-shortfall MDP), and a `MarketMakingEnv` (Avellaneda-Stoikov) that ships its closed-form analytical-optimal policy as a baseline, so agents are scored on regret-vs-provably-optimal.
Scenario families	Calm/Hard/Extreme vol-jump tiers plus `CointegratedPairs` (genuine mean-reverting spread), `RegimeShift` (trend-to-whipsaw), curriculum chaining (`CurriculumEnv`/`regime_curriculum`), and a frozen named held-out eval-seed regression set.
Rich observations	Opt-in, causal, leak-free obs augmentations computed from the point-in-time history: technical indicators (RSI/MACD/Bollinger/...), multi-timescale momentum, rolling covariance, a spread z-score, a recursive Kalman hedge-ratio spread (a `KalmanSpreadObservation` for the cointegrated-pairs scenario emitting the innovation-normalized z, leak-free by construction, superseding the rolling-OLS approximation), a Kalman constant-velocity trend obs (filtered velocity + sign), a synthetic seed-derived news/sentiment channel, and time-to-horizon. Declarative via `PreprocessingConfig`.
Risk + eval axes	Drawdown stop-out, turbulence-halt, liquidation-cascade, and a cross-sectional deleveraging circuit-breaker (flattens the oversold subset when universe-wide breadth flips oversold, distinct from the single-asset breakers) wrappers; a forecast-quality calibrated eval axis (FinPILOT), per-regime breakdown, efficient-frontier/Kelly baselines, a deterministic episode-failure taxonomy + suite rollup (clean / bankrupt / stopped-out / cascade-wiped / mandate-breach), and a full risk/profit metrics panel (Calmar/Sortino/VaR/CVaR/tail/turnover).
Vectorized rollouts	`VecTradingEnv` runs B scenario lanes in lockstep (rayon, structure-of-arrays JSON, current-Gymnasium `AutoresetMode`, async `send`/`recv`), exposed as a `gymnasium.vector` env.
Point-in-time-safe wrappers	Causal normalize (no future-bar leak), `TimeLimit`, `FrameStack`, `RecordEpisodeStatistics`, vector-env variants, and `flatten`/`unflatten` Dict-obs helpers, plus a `check_env` conformance harness that proves seed-determinism (and adopts Gymnasium's own `check_env`).
Gymnasium registration	Versioned, namespaced IDs: `gymnasium.make("OpenOutcry/Hard-v1")` and `make_vec(...)` route to the scalar and vector envs, with `-Eval-v1` variants on a disjoint held-out seed band.
Multi-agent markets	A PettingZoo `MultiAgentOpenOutcryEnv` (batched competition: N agents on one frozen scenario, SharpeBench-ranked), an `EndogenousMarketEnv` (a shared-book market where aggregate flow moves the cleared price via Kyle + Almgren-Chriss impact), and an `LOBMarketEnv` over a real deterministic limit-order-book matching engine (integer ticks, price-time priority, market/limit/cancel/modify, depth-ladder + microprice + queue-imbalance obs), now with a single-price call-auction uncross (`uncross`, the max-matched-volume open/close equilibrium the CDA book lacked) and a read-only walk-the-book `sweep_cost` query (average fill price + slippage without mutating the book or the tape).
Market realism gate	A `stylized_facts` / `certify_realism` diagnostic (Python, off the byte-identical hot path) that certifies a generated panel exhibits the Cont stylized facts, fat tails, volatility clustering, gain/loss skew, aggregational Gaussianity, Zumbach timescale asymmetry, Fano intermittency, so a drifted generator that emits near-Gaussian markets fails the realism proof instead of silently letting agents win on unrealistic tape.
Per-scenario mandates	Each scenario samples a trading mandate (long-only, market-neutral, drawdown-capped, beat-a-benchmark); the `verifiers` rubric is mandate-conditioned, so wrong-objective behavior is penalized, not just unrewarded.
Offline-RL + checkpointing	`to_minari` exports rollouts as a Farama Minari dataset (leak-safe, `recover_environment`-ready); `CheckpointableEnv` clones/restores/branches market state for tree search (O(1) native engine snapshot or replay-from-decisions); `OpenOutcryFuncEnv` is a stateless `gymnasium.functional.FuncEnv` view.
Benchmark protocol	A committed `EVALUATION.md`: the canonical eval contract, the disjoint train/held-out split, and a baseline leaderboard (deflated Sharpe to beat is 0.0; pass^k degrades Calm to Hard to Extreme).
Harness integration	An MCP server (`reset` / `step` / `spec` tools) so any MCP agent harness drives an episode with zero glue, a `LookaheadGuard` that refuses agent operations reading future data, versioned JSONL rollout traces that re-score offline through the SharpeBench kernel, and a cost-adjusted `RunMetrics` block for leaderboard ranking.

The determinism-critical core (the engine, scoring, scenario generation, mandates, the market-clearing model, the execution-noise integrity knob) lives in Rust so a published number is byte-identical across every surface; the per-ecosystem adapters (gymnasium, PettingZoo, Minari, verifiers, MCP) are thin and live in the language each ecosystem speaks.

Not yet shipped: the PrimeIntellect Environments-Hub listing and the Gordon conforming-agent adapter.

Quickstart

cargo add openoutcry        # the Rust crate (re-exports the engine + the wire contract)

use openoutcry::{TradingEnv, Dataset, CostModel, Window, BuyAndHold, Agent};

let data = Dataset::synthetic(4, 120, 1);
let mut env = TradingEnv::new(data, Window { start: 20, end: 120 }, CostModel::default(), 7);
let mut agent = BuyAndHold;
let mut obs = env.reset();
loop {
    let decision = agent.decide(&obs);
    let step = env.step(decision);     // -> { observation, reward, done, info }
    obs = step.observation;
    if step.done { break; }
}

Both stepping surfaces (the open-loop TradingEnv and the closed-loop run_backtest) call one shared per-step body, so a trajectory the env produces is byte-identical to the equivalent backtest, enforced by a test.

import gymnasium, openoutcry          # a first-class Gymnasium env
env = openoutcry.OpenOutcryEnv(n_symbols=4, n_days=120, seed=1)
obs, info = env.reset(seed=1)         # the seed selects the scenario
obs, reward, terminated, truncated, info = env.step(env.action_space.sample())

Train an agent

The Python package is a PrimeIntellect verifiers environment, so an agent trains over a distribution of leak-free scenarios scored by the SharpeBench kernel:

import openoutcry

# A multi-turn env that steps the market bar-by-bar over many seeded scenarios.
env = openoutcry.load_environment(n_windows=256, n_symbols=4, n_days=120)

# Measure overfitting directly: train vs a provably disjoint held-out seed band.
gap = openoutcry.generalization_gap(
    lambda seed: openoutcry.OpenOutcryEnv(n_symbols=4, n_days=120, seed=seed),
    n_train=64, n_test=64,
)
print(gap["gap_deflated_sharpe"])

Scale with vectorized rollouts. OpenOutcryVectorEnv steps B independent scenario lanes in lockstep (rayon under the hood), exposing the standard gymnasium.vector API:

import numpy as np, openoutcry

vec = openoutcry.OpenOutcryVectorEnv(num_envs=64, n_symbols=4, n_days=120)
obs, infos = vec.reset()
actions = np.full((vec.num_envs, len(vec.symbols)), 0.25, dtype=np.float32)
obs, rewards, terminated, truncated, infos = vec.step(actions)   # arrays of length 64

Compose point-in-time-safe wrappers. Standard gym wrappers, but the normalizers are causal, so no future bar ever leaks into the running statistics:

import openoutcry
from openoutcry import TimeLimit, CausalNormalizeObservation, RecordEpisodeStatistics

env = RecordEpisodeStatistics(            # info["episode"] carries Sharpe + max drawdown
    CausalNormalizeObservation(
        TimeLimit(openoutcry.OpenOutcryEnv(n_symbols=4, n_days=120), max_episode_steps=64)
    )
)

Register it like any Gymnasium env. Importing openoutcry registers versioned IDs, so the whole RL ecosystem reaches it through muscle memory:

import gymnasium, openoutcry
env = gymnasium.make("OpenOutcry/Hard-v1")            # difficulty + version pinned in the ID
vec = gymnasium.make_vec("OpenOutcry/Hard-v1", num_envs=8)

Trade a real multi-agent market. In EndogenousMarketEnv the agents' aggregate flow moves the price they all see (Kyle permanent + Almgren-Chriss temporary impact), not a frozen path:

from openoutcry import EndogenousMarketEnv     # a PettingZoo ParallelEnv

market = EndogenousMarketEnv(n_agents=3, n_symbols=4, n_days=120, seed=1)
obs, infos = market.reset(seed=1)
# every agent submits a target-weight order; the book clears once and the cleared
# price reflects everyone's flow, so one agent's size moves the others' fills.
obs, rewards, terminations, truncations, infos = market.step(
    {a: market.action_space(a).sample() for a in market.agents}
)

A one-command prime-rl GRPO training config lives in examples/prime-rl/; see docs/training.md for the full loop (install, vf-eval baseline, uv run rl).

Use it from anywhere

One Rust engine, scored identically across every surface, because they run the same code.

Surface	Get it	What it is
Rust crate	`cargo add openoutcry`	The env, the procedural scenario generator, the batched `VecTradingEnv`, the mandate / execution-noise / market-clearing cores, and the governed wire contract, re-exporting the leak-free engine.
Python	`pip install openoutcry`	Gymnasium (`Env` + `vector` + registered IDs), PettingZoo (competition + endogenous market), the `verifiers` training env, Minari export, checkpointing, `FuncEnv`, point-in-time-safe wrappers, traces, and an MCP server, over the pyo3 binding.
npm	`npm i @general-liquidity/openoutcry`	A typed JS/TS API over the engine compiled to WASM.
WASM	`openoutcry-wasm`	The wasm-bindgen bridge the npm package and Gordon (Bun) embed.

import { runBaseline } from "@general-liquidity/openoutcry";

// The identical Rust engine, in the browser or Bun: run a baseline over a seeded panel.
const run = runBaseline({
  agent: "momentum",
  dataset: { synthetic: { n_symbols: 4, n_days: 120, seed: 1 } },
  seed: 7,
});
console.log(run.returns.length, run.cost);   // per-period returns + realized execution cost

The agent itself can be written in any language: a conforming agent is a program that reads MarketObservation JSON (stdin or POST /decide) and writes Decision JSON. Reference agents in Rust, TypeScript, and Python double as the conformance smoke tests (crates/openoutcry/examples/).

The agent contract

The load-bearing standard. An Observation is point-in-time; a Decision is a set of target-weight orders:

// Observation (harness -> agent)
{ "date": "2025-01-02", "cash": 1.0,
  "symbols": [{ "symbol": "AAPL", "close_history": [187.2, 188.0, 190.4] }],
  "portfolio": [] }

// Decision (agent -> harness)
{ "orders": [{ "symbol": "AAPL", "action": "buy", "target_weight": 0.5 }] }

CONTRACT_VERSION tracks the wire shape and evolves additively only (new fields are optional with defaults), pinned by published JSON Schemas plus a conformance kit. The validate_decision_json boundary check rejects malformed decisions before they reach the engine. See crates/openoutcry/GOVERNANCE.md and crates/openoutcry/contract/.

Architecture

A Rust Cargo workspace, #![forbid(unsafe_code)], that depends on the published SharpeBench engine rather than vendoring it, so the env and the benchmark cannot drift.

sharpebench-sim (published 0.0.8) ... the leak-free point-in-time engine
        |
   crates/openoutcry ......... the env, scenario generator, batched VecTradingEnv,
        |                       mandate / exec-noise / market-clearing cores, the
        |                       Gym reset/step, and the governed wire contract
        |-- crates/openoutcry-wasm   the engine as WASM (-> the npm package)
        |-- crates/openoutcry-py     pyo3 + the ecosystem adapters (maturin)
        +-- npm/openoutcry           the typed TS wrapper over the wasm

Crate / package	Role
`openoutcry`	The Rust moat: `TradingEnv` (`reset`/`step`), `VecTradingEnv` (batched), the procedural scenario generator, the mandate and execution-noise cores, the `MarketClearing` impact engine, the `Scenario`/crisis-suite bundle, the re-exported wire contract plus scored `Run`, `CONTRACT_VERSION`, the conformance kit, and reference agents.
`openoutcry-wasm`	Pure JSON kernels (`run_baseline`, `replay_run`, `generate_scenario`, ...) plus wasm-bindgen exports, the identical engine for JS/TS.
`openoutcry-py`	A pyo3 extension over the Rust cores plus the thin ecosystem adapters: Gymnasium (`Env`/`vector`/registration), PettingZoo (competition + endogenous market), `verifiers`, Minari export, checkpointing, `FuncEnv`, wrappers, traces, and MCP (built by maturin). Optional extras: `verifiers`, `minari`, `pettingzoo`.
`@general-liquidity/openoutcry`	The typed npm wrapper over the WASM kernel.

Tech stack

Technology	Role
Rust	The engine and env, pure `f64`, deterministic, no `unsafe`, batched with rayon
WebAssembly	The engine for non-Rust hosts (`wasm-bindgen`)
TypeScript	The typed npm package
Python	The pyo3 binding and Gymnasium adapter (built by maturin)
Gymnasium	The RL env standard the Python adapter conforms to (`reset`/`step`/spaces/`vector`/registration)
PettingZoo	The multi-agent API the competition and endogenous-market envs conform to (passes `parallel_api_test`)
Minari	The offline-RL dataset standard `to_minari` exports trajectories into
Prime Intellect `verifiers`	The RLVR `Environment`/`Rubric` standard the training env conforms to
serde	Deterministic JSON for the wire contract (`float_roundtrip` for byte-exact replay)
GitHub Actions	CI: fmt, clippy, tests, wasm, cargo-deny, npm, maturin

Governance

The contract is governed in the open: additive-only evolution, a published deprecation window, and a conformance badge (see GOVERNANCE.md). Hosted by General Liquidity to start; the credibility is the leak-free-by-construction substrate plus recompute-to-verify trajectories, not trust in the host. Gordon (GL's agent) conforms to the contract like any other entrant.

License

Dual-licensed under either MIT or Apache-2.0, at your option.

_{The trading floor every agent runs on.}

Name		Name	Last commit message	Last commit date
Latest commit History 86 Commits
.github/workflows		.github/workflows
crates		crates
docs		docs
examples/prime-rl		examples/prime-rl
npm/openoutcry		npm/openoutcry
scripts		scripts
.gitignore		.gitignore
Cargo.lock		Cargo.lock
Cargo.toml		Cargo.toml
EVALUATION.md		EVALUATION.md
LICENSE-APACHE		LICENSE-APACHE
LICENSE-MIT		LICENSE-MIT
README.md		README.md
RELEASING.md		RELEASING.md
deny.toml		deny.toml
release.toml		release.toml
rust-toolchain.toml		rust-toolchain.toml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

OpenOutcry

The point-in-time environment for trading agents, and the contract they speak

Why

Status: published, active (pre-1.0)

Quickstart

Train an agent

Use it from anywhere

The agent contract

Architecture

Tech stack

Governance

License

About

Licenses found

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Uh oh!

Folders and files

Latest commit

History

Repository files navigation

OpenOutcry

The point-in-time environment for trading agents, and the contract they speak

Why

Status: published, active (pre-1.0)

Quickstart

Train an agent

Use it from anywhere

The agent contract

Architecture

Tech stack

Governance

License

About

Resources

License

Licenses found

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages