spring-agent-flow

Build stateful multi-agent workflows in Java — with graphs, retries, and persistence.

No orchestration code. No glue logic. Just define your agents and run.

🚀 Live demo of a real-world use case built with Spring Agent Flow

🔴 Explore a real-world B2B USE CASE built with Spring Agent Flow:

👉 https://huggingface.co/spaces/datallmhub/multi-agent-customer-ops

What it shows

Multi-agent orchestration (Triage → Lookup → Policy → Writer)
Hybrid AI + deterministic business logic
Typed shared state across agents
End-to-end decision traceability

Run locally

You can run the demo locally using the full source code:

👉 https://github.com/datallmhub/multi-agent-customer-ops

⚡ In 60 seconds

ExecutorAgent researcher = ExecutorAgent.builder()
        .chatClient(chatClient)
        .systemPrompt("Find key facts.")
        .build();

ExecutorAgent writer = ExecutorAgent.builder()
        .chatClient(chatClient)
        .systemPrompt("Write a clear report.")
        .build();

CoordinatorAgent coordinator = CoordinatorAgent.builder()
        .executors(Map.of("research", researcher, "writing", writer))
        .routingStrategy(RoutingStrategy.llmDriven(chatClient))
        .build();

AgentResult result = coordinator.execute(
        AgentContext.of("Compare Claude 4 and GPT-5"));

System.out.println(result.text());

Output:

=== Multi-Agent Coordination ===

Request: Compare Claude 4 and GPT-5

[router]   Routing to: research
[research] Gathering facts...

[router]   Routing to: writing
[writing]  Generating report...

Result:
Claude 4 excels in reasoning and long-context tasks.
GPT-5 shows stronger tool integration and instruction following.

This is a multi-step, stateful workflow with routing, coordination, and resilience — without writing orchestration code.

⭐ If this saves you time, consider starring the repo.

🧠 Why this exists

Real-world AI systems are not one LLM call.

They are:

multi-step
stateful
failure-prone
long-running

Spring AI gives you primitives. spring-agent-flow gives you a runtime.

📊 How it works

A coordinator routes tasks across agents, executing a graph with shared state, retries, and checkpoints.

🧠 Two levels of control

Level 1 — Squad API (recommended)

Dynamic routing, minimal setup. A CoordinatorAgent routes to ExecutorAgents — you focus on the agents, not the plumbing.

CoordinatorAgent coordinator = CoordinatorAgent.builder()
        .executors(Map.of(
            "research", researchExecutor,
            "analysis", analysisExecutor,
            "writing",  writingExecutor
        ))
        .routingStrategy(RoutingStrategy.llmDriven(chatClient))
        .build();

AgentResult result = coordinator.execute(AgentContext.of("..."));

Level 2 — Graph API

Explicit flows, loops, conditions, full control.

AgentGraph graph = AgentGraph.builder()
        .addNode("research", researcher)
        .addNode("analyze",  analyzer)
        .addNode("write",    writer)
        .addEdge("research", "analyze")
        .addEdge(Edge.conditional("analyze",
                ctx -> ctx.get(CONFIDENCE).doubleValue() < 0.7,
                "research"))                               // loop back
        .addEdge("analyze", "write")                       // fallback: forward
        .errorPolicy(ErrorPolicy.RETRY_ONCE)
        .build();

AgentResult result = graph.invoke(AgentContext.of("..."));

🧭 When should I use this?

Use it if:

your agent needs multiple LLM calls
your workflow has branches or loops
failures (retry, resume, rate limits) matter
multiple agents must coordinate

Avoid it if:

you just call ChatClient once

⚔️ Why not just Spring AI or simple loops?

Approach	Limitation
Spring AI alone	Low-level primitives only — you write the orchestration
Manual `while` loops	Don't scale, retries are hard, state becomes fragile
LangChain-style flows	Limited execution control, Python-first

spring-agent-flow provides:

explicit execution graphs
built-in resilience (retry + circuit breaker)
durable, typed state

Spring AI	spring-agent-flow
Primitives (`ChatClient`, tools)	Structured runtime (`AgentGraph`, `CoordinatorAgent`)
Manual orchestration	Graph-based execution
No durable state	Typed shared state + checkpoints
Retry logic in user code	Built-in retry + circuit breaker
No resume	Interrupt + resume support

🚀 Try it in 30 seconds (no API key required)

git clone https://github.com/datallmhub/spring-agent-flow.git
cd spring-agent-flow
mvn install -DskipTests -q
mvn -pl spring-agent-flow-samples exec:java

👉 Runs a real multi-agent workflow with routing, coordination, and state — fully simulated.

📦 Samples included

The project ships with ready-to-run examples — no LLM required.

Example	What it shows	Run
`MultiAgentCoordination`	Multi-agent routing with CoordinatorAgent	default
`MinimalPipeline`	Simple 2-step workflow using AgentGraph	`-Dexec.mainClass="...MinimalPipeline"`
`AdvancedGraphDemo`	Loops, conditions, state, listeners	`-Dexec.mainClass="...AdvancedGraphDemo"`

👉 Start with MultiAgentCoordination — it demonstrates the full power of the framework.

🧩 What you get

⚡ No orchestration code required
🧠 Stateful agent workflows
🔁 Built-in retries & circuit breakers
📊 Graph-based execution
💾 Durable checkpoints (JDBC / Redis)
🔌 Native Spring AI integration
📡 Streaming support
📈 Micrometer metrics

🧱 Architecture

Layered architecture showing coordination, execution, resilience, and persistence on top of Spring AI.

🛠 Installation

Requirements: Java 17+, Spring Boot 3.x, Spring AI 1.0+

Distributed via JitPack.

Maven

<repositories>
    <repository>
        <id>jitpack.io</id>
        <url>https://jitpack.io</url>
    </repository>
</repositories>

<dependency>
    <groupId>com.github.datallmhub.spring-agent-flow</groupId>
    <artifactId>spring-agent-flow-starter</artifactId>
    <version>v0.5.0</version>
</dependency>

Gradle

repositories {
    maven { url 'https://jitpack.io' }
}

dependencies {
    implementation 'com.github.datallmhub.spring-agent-flow:spring-agent-flow-starter:v0.5.0'
}

Modules

Module	Use case
`spring-agent-flow-starter`	Spring Boot auto-config, properties, Micrometer listener
`spring-agent-flow-core`	Minimal API (`Agent`, `AgentContext`, `StateKey`, `AgentResult`)
`spring-agent-flow-graph`	`AgentGraph`, `RetryPolicy`, `CircuitBreakerPolicy` SPI, checkpoint contract
`spring-agent-flow-squad`	`CoordinatorAgent`, `ExecutorAgent`, `ReActAgent`, `ParallelAgent`, `RoutingStrategy`
`spring-agent-flow-checkpoint`	`JdbcCheckpointStore`, `RedisCheckpointStore`, Jackson codec
`spring-agent-flow-resilience4j`	`CircuitBreakerPolicy` adapter backed by Resilience4j
`spring-agent-flow-cli-agents`	`CliAgentNode` — runs Claude Code / Codex / Gemini CLI agents as graph nodes
`spring-agent-flow-test`	`MockAgent`, `TestGraph` for unit-testing graphs

Minimal application.yml:

spring:
  ai:
    agents:
      enabled: true
      default-error-policy: RETRY_ONCE
      observability:
        metrics: true

📡 Streaming

graph.invokeStream(AgentContext.of("hello"))
    .subscribe(event -> {
        switch (event) {
            case AgentEvent.Token t         -> System.out.print(t.chunk());
            case AgentEvent.NodeTransition x -> System.out.println("\n--> " + x.to());
            case AgentEvent.Completed c     -> System.out.println("\n[done]");
            default -> {}
        }
    });

💾 Typed state — no `Map<String, Object>`

// Declare keys with types — compile-time safety
StateKey<Double> CONFIDENCE = StateKey.of("confidence", Double.class);
StateKey<String> SUMMARY    = StateKey.of("summary",    String.class);

// Use them anywhere
AgentContext ctx = context.with(CONFIDENCE, 0.85);
double score = ctx.get(CONFIDENCE);  // no cast needed

🔁 Resilience

AgentGraph.builder()
    .errorPolicy(ErrorPolicy.FAIL_FAST)          // or RETRY_ONCE / SKIP_NODE
    .retryPolicy(RetryPolicy.exponential(3, Duration.ofMillis(200)))
    .addNode("llm", flakyAgent,
             RetryPolicy.exponential(5, Duration.ofMillis(500)),   // per-node override
             new Resilience4jCircuitBreakerPolicy(registry))        // per-node breaker
    .build();

See resilient-typed-executor.md and circuit-breaker.md.

📈 Observability (Micrometer)

Metric	Tags	Description
`agents.execution.count`	`agent`, `graph`, `status`	Per-node execution count
`agents.execution.duration`	`agent`, `graph`	Per-node execution time
`agents.graph.transitions`	`graph`, `from`, `to`	Node-to-node transitions
`agents.execution.errors`	`agent`, `graph`, `cause`	Error count by type

🧪 Testing without an LLM

MockAgent mock = MockAgent.builder()
        .thenReturn("First response")
        .thenReturn("Second response")
        .build();

TestGraph.Trace trace = TestGraph.trace(
        AgentGraph.builder()
            .addNode("a", mock)
            .addNode("b", MockAgent.returning("done"))
            .addEdge("a", "b"));

AgentResult result = trace.invoke(AgentContext.of("test"));

assertThat(trace.visitedInOrder("a", "b")).isTrue();
assertThat(result.text()).isEqualTo("done");

📚 Recipes

ReAct loop — self-correcting agent with observation/action cycles
Supervisor pattern — coordinator re-routes until done
Parallel executors — fan-out/fan-in
Subgraphs — plug a graph in as a node
Human-in-the-loop — interrupt, wait for human input, resume
Durable runs — JDBC or Redis checkpoint store, resume after crash
Resilient typed executor — tool audit + typed output + retry
Circuit breaker — trip upstream calls with Resilience4j

📈 Roadmap

Version	Focus
0.5 (current)	Subgraphs, parallel fan-out, cancellation, typed output, `RetryPolicy`, `CircuitBreakerPolicy`, JDBC/Redis checkpoint store
1.0	API stabilization, documentation, community feedback
1.1	Crew roles (CrewAI-inspired), auto-config for checkpoint backends
2.0	OpenTelemetry tracing, MCP integration, Agent-as-Tool

📝 Note on scope

This project is independent and not affiliated with spring-ai-community/agent-client.

That project focuses on CLI agent integrations (Claude Code, Codex, Gemini).

spring-agent-flow focuses on something different: a graph-based runtime for stateful, multi-step agent workflows on top of Spring AI.

🤝 Contributing

Contributions welcome! Please see CONTRIBUTING.md for guidelines.

This project follows the Apache 2.0 License.

Inspiration

LangGraph — graph-based orchestration
CrewAI — role-based agent teams
AWS Strands — agent patterns for Java
Spring AI — the foundation we build on

Name		Name	Last commit message	Last commit date
Latest commit History 48 Commits
.github		.github
docs		docs
spring-agent-flow-checkpoint		spring-agent-flow-checkpoint
spring-agent-flow-cli-agents		spring-agent-flow-cli-agents
spring-agent-flow-core		spring-agent-flow-core
spring-agent-flow-graph		spring-agent-flow-graph
spring-agent-flow-resilience4j		spring-agent-flow-resilience4j
spring-agent-flow-samples		spring-agent-flow-samples
spring-agent-flow-squad		spring-agent-flow-squad
spring-agent-flow-starter		spring-agent-flow-starter
spring-agent-flow-test		spring-agent-flow-test
.gitignore		.gitignore
CONTRIBUTING.md		CONTRIBUTING.md
LICENSE		LICENSE
README.md		README.md
pom.xml		pom.xml

Folders and files

Latest commit

History

Repository files navigation

spring-agent-flow

🚀 Live demo of a real-world use case built with Spring Agent Flow

What it shows

Run locally

⚡ In 60 seconds

🧠 Why this exists

📊 How it works

🧠 Two levels of control

Level 1 — Squad API (recommended)

Level 2 — Graph API

🧭 When should I use this?

⚔️ Why not just Spring AI or simple loops?

🚀 Try it in 30 seconds (no API key required)

📦 Samples included

🧩 What you get

🧱 Architecture

🛠 Installation

Maven

Gradle

Modules

📡 Streaming

💾 Typed state — no Map<String, Object>

🔁 Resilience

📈 Observability (Micrometer)

🧪 Testing without an LLM

📚 Recipes

📈 Roadmap

📝 Note on scope

🤝 Contributing

Inspiration

About

Topics

Resources

License

Contributing

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

💾 Typed state — no `Map<String, Object>`

Packages