SemOPT - Semantic-Guided MCTS Optimizer

SemOPT uses Monte Carlo Tree Search (MCTS) to guide multi-stage LLM generation, converting a natural-language optimization problem into executable Gurobi Python code, and using execution + audit-based scoring to drive the search.

Implemented semantic pipeline (Depth 0 → 3)

The current implementation follows this state chain:

Problem (natural language) → SolutionStrategy (JSON, 4 fields)
SolutionStrategy → MathModel (JSON → formatted text)
The LLM returns a JSON object with sets/parameters/variables/objective/constraints, which the code formats into a readable “math model” text block.
MathModel → Gurobi Code (Python)

Note: the tree logger prints Depth 1 as “Structured Text”, but the actual state type is SolutionStrategyState.

Installation

pip install -r requirements.txt

This project depends on gurobipy. Make sure Gurobi is installed and licensed properly; otherwise, even correct generated code may fail to execute.

Run (CLI)

The CLI entry point is semopt/main.py. From the repo root, run it as a module:

Read problem from a file

export OPENAI_API_KEY="your-api-key-here"
python -m semopt.main --file benchmark/EasyLP/problem_001/desc.txt -n 8

Benchmark problems in this repo use desc.txt as the problem file name.

Pass the problem via command line

export OPENAI_API_KEY="your-api-key-here"
python -m semopt.main "Your optimization problem description here" -n 8

CLI arguments

problem: positional argument, problem description (optional; you can use --file)
--file, -f: read problem from file
--iterations, -n: number of MCTS iterations (overrides config.num_iterations)
--log-file, -l: path to the log file
--no-console: disable console output (log file still works)

Configuration (environment variables)

Configuration is loaded by Config.from_env() in semopt/config.py:

OPENAI_API_KEY: required if either generation or evaluation uses API (by default evaluation uses API); optional only when USE_LOCAL_MODEL=true and USE_LOCAL_FOR_EVALUATION=true
OPENAI_MODEL: model name for generation (default: gpt-4.1-nano)
OPENAI_BASE_URL: OpenAI-compatible base URL
SEED: random seed (default: 42)
NUM_ITERATIONS: default number of iterations (default: 8; can be overridden by -n/--iterations)
LOG_FILE: log file path (default: empty; if not provided, it falls back to logs/mcts_run.log)
LAMBDA_VAL: mixing weight for prior vs backpropagated value in UCT (default: 0.5)
OMEGA: exploration constant used in UCT (default: 1.414)
GENERATION_MAX_PARALLELISM: max parallelism for Layer 1/2 generation (default: 4)

Local model (vLLM) related:

USE_LOCAL_MODEL: use local model for generation (default: false)
USE_LOCAL_FOR_EVALUATION: use local model for evaluation/audits (default: false)
LOCAL_MODEL_PATH: local model path
- required when USE_LOCAL_MODEL=true or USE_LOCAL_FOR_EVALUATION=true (validated in code)
EVALUATION_MODEL: evaluation model name (default: gemini-2.5-flash-lite, used for API mode)
EVALUATION_MODEL_PATH: local evaluation model path

Outputs and artifacts

A typical run produces:

Tree-structured logs: console and/or log file (--log-file / LOG_FILE; default fallback is logs/mcts_run.log)
Best Path: the selected Depth 0 → 3 path with scores
Final code: printed in logs as “FINAL GENERATED CODE”
debug_responses/: saved Layer 1/2 prompts and raw responses (useful for debugging JSON repair/parsing)

The executor parses the objective value from STDOUT (via regex), so the Layer 3 prompt enforces printing: Optimal objective value: {model.objVal}

Project structure (core package)

semopt/
├── states/          # States (Problem, SolutionStrategy, MathModel, Code)
├── mcts/            # MCTS (Node/Tree/RolloutController)
├── generation/      # LLM interface and Layer 1/2/3 generators
├── processing/      # Screener / SemanticMerger / SMTMerger / CandidateScorer
├── evaluation/      # GurobiExecutor / forward audit / backward recon / scorer
├── utils/           # logger, prompt templates, JSON repair, etc.
├── config.py        # env-based configuration
└── main.py          # CLI entry point and run_semopt

Name		Name	Last commit message	Last commit date
Latest commit History 6 Commits
baseline		baseline
benchmark		benchmark
semopt		semopt
.gitignore		.gitignore
README.md		README.md
analyze_results.py		analyze_results.py
requirements.txt		requirements.txt
run_benchmark.py		run_benchmark.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

SemOPT - Semantic-Guided MCTS Optimizer

Implemented semantic pipeline (Depth 0 → 3)

Installation

Run (CLI)

Read problem from a file

Pass the problem via command line

CLI arguments

Configuration (environment variables)

Outputs and artifacts

Project structure (core package)

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

SemOPT - Semantic-Guided MCTS Optimizer

Implemented semantic pipeline (Depth 0 → 3)

Installation

Run (CLI)

Read problem from a file

Pass the problem via command line

CLI arguments

Configuration (environment variables)

Outputs and artifacts

Project structure (core package)

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages