MATRIX-Gen

Synthesizing Post-Training Data for LLMs through Multi-Agent Simulation Paper

MATRIX-Tuned-Model outperforms others, including Llama-3-8B-Instruct, with significantly less data.

News

To do: 🔥 Code release
10/18/2024: We released the preprint paper in arxiv.

Simulation

Our MATRIX generates realistic and diverse scenarios with 1000 real-world-grounded agents and structured communication (agent grouping, inter- and intra-group communication )

MATRIX-Gen

Overview of the proposed post-training data generation process (MATRIX-Gen) from scenarios

Data Quality in the General Domain

MATRIX-Gen-SFT

Models instruction-tuned on Llama3-8B using MATRIX-Gen-SFT consistently outperform those trained on baseline datasets with the same data quantity across both benchmarks.

MATRIX-Gen-DPO

Models preference-tuned on MATRIX-SFT-Model using MATRIX-Gen-DPO outperform baselines with equivalent data quantities on both benchmarks.

Data Quality in the Specific Domain

MATRIX-Gen-Code & MATRIX-Gen-Safe

MATRIX-Gen-MT

Effect of Scenario Scale & Structured Communication

Increasing scales of agents and scenarios significantly improves model performances. Agent-grouping-based structured communication produces the highest quality scenarios, while random communication and no communication yield lower quality results.

Citation

Please cite our paper if you find the repository helpful.

@inproceedings{Tang2024SynthesizingPD,
  title={Synthesizing Post-Training Data for LLMs through Multi-Agent Simulation},
  author={Shuo Tang and Xianghe Pang and Zexi Liu and Bohan Tang and Rui Ye and Xiaowen Dong and Yanfeng Wang and Siheng Chen},
  year={2024},
  url={https://arxiv.org/abs/2410.14251}
}

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

MATRIX-Gen

Synthesizing Post-Training Data for LLMs through Multi-Agent Simulation Paper

News

Simulation

MATRIX-Gen

Data Quality in the General Domain

Data Quality in the Specific Domain

Effect of Scenario Scale & Structured Communication

Citation

About

Uh oh!

Releases

Packages

ShuoTang123/MATRIX-Gen

Folders and files

Latest commit

History

Repository files navigation

MATRIX-Gen

Synthesizing Post-Training Data for LLMs through Multi-Agent Simulation Paper

News

Simulation

MATRIX-Gen

Data Quality in the General Domain

Data Quality in the Specific Domain

Effect of Scenario Scale & Structured Communication

Citation

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Packages