Skip to content

ShuoTang123/MATRIX-Gen

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

4 Commits
 
 

Repository files navigation

MATRIX-Gen

Synthesizing Post-Training Data for LLMs through Multi-Agent Simulation Paper

MATRIX-Tuned-Model outperforms others, including Llama-3-8B-Instruct, with significantly less data.

News

  • To do: 🔥 Code release

  • 10/18/2024: We released the preprint paper in arxiv.

Simulation

Our MATRIX generates realistic and diverse scenarios with 1000 real-world-grounded agents and structured communication (agent grouping, inter- and intra-group communication )

MATRIX-Gen

Overview of the proposed post-training data generation process (MATRIX-Gen) from scenarios

Data Quality in the General Domain

  • MATRIX-Gen-SFT

Models instruction-tuned on Llama3-8B using MATRIX-Gen-SFT consistently outperform those trained on baseline datasets with the same data quantity across both benchmarks.

  • MATRIX-Gen-DPO

Models preference-tuned on MATRIX-SFT-Model using MATRIX-Gen-DPO outperform baselines with equivalent data quantities on both benchmarks.

Data Quality in the Specific Domain

  • MATRIX-Gen-Code & MATRIX-Gen-Safe
  • MATRIX-Gen-MT

Effect of Scenario Scale & Structured Communication

Increasing scales of agents and scenarios significantly improves model performances. Agent-grouping-based structured communication produces the highest quality scenarios, while random communication and no communication yield lower quality results.

Citation

Please cite our paper if you find the repository helpful.

@inproceedings{Tang2024SynthesizingPD,
  title={Synthesizing Post-Training Data for LLMs through Multi-Agent Simulation},
  author={Shuo Tang and Xianghe Pang and Zexi Liu and Bohan Tang and Rui Ye and Xiaowen Dong and Yanfeng Wang and Siheng Chen},
  year={2024},
  url={https://arxiv.org/abs/2410.14251}
}

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published