add GARDO: Reinforcing Diffusion Models without Reward Hacking by tinnerhrhe · Pull Request #216 · yifan123/flow_grpo

tinnerhrhe · 2026-01-05T11:40:08Z

GARDO is introduced in the paper "GARDO: Reinforcing Diffusion Models without Reward Hacking" (https://arxiv.org/abs/2512.24138), which studies effective methods for relieving reward hacking without compromising sample efficiency.

This update includes recipes for GARDO training.

tinnerhrhe added 2 commits January 5, 2026 19:30

add GARDO

5fda0ba

update readme

de6d2a3

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

add GARDO: Reinforcing Diffusion Models without Reward Hacking#216

add GARDO: Reinforcing Diffusion Models without Reward Hacking#216
tinnerhrhe wants to merge 2 commits intoyifan123:mainfrom
tinnerhrhe:main

tinnerhrhe commented Jan 5, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

tinnerhrhe commented Jan 5, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant