Skip to content

vikrant-bhati/qwen-grpo

About

We created this project to implement GRPO on Qwen-2.5 small 3B model on Medical dataset. This was a team project with 4 other members

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors