-
Notifications
You must be signed in to change notification settings - Fork 17
Pull requests: LLM360/Reasoning360
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
Adding Tool-N1 data set to training mix with sync rl
#154
opened Feb 12, 2026 by
jb3618columbia
Collaborator
Loading…
Implement pass rate-based curriculum learning with weighted sampling
#153
opened Jan 24, 2026 by
jb3618columbia
Collaborator
Loading…
[BREAKING][refactor] Refactor codebase and add async capabilities
#152
opened Dec 18, 2025 by
nightlessbaron
Collaborator
Loading…
5 tasks
[algo] Adding CISPO policy loss
#150
opened Nov 6, 2025 by
twkillian
Contributor
Loading…
2 tasks done
Bump sglang[all] from 0.4.6.post5 to 0.5.4.post1
dependencies
Pull requests that update a dependency file
python
Pull requests that update python code
#148
opened Oct 27, 2025 by
dependabot
bot
Loading…
Bump torchvision from 0.20.1 to 0.24.0
dependencies
Pull requests that update a dependency file
python
Pull requests that update python code
#147
opened Oct 20, 2025 by
dependabot
bot
Loading…
Bump ray from 2.46.0 to 2.50.1
dependencies
Pull requests that update a dependency file
python
Pull requests that update python code
#146
opened Oct 20, 2025 by
dependabot
bot
Loading…
Pr upstream verl merge diffaware
#137
opened Sep 29, 2025 by
Jianshu-She
Collaborator
Loading…
7 tasks
Bump tokenizers from 0.21 to 0.22.1
dependencies
Pull requests that update a dependency file
python
Pull requests that update python code
#136
opened Sep 22, 2025 by
dependabot
bot
Loading…
Update numpy requirement from <2.0.0 to <3.0.0
dependencies
Pull requests that update a dependency file
python
Pull requests that update python code
#126
opened Aug 26, 2025 by
dependabot
bot
Loading…
Add reward function for SynLogic dataset
#123
opened Aug 14, 2025 by
LiqunMa
Collaborator
Loading…
1 task
ProTip!
Type g p on any issue or pull request to go back to the pull request listing page.