Actions: huggingface/trl
Actions
2,500+ workflow runs
2,500+ workflow runs
loss_type="chunked_nll" under DeepSpeed ZeRO-3
Build PR Documentation
#16617:
Pull request #5873
synchronize
by
qgallouedec
loss_type="chunked_nll" under DeepSpeed ZeRO-3
Build PR Documentation
#16616:
Pull request #5873
synchronize
by
qgallouedec
loss_type="chunked_nll" under DeepSpeed ZeRO-3
Build PR Documentation
#16615:
Pull request #5873
synchronize
by
qgallouedec
AsyncGRPOTrainer: add ProcessorMixin handling
Build PR Documentation
#16606:
Pull request #5895
synchronize
by
rycerzes