Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

feature(wqj): add vllm collector #856

Open
wants to merge 17 commits into
base: main
Choose a base branch
from
Open

Conversation

wqj2004
Copy link

@wqj2004 wqj2004 commented Feb 7, 2025

Overview

The whole RLHF training pipeline

onlinerlhf

Goal

In this PR, we need to finish the collector part in the above figure and add its unittest.

TODO

  • add vllm inferencer for LLM/VLM
  • scale up vllm inferencer in multi-GPUs
  • add several datasets definition @PaParaZz1 feature(nyz): add rlhf dataset #854
  • survey proper VQA dataset for RLHF training
  • add the final collector and test it on the above dataset
  • add tutorial and API doc

@PaParaZz1 PaParaZz1 changed the title feature(wqj): dev-vllm feature(wqj): add vllm collector Feb 8, 2025
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants