Skip to content

Commit f794edf

Browse files
improve docs
1 parent 496c506 commit f794edf

File tree

1 file changed

+6
-6
lines changed

1 file changed

+6
-6
lines changed

post-training/online_training.md

Lines changed: 6 additions & 6 deletions
Original file line numberDiff line numberDiff line change
@@ -1,12 +1,12 @@
11
# Online Training
22

33
## Prerequisites
4-
Want to use REINFORCE to train your model? First you'll need to build a custom vllm package.
4+
Want to use [REINFORCE](https://arxiv.org/abs/2402.14740) to train your model? First you'll need to build a custom vllm package.
55

6-
[synth-vllm](https://github.com/SynthLabsAI/synth-vllm) is a fork of [vllm](https://github.com/vllm-project/vllm)
7-
that has been modified to support using the weights in NeoX by sharing the GPU memory location of the model weights.
6+
[synth-vllm](https://github.com/SynthLabsAI/synth-vllm) is a fork of [vllm](https://github.com/vllm-project/vllm) maintained by [SynthLabs](https://www.synthlabs.ai/)
7+
that has been modified to support using the weights in GPT-NeoX by sharing the GPU memory location of the model weights.
88

9-
It currently supports llama models and pythia models.
9+
It currently supports Llama and Pythia models.
1010

1111
### Building the package
1212

@@ -34,15 +34,15 @@ pip install -e .
3434

3535
## Training
3636

37-
If you haven't already, run this command to generate the weights:
37+
If you haven't already, run this command to generate a copy of the Llama-3 weights in GPT-NeoX format:
3838
```bash
3939
python tools/ckpts/convert_hf_llama_to_neox.py --tp 4 --model meta-llama/Meta-Llama-3-8B-Instruct --model_path checkpoints/neox_converted/llama3-8b-instruct
4040
```
4141

4242
[online_example.sh](online_example.sh), [online_data_example_llama3.py](online_data_example_llama3.py) is an example of
4343
how to train a model using the synth-vllm package on a single node.
4444

45-
This assumes you are using a conda environment with NeoX installed under the name `neox`.
45+
This assumes you are using a conda environment with GPT-NeoX installed under the name `neox`.
4646

4747
To run the example, execute the following commands:
4848

0 commit comments

Comments
 (0)