Get LoRA script to work for single gpus #35

k8tems · 2024-02-15T02:53:40Z

This is my first pull request in 7 yrs or something so apologies in advance if I do anything wrong.
Currently, train_c_lora.py does not support training with single GPUs as doing so will fail with missing environment variables or calls to torch.distributed.barrier() which (I think) is supposed to only work in multi-gpu environments.
(e.g.) #28 #17
This pull request will will make some modifications to add single gpu support.

k8tems · 2024-02-15T02:56:01Z

I added an additional boolean parameter to the script that represents whether to use a single gpu or not.
Not sure if this is the best way to go.

…ve original CLI interface

asutermo · 2024-02-16T21:41:40Z

train/base.py

-    def save_checkpoints(self, models: Models, optimizers: Optimizers, suffix=None):
-        barrier()
+    def save_checkpoints(self, models: Models, optimizers: Optimizers, suffix=None, single_gpu=False):
+        if single_gpu:


shouldn't this be 'if not single_gpu:'?

I think that's fixed in this commit.
c00a449

rlucatoor

Works great for me, good job

k8tems added 2 commits February 15, 2024 02:39

Add single gpu support for LoRAs

55e00b4

Call barrier when single_gpu is False

c00a449

k8tems and others added 3 commits February 15, 2024 12:20

Also relay single_gpu to the recursive save_checkpoints invocation

e6cc655

Only reference argv[2] when there are sufficient parameters to preser…

fd33bf7

…ve original CLI interface

args -> sys.argv

e72b970

asutermo reviewed Feb 16, 2024

View reviewed changes

rlucatoor approved these changes Feb 19, 2024

View reviewed changes

rlucatoor mentioned this pull request Feb 19, 2024

TypeError: int() argument must be a string, a bytes-like object or a real number, not 'NoneType' #17

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Get LoRA script to work for single gpus #35

Get LoRA script to work for single gpus #35

Uh oh!

k8tems commented Feb 15, 2024 •

edited

Loading

Uh oh!

k8tems commented Feb 15, 2024

Uh oh!

asutermo Feb 16, 2024

Uh oh!

k8tems Feb 16, 2024

Uh oh!

rlucatoor left a comment

Uh oh!

Uh oh!

Get LoRA script to work for single gpus #35

Are you sure you want to change the base?

Get LoRA script to work for single gpus #35

Uh oh!

Conversation

k8tems commented Feb 15, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

k8tems commented Feb 15, 2024

Uh oh!

asutermo Feb 16, 2024

Choose a reason for hiding this comment

Uh oh!

k8tems Feb 16, 2024

Choose a reason for hiding this comment

Uh oh!

rlucatoor left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

k8tems commented Feb 15, 2024 •

edited

Loading