LoRA - or other adapters

Huggingface has a library called [PEFT](https://github.com/huggingface/peft) that provides various methods for efficient fine tuning of foundation models. One of the methods is [LoRA](https://arxiv.org/abs/2106.09685). During fine tuning, special lower rank weight matrices are updated instead of the full weight matrices in the base model.  The rest of the model is frozen. The special matrices are combined with the frozen matrices in the base model. This greatly reduces the number of computations that are needed during fine tuning.

The goal of this issue is to successfully fine tune NLLB using the LoRA capability in PEFT. This will allow us to greatly reduce the resources that are needed to fine tune NLLB and potentially allow us to try fine tuning larger models.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

LoRA - or other adapters #177

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Uh oh!

LoRA - or other adapters #177

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions