Load LoRA with uneven Rank #7954

mutatedducks97 · 2025-04-24T22:11:25Z

Summary

Internally invoke makes use of CustomLinear module to inject LoRA into but does not support fused weights from uneven rank from up and down matrices. In this PR we create a fused matrix to address this issue. Since huggingface diffusers implements FLUX using a fused kqv The rank of the LoRA generated from a diffuser FLUX model will be different.

Before this PR the following issues would occur. This resolves it.

For QKV attention layers (img_attn and txt_atn):
- CustomLinear(in_features=3072, out_features=9216)
- LoRA matrices: down=[12, 3072], up=[9216, 4]
- The error shows: cannot multiply (9216x4) with (12x3072)
- We need it to output a matrix with shape (9216, 3072)

New shape:torch.Size([9216, 3072])

For linear1 layers:
- CustomLinear(in_features=3072, out_features=21504)
- LoRA matrices: down=[16, 3072], up=[21504, 4]
- The error shows: cannot multiply (21504x4) with (16x3072)
- We need it to output a matrix with shape (21504, 3072)

Default Flux

50 steps training

350 steps training Flux

Related Issues / Discussions

QA Instructions

Merge Plan

Checklist

The PR has a short but descriptive title, suitable for a changelog
Tests added / updated (if applicable)
Documentation added / updated (if applicable)
Updated What's New copy (if doing a release after this PR)

mutatedducks97 added 2 commits April 24, 2025 17:08

integrate loRA

a75edb1

Merge branch 'main' into sam/model-load

3bfe79a

mutatedducks97 requested review from lstein, blessedcoolant, brandonrising, hipsterusername and jazzhaiku as code owners April 24, 2025 22:11

github-actions bot added python PRs that change python files backend PRs that change backend files labels Apr 24, 2025

mutatedducks97 marked this pull request as draft April 24, 2025 22:11

idk anymore tbh

2e12c19

mutatedducks97 marked this pull request as ready for review April 25, 2025 11:46

mutatedducks97 changed the title ~~Sam/model load~~ Load LoRA with uneven Rank Apr 25, 2025

enable fused matrix for quantized models

66448f6

hipsterusername closed this May 2, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Load LoRA with uneven Rank #7954

Load LoRA with uneven Rank #7954

Uh oh!

mutatedducks97 commented Apr 24, 2025 •

edited

Loading

Uh oh!

Uh oh!

Load LoRA with uneven Rank #7954

Load LoRA with uneven Rank #7954

Uh oh!

Conversation

mutatedducks97 commented Apr 24, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary

Default Flux

50 steps training

350 steps training Flux

Related Issues / Discussions

QA Instructions

Merge Plan

Checklist

Uh oh!

Uh oh!

mutatedducks97 commented Apr 24, 2025 •

edited

Loading