Fix kontext finetune issue when batch size >1 #11921

mymusise · 2025-07-14T12:44:11Z

What does this PR do?

Problem

Training fails with shape mismatch when using custom instance prompts and batch_size > 1 due to partial batches from the dataloader.

Solution

Set drop_last=True in BucketBatchSampler to ensure consistent batch sizes during training. This prevents shape mismatch errors when the last batch is smaller than the specified batch size.

Testing

Verified the fix resolves the shape mismatch error by running training with custom instance prompts and batch_size > 1. No shape mismatch occurs after this change.

Fixes # (issue)

Before submitting

This PR fixes a bug in the training script.
Did you read the contributor guideline?
Did you read our philosophy doc?
Was this discussed/approved via a GitHub issue or the forum? (N/A if not discussed)
Did you make sure to update the documentation with your changes? (N/A for code-only bugfix)
Did you write any new necessary tests? (Manual test performed)

Who can review?

Anyone in the community is free to review the PR once the tests have passed.

For this example script and dataloader logic, relevant reviewers could be:

@sayakpaul

Signed-off-by: mymusise <[email protected]>

asomoza · 2025-07-15T14:28:26Z

cc: @linoytsaban

HuggingFaceDocBuilderDev · 2025-07-15T21:02:23Z

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update.

linoytsaban

thanks @mymusise!
and thanks @asomoza for the tag, I think I had it initially as False so to not waste samples with small datasets, but didn't make the needed adjustments to support batches of variant sizes. Better to have it as True as this PR suggests

set drop_last to True

fccd300

Signed-off-by: mymusise <[email protected]>

mymusise changed the title ~~Fix kontext finetune issue where batch size >1~~ Fix kontext finetune issue when batch size >1 Jul 14, 2025

Merge branch 'main' into main

46ef36d

sayakpaul approved these changes Jul 15, 2025

View reviewed changes

Merge branch 'main' into main

f454cbc

asomoza added the close-to-merge label Jul 16, 2025

Merge branch 'main' into main

2d2bef8

asomoza mentioned this pull request Jul 16, 2025

[LoRA training] add aspect ratio bucketing #11438

Open

Merge branch 'main' into main

9c82389

linoytsaban approved these changes Jul 16, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Fix kontext finetune issue when batch size >1 #11921

Fix kontext finetune issue when batch size >1 #11921

mymusise commented Jul 14, 2025

Uh oh!

asomoza commented Jul 15, 2025

Uh oh!

HuggingFaceDocBuilderDev commented Jul 15, 2025

Uh oh!

linoytsaban left a comment

Uh oh!

Uh oh!

Fix kontext finetune issue when batch size >1 #11921

Are you sure you want to change the base?

Fix kontext finetune issue when batch size >1 #11921

Conversation

mymusise commented Jul 14, 2025

What does this PR do?

Problem

Solution

Testing

Before submitting

Who can review?

Uh oh!

asomoza commented Jul 15, 2025

Uh oh!

HuggingFaceDocBuilderDev commented Jul 15, 2025

Uh oh!

linoytsaban left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!