10s generation limit? #175

rosmur · 2025-11-09T00:03:50Z

rosmur
Nov 9, 2025

I can seem to only generate 10s of audio (running on Google Colab. Looking for the solution to be able to generate longer audio files.

ZackHodari · 2025-11-09T14:35:45Z

ZackHodari
Nov 9, 2025
Maintainer

It shouldn't be limited to 10s, the max sequence length of the model is 2048 tokens which is ~163 seconds

If you have a lot of context, e.g. previous turns or voice prompts it'll reduce the max generation you can create.

Also take a look at max_audio_length_ms for csm/generator.py:Generator, it's pre-set to 90s so shouldn't be an issue

0 replies

rosmur · 2025-11-10T00:47:00Z

rosmur
Nov 10, 2025
Author

Thanks Zack. Then the code below should work just fine (taken from the example)?

import torch
from transformers import CsmForConditionalGeneration, AutoProcessor

model_id = "sesame/csm-1b"
device = "cuda" if torch.cuda.is_available() else "cpu"

# load the model and the processor
processor = AutoProcessor.from_pretrained(model_id)
model = CsmForConditionalGeneration.from_pretrained(model_id, device_map=device)

# another equivalent way to prepare the inputs
conversation = [
    {"role": "0", "content": [{"type": "text", "text": "long set of text pasted here in actuality"}]},
]
inputs = processor.apply_chat_template(
    conversation,
    tokenize=True,
    return_dict=True,
).to(device)

# infer the model
audio = model.generate(**inputs, output_audio=True)
processor.save_audio(audio, "copypasta.wav")

This is resulting in only 10s of output despite the input text being much longer than 10s worth of audio.

0 replies

ZackHodari · 2025-11-11T16:17:16Z

ZackHodari
Nov 11, 2025
Maintainer

We didn't implement the transformer's version of CSM, you'll want to check their docs https://huggingface.co/docs/transformers/model_doc/csm

In the transformers library there are standard ways to change the max output length, e.g. in GenerationConfig

0 replies

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

10s generation limit? #175

Uh oh!

{{title}}

Uh oh!

Replies: 3 comments

Uh oh!

{{title}}

Uh oh!

Uh oh!

{{title}}

Uh oh!

Uh oh!

{{title}}

Uh oh!

Select a reply

Uh oh!

10s generation limit? #175

Uh oh!

rosmur Nov 9, 2025

Replies: 3 comments

Uh oh!

ZackHodari Nov 9, 2025 Maintainer

Uh oh!

rosmur Nov 10, 2025 Author

Uh oh!

ZackHodari Nov 11, 2025 Maintainer

rosmur
Nov 9, 2025

ZackHodari
Nov 9, 2025
Maintainer

rosmur
Nov 10, 2025
Author

ZackHodari
Nov 11, 2025
Maintainer