Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Generate songs based on a reference #27

Open
MJ54pwnz opened this issue Jan 30, 2025 · 8 comments
Open

Generate songs based on a reference #27

MJ54pwnz opened this issue Jan 30, 2025 · 8 comments
Labels
enhancement New feature or request

Comments

@MJ54pwnz
Copy link

Hi! Thank you so much for this incredible work, you just put an end to closed source platforms like Suno. I have a question about using reference as a base for a song. Can we take a reference song for example and change lyrics somehow? Or generate a brand new song with new sound and etc?

@a43992899
Copy link
Collaborator

Can we take a reference song for example and change lyrics somehow?

Or generate a brand new song with new sound and etc?

Could you elaborate?

@MJ54pwnz
Copy link
Author

@a43992899
Oh yeah, now I get it, yes, that's the feature I was talking about, ICL. I was also thinking about the fact that you could copy the style and melody with vocals, but change the lyrics while keeping the same style and voice. And about the elaborating, I was thinking about taking a song and changing its genre but trying to keep the style, like turning a pop song into a rock song. Again, thank you so much for this work, if you need any help with data collection I'd be happy to help!

@a43992899
Copy link
Collaborator

It is hard for ICL to copy melody for now.

If you're referring to editing lyrics while preserving the melody, our current model does not support this directly. While there might be complex workarounds to achieve music continuation that aligns with lyrics editing, they are quite tricky to realize.

A more elegant solution would be to continue pretraining the stage 1 checkpoint to enable song-level Fill-In-the-Middle (FIM). However, this approach would require substantial computational resources. We may consider adding this feature in YuE v2.

@MJ54pwnz
Copy link
Author

MJ54pwnz commented Jan 30, 2025

Okay that's makes sense, we'll you guys don't have to rush, take as much time as you need!^^ and the last question. Will it be possible to fine tune the model on 24gb gpu? Also I think it would be wonderful for you guys to make an discord server so all people could share their samples and talk about your amazing models

@frink
Copy link

frink commented Jan 31, 2025

It would be great if there was a flag that could be flipped to copy the melody instead of the style. (Kinda thinking like Mureka...)

@a43992899
Copy link
Collaborator

a43992899 commented Jan 31, 2025

I plan to add melody control in the future. However, the open-source community currently lacks high-quality melody transcription models.

Music Information Retrieval (MIR) progress lags behind advancements in generative AI. That said, I’ve noticed the use of chroma conditioning in MusicGen, which could be a promising direction. Perhaps someone could fine-tune a model to enable MusicGen-style melody control.

@a43992899 a43992899 added the enhancement New feature or request label Feb 3, 2025
@LCF2764
Copy link

LCF2764 commented Feb 4, 2025

I plan to add melody control in the future. However, the open-source community currently lacks high-quality melody transcription models.

Music Information Retrieval (MIR) progress lags behind advancements in generative AI. That said, I’ve noticed the use of chroma conditioning in MusicGen, which could be a promising direction. Perhaps someone could fine-tune a model to enable MusicGen-style melody control.

Great idea to include melody control! I specialize in singing melody transcription and have gotten solid results in this area. Happy to collaborate if you need support!!

@a43992899
Copy link
Collaborator

@LCF2764

Thanks! That sounds great—do you have any models on Hugging Face that you plan to open source? I'd love to try them out.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
enhancement New feature or request
Projects
None yet
Development

No branches or pull requests

4 participants