Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Questions about data preparation #15

Open
nkkanee opened this issue Apr 23, 2024 · 0 comments
Open

Questions about data preparation #15

nkkanee opened this issue Apr 23, 2024 · 0 comments

Comments

@nkkanee
Copy link

nkkanee commented Apr 23, 2024

I'm a beginner and I apologize for my lack of knowledge. I am currently preparing the data. Could you please tell me the steps to prepare the data?

First, we created train.tsv using prepare_manifest.py. After that, I used this train.tsv file to refer to HuBERT and created train.km and dict.km.txt.

What should I do then?

Also, how should I handle prepare_codecs_from_manifest.py?

Sorry for my poor writing

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant