You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
According to huggingface documentation for wav2vec2 and HuBERT, "return_attention_mask": true should be specified for models trained with extractor_mode: layer and you have default/group .
Though, I haven't figured out for sure from your code did you use attention_mask or not during training. I suspect you didn't since you specified pad_audio: false.
Could you please clarify this misalignment and confirm if you were using attention_mask during training or not?
Thank you!
The text was updated successfully, but these errors were encountered:
Hey!
In
fairseq
for both MERT-v1-95M and MERT-v1-330M you specifiedHowever in
huggingface
you haveI see discrepancy here:
normalize: false
vs"do_normalize": true
huggingface
documentation for wav2vec2 and HuBERT,"return_attention_mask": true
should be specified for models trained withextractor_mode: layer
and you havedefault/group
.Though, I haven't figured out for sure from your code did you use
attention_mask
or not during training. I suspect you didn't since you specifiedpad_audio: false
.Could you please clarify this misalignment and confirm if you were using
attention_mask
during training or not?Thank you!
The text was updated successfully, but these errors were encountered: