Skip to content

Harshness of speech #10

@adriandewitts

Description

@adriandewitts

Hi there,

Thank you all for the great work on Optispeech. @w11wo and I have been getting some great results.

I have a sound engineering background, and I’ve got a good idea of the different kinds of issues related to voice quality. I’d like to help and contribute from that perspective.

Optispeech doesn’t have the usual artefacts we’ve heard before, which has been great.

I’ve noticed the “harshness” of our trained voices. This screenshot from the Audacity spectrogram shows this. The bottom window is the original recorded voice, and the top is from Optispeech.

The screenshot points out one example of the ’s’ sound or sibilant sound. In the original, you can see that there is a gentle rise and fall of higher frequencies from 4-18k. Optispeech’s character is loud sibilance across the spectrum with a pronounced start and stop. You can make out the sibilant sounds in the speech.

Please let me know if you have any questions or if there is any way that I can help the project in general.

screenshot_2024-09-23_at_10 35 30___am

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions