Shouts, non recognized words #92

hanz-s · 2023-09-24T07:48:02Z

hanz-s
Sep 24, 2023

Hello,

is there a way to increase recognition of shouts, or voice that is'nt necessarily a word, to not get filtered when they are not recognized as a word. so one can get the timing of those shouts in the ultrastar text file? would i need a language model trained for this, oder can i add an additional model somehow which increases supports this anlong with the normal language models?

rakuri255 · 2023-09-24T18:18:05Z

rakuri255
Sep 24, 2023
Maintainer

Its currently a difficult task, since the llm models only give timestamps for words. I think when you find an model on huggingface which is trained for singing, that would be the best chance.

0 replies

rakuri255 · 2023-09-25T22:26:49Z

rakuri255
Sep 25, 2023
Maintainer

I have been thinking about it again. Since we have the voice separated, we could analyze the volume. Then we could say that if there is no word/timestamp, then there is some kind of "lalala".
We need this approach anyway to eliminate the silent pause in the word itself.

1 reply

hanz-s Nov 11, 2023
Author

nice! a placeholder would be worth gold. maybe one could set a param with a min volume / length of human voice to be recognized for a placeholder position.
thank your for your feedback to my question!

rakuri255 · 2023-10-05T12:29:39Z

rakuri255
Oct 5, 2023
Maintainer

We have now some spectrograms to play with.

0 replies

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Shouts, non recognized words #92

{{title}}

Replies: 3 comments 1 reply

{{title}}

{{title}}

{{title}}

{{title}}

Select a reply

Shouts, non recognized words #92

hanz-s Sep 24, 2023

Replies: 3 comments · 1 reply

rakuri255 Sep 24, 2023 Maintainer

rakuri255 Sep 25, 2023 Maintainer

hanz-s Nov 11, 2023 Author

rakuri255 Oct 5, 2023 Maintainer

hanz-s
Sep 24, 2023

Replies: 3 comments 1 reply

rakuri255
Sep 24, 2023
Maintainer

rakuri255
Sep 25, 2023
Maintainer

hanz-s Nov 11, 2023
Author

rakuri255
Oct 5, 2023
Maintainer