Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Seems not work on speechocean dataset #9

Open
louislau1129 opened this issue Aug 14, 2022 · 1 comment
Open

Seems not work on speechocean dataset #9

louislau1129 opened this issue Aug 14, 2022 · 1 comment

Comments

@louislau1129
Copy link

louislau1129 commented Aug 14, 2022

Hi, @felixkreuk , first thank you for open-sourced such good repo on unsupervised phoneme segmentation. Recently, I conduct several experiments on SpeechOcean 762 dataset, which is a standard speech scoring dataset.

  1. First, I directly apply the provided pretrained boundary detection model on this corpus, and only found about 50% F1 and R value.
  2. I suspect this may relates to the domain mismatch problem, so I try to re-train this boundary detection model on the SpeechOcean corpus from scratch, but still attains about 50% F1 and R value, it is far lag from the referenced force aligned boundary result.

The following result screenshot is about using the random initialized model (without start training) to predict:
guKmKj9ga2

The following result screenshot is about using the trained model to predict:
Z0bB4fn1HS

Any idea on fixing this issue? Thanks in advance!

@louislau1129
Copy link
Author

(Ps: I have re-produce the reported result on timit using the code

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant