-
Notifications
You must be signed in to change notification settings - Fork 1.4k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Not getting alignment properly #628
Comments
Use Sox to remove silence in the audio file. It's not yet a complete success, but some improvements have been made. import subprocess
import os
import glob
def remove_silence(input_file, output_file):
try:
# sox
subprocess.run([
'sox', input_file, output_file, 'silence', '2', '0.1', '1%', 'reverse', 'silence', '2', '0.1', '1%', 'reverse'
], check=True)
print(f'Successfully removed silence from {input_file} and saved to {output_file}')
except subprocess.CalledProcessError as e:
print(f'Error occurred: {e}')
def process_folder(input_folder, output_folder):
# mkdir output folder
os.makedirs(output_folder, exist_ok=True)
# process all of the wav files in the input_folder
for wav_file in glob.glob(os.path.join(input_folder, '*.wav')):
file_name = os.path.basename(wav_file)
output_wav = os.path.join(output_folder, file_name)
remove_silence(wav_file, output_wav)
input_folder = '~/data/train'
output_folder = '~/data/processed_train'
process_folder(input_folder, output_folder) |
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Hi!
I trained tacotron2 more than 60000 steps but I cannot get alignment properly.
The alignment graph is as follows. Does anyone know the cause of this?
I'm training using 100 samples of elderly voice data selected from the common voice dataset.
Training performance was not good in previous attempts, so I looked for other issues.
But sadly it didn't work.
The text was updated successfully, but these errors were encountered: