The AgeVoxCeleb is the large-scale, balanced, and multi-modal age corpus that contains both video and speech of the same speakers from a wide age range.
The AgeVoxCeleb contains nearly 168k videos of approximately 5000 speakers selected from VoxCeleb2. All the videos are labeled with the speaker’s real age estimated using each celebrity's name and title of the original YouTube video.
To know the detail of how we built AgeVoxceleb, please refer to our paper presented in ICASSP2021.
In this repository, only annotation labels are distributed. To obtain the data including audio files, cropped video files, and other meta information, please visit the original web-page of VoxCeleb2 and download them.
Each line in utt2spk.train
and utt2spk.test
corresponds to a pair of the segment-id and annotated age labels
in training and test sets.
Note that the segment-id is composed of speaker-ID, video-ID, and segment-ID defined in the original VoxCeleb.
In addition, please note that how to divide the data into training and test sets is different from ones defined in the original VoxCeleb dataset.
There is another work [1] that is closely related to us but conducted independently of our work. The key difference between our work and theirs is that they automatically annotated age labels in a more strict way. Specifically, they use video semantic and people identity information taken from multiple data sources to get the accurate speaker's birth year and the recording date. Please refer to their Arxiv paper[1] and Github repository for detailed information.
[1] Khaled Hechmi, Trung Ngo Trong, Ville Hautamaki, Tomi Kinnunen, ``VoxCeleb Enrichment for Age and Gender Recognition,'' arXiv:2109.13510, 2021,
If you use this dataset, please cite:
@inproceedings{tawara2021age,
title={Age-VOX-Celeb: Multi-Modal Corpus for Facial and Speech Estimation},
author={Tawara, Naohiro and Ogawa, Atsunori and Kitagishi, Yuki and Kamiyama, Hosana},
booktitle={ICASSP 2021-2021 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)},
pages={6963--6967},
year={2021},
organization={IEEE}
}