Designing High-Coverage Multi-level Text Corpus for Non-professional-voice Conservation
Jůzová, M., Tihelka, D., Matoušek, J. (2016): Designing High-Coverage Multi-level Text Corpus for Non-professional-voice Conservation. In: Andrey Ronzhin, Rodmonga Potapova, Géza Németh (2016) Speech and Computer. Volume 9811 of the series Lecture Notes in Computer Science, pp 207-215. ISBN: 978-3-319-43957-0 (Print) 978-3-319-43958-7 (Online). doi:10.1007/978-3-319-43958-7_24
Abstrakt: The paper focuses on building a text corpus suitable for the conservation of the voices of non-professional speakers, who are loosing their voices due to serious healthy problems. Since we do not know in advance, how many sentences a speaker will be able to record, we propose a multi-level greedy algorithm which can ensure the coverage of selected texts by various phonetic and prosodic units. The comparison of such coverage is presented for various corpus sizes, and compared to the generic TTS corpus recorded by a healthy professional speaker.