Matoušek, J and Tihelka, D. : Annotation Errors Detection in TTS Corpora . Proceedings of INTERSPEECH 2013, p. 1511-1515, Lyon, France, 2013.

We investigate the problem of automatic detection of annotation errors in single-speaker read-speech corpora used for text-to-speech (TTS) synthesis. Various word-level feature sets were used, and the performance of several detection methods based on support vector machines, extremely randomized trees, k-nearest neighbors, and the performance of novelty and outlier detection are evaluated. We show that both word- and utterance-level annotation error detections perform very well with both high precision and recall scores and with F1 measure being almost 90%, or 97%, respectively.

Název: Annotation Errors Detection in TTS Corpora
Autor: Matoušek, J ; Tihelka, D.
Název - česky: Detekce anotačních chyb v korpusech pro TTS
Jazyk publikace: anglicky
Datum vydání: 29.8.2013
Rok vydání: 2013
Typ publikace: Stať ve sborníku
Název knihy: Proceedings of INTERSPEECH 2013
Strana: 1511 - 1515
ISBN: 978-1-62993-443-3
Místo vydání: Lyon, France
Datum: 25.8.2013 - 29.8.2013
annotation error detection, classification, novelty detection, read speech corpora, speech synthesis


 author = {Matou\v{s}ek, J and Tihelka, D.},
 title = {Annotation Errors Detection in TTS Corpora},
 year = {2013},
 address = {Lyon, France},
 pages = {1511-1515},
 booktitle = {Proceedings of INTERSPEECH 2013},
 ISBN = {978-1-62993-443-3},
 url = {http://www.kky.zcu.cz/en/publications/MatousekJ_2013_AnnotationErrors},