Skip to content

Detail of publication


Matoušek, J and Tihelka, D. : Annotation Errors Detection in TTS Corpora . Proceedings of INTERSPEECH 2013, p. 1511-1515, Lyon, France, 2013.

Download PDF



We investigate the problem of automatic detection of annotation errors in single-speaker read-speech corpora used for text-to-speech (TTS) synthesis. Various word-level feature sets were used, and the performance of several detection methods based on support vector machines, extremely randomized trees, k-nearest neighbors, and the performance of novelty and outlier detection are evaluated. We show that both word- and utterance-level annotation error detections perform very well with both high precision and recall scores and with F1 measure being almost 90%, or 97%, respectively.

Detail of publication

Title: Annotation Errors Detection in TTS Corpora
Author: Matoušek, J ; Tihelka, D.
Language: English
Date of publication: 29 Aug 2013
Year: 2013
Type of publication: Papers in proceedings of reviewed conferences
Book title: Proceedings of INTERSPEECH 2013
Page: 1511 - 1515
ISBN: 978-1-62993-443-3
Address: Lyon, France
Date: 25 Aug 2013 - 29 Aug 2013
/ 2014-01-26 21:16:10 /


annotation error detection, classification, novelty detection, read speech corpora, speech synthesis


 author = {Matou\v{s}ek, J and Tihelka, D.},
 title = {Annotation Errors Detection in TTS Corpora},
 year = {2013},
 address = {Lyon, France},
 pages = {1511-1515},
 booktitle = {Proceedings of INTERSPEECH 2013},
 ISBN = {978-1-62993-443-3},
 url = {},