Přejít na obsah

Detail publikace


Trmal Jan and Zelinka Jan and Luděk Müller : Adaptation of a Feedforward Artificial Neural Network Using a Linear Transform . Text, Speech and Dialogue, Lecture Notes in Computer Science, vol. 6231, p. 423-430, Springer Berlin / Heidelberg, 2010.

PDF ke stažení



In this paper we present a novel method for adaptation of a multi-layer perceptron neural network (MLP ANN). Nowadays, the adaptation of the ANN is usually done as an incremental retraining either of a subset or the complete set of the ANN parameters. However, since sometimes the amount of the adaptation data is quite small, there is a fundamental drawback of such approach – during retraining, the network parameters can be easily overfitted to the new data. There certainly are techniques that can help overcome this problem (early-stopping, cross-validation), however application of such techniques leads to more complex and possibly more data hungry training procedure. The proposed method approaches the problem from a different perspective. We use the fact that in many cases we have an additional knowledge about the problem. Such additional knowledge can be used to limit the dimensionality of the adaptation problem. We applied the proposed method on speaker adaptation of a phoneme recognizer based on traps (Temporal Patterns) parameters. We exploited the fact that the employed traps parameters are constructed using log-outputs of mel-filter bank and by virtue of reformulating the first layer weight matrix adaptation problem as a mel-filter bank output adaptation problem, we were able to significantly limit the number of free variables. Adaptation using the proposed method resulted in a substantial improvement of phoneme recognizer accuracy.

Detail publikace

Název: Adaptation of a Feedforward Artificial Neural Network Using a Linear Transform
Autor: Trmal Jan ; Zelinka Jan ; Luděk Müller
Název - česky: Adaptace ANN pomocí LT
Jazyk publikace: anglicky
Datum vydání: 1.9.2010
Rok vydání: 2010
Typ publikace: Článek z časopisu
Název časopisu / knihy: Text, Speech and Dialogue
Svazek: Lecture Notes in Computer Science
Číslo vydání: 6231
Strana: 423 - 430
ISBN: 3-642-15759-9
ISSN: 0302-9743
Místo vydání: Springer Berlin / Heidelberg
Datum: 6.9.2010 - 10.9.2010
/ 2011-03-15 17:36:49 /


 author = {Trmal Jan and Zelinka Jan and Lud\v{e}k M\"{u}ller},
 title = {Adaptation of a Feedforward Artificial Neural Network Using a Linear Transform},
 year = {2010},
 journal = {Text, Speech and Dialogue},
 address = {Springer Berlin / Heidelberg},
 volume = {6231},
 pages = {423-430},
 series = {Lecture Notes in Computer Science},
 ISBN = {3-642-15759-9},
 ISSN = {0302-9743},
 url = {http://www.kky.zcu.cz/en/publications/TrmalJan_2010_Adaptationof},