M. Bulín and J. Švec and P. Ircing : Full-text search through MALACH archive using speech recognition . 2020.


We present a search engine allowing the full-text search through the audiovisual archives like MALACH. We use the state-of-the-art speech recognition methods to get text transcripts which are subsequently indexed in a database. The speech recognition is performed in advance to create an index allowing the fast search through all indexed records. The results of our powerful search engine are presented through a brand new user interface. The user interface is based on the latest web technologies and provides a fast and intuitive audiovisual browsing of the parts containing the searched phrase. It supports a multilingual searching as well as filtering and sorting found results. The overall design of the interface is as general as possible, which allows its deployment on various MALACH-like archives and, moreover, besides video recordings to browse for example scanned documents in the same manner.

Název: Full-text search through MALACH archive using speech recognition
Autor: M. Bulín ; J. Švec ; P. Ircing
Název - česky: Full-textové vyhledávání v archivu MALACH využitím rozpoznávání řeči
Jazyk publikace: anglicky
Datum vydání: 27.1.2020
Rok vydání: 2020
Typ publikace: Prezentace na konferencích a seminářích v ČR
Datum: 27.1.2020 - 28.1.2020
