DARPA - RATS - Robust Automatic Transcription of Speech

Czech title:DARPA - RATS - Robustní automatická transkripce řeči
Reseach leader:Matějka Pavel
Team leaders:Burget Lukáš, Černocký Jan
Team members:Fér Radek, Glembek Ondřej, Heřmanský Hynek, Karafiát Martin, Kobes Michal (UPGM FIT VUT), Novotný Ondřej, Ogawa Tetsuji (WasUni), Ondel Lucas, Plchot Oldřich, Popková Anna (FIT VUT), Silnova Anna, Skácel Miroslav, Veselý Karel
Agency:Raytheon BBN Technologies
Code:P14322-BBN
Start:2015-02-23
End:2017-03-31
Keywords:speech recognition, speaker recognition, language recognition, keyword spotting, robustness, noise, transmission channels
Annotation:
Existing speech signal processing technologies are inadequate for most noisy or degraded speech signals that are important to military intelligence. The Robust Automatic Transcription of Speech (RATS) program is creating algorithms and software for performing the following tasks on potentially speech-containing signals received over communication channels that are extremely noisy and/or highly distorted: Speech Activity Detection, Language Identification, Speaker Identification and Key Word Spotting.

Publications

2016BRUMMER Niko, SWART Albert du Preez, PRIETO Jesús J., GARCIA Perera Leibny Paola, MATĚJKA Pavel, PLCHOT Oldřich, DIEZ Sánchez Mireia, SILNOVA Anna, JIANG Xiaowei, NOVOTNÝ Ondřej, ROHDIN Johan A., GLEMBEK Ondřej, GRÉZL František, BURGET Lukáš, ONDEL Lucas, PEŠÁN Jan, ČERNOCKÝ Jan, KENNY Patrick, ALAM Jahangir, BHATTACHARYA Gautam and ZEINALI Hossein et al. ABC NIST SRE 2016 SYSTEM DESCRIPTION. San Diego: National Institute of Standards and Technology, 2016.
 LI Ruizhi, MALLIDI Sri Harish, PLCHOT Oldřich, BURGET Lukáš and DEHAK Najim. Exploiting Hidden-Layer Responses of Deep Neural Networks for Language Recognition. In: Proceedings of Interspeech 2016. San Francisco: International Speech Communication Association, 2016, pp. 2365-2369. ISBN 978-1-5108-3313-5.
 MATĚJKA Pavel, GLEMBEK Ondřej, NOVOTNÝ Ondřej, PLCHOT Oldřich, GRÉZL František, BURGET Lukáš and ČERNOCKÝ Jan. Analysis Of DNN Approaches To Speaker Identification. In: Proceedings of the 41th IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP 2016), 2016. Shanghai: IEEE Signal Processing Society, 2016, pp. 5100-5104. ISBN 978-1-4799-9988-0.
 NOVOTNÝ Ondřej, MATĚJKA Pavel, GLEMBEK Ondřej, PLCHOT Oldřich, GRÉZL František, BURGET Lukáš and ČERNOCKÝ Jan. Analysis of the DNN-Based SRE Systems in Multi-language Conditions. In: Proceedings of SLT 2016. San Diego: IEEE Signal Processing Society, 2016, pp. 199-204. ISBN 978-1-5090-4903-5.
 NOVOTNÝ Ondřej, MATĚJKA Pavel, PLCHOT Oldřich, GLEMBEK Ondřej, BURGET Lukáš and ČERNOCKÝ Jan. Analysis of Speaker Recognition Systems in Realistic Scenarios of the SITW 2016 Challenge. In: Proceedings of Interspeech 2016. San Francisco: International Speech Communication Association, 2016, pp. 828-832. ISBN 978-1-5108-3313-5.
 PEŠÁN Jan, BURGET Lukáš and ČERNOCKÝ Jan. Sequence Summarizing Neural Networks for Spoken Language Recognition. In: Proceedings of Interspeech 2016. San Francisco: International Speech Communication Association, 2016, pp. 3285-3289. ISBN 978-1-5108-3313-5.
 PLCHOT Oldřich, BURGET Lukáš, ARONOWITZ Hagai and MATĚJKA Pavel. Audio Enhancing With DNN Autoencoder For Speaker Recognition. In: Proceedings of the 41th IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP 2016), 2016. Shanghai: IEEE Signal Processing Society, 2016, pp. 5090-5094. ISBN 978-1-4799-9988-0.
 PLCHOT Oldřich, MATĚJKA Pavel, FÉR Radek, GLEMBEK Ondřej, NOVOTNÝ Ondřej, PEŠÁN Jan, VESELÝ Karel, ONDEL Lucas, KARAFIÁT Martin, GRÉZL František, KESIRAJU Santosh, BURGET Lukáš, BRUMMER Niko, SWART Albert du Preez, CUMANI Sandro, MALLIDI Sri Harish and LI Ruizhi. BAT System Description for NIST LRE 2015. In: Proceedings of Odyssey 2016, The Speaker and Language Recognition Workshop. Bilbao: International Speech Communication Association, 2016, pp. 166-173. ISSN 2312-2846.
2015CUMANI Sandro, PLCHOT Oldřich and FÉR Radek. Exploiting i-vector posterior covariances for short-duration language recognition. In: Proceedings of Interspeech 2015. Dresden: International Speech Communication Association, 2015, pp. 1002-1006. ISBN 978-1-5108-1790-6. ISSN 1990-9772.
 FÉR Radek, MATĚJKA Pavel, GRÉZL František, PLCHOT Oldřich and ČERNOCKÝ Jan. Multilingual Bottleneck Features for Language Recognition. In: Proceedings of Interspeech 2015. Dresden: International Speech Communication Association, 2015, pp. 389-393. ISBN 978-1-5108-1790-6. ISSN 1990-9772.
 PEŠÁN Jan, BURGET Lukáš, HEŘMANSKÝ Hynek and VESELÝ Karel. DNN derived filters for processing of modulation spectrum of speech. In: Proceedings of Interspeech 2015. Dresden: International Speech Communication Association, 2015, pp. 1908-1911. ISBN 978-1-5108-1790-6. ISSN 1990-9772.

Your IPv4 address: 54.225.41.203
Switch to IPv6 connection

DNSSEC [dnssec]