Jan Trmal
Johns Hopkins University
64 Papers
395 Citations
Jan Trmal is an academic researcher from Johns Hopkins University. The author has contributed to research in topics: Computer science & Artificial neural network. The author has an hindex of 15, co-authored 63 publications. Previous affiliations of Jan Trmal include University of West Bohemia.
Chat about Author
Papers
Multi-Task Self-Supervised Learning for Robust Speech Recognition
Mirco Ravanelli,Jianyuan Zhong,Santiago Pascual,Pawel Swietojanski,Joao Monteiro,Jan Trmal,Yoshua Bengio +6 more
- 25 Jan 2020
TL;DR: PASE+ is proposed, an improved version of PASE that better learns short- and long-term speech dynamics with an efficient combination of recurrent and convolutional networks and learns transferable representations suitable for highly mismatched acoustic conditions.
366
Improving deep neural network acoustic models using generalized maxout networks
Xiaohui Zhang,Jan Trmal,Daniel Povey,Sanjeev Khudanpur +3 more
- 04 May 2014
TL;DR: This paper introduces two new types of generalized maxout units, which they are called p-norm and soft-maxout, and presents a method to control that instability during training when training unbounded-output nonlinearities.
343
The Fifth 'CHiME' Speech Separation and Recognition Challenge: Dataset, Task and Baselines.
Jon Barker,Shinji Watanabe,Emmanuel Vincent,Jan Trmal +3 more
- 02 Sep 2018
TL;DR: The 5th CHiME Challenge is introduced, which considers the task of distant multi-microphone conversational ASR in real home environments and describes the data collection procedure, the task, and the baseline systems for array synchronization, speech enhancement, and conventional and end-to-end ASR.
282
•Posted Content
CHiME-6 Challenge:Tackling Multispeaker Speech Recognition for Unsegmented Recordings
Shinji Watanabe,Michael I. Mandel,Jon Barker,Emmanuel Vincent,Ashish Arora,Xuankai Chang,Sanjeev Khudanpur,Vimal Manohar,Daniel Povey,Desh Raj,David Snyder,Aswin Shanmugam Subramanian,Jan Trmal,Bar Ben Yair,Christoph Boeddeker,Zhaoheng Ni,Yusuke Fujita,Shota Horiguchi,Naoyuki Kanda,Takuya Yoshioka,Neville Ryant +20 more
TL;DR: Of note, Track 2 is the first challenge activity in the community to tackle an unsegmented multispeaker speech recognition scenario with a complete set of reproducible open source baselines providing speech enhancement, speaker diarization, and speech recognition modules.
CHiME-6 Challenge: Tackling multispeaker speech recognition for unsegmented recordings
Shinji Watanabe,Michael I. Mandel,Jon Barker,Emmanuel Vincent,Ashish Arora,Xuankai Chang,Sanjeev Khudanpur,Vimal Manohar,Daniel Povey,Desh Raj,David Snyder,Aswin Shanmugam Subramanian,Jan Trmal,Bar Ben Yair,Christoph Boeddeker,Zhaoheng Ni,Yusuke Fujita,Shota Horiguchi,Naoyuki Kanda,Takuya Yoshioka,Neville Ryant +20 more
- 04 May 2020
TL;DR: The 6th CHiME Speech Separation and Recognition Challenge (CHiME-6) as mentioned in this paper was the first challenge activity in the community to tackle an unsegmented multispeaker speech recognition scenario with a complete set of reproducible open source baselines.
202