Jan Trmal

Johns Hopkins University

64 Papers

395 Citations

Jan Trmal is an academic researcher from Johns Hopkins University. The author has contributed to research in topics: Computer science & Artificial neural network. The author has an hindex of 15, co-authored 63 publications. Previous affiliations of Jan Trmal include University of West Bohemia.

Author Tools

Create citation map

Create Author Profile

Analyze Jan Trmal's Top Papers

Chat about Author

Papers

•Proceedings Article•10.1109/ICASSP40776.2020.9053569

Multi-Task Self-Supervised Learning for Robust Speech Recognition

Mirco Ravanelli, +6 more

- 25 Jan 2020

TL;DR: PASE+ is proposed, an improved version of PASE that better learns short- and long-term speech dynamics with an efficient combination of recurrent and convolutional networks and learns transferable representations suitable for highly mismatched acoustic conditions.

...read moreread less

366

Proceedings Article•10.1109/ICASSP.2014.6853589

Improving deep neural network acoustic models using generalized maxout networks

Xiaohui Zhang, +3 more

- 04 May 2014

TL;DR: This paper introduces two new types of generalized maxout units, which they are called p-norm and soft-maxout, and presents a method to control that instability during training when training unbounded-output nonlinearities.

...read moreread less

343

•Proceedings Article•10.21437/INTERSPEECH.2018-1768

The Fifth 'CHiME' Speech Separation and Recognition Challenge: Dataset, Task and Baselines.

Jon Barker, +3 more

- 02 Sep 2018

TL;DR: The 5th CHiME Challenge is introduced, which considers the task of distant multi-microphone conversational ASR in real home environments and describes the data collection procedure, the task, and the baseline systems for array synchronization, speech enhancement, and conventional and end-to-end ASR.

...read moreread less

282

•Posted Content

CHiME-6 Challenge:Tackling Multispeaker Speech Recognition for Unsegmented Recordings

Shinji Watanabe, +20 more

- 20 Apr 2020

- arXiv: Sound

TL;DR: Of note, Track 2 is the first challenge activity in the community to tackle an unsegmented multispeaker speech recognition scenario with a complete set of reproducible open source baselines providing speech enhancement, speaker diarization, and speech recognition modules.

...read moreread less

267

•Proceedings Article•10.21437/CHIME.2020-1

CHiME-6 Challenge: Tackling multispeaker speech recognition for unsegmented recordings

Shinji Watanabe, +20 more

- 04 May 2020

TL;DR: The 6th CHiME Speech Separation and Recognition Challenge (CHiME-6) as mentioned in this paper was the first challenge activity in the community to tackle an unsegmented multispeaker speech recognition scenario with a complete set of reproducible open source baselines.

...read moreread less

202

...

Expand