6 Papers
Jun Ma is an academic researcher from Shanghai Jiao Tong University. The author has contributed to research in topics: Computer science & Password. The author has an hindex of 1, co-authored 1 publications.
Chat about Author
Papers
Preventing DeepFake Attacks on Speaker Authentication by Dynamic Lip Movement Analysis
TL;DR: Zhang et al. as discussed by the authors proposed a new visual speaker authentication scheme based on the deep convolutional neural network (DCNN), which is composed of two functional parts, namely, the Fundamental Feature Extraction network (FFE-Net) and the Representative lip feature extraction and Classification network (RC-Net).
77
Modification and application of highly active alkaline pectin lyase
TL;DR: In this article , the pectin lyase with the highest expression activity from Bacillus clausii, modified and expressed in Escherichia coli BL21(DE3) was significantly improved.
EfficientTTS 2: Variational End-to-End Text-to-Speech Synthesis and Voice Conversion
Chenfeng Miao,Qingying Zhu,Minchuan Chen,Jun Ma,Shaojun Wang,Jing Xiao +5 more
TL;DR: EfficientTTS 2 is a high-quality end-to-end text-to-speech synthesis and voice conversion framework that is fully differentiable, highly efficient, and achieves comparable or better speech quality than baseline models.
3
Improving End-to-End Modeling For Mandarin-English Code-Switching Using Lightweight Switch-Routing Mixture-of-Experts
Fengyun Tan,Chaofeng Feng,Tao Wei,Shuai Gong,Jinqiang Leng,Wei Chu,Jun Ma,Shaojun Wang,Jing Xiao +8 more
- 20 Aug 2023
TL;DR: A lightweight Switch-Routing network is proposed, which includes two experts and a switch router, which has a better performance on the ASRU code-switching test set and requires much less inference time with RTF decreasing by 31.39 % .
3
Towards Efficiently Learning Monotonic Alignments for Attention-based End-to-End Speech Recognition
Chenfeng Miao,Kun Zou,Ziyang Zhuang,Tao Wei,Jun Ma,Shaojun Wang,Jing Xiao +6 more
- 18 Sep 2022
TL;DR: A new way to train attention-based end-to-end speech recognition models with an additional training objective, allowing the models to learn the monotonic alignments effectively and efficiently is proposed.
1