Journal Article10.1109/CC.2018.8357692
Microphone array speech enhancement based on tensor filtering methods
8
TL;DR: The experimental results indicate that the proposed tenor filtering methods have potential ability of retrieving the target signal from noisy microphone array signal and the multi-mode Wiener filtering method provides the best denoising results among the three ones.
read more
Abstract: This paper proposes a novel microphone array speech denoising scheme based on tensor filtering methods including truncated HOSVD (High-Order Singular Value Decomposition), low rank tensor approximation and multi-mode Wiener filtering. Microphone array speech signal is represented in three-order tensor space with channel, time, and spectrum modes and then tensor filtering model can be designed to process the multiway array data. As to the first method, noise can be reduced through the truncated HOSVD which is a simple scheme in tensor processing. It is more accurate to find the lower-rank approximation of the three-order tensor with Tucker model. Then MDL (Minimum Description Length) criterion is used to estimate the optimal tensor rank in the second method. Further, multimode Wiener filtering approach upon tensor analysis can be considered as the spanning of one-mode wiener filtering. How to take advantages of tensor model to obtain a set of filters is the heart of the novel scheme. The performances of the proposed three approaches are evaluated with objective indexes and listening quality test. The experimental results indicate that the proposed tenor filtering methods have potential ability of retrieving the target signal from noisy microphone array signal and the multi-mode Wiener filtering method provides the best denoising results among the three ones.
read more
Chat with Paper
AI Agents for this Paper
Find similar papers on Google Scholar, PubMed and Arxiv
Write a critical review of this paper
Analyze citations of this paper to find unaddressed research gaps
Citations
Compression of Head-Related Transfer Function Based on Tucker and Tensor Train Decomposition
TL;DR: A novel compression method combining Tucker model with tensor train decomposition based on a 5-order HRTF tensor model developed in subspaces of an ear, subject, azimuth, elevation, and frequency is proposed.
Speech enhancement via adaptive Wiener filtering and optimized deep learning framework
TL;DR: In this article , the authors proposed a new speech enhancement framework that includes training phase and testing phase, where the input signal is first given to STFT-based noise estimate and NMF-based spectra estimate during the training phase in order to compute the noise spectra and signal spectra, respectively.
4
Tensor completion for recovering multichannel audio signal with missing data
TL;DR: A low-rank Tensor completion method that is a high-order generalization of matrix completion is proposed that outperforms state-of-the-art methods and subjective listening tests indicate that better audio effects were obtained by tensor completion.
4
A new speech signal denoising algorithm using common vector approach
Erol Seke,Kemal Özkan +1 more
TL;DR: A method based on common vector approach (CVA) for reducing losses in single-channel enhancement algorithms for speech denoising and highly promising results are obtained in terms of several speech quality measures.
3
References
A Multilinear Singular Value Decomposition
TL;DR: There is a strong analogy between several properties of the matrix and the higher-order tensor decomposition; uniqueness, link with the matrix eigenvalue decomposition, first-order perturbation effects, etc., are analyzed.
Evaluation of Objective Quality Measures for Speech Enhancement
Yi Hu,Philipos C. Loizou +1 more
TL;DR: The evaluation of correlations of several objective measures with these three subjective rating scales is reported on and several new composite objective measures are also proposed by combining the individual objective measures using nonparametric and parametric regression analysis techniques.
1.9K
Third-Order Tensors as Operators on Matrices: A Theoretical and Computational Framework with Applications in Imaging
TL;DR: This paper investigates further implications including a bilinear operator on the matrices which is nearly an inner product and which leads to definitions for length ofMatrices, angle between two matrices, and orthogonality of matrices and the use of t-linear combinations to characterize the range and kernel of a mapping defined by a third-order tensor and the t-product and the quantification of the dimensions of those sets.
1.1K
Switching adaptive filters for enhancing noisy and reverberant speech from microphone array recordings
D. Van Compernolle
- 03 Apr 1990
TL;DR: Switching adaptive filters, suitable for speech beamforming, with no prior knowledge about the speech source are presented, and the most robust solution, i.e. a delay and sum beamformer that cues in on the direct path only and neglects all multipath contributions is given.
146
Optimal distributed minimum-variance beamforming approaches for speech enhancement in wireless acoustic sensor networks
TL;DR: The algorithmic challenges arising when applying the LCMV beamformer in wireless acoustic sensor networks (WASNs), which are a next-generation technology for audio acquisition and processing are addressed.
105