Source separation

Topic Tools

Papers published on a yearly basis

Papers

Journal Article•10.1162/NECO.1995.7.6.1129•

An information-maximization approach to blind separation and blind deconvolution

[...]

Anthony J. Bell¹, Terrence J. Sejnowski¹•Institutions (1)

University of California, San Diego¹

01 Nov 1995-Neural Computation

TL;DR: It is suggested that information maximization provides a unifying framework for problems in "blind" signal processing and dependencies of information transfer on time delays are derived.

...read moreread less

Abstract: We derive a new self-organizing learning algorithm that maximizes the information transferred in a network of nonlinear units. The algorithm does not assume any knowledge of the input distributions, and is defined here for the zero-noise limit. Under these conditions, information maximization has extra properties not found in the linear case (Linsker 1989). The nonlinearities in the transfer function are able to pick up higher-order moments of the input distributions and perform something akin to true redundancy reduction between units in the output representation. This enables the network to separate statistically independent components in the inputs: a higher-order generalization of principal components analysis. We apply the network to the source separation (or cocktail party) problem, successfully separating unknown mixtures of up to 10 speakers. We also show that a variant on the network architecture is able to perform blind deconvolution (cancellation of unknown echoes and reverberation in a speech signal). Finally, we derive dependencies of information transfer on time delays. We suggest that information maximization provides a unifying framework for problems in "blind" signal processing.

...read moreread less

9,907 citations

Journal Article•10.1016/S0893-6080(00)00026-5•

Independent component analysis: algorithms and applications

[...]

Aapo Hyvärinen¹, Erkki Oja¹•Institutions (1)

Helsinki University of Technology¹

01 May 2000-Neural Networks

TL;DR: The basic theory and applications of ICA are presented, and the goal is to find a linear representation of non-Gaussian data so that the components are statistically independent, or as independent as possible.

...read moreread less

9,703 citations

Journal Article•10.1109/TSA.2005.858005•

Performance measurement in blind audio source separation

[...]

Emmanuel Vincent¹, Rémi Gribonval, Cédric Févotte•Institutions (1)

Queen Mary University of London¹

01 Jul 2006-IEEE Transactions on Audio, Speech, and Language Processing

TL;DR: This paper considers four different sets of allowed distortions in blind audio source separation algorithms, from time-invariant gains to time-varying filters, and derives a global performance measure using an energy ratio, plus a separate performance measure for each error term.

...read moreread less

Abstract: In this paper, we discuss the evaluation of blind audio source separation (BASS) algorithms. Depending on the exact application, different distortions can be allowed between an estimated source and the wanted true source. We consider four different sets of such allowed distortions, from time-invariant gains to time-varying filters. In each case, we decompose the estimated source into a true source part plus error terms corresponding to interferences, additive noise, and algorithmic artifacts. Then, we derive a global performance measure using an energy ratio, plus a separate performance measure for each error term. These measures are computed and discussed on the results of several BASS problems with various difficulty levels

...read moreread less

3,467 citations

Journal Article•10.1109/78.554307•

A blind source separation technique using second-order statistics

[...]

Adel Belouchrani¹, Karim Abed-Meraim², Jean-François Cardoso³, Eric Moulines³•Institutions (3)

Villanova University¹, University of Melbourne², École Normale Supérieure³

01 Feb 1997-IEEE Transactions on Signal Processing

TL;DR: A new source separation technique exploiting the time coherence of the source signals is introduced, which relies only on stationary second-order statistics that are based on a joint diagonalization of a set of covariance matrices.

...read moreread less

Abstract: Separation of sources consists of recovering a set of signals of which only instantaneous linear mixtures are observed. In many situations, no a priori information on the mixing matrix is available: The linear mixture should be "blindly" processed. This typically occurs in narrowband array processing applications when the array manifold is unknown or distorted. This paper introduces a new source separation technique exploiting the time coherence of the source signals. In contrast with other previously reported techniques, the proposed approach relies only on stationary second-order statistics that are based on a joint diagonalization of a set of covariance matrices. Asymptotic performance analysis of this method is carried out; some numerical simulations are provided to illustrate the effectiveness of the proposed method.

...read moreread less

2,975 citations

Proceedings Article•10.1109/ICASSP.2016.7471631•

Deep clustering: Discriminative embeddings for segmentation and separation

[...]

John R. Hershey¹, Zhuo Chen², Jonathan Le Roux¹, Shinji Watanabe¹•Institutions (2)

Mitsubishi Electric Research Laboratories¹, Columbia University²

20 Mar 2016

TL;DR: In this paper, a deep network is trained to assign contrastive embedding vectors to each time-frequency region of the spectrogram in order to implicitly predict the segmentation labels of the target spectrogram from the input mixtures.

...read moreread less

Abstract: We address the problem of "cocktail-party" source separation in a deep learning framework called deep clustering. Previous deep network approaches to separation have shown promising performance in scenarios with a fixed number of sources, each belonging to a distinct signal class, such as speech and noise. However, for arbitrary source classes and number, "class-based" methods are not suitable. Instead, we train a deep network to assign contrastive embedding vectors to each time-frequency region of the spectrogram in order to implicitly predict the segmentation labels of the target spectrogram from the input mixtures. This yields a deep network-based analogue to spectral clustering, in that the embeddings form a low-rank pair-wise affinity matrix that approximates the ideal affinity matrix, while enabling much faster performance. At test time, the clustering step "decodes" the segmentation implicit in the embeddings by optimizing K-means with respect to the unknown assignments. Preliminary experiments on single-channel mixtures from multiple speakers show that a speaker-independent model trained on two-speaker mixtures can improve signal quality for mixtures of held-out speakers by an average of 6dB. More dramatically, the same model does surprisingly well with three-speaker mixtures.

...read moreread less

1,735 citations

...

Expand

Year	Papers
2025	47
2024	96
2023	139
2022	215
2021	198
2020	248

Topic Tools

Papers published on a yearly basis

Papers

An information-maximization approach to blind separation and blind deconvolution

Independent component analysis: algorithms and applications

Performance measurement in blind audio source separation

A blind source separation technique using second-order statistics

Deep clustering: Discriminative embeddings for segmentation and separation

Related Topics (5)

Performance Metrics