TL;DR: This paper presents a meta-modelling architecture for microphone Array Processing that automates the very labor-intensive and therefore time-heavy and expensive process of manually shaping Microphone Arrays for Speech Input in Automobiles.
Abstract: I. Speech Enhancement.- 1 Constant Directivity Beamforming.- 2 Superdirective Microphone Arrays.- 3 Post-Filtering Techniques.- 4 Spatial Coherence Functions for Differential Microphones in Isotropic Noise Fields.- 5 Robust Adaptive Beamforming.- 6 GSVD-Based Optimal Filtering for Multi-Microphone Speech Enhancement.- 7 Explicit Speech Modeling for Microphone Array Speech Acquisition.- II. Source Localization.- 8 Robust Localization in Reverberant Rooms.- 9 Multi-Source Localization Strategies.- 10 Joint Audio-Video Signal Processing for Object Localization and Tracking.- III. Applications.- 11 Microphone-Array Hearing Aids.- 12 Small Microphone Arrays with Postfilters for Noise and Acoustic Echo Reduction.- 13 Acoustic Echo Cancellation for Beamforming Microphone Arrays.- 14 Optimal and Adaptive Microphone Arrays for Speech Input in Automobiles.- 15 Speech Recognition with Microphone Arrays.- 16 Blind Separation of Acoustic Signals.- IV. Open Problems and Future Directions.- 17 Future Directions for Microphone Arrays.- 18 Future Directions in Microphone Array Processing.
TL;DR: A new version of the classical deconvolution method CLEAN is proposed here: CLEAN-SC, which is based on spatial source coherence, and side lobes can be removed of actually measured beam patterns of measured noise sources.
Abstract: To obtain higher resolution acoustic source plots from microphone array measurements, deconvolution techniques are becoming increasingly popular. Deconvolution algorithms aim at identifying Point Spread Functions (PSF) in source plots, and may therefore fall short when actual beam patterns of measured noise sources are not similar to synthetically obtained PSF's. To overcome this, a new version of the classical deconvolution method CLEAN is proposed here: CLEAN-SC. By this new method, which is based on spatial source coherence, side lobes can be removed of actually measured beam patterns. Essentially, CLEAN-SC iteratively removes the part of the source plot which is spatially coherent with the peak source. A feature of CLEAN-SC is its ability to extract absolute sound power levels from the source plots. The merits of CLEAN-SC were demonstrated using array measurements of airframe noise on a scale model of the Airbus A340 in the 8×6 m2 closed test section of DNW-LLF.
TL;DR: DEMAND (Diverse Environments Multi-channel Acoustic Noise Database) is provided, providing a set of 16-channel noise files recorded in a variety of indoor and outdoor settings to encourage research into algorithms beyond the stereo setup.
Abstract: Multi-microphone arrays allow for the use of spatial filtering techniques that can greatly improve noise reduction and source separation. However, for speech and audio data, work on noise reduction or separation has focused primarily on one- or two-channel systems. Because of this, databases of multichannel environmental noise are not widely available. DEMAND (Diverse Environments Multi-channel Acoustic Noise Database) addresses this problem by providing a set of 16-channel noise files recorded in a variety of indoor and outdoor settings. The data was recorded using a planar microphone array consisting of four staggered rows, with the smallest distance between microphones being 5 cm and the largest being 21.8 cm. DEMAND is freely available under a Creative Commons license to encourage research into algorithms beyond the stereo setup.
TL;DR: This paper describes a beamforming microphone array consisting of pressure microphones that are mounted on the surface of a rigid sphere based on a spherical harmonic decomposition of the soundfield that allows a simple and computationally effective, yet flexible beamformer structure.
Abstract: This paper describes a beamforming microphone array consisting of pressure microphones that are mounted on the surface of a rigid sphere. The beamformer is based on a spherical harmonic decomposition of the soundfield. We show that this allows a simple and computationally effective, yet flexible beamformer structure. The look-direction can be steered to any direction in 3-D space without changing the beampattern. In general the number of sensors and their location is quite arbitrary as long as they hold a certain orthogonality constraint that we derived. For a practical example we chose a spherical array with 32 elements. The microphones are located at the center of the faces of a truncated icosahedron. The radius of the sphere is 5 cm. With this setup we can achieve a Directivity Index of 12 dB and higher. The operating frequency range is from 100 Hz to 5 kHz.
TL;DR: In this article, a system and method for teleconferencing and recording of meetings is presented, which uses a variety of capture devices (a 360° camera, a whiteboard camera, presenter view camera, and a remote view camera) to provide a rich experience for people who want to participate in a meeting from a distance.
Abstract: A system and method for teleconferencing and recording of meetings. The system uses a variety of capture devices (a novel 360° camera, a whiteboard camera, a presenter view camera, a remote view camera, and a microphone array) to provide a rich experience for people who want to participate in a meeting from a distance. The system is also combined with speaker clustering, spatial indexing, and time compression to provide a rich experience for people who miss a meeting and want to watch it afterward.