TL;DR: A new de-essing approach based on a convolutional neural network architecture is presented that outperforms existing de-esser plugins in terms of erroneous signal attenuation and was rated favorably by audio professionals.
Abstract: De-essing is the process of attenuating vocal sibilance in audio recordings. Especially in audio mastering, conventional de-essers often degrade the clarity of the source signal due to unreliable differentiation between vocal sibilance and other high-pitched sounds. Machine learning poses a promising solution to this problem. In this context, a new de-essing approach based on a convolutional neural network architecture is presented. The introduced prototype de-esser outperforms existing de-esser plugins in terms of erroneous signal attenuation and was rated favorably by audio professionals.
TL;DR: When processing a vocal track a commonly used technique is de-essing, the process of attenuating sibilant sounds, which can be done both with a dedicated device and using a non-dedicated device.
Abstract: When processing a vocal track a commonly used technique is de-essing, the process of attenuating sibilant sounds. This can be done in different ways namely; de-essing with a dedicated device, using ...