Open Access
TarsosDSP, a Real-Time Audio Processing Framework in Java
Joren Six,Olmo Cornelis,Marc Leman +2 more
- 27 Jan 2014
TL;DR: TarsosDSP is one of a only a few frameworks that offers both analysis, processing and feature extraction in real-time, a unique feature in the Java ecosystem.
read more
Abstract: This paper presents TarsosDSP, a framework for real-time audio analysis and processing. Most libraries and frameworks offer either audio analysis and feature extraction or audio synthesis and processing. TarsosDSP is one of a only a few frameworks that offers both analysis, processing and feature extraction in real-time, a unique feature in the Java ecosystem. The framework contains practical audio processing algorithms, it can be extended easily, and has no external dependencies. Each algorithm is implemented as simple as possible thanks to a straightforward processing pipeline. TarsosDSP's features include a resampling algorithm, onset detectors, a number of pitch estimation algorithms, a time stretch algorithm, a pitch shifting algorithm, and an algorithm to calculate the Constant-Q. The framework also allows simple audio synthesis, some audio effects, and several filters. The Open Source framework is a valuable contribution to the MIR-Community and ideal fit for interactive MIR-applications on Android.
read more
Chat with Paper
AI Agents for this Paper
Find similar papers on Google Scholar, PubMed and Arxiv
Write a critical review of this paper
Analyze citations of this paper to find unaddressed research gaps
Citations
•Proceedings Article
Panako: a scalable acoustic fingerprinting system handling time-scale and pitch modification
Joren Six,Marc Leman +1 more
- 01 Jan 2014
TL;DR: A scalable granular acoustic fingerprinting system robust against time and pitch scale modification is presented, designed to be robust against pitch shifting, time stretching and tempo changes, while remaining scalable.
50
TILES audio recorder: an unobtrusive wearable solution to track audio activity
Tiantian Feng,Amrutha Nadarajan,Colin Vaz,Brandon M. Booth,Shrikanth S. Narayanan +4 more
- 10 Jun 2018
TL;DR: The TILES Audio Recorder (TAR) is proposed - an unobtrusive and scalable solution to track audio activity using an affordable miniature mobile device with an open-source app, and shows that performing feature extraction only during speech segments greatly increases battery life.
38
Clinical feedback and technology selection of game based dysphonic rehabilitation tool
Zhihan Lv,Chantal Esteve,Javier Chirivella,Pablo Gagliardo +3 more
- 20 May 2015
TL;DR: An assistive training tool software for rehabilitation of dysphonic patients is evaluated according to the practical clinical feedback from the treatments.
36
•Posted Content
Preprint Clinical Feedback and Technology Selection of Game Based Dysphonic Rehabilitation Tool
TL;DR: In this article, an assistive training tool software for rehabilitation of dysphonic patients is evaluated according to the practical clinical feedback from the treatments, which employs a serious game as the attractive logic part and running on the tablet with normal microphone as input device.
27
Context-Aware Speech Stress Detection in Hospital Workers Using Bi-LSTM Classifiers
Amr Gaballah,Abhishek Tiwari,Shrikanth S. Narayanan,Tiago H. Falk +3 more
- 06 Jun 2021
TL;DR: In this article, a context-aware speech-based system for stress detection is presented, where the importance of context-awareness for stress level detection based on a bidirectional LSTM deep neural network is shown.
22
References
YIN, a fundamental frequency estimator for speech and music
TL;DR: An algorithm is presented for the estimation of the fundamental frequency (F0) of speech or musical sounds, based on the well-known autocorrelation method with a number of modifications that combine to prevent errors.
An overlap-add technique based on waveform similarity (WSOLA) for high quality time-scale modification of speech
Werner Verhelst,M. Roelands +1 more
- 27 Apr 1993
TL;DR: The resulting WSOLA (waveform-similarity-based synchronized overlap-add) algorithm produces high-quality speech output, is algorithmically and computationally efficient and robust, and allows for online processing with arbitrary time-scaling factors.
501
MARSYAS: a framework for audio analysis
George Tzanetakis,Perry R. Cook +1 more
TL;DR: This paper describes MARSYAS, a framework for experimenting, evaluating and integrating techniques for audio content analysis in restricted domains and a new method for temporal segmentation based on audio texture that is combined with audio analysis techniques and used for hierarchical browsing, classification and annotation of audio files.
Essentia: An Audio Analysis Library for Music Information Retrieval.
Dmitry Bogdanov,Nicolas Wack,Emilia Gómez,Sankalp Gulati,Perfecto Herrera,Oscar Mayor,Gerard Roma,Justin Salamon,Jose R. Zapata,Xavier Serra +9 more
- 04 Nov 2013
TL;DR: Comunicacio presentada a la 14th International Society for Music Information Retrieval Conference, celebrada a Curitiba (Brasil) els dies 4 a 8 de novembre de 2013.
Automatic Extraction of Tempo and Beat From Expressive Performances
TL;DR: It is shown that estimating the perceptual salience of rhythmic events significantly improves the results of a computer program which is able to estimate the tempo and the times of musical beats in expressively performed music.