Scispace (Formerly Typeset)
  1. Home
  2. Topics
  3. Pitch detection algorithm
  4. 2018
  1. Home
  2. Topics
  3. Pitch detection algorithm
  4. 2018
Showing papers on "Pitch detection algorithm published in 2018"
Proceedings Article•10.1109/ICASSP.2018.8462079•
Automatic Music Transcription Leveraging Generalized Cepstral Features and Deep Learning

[...]

Yu-Te Wu1, Berlin Chen1, Li Su2•
National Taiwan Normal University1, Academia Sinica2
10 Sep 2018
TL;DR: This paper shows that using multiple features represented in both the frequency and time domains with deep learning modeling can reduce such interference in musical signals with multiple concurrent pitches.
Abstract: Spectral features are limited in modeling musical signals with multiple concurrent pitches due to the challenge to suppress the interference of the harmonic peaks from one pitch to another. In this paper, we show that using multiple features represented in both the frequency and time domains with deep learning modeling can reduce such interference. These features are derived systematically from conventional pitch detection functions that relate to one another through the discrete Fourier transform and a nonlinear scaling function. Neural networks modeled with these features outperform state-of-the-art methods while using less training data.

15 citations

Proceedings Article•10.1109/ICASSP.2018.8461794•
A Parallel Fusion Approach to Piano Music Transcription Based on Convolutional Neural Network

[...]

Fu'ze Cong1, Shuchang Liu1, Li Guo1, Geraint A. Wiggins2•
Beijing University of Posts and Telecommunications1, Queen Mary University of London2
13 Apr 2018
TL;DR: Experimental results reveal that the proposed approach based on Convolutional Neural Networks for polyphonic piano transcription preforms better in both frame- and note-based metrics.
Abstract: In this paper, a supervised approach based on Convolutional Neural Networks (CNN) for polyphonic piano transcription is presented. The system consists of pitch detection model, onset/offset detection model, and note search model. The pitch detection model is a single-channel CNN predicting the probabilities of pitches contained in one frame of the audio. The onset/offset model based on dual-channel CNN is used for estimating the probabilities of each pitch's onset or offset in a frame. The note search model is rule-based; it integrates the outputs of the pitch model and onset/offset model to determine the final onset, offset and pitch of notes in audio. Two experiments with different dataset conditions are accomplished to compare with state-of-the-art approaches on the same datasets. Experimental results reveal that the proposed approach preforms better in both frame- and note-based metrics.

14 citations

Journal Article•10.1109/LSP.2018.2874155•
Traditional Machine Learning for Pitch Detection

[...]

Thomas Drugman1, Goeric Huybrechts1, Viacheslav Klimkov1, Alexis Moinet1•
Amazon.com1
04 Oct 2018-IEEE Signal Processing Letters
TL;DR: In this paper, the authors consider voicing detection as a classification problem and F0 contour estimation as a regression problem, and assess the discrimination power of existing and proposed features through mutual information.
Abstract: Pitch detection is a fundamental problem in speech processing as F0 is used in a large number of applications. Recent papers have proposed deep learning for robust pitch tracking. In this letter, we consider voicing detection as a classification problem and F0 contour estimation as a regression problem. For both tasks, acoustic features from multiple domains and traditional machine learning methods are used. The discrimination power of existing and proposed features is assessed through mutual information. Multiple supervised and unsupervised approaches are compared. A significant relative reduction of voicing errors over the best baseline is obtained—20% with the best clustering method ( K -means) and 45% with a multi-layer perceptron. For F0 contour estimation, the benefits of regression techniques are limited though. We investigate whether those objective gains translate in a parametric synthesis task. Clear perceptual preferences are observed for the proposed approach over two widely used baselines (robust algorithm for pitch tracking (RAPT) and distributed inline-filter operation (DIO)).

13 citations

Proceedings Article•10.1109/ICASSP.2018.8462482•
Use of Pitch Continuity for Robust Speech Activity Detection

[...]

Yiwen Shao, Qiguang Lin
15 Apr 2018
TL;DR: A novel way to integrate the pitch continuity with pitch-related features is proposed and this feature leads to better SAD performance (with an up to 39.3% relative improvement on miss rate compared to Combo-SAD).
Abstract: Speech activity detection (SAD) is an important component for various speech processing applications and has been researched extensively recently. The pitch continuity, a significant characteristic of speech, however, has not successfully played a role in existing SAD methods. In this work, we propose a novel way to integrate the pitch continuity with pitch-related features. Practice is carried out through the Combo-SAD approach: We examine three consecutive frames and assume that they all have the same pitch as the center frame due to pitch continuity. Corresponding feature values are recomputed at the adjusted pitch location and then used in the final expression. The new combo feature is evaluated with various types of additive noise at different signal-to-noise ratios (SNR). The results show that the new feature leads to better SAD performance (with an up to 39.3% relative improvement on miss rate compared to Combo-SAD). We also introduce a novel variant of the underlying autocorrelation function and illustrate how it can improve the accuracy of pitch detection.

11 citations

Proceedings Article•10.21437/INTERSPEECH.2018-45•
A Novel Normalization Method for Autocorrelation Function for Pitch Detection and for Speech Activity Detection.

[...]

Qiguang Lin, Yiwen Shao1•
Johns Hopkins University1
2 Sep 2018
TL;DR: A novel normalization method, eACF, is proposed that can both mitigate the tapering effect and minimize the overcompensation of wACF and leads to better performance both in terms of pitch detection and speech activity detection.
Abstract: Autocorrelation functions (ACF) have been used in various pitch detection algorithms (PDA) and voicing-feature based speech activity detection (SAD) techniques. Speech is assumed to be stationary over a short-term window, and a Hanning window is typically applied in the calculation of ACF. As a result of windowing, the ACF tapers as the autocorrelation lags increase. Boersma demonstrated that the tapering effect could be compensated for by dividing the ACF of the windowed signal by the autocorrelation of the windowing function itself, referred to as wACF hereafter. We recently found that wACF could cause overcompensation and therefore, result in errors in pitch detection. In this paper, a novel normalization method, eACF, is proposed that can both mitigate the tapering effect and minimize the overcompensation. The new method is evaluated on synthetic speech and on the TIMIT database with various types of additive noise at different signal-to-noise (SNR) ratios. The results show that the new method leads to better performance both in terms of pitch detection and speech activity detection. In this paper, we also investigate the scenarios where applying the wACF method is advantageous and where it is not.

8 citations

Journal Article•10.21608/MJEER.2018.64399•
Efficient Implementation of Adaptive Wiener Filter for Pitch Detection from Noisy Speech Signals

[...]

Marwa A. Nasr, Mohammed Abd-Elnaby, Adel S. El-Fishawy, Sayed El-Rabaie, Fathi E. Abd El-Samie 
1 Jan 2018

6 citations

Proceedings Article•10.1109/ISSPIT.2018.8642626•
Real-Time Polyphonic Pitch Detection on Acoustic Musical Signals

[...]

Thomas A. Goodman1, Ian Batten1•
University of Birmingham1
1 Dec 2018
TL;DR: An algorithm for fundamental frequency detection on polyphonic acoustic musical signals, based on a new ‘raking’ method over the frequency-domain spectra, which boasts a good accuracy compared to other such methods, as well as the ability to function effectively in real-time.
Abstract: This paper presents an algorithm for fundamental frequency detection on polyphonic acoustic musical signals, based on a new ‘raking’ method over the frequency-domain spectra. The algorithm is evaluated as a classifier, and boasts a good accuracy (83.20%) compared to other such methods, as well as the ability to function effectively in real-time, with a running-speed below 140ms per window evaluated. This proves to be real-time for the use-case, as the latency between an auditory stimulus and its perception by a person has been shown to be longer than this. The algorithm itself runs in linear-time, but is thus slowed by the O(nlog(n)) Fast Fourier Transform during preprocessing. Though the algorithm fails to account for certain edge-cases with overlapping harmonics as well as certain instruments, future work and improvements are also presented, paving the way for further research.

6 citations

Dissertation•
The auditory fingerprint : multidimensional characterization of individual pitch perception in musicians

[...]

J Benner
2 Feb 2018
TL;DR: In this paper, audio fingerprints were created basically based on a composite of five psychoacoustic hearing tests assessing subjective pitch perception in musicians, including 93 musicians including 49 professionals and 44 amateurs, were individually measured for: (a) pitch perception preference (holistic vs. spectral mode), (b) relative pitch perception (musical interval recognition), (c) absolute pitch perception(perfect pitch), (d) frequency discrimination threshold (just noticeable difference), and (e) auditory imagery of tone-sequences).
Abstract: Musicians have been repeatedly reported to show remarkable inter-individual differences in elementary hearing functions, sound perception mode, musical instrument preference, performance style, as well as musical abilities such as absolute- and relative pitch perception or auditory imagery (audiation). However, relevant literature in the field regarding perceptual and psychophysical aspects of sound and particularly pitch perception is highly contradictory, and subjective differences are mostly unconsidered. Moreover, it is largely unexplored how individual differences in (musical) pitch perception are related to further musical abilities and behavior. In the present work, “auditory fingerprints” were created basically based on a composite of five psychoacoustic hearing tests assessing subjective pitch perception in musicians. A total of 93 musicians, including 49 professionals and 44 amateurs, were individually measured for: (a) pitch perception preference (holistic vs. spectral mode), (b) relative pitch perception (musical interval recognition), (c) absolute pitch perception („perfect pitch“), (d) frequency discrimination threshold (just noticeable difference), and (e) auditory imagery of tone-sequences. Overall, eight psychoacoustic parameters were extracted and analyzed using statistical methods. In addition, preferences for musical instruments were evaluated. On the individual level, the results show a high inter-individual variability across the eight psychoacoustic parameters, reflecting clear individual differences in pitch perception and related musical abilities. In addition, principal component analysis (PCA) revealed four different main components to sufficiently represent coherent aspects of the psychoacoustic data, namely: tonal musicality, pitch & timbre preference, low-band sensitivity and high-band sensitivity. On the group level, multi-parametric cluster analyses revealed three sub-groups of subjects, showing significantly different results with respect to the underlying perceptional patterns. Consequently, at least three different modes of pitch perception are suggested, characterized by: 1. Pronounced analytic pattern recognition, focused on spectra / timbre and sensitive to single tones; 2. Pronounced holistic pattern recognition, focused on (missing) fundamental pitch and rather insensitive to single tones; 3. Less pronounced audiation and pitch detection abilities, linked to ambiguous multi-pitch sensations (“balanced mode”). Taken together, the findings suggest that individual “auditory fingerprints” extracted from psychoacoustic hearing tests, reflect remarkable inter-individual differences, but also typical patterns of perceiving (musical) pitch and sound. It can be concluded, that specific auditory characteristics are related to the individual musical (instrument) preference, style and performance of musicians, as well their learning abilities.

5 citations

Patent•
Three-coordinate radar plot clotting method based on 2D sliding window local extremum

[...]

Xia Yonghong, Kuang Huaxing, Ding Chun, Yao Yuan
9 Jan 2018
TL;DR: In this article, a three-coordinate radar plot clotting method based on 2D sliding window local extremum was proposed for the detection of a target in the azimuth wave position.
Abstract: The invention provides a three-coordinate radar plot clotting method based on 2D sliding window local extremum In the range dimension, range detection clotting is carried out based on the M/N criterion In the pitch dimension, pitch detection clotting is carried out through an 'I'-shaped search window with range lattices and pitch beams as units and based on a 2D sliding window local extremum method In the azimuth dimension, azimuth detection clotting is carried out through a 'nine-rectangle-grid' search window based on a 2D sliding window When that the extension of a target in the azimuthwave position ends is determined, final plot clotting information is output According to the method of the invention, through fine detection clotting based on a 2D sliding window in the pitch and azimuth dimensions, the problem that 'missing detection' is caused in azimuth detection clotting as a target cannot be 'aligned' in range and pitch in different azimuth wave positions is solved, and theprobability of target plot detection is improved

4 citations

Patent•
Musical performance evaluation system and method

[...]

Walder Samuel Speizman, Indukuri Vishnu
3 May 2018
TL;DR: In this paper, a musical performance evaluation method utilizes a musical composition database stored on a server containing musical encodings of all notes to be performed in a musical compositions, and a student user performing requested musical composition for evaluation may use a client application from a client computing device to receive audio input from the performance and compare identified frequencies to intended frequencies, and identify deviations from pitch, rhythm, and tempo throughout the performance.
Abstract: According to embodiments of the present invention, a musical performance evaluation method utilizes a musical composition database stored on a server containing musical encodings of all notes to be performed in a musical composition. A student user performing requested musical composition for evaluation may use a client application from a client computing device to receive audio input from the performance and compare identified frequencies to intended frequencies, and identify deviations from pitch, rhythm, and tempo throughout the performance. Deviations may be identified by switching between several pitch detection algorithms based upon a type of note (single note, chord, or plucked string) expected to be played by a student user in real time. Factors for deducting from a performance pitch score and a performance rhythm score are determined and used to calculate a performance pitch score and a performance rhythm score. Error data and scores generated may be transmitted to a server and stored in a historical session database. A student user and an instructor user may review the historical session database to determine the student's progress in musical education.

4 citations

Proceedings Article•10.1109/ICOMIS.2018.8645016•
Combining Zero Replacement Speech Enhancement with Lag Window Method for Pitch Detection

[...]

Sicong Du1, Yosuke Sugiura1, Tetsuya Shimamura1•
Saitama University1
1 Dec 2018
TL;DR: An anti- noise pitch detection method that combines a speech enhancement algorithm with a spectral flattening algorithm is proposed that has the lowest gross pitch error (GPE) rate among all the methods when dealing with white-noise added male speeches.
Abstract: Pitch is one of the most essential features in human speech analysis. Although numerous pitch detection methods have been developed, it is still a challenge to provide a high pitch detection performance in noisy environments. In this paper, we propose an anti-noise pitch detection method that combines a speech enhancement algorithm with a spectral flattening algorithm. In the experiments, we compare the proposed method with several widely used or state-of-the-art pitch detection methods. The results show that the proposed method has the lowest gross pitch error (GPE) rate among all the methods when dealing with white-noise added male speeches. Moreover, comparing the pitches estimated by the proposed method to those estimated by the conventional lag window method, we can see that the speech enhancement algorithm helps diminish pitch errors.
Patent•
Constant temperature digital display pitch extensibility appearance

[...]

Zhang Xudong
11 Dec 2018
TL;DR: In this paper, the utility model discloses a constant temperature digital display pitch extensibility appearance relates to a pitch detection device, which aims at solving current constant temperature display pitch Extensibility Appearance and detects the problem that the precision is low, and its technical scheme main points are, sets up the transparent plate that the lot is located basin diapire top at the basin inside seal, is formed with the cavity between transparent plate and the basin dapire, set up the lamp source in the cavity, be provided with the convex lens that are located the aque
Abstract: The utility model discloses a constant temperature digital display pitch extensibility appearance relates to a pitch detection device, aims at solving current constant temperature digital display pitch extensibility appearance and detects the problem that the precision is low, and its technical scheme main points are, sets up the transparent plate that the lot is located basin diapire top at the basin inside seal, is formed with the cavity between transparent plate and the basin diapire, sets up the lamp source in the cavity, be provided with the convex lens that are located the aqueous mediumtop, play amplification effect in the basin, reached and improved the definition of surveing, the effect that improves experimental precision.
Patent•
Real-time pitch detection for creating, practicing and sharing of musical harmonies

[...]

Andrew Goren, Michael Holroyd
12 Jun 2018
TL;DR: In this paper, a pitch detection system for real-time pitch detection of voiced musical notes is presented, where sound waves produced by a voiced rendition of one or more musical notes are converted to a time domain electronic audio signal.
Abstract: Real-time pitch detection of voiced musical notes involves converting sound waves, produced by a voiced rendition of one or more musical notes, to a time domain electronic audio signal. The electronic audio signal is processed to determine a true pitch of the time domain electronic audio signal. True pitch information is displayed in real-time, concurrent with the voiced rendition of each musical note. A pitch indicator conveys to a user information concerning the true pitch which has been determined. The true pitch is determined by segmenting the electronic audio signal into a plurality of audio signal samples and applying a constant-Q transform. Additional processing steps are applied to reduce pitch detection errors.
Journal Article•
Accuracy Improvement of MFCC Based Speech Recognition by Preventing DFT Leakage Using Pitch Segmentation

[...]

Sopon Wiriyarattanakul1, Nawapak Eua-Anant1•
Khon Kaen University1
15 Feb 2018-Journal of Telecommunication, Electronic and Computer Engineering
TL;DR: A pitch-based speech signal segmentation to reduce spectral leakage is proposed by utilizing a new technique of pitch detection based on Short-time Energy Waveform (SEW) to yield segmented speech intervals with complete periods.
Abstract: Most MFCC based speech recognition algorithms employ frame segmentation to divide a signal into fixed-size frames as the first step prior to MFCC feature extraction. Commonly used fixed frame sizes, around 20-40 ms, do not usually fit into complete periods of speech signals. Consequently, in MFCC feature extraction, spectral leakage arises after Discrete Fourier Transform is applied to these fixed-size intervals resulting in smeared spectra and reduced speech recognition performance. In this paper, a pitch-based speech signal segmentation to reduce spectral leakage is proposed by utilizing a new technique of pitch detection based on Short-time Energy Waveform (SEW) to yield segmented speech intervals with complete periods. The proposed method utilizes local minima of SEW as markers for pitch segmentation. After segmenting speech signals into pitches, MFCC feature vectors are extracted and subsequently used as raw data for speech recognition using artificial neural networks. Speech recognition experiments using artificial neural networks, applied to collect Thai language speech signals from 40 speakers, were conducted. Empirical results indicate that speech recognition using speech signals segmented into pitches yields more accurate recognition results than those using speech signals segmented into a fixed frame.
Patent•
Pitch calibration method and device for pitch-adjustable propeller

[...]

Lixiong Zhao
2 Feb 2018
TL;DR: In this paper, a pitch calibration method and device for a pitch-adjustable propeller is presented, which consists of the steps of controlling a drive mechanism to drive blades to rotate at a uniform speed, sequentially increasing (decreasing) the pitch corresponding to a pitch adjusting command, obtaining the pitch detected by a pitch detectionsensor and a load sensor until the drive mechanism load is greater (smaller) than or equal to the set maximum (minimum) drive mechanisms load value.
Abstract: The invention discloses a pitch calibration method and device for a pitch-adjustable propeller and belongs to the technical field of ship propulsion The method comprises the steps of controlling a drive mechanism to drive blades to rotate at a uniform speed; sequentially increasing (decreasing) the pitch corresponding to a pitch adjusting command, obtaining the pitch detected by a pitch detectionsensor and a drive mechanism load detected by a load sensor until the drive mechanism load is greater (smaller) than or equal to the set maximum (minimum) drive mechanism load value every time the pitch corresponding to the pitch adjusting command is increased (decreased), and regarding the pitch detected by the pitch detection sensor when increasing (decreasing) of the pitch corresponding to thepitch adjusting command is stopped as the pitch calibration value of the greatest advancing (reversing) position; and if the drive mechanism load is minimum, regarding the pitch detected by the pitchdetection sensor as the pitch calibration value of the zero-pitch position According to the pitch calibration method and device for the pitch-adjustable propeller, operation is simple and timely
Patent•
Technique determination device and recording medium

[...]

Nariyama Ryuichi1, Shuichi Matsumoto1•
Yamaha Corporation1
27 Sep 2018
TL;DR: In this article, a pitch detection unit detecting a pitch on a time-series basis based on the input sound, a sound volume detection unit based on a sound-volume detection model, and a first starting point detection unit determining whether variation of the sound volume is equal to or larger than a predetermined threshold for each predetermined period.
Abstract: a pitch detection unit detecting a pitch on a time-series basis based on the input sound, a sound-volume detection unit detecting a sound volume on the time series basis based on the input sound, a first starting-point detection unit determining whether variation of the sound volume is equal to or larger than a predetermined threshold for each predetermined period and detecting a starting point of a period in which the variation of the sound volume is equal to or larger than the threshold as a first starting point, and a technique determination unit determining a technique of the input sound based on a change of the sound volume after the first starting point and variation of the pitch after the first starting point.
Patent•
System for creating, practicing and sharing of musical harmonies

[...]

Andrew Goren, Michael Holroyd, Lindsay Ifill
12 Jun 2018
TL;DR: In this article, real-time pitch detection is used to determine a pitch of each note which is voiced by a person, and a graphic indication of the actual pitch which is sung is displayed in conjunction with the musical note indicators.
Abstract: Collaboratively creating musical harmonies includes receiving a user selection of a particular harmony. In response to this selection, there is displayed on a display screen of a computing device a plurality of musical note indicators or notes to specify a first harmony part of a musical piece to be performed. Real-time pitch detection is used to determine a pitch of each note which is voiced by a person, and a graphic indication of the actual pitch which is sung is displayed in conjunction with the musical note indicators.
Proceedings Article•10.2991/CIMNS-18.2018.7•
A Frame-Based Extended MIDI-Pitch Detection Algorithm Based on Variable Sampling Technology

[...]

Yin Feng, Mingtai Lin
1 Nov 2018
TL;DR: This paper proposes a framed-based extended MIDI-pitch detection algorithm based on variable sampling technology that has a good raw pitch accuracy rate and less computation complexity o(nlog2n).
Abstract: This paper proposes a framed-based extended MIDI-pitch detection algorithm based on variable sampling technology. Symbols P--, P-, P+ and P++ are used to present four different logarithmic frequencies around the MIDI pitch P (integer) within 100 cents interval and called as extended MIDI-pitch to enhance the expression resolution of MIDI pitch of a signal frame. Experiment shows that the algorithm has a good raw pitch accuracy rate and less computation complexity o(nlog2n). Keywords—extended MIDI pitch; singing transcription; singing signal processing
Proceedings Article•10.1109/ICSPIS.2018.8700555•
A Regularized Least Squares-Based Method for Optimal Fusion of Speech Pitch Detection Algorithms

[...]

Ziba Imani1, Seyed Jahanshah Kabudian1•
Razi University1
1 Dec 2018
TL;DR: This paper proposes a method for optimal combination of fundamental frequency estimation methods, in noisy signals, and demonstrates the effectiveness of the proposed method in comparison to state-of-the-art methods.
Abstract: Fundamental frequency estimation is one of the most important issues in the field of speech processing. An accurate estimate of the fundamental frequency plays a key role in the field of speech and music analysis. So far, various methods have been proposed in the time- and frequency-domain. However, the main challenge is the strong noises in speech signals. In this paper, to improve the accuracy of fundamental frequency estimation, we propose a method for optimal combination of fundamental frequency estimation methods, in noisy signals. In this method, to discriminate voiced frames from unvoiced frames in a better way, the Voiced/Unvoiced (V/U) scores of four pitch detection methods are combined linearly. These methods are: Autocorrelation, Yin, YAAPT and SWIPE. After identifying the Voiced/Unvoiced label of each frame, the fundamental frequency (F 0 ) of the frame is estimated using the SWIPE method. The optimal coefficients for linear combination are determined using the regularized least squares method with Tikhonov regularization. To evaluate the proposed method, 10 speech files (5 female and 5 male voices) are selected from the PTDB-TUG standard database and the results are presented in terms of SDFPE, GPE, VDE, PTE and FFE standard error criteria. The results indicate that our proposed method relatively reduced the aforementioned criteria (averaged in various SNRs) by 27.13%, 22.14%, 17.40%, and 26.74% respectively, which demonstrate the effectiveness of the proposed method in comparison to state-of-the-art methods.
Book Chapter•
Computational model of pitch detection, perceptive foundations, and application to Norwegian fiddle music

[...]

Olivier Lartillot, Hans-Hinrich Thedens, Alexander Refsum Jensenius
1 Jan 2018
Proceedings Article•10.1109/IWSSIP.2018.8439314•
Improving Pitch Extraction Performance through Laryngeal Mechanisms Background

[...]

Everton B. Lacerda1, Carlos A. B. Mello1•
Federal University of Pernambuco1
20 Jun 2018
TL;DR: The goal is to optimize the frequency range of the pitch detectors relying on the fact that each laryngeal mechanism has a frequency range, and therefore, it is not necessary to use all range of human voice to every sound, since it was produced in just one specific mechanism.
Abstract: Pitch extraction is a well-known and prominent application in signal processing. However, there are chances for improvements yet. This paper proposes a methodology based on laryngeal mechanisms background for pitch extraction. The goal is to optimize the frequency range of the pitch detectors relying on the fact that each laryngeal mechanism has a frequency range, and therefore, it is not necessary to use all range of human voice to every sound, since it was produced in just one specific mechanism. Since our proposal consists of parameter tuning, it is not limited to specific pitch extraction algorithms. Our experiments show that this adjustment on the frequency range can significantly improve pitch detection accuracy.
Proceedings Article•10.1109/IJCNN.2018.8489768•
A Pitch Extraction System Based on Laryngeal Mechanisms Classification

[...]

Everton B. Lacerda1, Carlos A. B. Mello1•
Federal University of Pernambuco1
8 Jul 2018
TL;DR: A system to estimate pitch based on laryngeal mechanisms classification based on parameter tuning, which can significantly improve pitch detection accuracy and is not limited to any specific pitch extraction method.
Abstract: Pitch extraction is one of the most important areas in signal processing. One of the reasons for this fact is because it is a key component of several speech processing, coding or synthesis systems. Many methods were proposed to date; however, there are possible improvements yet, mainly concerning the fine-tuning of parameters, since the majority of the works focus on the definition of completely new approaches. This paper proposes a system to estimate pitch based on laryngeal mechanisms classification. Currently, this classification is based on texture discrimination between visual representations of the audio signal. Therefore, at first, we improve state-of-the-art accuracy in classifying laryngeal mechanisms through changing this image generation process, based on the spectrogram of the signal, and by tuning the used classifier. After that, we optimize the frequency range of the pitch detectors relying on that classification. Such optimization is possible because each laryngeal mechanism has a frequency range, and therefore, it is not necessary to use the complete range of human voice to every sound; it was produced in just one specific mechanism. Since our system consists of parameter tuning, it is not limited to any specific pitch extraction method. Our experiments over two well-known pitch detectors show that this adjustment on the frequency range based on laryngeal mechanism can significantly improve pitch detection accuracy.
Proceedings Article•10.1109/ICCUBEA.2018.8697523•
Detection of Pitch Frequency of Indian Classical Music Based on Hilbert-Huang Transform for Automatic Note Transcription

[...]

Snehal R. Kharvatkar, Manish Sharma, D. G. Khairnar, Indraneel C. Naik
1 Aug 2018
TL;DR: A step by step algorithm for detecting pitch period from classical music signal based on Hilbert-Huang transform (HHT) is proposed and the results show that the variation of the pitch period can be accurately detected.
Abstract: The pitch detection is an integral element of automatic music transcription system. Empirical Mode Decomposition (EMD) technique plays a key part in the pitch detection. With this technique, any complicated data set comprising of frequency-amplitude points can be decomposed into small number of finite Intrinsic Mode Functions (IMF). The IMF logic is in accordance with a well-behaved and well proven Hilbert transform. In this paper, a step by step algorithm for detecting pitch period from classical music signal based on Hilbert-Huang transform (HHT) is proposed. Traditional windowing methods have two limitations namely overlapping of windows and an assumption of stationary pitch period within a window. In contrast, HHT shows no limitations on window selection and allows pitch period changing within windows. It also can be used to monitor the variation of the pitch. To validate the proposed method, the pure tone of standard pitch is used. The results show that the variation of the pitch period can be accurately detected. This demonstrates the successful application of Hilbert-Huang transform for pitch detection from Indian classical music signal.
Patent•
Two-degree-of-freedom motion attitude measurement method based on visual recognition for airborne radar antenna

[...]

Yu Wenbo, Guo Guang, Qian Jun
21 Dec 2018
TL;DR: In this article, a two-degree-of-freedom motion attitude measurement method based on visual recognition for an airborne radar antenna is proposed, which consists of the following steps: setting an orientation detection identifier on a mechanism for azimuth movement, setting a pitch detection identifier and collecting motion start and end points and a motion time of the detection identifier by an image collection device.
Abstract: The invention provides a two-degree-of-freedom motion attitude measurement method based on visual recognition for an airborne radar antenna, and belongs to the field of motion attitude measurement ofairborne radar antennas. The method comprises the following steps: setting an orientation detection identifier on a mechanism for azimuth movement, setting a pitch detection identifier on a mechanismfor azimuth and pitch movement, collecting motion start and end points and a motion time of the detection identifier by an image collection device, calculating a rotation angle according to the distance between the motion start and end points and the motion radius, and calculating angular velocities and angular accelerations for azimuth and pitch according to the rotation angle and the motion time. The method can accurately detect two-degree-of-freedom motion attitude information of the airborne radar antenna within a certain range, and can finally realize the detection of angular velocities and angular accelerations for azimuth and pitch of antennas, which is convenient, accurate and reliable.
Proceedings Article•10.1109/ICASSP.2018.8461914•
Towards Complete Polyphonic Music Transcription: Integrating Multi-Pitch Detection and Rhythm Quantization

[...]

Eita Nakamura1, Emmanouil Benetos2, Kazuyoshi Yoshii1, Simon Dixon2•
Kyoto University1, Queen Mary University of London2
15 Apr 2018
TL;DR: Systematic evaluations on commonly used classical piano data show that these treatments of polyphonic transcription improve the performance of transcription, which can be used as benchmarks for further studies.
Abstract: Most work on automatic transcription produces “piano roll” data with no musical interpretation of the rhythm or pitches We present a polyphonic transcription method that converts a music audio signal into a human-readable musical score, by integrating multi-pitch detection and rhythm quantization methods This integration is made difficult by the fact that the multi-pitch detection produces erroneous notes such as extra notes and introduces timing errors that are added to temporal deviations due to musical expression Thus, we propose a rhythm quantization method that can remove extra notes by extending the metrical hidden Markov model and optimize the model parameters We also improve the note-tracking process of multi-pitch detection by refining the treatment of repeated notes and adjustment of onset times Finally, we propose evaluation measures for transcribed scores Systematic evaluations on commonly used classical piano data show that these treatments improve the performance of transcription, which can be used as benchmarks for further studies
Posted Content•
Binning based algorithm for Pitch Detection in Hindustani Classical Music.

[...]

Malvika Singh
7 Jan 2018
TL;DR: A unique algorithm for pitch detection using the binning method as described in the paper using appropriate bin size is devised and helps to segregate the important pitches in a given musical piece.
Abstract: Speech coding forms a crucial element in speech communications. An important area concerning it lies in feature extraction which can be used for analyzing Hindustani Classical Music. An important feature in this respect is the fundamental frequency often referred to as the pitch. In this work, the terms pitch and its acoustical sensation, the frequency is used interchangeably. There exists numerous pitch detection algorithms which detect the main/ fundamental frequency in a given musical piece, but we have come up with a unique algorithm for pitch detection using the binning method as described in the paper using appropriate bin size. Previous work on this subject throws light on pitch identification for Hindustani Classical Music. Pitch Class Distribution has been employed in this work. It can be used to identify pitches in Hindustani Classical Music which is based on suitable intonations and swaras. It follows a particular ratio pattern which is a tuning for diatonic scale proposed by Ptolemy and confirmed by Zarlino is explored in this paper. We have also given our estimated of these ratios and compared the error with the above. The error produced by varying the bin size in our algorithm is investigated and an estimate for an appropriate bin size is suggested and tested. The binning algorithm thus helps to segregate the important pitches in a given musical piece.

Tools

SciSpace AgentBiomedical AgentSciSpace RecruitSciSpace for EnterpriseAgent GalleryChat with PDFLiterature ReviewAI WriterFind TopicsParaphraserCitation GeneratorExtract DataAI DetectorCitation Booster

Learn

ResourcesLive Workshops

SciSpace

CareersSupportBrowse PapersPricingSciSpace Affiliate ProgramCancellation & Refund PolicyTermsPrivacyData Sources

Directories

PapersTopicsJournalsAuthorsConferencesInstitutionsCitation StylesWriting templates

Extension & Apps

SciSpace Chrome ExtensionSciSpace Mobile App

Contact

support@scispace.com
SciSpace

© 2026 | PubGenius Inc. | Suite # 217 691 S Milpitas Blvd Milpitas CA 95035, USA

soc2
Secured by Delve