Neural Network-Based Dynamic Segmentation and Weighted Integrated Matching of Cross-Media Piano Performance Audio Recognition and Retrieval Algorithm
TL;DR: A dynamic threshold-based segmentation and weighted comprehensive matching algorithm based on neural networks for cross-media piano performance audio recognition and retrieval to solve the problems of imprecise dynamic note segmentsation and inconsistent matching templates.
read more
Abstract: This paper presents a dynamic segmentation and weighted comprehensive matching algorithm based on neural networks for cross-media piano performance audio recognition and retrieval. The 3D convolutional neural network process is separated to compress the network parameters and improve the computational speed. Skip connection and layer-wise learning rate solve the problem that the separated network is challenging to train. The piano performance audio recognition is facilitated by shuffle operation. In pattern recognition, music retrieval algorithms are gaining more and more attention due to their ease of implementation and efficiency. However, the problems of imprecise dynamic note segmentation and inconsistent matching templates directly affect the accuracy of the MIR algorithm. We propose a dynamic threshold-based segmentation and weighted comprehensive matching algorithm to solve these problems. The amplitude difference step is dynamically set, and the notes are segmented according to the changing threshold to improve the accuracy of note segmentation. A standard score frequency is used to transform the pitch template to achieve input normalization to enhance the accuracy of matching. Direct matching and DTW matching are fused to improve the adaptability and robustness of the algorithm. Finally, the effectiveness of the method is experimentally demonstrated. This paper implements the data collection and processing, audio recognition, and retrieval algorithm for cross-media piano performance big data through three main modules: the collection, processing, and storage module of cross-media piano performance big data, the building module of audio recognition of cross-media piano performance big data, and the dynamic precision module of cross-media piano performance big data.
read more
Chat with Paper
AI Agents for this Paper
Find similar papers on Google Scholar, PubMed and Arxiv
Write a critical review of this paper
Analyze citations of this paper to find unaddressed research gaps
Citations
Learning Experience of University Music Course Based on Emotional Computing
Lin-Wan Huang
TL;DR: A multimedia identification and analysis method for piano performance music based on emotional computing using recurrent neural networks (RNNs) is proposed. The system can identify and analyze various piano performances with high accuracy, precision, recall, and F1 Score.
1
Research on Musical Tone Recognition Method Based on Improved RNN for Vocal Music Teaching Network Courses
Kaiyi Long
TL;DR: The test results show that the fast Fourier process with multiple time superposition and a dimension length of 40 is most beneficial to the accuracy of the model and the improved algorithm was the most accurate in terms of F1 values and is suitable for use in vocal music teaching courses.
1
Construction and Application of a Piano Playing Pitch Recognition Model Based on Neural Network
Guobin Wu,Wei Chen +1 more
TL;DR: A piano playing intonation recognition model is constructed and the optimized result is used as the feature of piano music to realize the prediction of the music recognition of the Intonation preference, which effectively improves the musical tone recognition rate.
Optimization of Piano Performance Teaching Mode Using Network Big Data Analysis Technology
Xiang Wei,Shuo Sun +1 more
TL;DR: To optimize piano performance teaching mode, a MIDI piano teaching performance evaluation method based on bidirectional LSTM is proposed. The method utilizes a three-layer bidirectional LSTM neural network to capture useful information and improve work efficiency.
Application of GIS Technology-Supported Cross Media Fusion Method Based on Deep Learning in Landscape Performance Evaluation
TL;DR: GIS technology can provide reasonable and sustainable data support for landscape planning and ecological development and make wetland landscape planning consider the spatial layout of landscape and the optimal allocation of resources more.
References
Accurate prediction of protein structures and interactions using a three-track neural network
Minkyung Baek,Frank DiMaio,Ivan Anishchenko,Justas Dauparas,Sergey Ovchinnikov,Gyu Rie Lee,Jue Wang,Qian Cong,Lisa N. Kinch,R. Dustin Schaeffer,Claudia Millán,Hahnbeom Park,Carson Adams,Caleb R. Glassman,Andy DeGiovanni,Jose Henrique Pereira,Andria V. Rodrigues,Alberdina A. van Dijk,Ana C. Ebrecht,Diederik J. Opperman,Theo Sagmeister,Christoph Buhlheller,Christoph Buhlheller,Tea Pavkov-Keller,Manoj K. Rathinaswamy,Udit Dalwadi,Calvin K. Yip,John E. Burke,K. Christopher Garcia,Nick V. Grishin,Paul D. Adams,Paul D. Adams,Randy J. Read,David Baker +33 more
TL;DR: In this article, a three-track network is proposed to combine information at the one-dimensional (1D) sequence level, the 2D distance map level, and the 3D coordinate level.
Fully hardware-implemented memristor convolutional neural network
Peng Yao,Huaqiang Wu,Bin Gao,Jianshi Tang,Qingtian Zhang,Wenqiang Zhang,Jianhua Yang,He Qian +7 more
TL;DR: The fabrication of high-yield, high-performance and uniform memristor crossbar arrays for the implementation of CNNs and an effective hybrid-training method to adapt to device imperfections and improve the overall system performance are proposed.
1.8K
Convolutional neural network: a review of models, methodologies and applications to object detection
TL;DR: This paper mainly focus on the application of deep learning architectures to three major applications, namely (i) wild animal detection, (ii) small arm detection and (iii) human being detection.
897
Four Generations of High-Dimensional Neural Network Potentials.
TL;DR: In this article, the authors present a classification scheme for the family of high-dimensional neural network potentials (HDNNPs) and discuss the applicability and remaining limitations of these potentials along with an outlook at possible future developments.
528
Understanding the role of individual units in a deep neural network.
TL;DR: This work presents network dissection, an analytic framework to systematically identify the semantics of individual hidden units within image classification and image generation networks, and applies it to understanding adversarial attacks and to semantic image editing.
478