Neural Network-Based Dynamic Segmentation and Weighted Integrated Matching of Cross-Media Piano Performance Audio Recognition and Retrieval Algorithm

doi:10.1155/2022/9323646

Open AccessJournal Article10.1155/2022/9323646

Neural Network-Based Dynamic Segmentation and Weighted Integrated Matching of Cross-Media Piano Performance Audio Recognition and Retrieval Algorithm

Tianshu Wang

- 13 May 2022

- Computational Intelligence and Neuroscie...

- Vol. 2022, pp 1-13

8

TL;DR: A dynamic threshold-based segmentation and weighted comprehensive matching algorithm based on neural networks for cross-media piano performance audio recognition and retrieval to solve the problems of imprecise dynamic note segmentsation and inconsistent matching templates.

Abstract: This paper presents a dynamic segmentation and weighted comprehensive matching algorithm based on neural networks for cross-media piano performance audio recognition and retrieval. The 3D convolutional neural network process is separated to compress the network parameters and improve the computational speed. Skip connection and layer-wise learning rate solve the problem that the separated network is challenging to train. The piano performance audio recognition is facilitated by shuffle operation. In pattern recognition, music retrieval algorithms are gaining more and more attention due to their ease of implementation and efficiency. However, the problems of imprecise dynamic note segmentation and inconsistent matching templates directly affect the accuracy of the MIR algorithm. We propose a dynamic threshold-based segmentation and weighted comprehensive matching algorithm to solve these problems. The amplitude difference step is dynamically set, and the notes are segmented according to the changing threshold to improve the accuracy of note segmentation. A standard score frequency is used to transform the pitch template to achieve input normalization to enhance the accuracy of matching. Direct matching and DTW matching are fused to improve the adaptability and robustness of the algorithm. Finally, the effectiveness of the method is experimentally demonstrated. This paper implements the data collection and processing, audio recognition, and retrieval algorithm for cross-media piano performance big data through three main modules: the collection, processing, and storage module of cross-media piano performance big data, the building module of audio recognition of cross-media piano performance big data, and the dynamic precision module of cross-media piano performance big data.

Chat with Paper

AI Agents for this Paper

Find similar papers on Google Scholar, PubMed and Arxiv
Write a critical review of this paper
Analyze citations of this paper to find unaddressed research gaps

Citations

Journal Article•10.52783/jes.684

Learning Experience of University Music Course Based on Emotional Computing

Lin-Wan Huang

- 25 Jan 2024

- Deleted Journal

TL;DR: A multimedia identification and analysis method for piano performance music based on emotional computing using recurrent neural networks (RNNs) is proposed. The system can identify and analyze various piano performances with high accuracy, precision, recall, and F1 Score.

...read moreread less

1

Journal Article•10.4018/ijwltt.327948

Research on Musical Tone Recognition Method Based on Improved RNN for Vocal Music Teaching Network Courses

Kaiyi Long

- 09 Aug 2023

- International Journal of Web-based Learn...

TL;DR: The test results show that the fast Fourier process with multiple time superposition and a dimension length of 40 is most beneficial to the accuracy of the model and the improved algorithm was the most accurate in terms of F1 values and is suitable for use in vocal music teaching courses.

...read moreread less

1

•Journal Article•10.1155/2022/8431982

Construction and Application of a Piano Playing Pitch Recognition Model Based on Neural Network

Guobin Wu, +1 more

- 17 Sep 2022

- Computational Intelligence and Neuroscie...

TL;DR: A piano playing intonation recognition model is constructed and the optimized result is used as the feature of piano music to realize the prediction of the music recognition of the Intonation preference, which effectively improves the musical tone recognition rate.

...read moreread less

Journal Article•10.4018/ijicte.341266

Optimization of Piano Performance Teaching Mode Using Network Big Data Analysis Technology

Xiang Wei, +1 more

- 26 Mar 2024

- International Journal of Information and...

TL;DR: To optimize piano performance teaching mode, a MIDI piano teaching performance evaluation method based on bidirectional LSTM is proposed. The method utilizes a three-layer bidirectional LSTM neural network to capture useful information and improve work efficiency.

...read moreread less

•Journal Article•10.1155/2022/8339895

Application of GIS Technology-Supported Cross Media Fusion Method Based on Deep Learning in Landscape Performance Evaluation

Xiaoqing Liu, +3 more

- 08 Sep 2022

- Computational Intelligence and Neuroscie...

TL;DR: GIS technology can provide reasonable and sustainable data support for landscape planning and ecological development and make wetland landscape planning consider the spatial layout of landscape and the optimal allocation of resources more.

...read moreread less

References

•Journal Article•10.1126/SCIENCE.ABJ8754

Accurate prediction of protein structures and interactions using a three-track neural network

Minkyung Baek, +33 more

- 20 Aug 2021

- Science

TL;DR: In this article, a three-track network is proposed to combine information at the one-dimensional (1D) sequence level, the 2D distance map level, and the 3D coordinate level.

...read moreread less

3.9K

Journal Article•10.1038/S41586-020-1942-4

Fully hardware-implemented memristor convolutional neural network

Peng Yao, +7 more

- 29 Jan 2020

- Nature

TL;DR: The fabrication of high-yield, high-performance and uniform memristor crossbar arrays for the implementation of CNNs and an effective hybrid-training method to adapt to device imperfections and improve the overall system performance are proposed.

...read moreread less

1.8K

Journal Article•10.1007/S13748-019-00203-0

Convolutional neural network: a review of models, methodologies and applications to object detection

Anamika Dhillon, +1 more

- 01 Jun 2020

- Progress in Artificial Intelligence

TL;DR: This paper mainly focus on the application of deep learning architectures to three major applications, namely (i) wild animal detection, (ii) small arm detection and (iii) human being detection.

...read moreread less

897

Journal Article•10.1021/ACS.CHEMREV.0C00868

Four Generations of High-Dimensional Neural Network Potentials.

Jörg Behler

- 29 Mar 2021

- Chemical Reviews

TL;DR: In this article, the authors present a classification scheme for the family of high-dimensional neural network potentials (HDNNPs) and discuss the applicability and remaining limitations of these potentials along with an outlook at possible future developments.

...read moreread less

528

•Journal Article•10.1073/PNAS.1907375117

Understanding the role of individual units in a deep neural network.

David Bau, +5 more

- 01 Sep 2020

- Proceedings of the National Academy of S...

TL;DR: This work presents network dissection, an analytic framework to systematically identify the semantics of individual hidden units within image classification and image generation networks, and applies it to understanding adversarial attacks and to semantic image editing.

...read moreread less

478