Audio Mostly Conference

Conference Tools

Papers published on a yearly basis

Papers

Proceedings Article•10.1145/2095667.2095677•

Identification of perceptual qualities in textural sounds using the repertory grid method

[...]

Thomas Grill¹, Arthur Flexer¹, Stuart Cunningham²•Institutions (2)

Austrian Research Institute for Artificial Intelligence¹, Glyndŵr University²

7 Sep 2011

TL;DR: This paper is about exploring which perceptual qualities are relevant to people listening to textural sounds by conducting mixed qualitative-quantitative interviews within the repertory grid framework and identifying ten bi-polar qualities.

...read moreread less

Abstract: This paper is about exploring which perceptual qualities are relevant to people listening to textural sounds. Knowledge about those personal constructs shall eventually lead to more intuitive interfaces for browsing large sound libraries. By conducting mixed qualitative-quantitative interviews within the repertory grid framework ten bi-polar qualities are identified. A subsequent web-based study yields measures for inter-rater agreement and mutual similarity of the perceptual qualities based on a selection of 100 textural sounds. Additionally, some initial experiments are conducted to test standard audio descriptors for their correlation with the perceptual qualities.

...read moreread less

44 citations

Proceedings Article•10.1145/2814895.2814905•

Machine Learning Algorithms for Environmental Sound Recognition: Towards Soundscape Semantics

[...]

Vasileios Bountourakis¹, Lazaros Vrysis¹, George Papanikolaou¹•Institutions (1)

Aristotle University of Thessaloniki¹

7 Oct 2015

TL;DR: This paper investigates methods aiming at the automatic recognition and classification of discrete environmental sounds, for the purpose of subsequently applying these methods to the recognition of soundscapes.

...read moreread less

Abstract: This paper investigates methods aiming at the automatic recognition and classification of discrete environmental sounds, for the purpose of subsequently applying these methods to the recognition of soundscapes. Research in audio recognition has traditionally focused on the domains of speech and music. Comparatively little research has been done towards recognizing non-speech environmental sounds. For this reason, in this paper, we apply existing techniques that have been proved efficient in the other two domains. These techniques are comprehensively compared to determine the most appropriate one for addressing the problem of environmental sound recognition.

...read moreread less

38 citations

Proceedings Article•10.1145/3243274.3243301•

Symbolic Music Similarity through a Graph-Based Representation

[...]

Federico Simonetta¹, Filippo Carnovalini¹, Nicola Orio¹, Antonio Rodà¹•Institutions (1)

University of Padua¹

12 Sep 2018

TL;DR: A novel representation system for symbolic music that could theoretically represent music both from a horizontal (contrapuntal) and from a vertical (harmonic) point of view, by keeping into account contextual and harmonic information is described.

...read moreread less

Abstract: In this work, a novel representation system for symbolic music is described. The proposed representation system is graph-based and could theoretically represent music both from a horizontal (contrapuntal) and from a vertical (harmonic) point of view, by keeping into account contextual and harmonic information. It could also include relationships between internal variations of motifs and themes. This is achieved by gradually simplifying the melodies and generating layers of reductions that include only the most important notes from a structural and harmonic viewpoint. This representation system has been tested in a music information retrieval task, namely melodic similarity, and compared to another system that performs the same task but does not consider any contextual or harmonic information, showing how the structural information is needed in order to find certain relations between musical pieces. Moreover, a new dataset consisting of more than 5000 leadsheets is presented, with additional meta-musical information taken from different web databases, including author, year of first performance, lyrics, genre and stylistic tags.

...read moreread less

38 citations

Proceedings Article•10.1145/1859799.1859809•

A preliminary experiment to assess the fear value of preselected sound parameters in a survival horror game

[...]

Tom Alexander Garner¹, Mark Grimshaw¹, Debbie Abdel Nabi¹•Institutions (1)

University of Bolton¹

15 Sep 2010

TL;DR: Results indicate that coarse manipulation of audio parameters has the potential to influence the intensity of the player's fear response whilst playing a survival horror game.

...read moreread less

Abstract: This paper presents an experiment testing which sound parameters, in a survival horror game context, most warrant further investigation as a means to control the level of fear in such games. The experiment is part of a long-term study ultimately designed to support the development of a biofeedback procedural audio engine for computer games. By this means, it is hoped to provide an enhanced gaming experience whereby sound synthesis and audio processing is conducted in real-time according to the player's affect responses and emotional state. Results indicate that coarse manipulation of audio parameters has the potential to influence the intensity of the player's fear response whilst playing a survival horror game. Evidence is also presented that supports the integration of event logging and realtime participant vocal response into an experimental design to gather unbiased, quantitative data that can be associated with qualitative emotional response.

...read moreread less

37 citations

Proceedings Article•10.1145/3411109.3411123•

A study on spoken language identification using deep neural networks

[...]

Alexandra Draghici, Jakob Abeßer, Hanna Lukashevich

15 Sep 2020

TL;DR: These findings confirm that both a convolutional neural network as well as convolutionAL recurrent neural networks are capable to learn language-specific patterns in mel spectrogram representations of speech recordings.

...read moreread less

Abstract: In this paper, we investigate a previously proposed algorithm for spoken language identification based on convolutional neural networks and convolutional recurrent neural networks. We improve the algorithm by modifying the training strategy to ensure equal class distribution and efficient memory usage. We successfully replicate previous experimental findings using a modified set of languages. Our findings confirm that both a convolutional neural network as well as convolutional recurrent neural networks are capable to learn language-specific patterns in mel spectrogram representations of speech recordings.

...read moreread less

32 citations

...

Expand

No. of papers from the Conference in previous years
Year	Papers
2023	3
2022	38
2021	44
2020	41
2019	49
2018	42

Conference Tools

Papers published on a yearly basis

Papers

Identification of perceptual qualities in textural sounds using the repertory grid method

Machine Learning Algorithms for Environmental Sound Recognition: Towards Soundscape Semantics

Symbolic Music Similarity through a Graph-Based Representation

A preliminary experiment to assess the fear value of preselected sound parameters in a survival horror game

A study on spoken language identification using deep neural networks

Performance Metrics