Unsupervised analysis of polyphonic music by sparse coding.

IEEE Trans Neural Netw

Department of Electronic Engineering, Queen Mary, University of London, London E1 4NS, UK.

Published: January 2006

AI Article Synopsis

  • The study explores a data-driven method for analyzing and transcribing polyphonic music using a probabilistic model that finds sparse linear decompositions of musical spectra.
  • The system represents each music spectrum as a combination of a few key spectra from a learned dictionary, effectively identifying individual notes in complex musical mixtures.
  • This approach uniquely integrates a probabilistic framework, a simple generative model that requires minimal assumptions about the music, and the capability to learn effective spectral representations directly from polyphonic examples without needing separate training on single-note music.

Article Abstract

We investigate a data-driven approach to the analysis and transcription of polyphonic music, using a probabilistic model which is able to find sparse linear decompositions of a sequence of short-term Fourier spectra. The resulting system represents each input spectrum as a weighted sum of a small number of "atomic" spectra chosen from a larger dictionary; this dictionary is, in turn, learned from the data in such a way as to represent the given training set in an (information theoretically) efficient way. When exposed to examples of polyphonic music, most of the dictionary elements take on the spectral characteristics of individual notes in the music, so that the sparse decomposition can be used to identify the notes in a polyphonic mixture. Our approach differs from other methods of polyphonic analysis based on spectral decomposition by combining all of the following: (a) a formulation in terms of an explicitly given probabilistic model, in which the process estimating which notes are present corresponds naturally with the inference of latent variables in the model; (b) a particularly simple generative model, motivated by very general considerations about efficient coding, that makes very few assumptions about the musical origins of the signals being processed; and (c) the ability to learn a dictionary of atomic spectra (most of which converge to harmonic spectral profiles associated with specific notes) from polyphonic examples alone-no separate training on monophonic examples is required.

Download full-text PDF

Source
http://dx.doi.org/10.1109/TNN.2005.861031DOI Listing

Publication Analysis

Top Keywords

polyphonic music
12
music sparse
8
probabilistic model
8
notes polyphonic
8
polyphonic
6
unsupervised analysis
4
analysis polyphonic
4
music
4
sparse coding
4
coding investigate
4

Similar Publications

Predominant instrument recognition plays a vital role in music information retrieval. This task involves identifying and categorizing the dominant instruments present in a piece of music based on their distinctive time-frequency characteristics and harmonic distribution. Existing predominant instrument recognition approaches mainly focus on learning implicit mappings (such as deep neural networks) from time-domain or frequency-domain representations of music audio to instrument labels.

View Article and Find Full Text PDF

Music ensemble performance provides an ecologically valid context for investigating leadership dynamics in small group interactions. Musical texture, specifically the relative salience of simultaneously sounding ensemble parts, is a feature that can potentially alter leadership dynamics by introducing hierarchical relationships between individual parts. The present study extended previous work on quantifying interpersonal coupling in musical ensembles by examining the relationship between musical texture and leader-follower relations, operationalised as directionality of influence between co-performers' body motion in concert video recordings.

View Article and Find Full Text PDF

Lithuanian polyphonic songs : the archaic nature of their musical language in the context of global music.

Front Psychol

February 2024

Rhythmic Music Department, Faculty of Arts and Creative Technologies, Vilnius University of Applied Sciences, Vilnius, Lithuania.

Lithuanian traditional polyphonic songs, known as , are characterized by a distinctive musical language and have almost no analogues in world music. The aim of this article is to explore the peculiarities of their musical language and the socio-cultural context of their performance tradition in order to reveal their archaic origins. The archaic nature of songs is shown not by individual features of their musical language, but by the totality of these features, the peculiarities of their poetics, and performance traditions.

View Article and Find Full Text PDF

Multiple fundamental frequency estimation has been extensively used in applications such as melody extraction, music transcription, instrument identification, and source separation. This paper presents an approach based on the iterative detection and extraction of note events, which are considered to be harmonic sounds characterised by a continuous pitch trajectory. Note events are assumed to be associated with musical notes being played by a single instrument, and their pitch trajectories are iteratively estimated.

View Article and Find Full Text PDF

Music-oriented auditory attention detection (AAD) aims at determining which instrument in polyphonic music a listener is paying attention to by analyzing the listener's electroencephalogram (EEG). However, the existing linear models cannot effectively mimic the nonlinearity of the human brain, resulting in limited performance. Thus, a nonlinear music-oriented AAD model is proposed in this paper.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!