The Problem of Limited Inter-rater Agreement in Modelling Music Similarity.

J New Music Res

Austrian Research Institute for Artificial Intelligence (OFAI), Intelligent Music Processing and Machine Learning Group , Vienna , Austria .

Published: July 2016

One of the central goals of Music Information Retrieval (MIR) is the quantification of similarity between or within pieces of music. These quantitative relations should mirror the human perception of music similarity, which is however highly subjective with low inter-rater agreement. Unfortunately this principal problem has been given little attention in MIR so far. Since it is not meaningful to have computational models that go beyond the level of human agreement, these levels of inter-rater agreement present a natural upper bound for any algorithmic approach. We will illustrate this fundamental problem in the evaluation of MIR systems using results from two typical application scenarios: (i) modelling of music similarity between pieces of music; (ii) music structure analysis within pieces of music. For both applications, we derive upper bounds of performance which are due to the limited inter-rater agreement. We compare these upper bounds to the performance of state-of-the-art MIR systems and show how the upper bounds prevent further progress in developing better MIR systems.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC5256035PMC
http://dx.doi.org/10.1080/09298215.2016.1200631DOI Listing

Publication Analysis

Top Keywords

inter-rater agreement
16
music similarity
12
pieces music
12
mir systems
12
upper bounds
12
limited inter-rater
8
music
8
modelling music
8
similarity pieces
8
bounds performance
8

Similar Publications

Evaluation of objective methods for analyzing ipsilateral motor evoked potentials in stroke survivors with chronic upper extremity motor impairment.

J Neural Eng

January 2025

Department of Physical Medicine and Rehabilitation, MetroHealth Medical Center, 4229 Pearl Road, Suite N4-13, Cleveland, Ohio, 44109-1998, UNITED STATES.

Ipsilateral motor evoked potentials (iMEPs) are believed to represent cortically evoked excitability of uncrossed brainstem-mediated pathways. In the event of extensive injury to (crossed) corticospinal pathways, which can occur following a stroke, uncrossed ipsilateral pathways may serve as an alternate resource to support the recovery of the paretic limb. However, iMEPs, even in neurally intact people, can be small, infrequent, and noisy, so discerning them in stroke survivors is very challenging.

View Article and Find Full Text PDF

Expert Rater Agreement for Symptoms and Diagnosis of Bipolar Disorder in Youth.

Child Psychiatry Hum Dev

January 2025

Department of Psychiatry and Behavioral Health, Stony Brook University, 101 Nicolls Road, Stony Brook, NY, USA.

The diagnosis of bipolar disorder (BD) in young children has been a topic of debate, in part owing to varied interpretation of manic-like symptoms. We examined how expert academic clinicians participating in the pediatric bipolar biobank varied in their interpretation and application of Diagnostic and Statistical Manual of Mental Disorders (DSM) criteria and diagnoses. Study co-investigators reviewed 12 standardized narratives and for each marked a visual analog scale with their confidence in the presence of manic episodes and criteria.

View Article and Find Full Text PDF

Background: Peak-width of skeletonized mean diffusivity (PSMD) is an emerging biomarker of cerebral small vessel disease (cSVD)-related vascular contributions to cognitive impairment and dementia (VCID). Higher PSMD values reflect greater white matter microstructural damage, and prior research has related PSMD to sporadic and monogenic forms of cSVD and worse cognitive function. Therefore, we proposed PSMD as a risk stratification biomarker for VCID.

View Article and Find Full Text PDF

Public Health.

Alzheimers Dement

December 2024

Department of Clinical, Educational, and Health Psychology, Division of Psychology and Language Sciences, University College London, London, United Kingdom.

Background: Non-memory-led dementias pose additional challenges to 'typical dementias' including unusual symptoms and younger onset leading to particularly high neuropsychiatric comorbidities. As part of the economic evaluation supporting the RD-talk research programme, we require the collection of participant level resource use associated with the intervention compared with usual care. The resource use measure (RUM) needs to be sufficiently comprehensive, but still focused to capture the key items of interest (e.

View Article and Find Full Text PDF

Background: Understand individuals' self-perception of aging is crucial for promoting a positive aging experience, better health with good quality of life, addressing activities participation, and can help by advocating policies and interventions that support the diverse needs of an aging population. This study aims to examine the validity and reliability of the Chinese version of BAPQ (C-BAPQ) for the healthy older people by assessing the content validity, test-retest reliability, and correlational analyses with mental health by Depression, Anxiety, and Stress Scale (DASS-21), quality of life by the Short Form 36 Health Survey (SF-36) and activity participation by the Model of Human Occupation Screening Tool (MOHOST). Moreover, to study the factor structure of the Chinese version of BAPQ (C-BAPQ) by using exploratory factor analysis.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!