Perceptually based head-related transfer function database optimization.

J Acoust Soc Am

LIMSI-CNRS, BP 133, Universit Paris Sud, Orsay, France.

Published: February 2012

In the context of binaural audio rendering, choosing the best head-related transfer function (HRTF) for an individual from large databases poses several problems. This study proposes a method to reduce the size of a given HRTF database. Participants, 45 in total, were asked to rate the quality of binaural synthesis for 46 HRTFs. The lack of reciprocity in the ratings was noted. Results were used to create a perceptually optimized HRTF subset which satisfied all participants' judgments. The subset was validated using localization tests on a separate group of subjects with results showing reduced errors when subjects were given their best choice, rather than their worst choice HRTF.

Download full-text PDF

Source
http://dx.doi.org/10.1121/1.3672641DOI Listing

Publication Analysis

Top Keywords

head-related transfer
8
transfer function
8
perceptually based
4
based head-related
4
function database
4
database optimization
4
optimization context
4
context binaural
4
binaural audio
4
audio rendering
4

Similar Publications

On the generalization of accommodation to head-related transfer functions.

J Acoust Soc Am

January 2025

Dyson School of Design Engineering, Imperial College London, SW7 2DB London, United Kingdom.

To date, there is strong evidence indicating that humans with normal hearing can adapt to non-individual head-related transfer functions (HRTFs). However, less attention has been given to studying the generalization of this adaptation to untrained conditions. This study investigated how adaptation to one set of HRTFs can generalize to another set of HRTFs.

View Article and Find Full Text PDF

In acoustics, an artificial head generally comprises two pinnae and occasionally a torso, which are useful for recording binaural signals and acquiring head-related transfer functions (HRTFs). Currently, most artificial heads are designed based on the anthropometric parameters of specific populations. However, anthropometric parameters do not accurately express head surface shapes, and thus, typical HRTFs are difficult to generate.

View Article and Find Full Text PDF

Perceptually enhanced spectral distance metric for head-related transfer function quality prediction.

J Acoust Soc Am

December 2024

Key Laboratory of Speech Acoustics and Content Understanding, Institute of Acoustics, Chinese Academy of Sciences, Beijing 100190, China.

Given the substantial time and complexity involved in the perceptual evaluation of head-related transfer function (HRTF) processing, there is considerable value in adopting numerical assessment. Although many numerical methods have been introduced in recent years, monaural spectral distance metrics such as log-spectral distortion (LSD) remain widely used despite their significant limitations. In this study, listening tests were conducted to investigate the correlation between LSD and the auditory perception of HRTFs.

View Article and Find Full Text PDF

Auditory localization: a comprehensive practical review.

Front Psychol

July 2024

Laboratory for Research on Learning and Development (LEAD), CNRS UMR, Université de Bourgogne, Dijon, France.

Auditory localization is a fundamental ability that allows to perceive the spatial location of a sound source in the environment. The present work aims to provide a comprehensive overview of the mechanisms and acoustic cues used by the human perceptual system to achieve such accurate auditory localization. Acoustic cues are derived from the physical properties of sound waves, and many factors allow and influence auditory localization abilities.

View Article and Find Full Text PDF

Extended-wear hearing aids (EWHAs) are small broadband analog amplification devices placed deeply enough in the ear canal to preserve most of the cues in the head-related transfer function. However, little is known about how EWHAs affect localization accuracy for normal hearing threshold (NHT) listeners. In this study, eight NHT participants were fitted with EWHAs and localized broadband sounds of different durations (250 ms and 4 s) and stimulus intensities (40, 50, 60, 70, and 80 dBA) in a spherical speaker array.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!