Publications by Ricard Marxer

Publications by authors named "Ricard Marxer"

Page 1 of 1

Automatic detection for bioacoustic research: a practical guide from and for biologists and computer scientists.

Arik Kershenbaum Çağlar Akçay Lakshmi Babu-Saheer Alex Barnhill Paul Best Ricard Marxer

Biol Rev Camb Philos Soc

October 2024

Article Synopsis

* Advances in computing and machine learning offer solutions for automatic analysis of acoustic data, but the field is still developing and faces challenges in bridging the gap between biology and technology.
* This review outlines trends in bioacoustic PAM, introduces machine learning applications, and offers a practical guide for researchers on building automatic detection systems while highlighting future directions in the field.

View Article and Find Full Text PDF

Applying machine learning to primate bioacoustics: Review and perspectives.

Jules Cauzinille Benoit Favre Ricard Marxer Arnaud Rey

Am J Primatol

October 2024

This paper provides a comprehensive review of the use of computational bioacoustics as well as signal and speech processing techniques in the analysis of primate vocal communication. We explore the potential implications of machine learning and deep learning methods, from the use of simple supervised algorithms to more recent self-supervised models, for processing and analyzing large data sets obtained within the emergence of passive acoustic monitoring approaches. In addition, we discuss the importance of automated primate vocalization analysis in tackling essential questions on animal communication and highlighting the role of comparative linguistics in bioacoustic research.

View Article and Find Full Text PDF

Deep audio embeddings for vocalisation clustering.

Paul Best Sébastien Paris Hervé Glotin Ricard Marxer

PLoS One

July 2023

The study of non-human animals' communication systems generally relies on the transcription of vocal sequences using a finite set of discrete units. This set is referred to as a vocal repertoire, which is specific to a species or a sub-group of a species. When conducted by human experts, the formal description of vocal repertoires can be laborious and/or biased.

View Article and Find Full Text PDF

Author Correction: Temporal evolution of the Mediterranean fin whale song.

Paul Best Ricard Marxer Sébastien Paris Hervé Glotin

Sci Rep

December 2022

View Article and Find Full Text PDF

Temporal evolution of the Mediterranean fin whale song.

Paul Best Ricard Marxer Sébastien Paris Hervé Glotin

Sci Rep

August 2022

We present an analysis of fin whale (Balaenoptera physalus) songs on passive acoustic recordings from the Pelagos Sanctuary (Western Mediterranean Basin). The recordings were gathered between 2008 and 2018 using 2 different hydrophone stations. We show how 20 Hz fin whale pulses can be automatically detected using a low complexity convolutional neural network (CNN) despite data variability (different recording devices exposed to diverse noises).

View Article and Find Full Text PDF

Lexical frequency effects in English and Spanish word misperceptions.

Martin Cooke María Luisa García Lecumberri Jon Barker Ricard Marxer

J Acoust Soc Am

February 2019

When listeners misperceive words in noise, do they report words that are more common? Lexical frequency differences between misperceived and target words in English and Spanish were examined for five masker types. Misperceptions had a higher lexical frequency in the presence of pure energetic maskers, but frequency effects were reduced or absent for informational maskers. The tendency to report more common words increased with the degree of energetic masking, suggesting that uncertainty about segment identity provides a role for lexical frequency.

View Article and Find Full Text PDF

A corpus of audio-visual Lombard speech with frontal and profile views.

Najwa Alghamdi Steve Maddock Ricard Marxer Jon Barker Guy J Brown

J Acoust Soc Am

June 2018

This paper presents a bi-view (front and side) audiovisual Lombard speech corpus, which is freely available for download. It contains 5400 utterances (2700 Lombard and 2700 plain reference utterances), produced by 54 talkers, with each utterance in the dataset following the same sentence format as the audiovisual "Grid" corpus [Cooke, Barker, Cunningham, and Shao (2006). J.

View Article and Find Full Text PDF

An Innovative Speech-Based User Interface for Smarthomes and IoT Solutions to Help People with Speech and Motor Disabilities.

Massimiliano Malavasi Enrico Turri Jose Joaquin Atria Heidi Christensen Ricard Marxer

Stud Health Technol Inform

April 2018

A better use of the increasing functional capabilities of home automation systems and Internet of Things (IoT) devices to support the needs of users with disability, is the subject of a research project currently conducted by Area Ausili (Assistive Technology Area), a department of Polo Tecnologico Regionale Corte Roncati of the Local Health Trust of Bologna (Italy), in collaboration with AIAS Ausilioteca Assistive Technology (AT) Team. The main aim of the project is to develop experimental low cost systems for environmental control through simplified and accessible user interfaces. Many of the activities are focused on automatic speech recognition and are developed in the framework of the CloudCAST project.

View Article and Find Full Text PDF

A corpus of noise-induced word misperceptions for English.

Ricard Marxer Jon Barker Martin Cooke Maria Luisa Garcia Lecumberri

J Acoust Soc Am

November 2016

Words spoken against a noise background often form an ambiguous percept. However, in certain conditions, a listener will mishear a noisy word but report hearing the same incorrect word as reported by other listeners. These consistent hearing errors are valuable as tests of detailed models of speech perception.

View Article and Find Full Text PDF