PHOR-in-One: A multilingual lexical database with PHonological, ORthographic and PHonographic word similarity estimates in four languages.

Behav Res Methods

Research Unit in Human Cognition, CIPsi, School of Psychology, University of Minho, Campus de Gualtar, 4710-057, Braga, Portugal.

Published: October 2023

A large body of research seeking to explore how form affects lexical processing in bilinguals has suggested that orthographically similar translations (e.g., English-Portuguese "paper-papel") are responded to more quickly and accurately than words with little to no overlap (e.g., English-Portuguese "house-casa"). One of the most prominent algorithms to estimate orthographic similarity, the normalized Levenshtein distance (NLD), returns an index of the proportion of identical characters of two strings, and is an efficient and invaluable tool for the selection, manipulation, and control of verbal stimuli. Notwithstanding its many advantages for second-language research, the absence of a comparable measure for phonology has resulted in the adoption of different strategies to assess the degree of interlanguage phonological similarity across the literature, with profound implications for the interpretation of results on the relative role of orthographic and phonological similarity in bilingual lexical access. In the present work, we introduce PHOR-in-One, a multilingual lexical database with a set of phonological and orthographic NLD estimates for 6160 translation equivalents in American and British English, European Portuguese, German and Spanish in a total of 30,800 words. We also propose a new measure of phonographic NLD, a pooled index of orthographic and phonological similarity, particularly useful for researchers interested in controlling for and/or manipulating both estimates at once. PHOR-in-One includes a comprehensive characterization of its lexical entries, namely Part-of-Speech-dependent and independent frequency counts, number of letters and phonemes, and phonetic transcription. PHOR-in-One is thus a valuable tool to support bilingual and multilingual research.

Download full-text PDF

Source
http://dx.doi.org/10.3758/s13428-022-01985-3DOI Listing

Publication Analysis

Top Keywords

phonological similarity
12
phor-in-one multilingual
8
multilingual lexical
8
lexical database
8
phonological orthographic
8
orthographic phonological
8
lexical
5
phonological
5
orthographic
5
similarity
5

Similar Publications

Cracking arbitrariness: A data-driven study of auditory iconicity in spoken English.

Psychon Bull Rev

January 2025

Department of Psychology, University of Milano - Bicocca, Piazza dell'Ateneo Nuovo 1, Milan, MI, 20126, Italy.

Auditory iconic words display a phonological profile that imitates their referents' sounds. Traditionally, those words are thought to constitute a minor portion of the auditory lexicon. In this article, we challenge this assumption by assessing the pervasiveness of onomatopoeia in the English auditory vocabulary through a novel data-driven procedure.

View Article and Find Full Text PDF

Background/objectives: Previous studies have examined the role of working memory in cognitive tasks such as syntactic, semantic, and phonological processing, thereby contributing to our understanding of linguistic information management and retrieval. However, the real-time processing of phonological information-particularly in relation to suprasegmental features like tone, where its contour represents a time-varying signal-remains a relatively underexplored area within the framework of Information Processing Theory (IPT). This study aimed to address this gap by investigating the real-time processing of similar tonal information by native Cantonese speakers, thereby providing a deeper understanding of how IPT applies to auditory processing.

View Article and Find Full Text PDF

Rhythm perception in speech and non-speech acoustic stimuli has been shown to be affected by general acoustic biases as well as by phonological properties of the native language of the listener. The present paper extends the cross-linguistic approach in this field by testing the application of the iambic-trochaic law as an assumed general acoustic bias on rhythmic grouping of non-speech stimuli by speakers of three languages: Arabic, Hebrew and German. These languages were chosen due to relevant differences in their phonological properties on the lexical level alongside similarities on the phrasal level.

View Article and Find Full Text PDF

French and German poetry are classically considered to utilize fundamentally different linguistic structures to create rhythmic regularity. Their metrical rhythm structures are considered poetically to be very different. However, the biophysical and neurophysiological constraints upon the speakers of these poems are highly similar.

View Article and Find Full Text PDF

Background: Speech-language pathologists (SLPs) in non-English speaking countries face challenges when assessing children for speech sound disorders (SSD). Exploring their clinical challenges in service delivery-along with their problem-solving approaches-may contribute to the development of instruments for clinical use in such settings.

Aim: The study aimed to explore assessment methods used by Iranian SLPs to identify children with SSD.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!