We present word frequencies based on subtitles of British television programmes. We show that the SUBTLEX-UK word frequencies explain more of the variance in the lexical decision times of the British Lexicon Project than the word frequencies based on the British National Corpus and the SUBTLEX-US frequencies. In addition to the word form frequencies, we also present measures of contextual diversity part-of-speech specific word frequencies, word frequencies in children programmes, and word bigram frequencies, giving researchers of British English access to the full range of norms recently made available for other languages. Finally, we introduce a new measure of word frequency, the Zipf scale, which we hope will stop the current misunderstandings of the word frequency effect.
Download full-text PDF |
Source |
---|---|
http://dx.doi.org/10.1080/17470218.2013.850521 | DOI Listing |
Behav Res Methods
December 2024
ETSI de Telecomunicación, Universidad Politécnica de Madrid, Avenida Complutense, 30, 28040, Madrid, Spain.
This study investigates the potential of large language models (LLMs) to estimate the familiarity of words and multi-word expressions (MWEs). We validated LLM estimates for isolated words using existing human familiarity ratings and found strong correlations. LLM familiarity estimates performed even better in predicting lexical decision and naming performance in megastudies than the best available word frequency measures.
View Article and Find Full Text PDFBehav Res Methods
December 2024
Department of Psychology, University of Milano-Bicocca, P.zza dell'Ateneo Nuovo, 1, 20126, Milano, Italy.
Despite being largely spoken and studied by language and cognitive scientists, Italian lacks large resources of language processing data. The Italian Crowdsourcing Project (ICP) is a dataset of word recognition times and accuracy including responses to 130,465 words, which makes it the largest dataset of its kind item-wise. The data were collected in an online word knowledge task in which over 156,000 native speakers of Italian took part.
View Article and Find Full Text PDFSci Rep
December 2024
Department of Endocrinology and Metabolism, Chengdu First People's Hospital, No.18 North Vientiane Road, High-Tech Zone, Chengdu, 610000, Sichuan, China.
We aimed to determine the association between anion gap-to-calcium ratio (ACR) and 30-day mortality in sepsis patients with diabetes mellitus (DM). Data for sepsis patients diagnosed with DM was extracted from Medical Information Mart for Intensive Care Database IV. After screening, 4429 eligible subjects were included in our study finally.
View Article and Find Full Text PDFTop Cogn Sci
December 2024
Institut Jean Nicod, Département d'études cognitives, ENS, EHESS, CNRS, PSL University.
Efficiency principles are increasingly called upon to study features of human language and communication. Zipf's law of abbreviation is widely seen as a classic instance of a linguistic pattern brought about by language users' search for efficient communication. The "law"-a recurrent correlation between the frequency of words and their brevity-is a near-universal principle of communication, having been found in all of the hundreds of human languages where it has been tested, and a few nonhuman communication systems as well.
View Article and Find Full Text PDFCureus
November 2024
Department of Otolaryngology, Rutgers University New Jersey Medical School, Newark, USA.
Introduction: Every year, 530,000 tonsillectomies are performed in the United States. Many patients use social media for medical advice and support. This study investigates Reddit perspectives to identify the current needs of tonsillectomy patients.
View Article and Find Full Text PDFEnter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!