While a number of tools have been developed for researchers to compute the lexical characteristics of words, extant resources are limited in their useability and functionality. Specifically, some tools require users to have some prior knowledge of some aspects of the applications, and not all tools allow users to specify their own corpora. Additionally, current tools are also limited in terms of the range of metrics that they can compute. To address these methodological gaps, this article introduces LexiCAL, a fast, simple, and intuitive calculator for lexical variables. Specifically, LexiCAL is a standalone executable that provides options for users to calculate a range of theoretically influential surface, orthographic, phonological, and phonographic metrics for any alphabetic language, using any user-specified input, corpus file, and phonetic system. LexiCAL also comes with a set of well-documented Python scripts for each metric, that can be reproduced and/or modified for other research purposes.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC8087072PMC
http://journals.plos.org/plosone/article?id=10.1371/journal.pone.0250891PLOS

Publication Analysis

Top Keywords

calculator lexical
8
lexical variables
8
lexical
7
lexical calculator
4
variables number
4
tools
4
number tools
4
tools developed
4
developed researchers
4
researchers compute
4

Similar Publications

Infants' language is often measured indirectly via parent reports, but mothers may underestimate or overestimate their infants' word comprehension. The current study examined estimations of mothers from diverse educational backgrounds regarding their infants' word comprehension and how these estimations are associated with their verbal input and infants' receptive vocabulary at 14 months. We compared 34 infants' looking-while-listening (LWL) performances with the mothers' Turkish Communicative Development Inventory (TCDI) reports to calculate the mothers' overestimation and underestimation.

View Article and Find Full Text PDF

Double-target self-supervised clustering with multi-feature fusion for medical question texts.

PeerJ Comput Sci

June 2024

Institute of Medical Information, Chinese Academy of Medical Sciences, Peking Union Medical College, Beijing, China.

Background: To make the question text represent more information and construct an end-to-end text clustering model, we propose a double-target self-supervised clustering with multi-feature fusion (MF-DSC) for texts which describe questions related to the medical field. Since medical question-and-answer data are unstructured texts and characterized by short characters and irregular language use, the features extracted by a single model cannot fully characterize the text content.

Methods: Firstly, word weights were obtained based on term frequency, and word vectors were generated according to lexical semantic information.

View Article and Find Full Text PDF

Background: Clinical named entity recognition (CNER) is a fundamental task in natural language processing used to extract named entities from electronic medical record texts. In recent years, with the continuous development of machine learning, deep learning models have replaced traditional machine learning and template-based methods, becoming widely applied in the CNER field. However, due to the complexity of clinical texts, the diversity and large quantity of named entity types, and the unclear boundaries between different entities, existing advanced methods rely to some extent on annotated databases and the scale of embedded dictionaries.

View Article and Find Full Text PDF

The exponential growth of digital information, particularly user-generated content on social media and blogging platforms, has underscored the importance of sentiment analysis (SA). Arabic language sentiment analysis (ASA) involves identifying the orientation of ideas, feelings, emotions, and attitudes within Arabic text to determine whether they convey a positive, negative, or neutral sentiment. This paper presents a comprehensive review of the past decade, focusing on the utilization of SA in the Arabic language.

View Article and Find Full Text PDF

Language serves as a mirror reflecting our understanding of the world and mirrors the world of literature. Pragmatics extends beyond mere literal interpretation, delving into comprehending subtle implications, varying expressions and lexical ambiguities. This deepens our exploration and appreciation of the world of literature.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!