Word embeddings based on a conditional model are commonly used in Natural Language Processing (NLP) tasks to embed the words of a dictionary in a low dimensional linear space. Their computation is based on the maximization of the likelihood of a conditional probability distribution for each word of the dictionary. These distributions form a Riemannian statistical manifold, where word embeddings can be interpreted as vectors in the tangent space of a specific reference measure on the manifold. A novel family of word embeddings, called α-embeddings have been recently introduced as deriving from the geometrical deformation of the simplex of probabilities through a parameter α, using notions from Information Geometry. After introducing the α-embeddings, we show how the deformation of the simplex, controlled by α, provides an extra handle to increase the performances of several intrinsic and extrinsic tasks in NLP. We test the α-embeddings on different tasks with models of increasing complexity, showing that the advantages associated with the use of α-embeddings are present also for models with a large number of parameters. Finally, we show that tuning α allows for higher performances compared to the use of larger models in which additionally a transformation of the embeddings is learned during training, as experimentally verified in attention models.
Download full-text PDF |
Source |
---|---|
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC7996742 | PMC |
http://dx.doi.org/10.3390/e23030287 | DOI Listing |
Front Comput Neurosci
December 2024
Department of Radiology, The Affiliated Hospital of Yangzhou University, Yangzhou University, Yangzhou, Jiangsu, China.
It is a universal phenomenon for patients who do not know which clinical department to register in large general hospitals. Although triage nurses can help patients, due to the larger number of patients, they have to stand in a queue for minutes to consult. Recently, there have already been some efforts to devote deep-learning techniques or pre-trained language models (PLMs) to triage recommendations.
View Article and Find Full Text PDFBehav Res Methods
December 2024
Key Laboratory of Brain Functional Genomics (MOE & STCSM), Affiliated Mental Health Center (ECNU), Institute of Brain and Education Innovation, School of Psychology and Cognitive Science, East China Normal University, Shanghai, China.
Word associations are among the most direct ways to measure word meaning in human minds, capturing various relationships, even those formed by non-linguistic experiences. Although large-scale word associations exist for Dutch, English, and Spanish, there is a lack of data for Mandarin Chinese, the most widely spoken language from a distinct language family. Here we present the Small World of Words-Zhongwen (Chinese) (SWOW-ZH), a word association dataset of Mandarin Chinese derived from a three-response word association task.
View Article and Find Full Text PDFSci Rep
December 2024
SDU Health Informatics and Technology, The Maersk Mc-Kinney Moller Institute, University of Southern Denmark, Odense, Denmark.
The increasing development of technology has led to the increase of digital data in various fields, such as medication-related texts. Sentiment Analysis (SA) in medication is essential to give clinicians insights into patients' feedback about the treatment procedure. Therefore, this study intends to develop Artificial Intelligence (AI) models to predict patients' sentiments.
View Article and Find Full Text PDFJMIR Aging
December 2024
Department of Health & Wellness Design, School of Public Health- Bloomington, Indiana University, Bloomington, IN, United States.
Background: As Alzheimer disease (AD) and AD-related dementias (ADRD) progress, individuals increasingly require assistance from unpaid, informal caregivers to support them in activities of daily living. These caregivers may experience high levels of financial, mental, and physical strain associated with providing care. CareVirtue is a web-based tool created to connect and support multiple individuals across a care network to coordinate care activities and share important information, thereby reducing care burden.
View Article and Find Full Text PDFPsychon Bull Rev
December 2024
Laboratoire Cognition Langage & Développement (LCLD), Centre de Recherche Cognition et Neurosciences (CRCN), Université Libre de Bruxelles (ULB), Av. F. Roosevelt, 50 /CP 191, 1050, Brussels, Belgium.
Lexical competition between newly acquired and already established representations of written words is considered a marker of word integration into the mental lexicon. To date, studies about the emergence of lexical competition involved mostly artificial training procedures based on overexposure and explicit instructions for memorization. Yet, in real life, novel word encounters occur mostly without explicit learning intent, through reading texts with words appearing rarely.
View Article and Find Full Text PDFEnter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!