Background: Applications of Natural Language Processing (NLP) technology to biomedical texts have generated significant interest in recent years. In this paper we identify and investigate the phenomenon of linguistic subdomain variation within the biomedical domain, i.e., the extent to which different subject areas of biomedicine are characterised by different linguistic behaviour. While variation at a coarser domain level such as between newswire and biomedical text is well-studied and known to affect the portability of NLP systems, we are the first to conduct an extensive investigation into more fine-grained levels of variation.
Results: Using the large OpenPMC text corpus, which spans the many subdomains of biomedicine, we investigate variation across a number of lexical, syntactic, semantic and discourse-related dimensions. These dimensions are chosen for their relevance to the performance of NLP systems. We use clustering techniques to analyse commonalities and distinctions among the subdomains.
Conclusions: We find that while patterns of inter-subdomain variation differ somewhat from one feature set to another, robust clusters can be identified that correspond to intuitive distinctions such as that between clinical and laboratory subjects. In particular, subdomains relating to genetics and molecular biology, which are the most common sources of material for training and evaluating biomedical NLP tools, are not representative of all biomedical subdomains. We conclude that an awareness of subdomain variation is important when considering the practical use of language processing applications by biomedical researchers.
Download full-text PDF |
Source |
---|---|
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC3118171 | PMC |
http://dx.doi.org/10.1186/1471-2105-12-212 | DOI Listing |
Front Aging Neurosci
January 2025
Affiliated Mental Health Center & Hangzhou Seventh People's Hospital, Zhejiang University School of Medicine, Hangzhou, China.
Objectives: This study seeks to delineate the sleep architecture characteristics in older adults with short-term insomnia and mild cognitive impairment (MCI) and to explore their association with cognitive performance.
Methods: Ninety elderly individuals with short-term insomnia were enrolled and stratified into two cohorts based on their Montreal Cognitive Assessment (MoCA) scores: the Short-Term Insomnia Group (STID) comprising 35 participants and the Short-Term Insomnia with Cognitive Impairment Group (STID-MCI) with 55 participants. Demographic data, Pittsburgh Sleep Quality Index (PSQI), MoCA, Hamilton Depression Rating Scale (HAMD-17), Hamilton Anxiety Rating Scale (HAMA), and polysomnography (PSG) parameters were compared between groups.
Rev Sci Instrum
January 2025
School of Computer Science and Technology, Xi'an University of Posts and Telecommunications, Xi'an, Shaanxi 710121, China.
Emotion recognition based on electroencephalogram (EEG) has always been a research hotspot. However, due to significant individual variations in EEG signals, cross-subject emotion recognition based on EEG remains a challenging issue to address. In this article, we propose a dynamic domain-adaptive EEG emotion recognition method based on multi-source selection.
View Article and Find Full Text PDFGenes (Basel)
December 2024
Receptor Biology Section, National Institute of Neurological Disorders and Stroke, National Institutes of Health, Bethesda, MD 20892, USA.
Neuroligins (NLGNs) are postsynaptic adhesion molecules critical for neuronal development that are highly associated with autism spectrum disorder (ASD). Here, we provide an overview of the literature on rare variants. In addition, we introduce a new approach to analyze human variation within genes to identify sensitive regions that have an increased frequency of ASD-associated variants to better understand NLGN function.
View Article and Find Full Text PDFBMC Med Educ
December 2024
Department of Ophthalmology, King Abdul-Aziz Specialist Hospital, Sakaka, Saudi Arabia.
Background And Aim: The present survey aimed to explore northern Saudi medical students' perceptions and sociodemographic influencing factors in choosing ophthalmology as a career specialty to determine if they are in alignment with the health sector transformation program of the Vision 2030 initiative.
Methods: This study was conducted among 384 medical students from two universities in northern Saudi Arabia (KSA). We used a standard and validated questionnaire to collect the required data.
medRxiv
November 2024
National Institutes of Health, National Institute of Mental Health, Bethesda, MD, United States.
Sex chromosome trisomies (SCTs) are a group of genetic disorders characterized by presence of a supernumerary sex chromosome, resulting in karyotypes other than XX or XY. These include XXX (Trisomy X), XXY (Klinefelter syndrome), and XYY (Jacobs syndrome). Sex chromosome trisomies have been linked to increased risk for psychopathology; however, this relationship warrants additional research.
View Article and Find Full Text PDFEnter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!