Respiratory diseases pose a significant global health burden, with challenges in early and accurate diagnosis due to overlapping clinical symptoms, which often leads to misdiagnosis or delayed treatment. To address this issue, we developed , an artificial intelligence (AI)-based diagnostic system that utilizes natural language processing (NLP) to extract key clinical features from electronic health records (EHRs) for the accurate classification of respiratory diseases. This study employed a large cohort of 31,267 EHRs from multiple centers for model training and internal testing.
View Article and Find Full Text PDFBackground: Chinese medical entities have not been organized comprehensively due to the lack of well-developed terminology systems, which poses a challenge to processing Chinese medical texts for fine-grained medical knowledge representation. To unify Chinese medical terminologies, mapping Chinese medical entities to their English counterparts in the Unified Medical Language System (UMLS) is an efficient solution. However, their mappings have not been investigated sufficiently in former research.
View Article and Find Full Text PDFMedical entity normalization is an important task for medical information processing. The Unified Medical Language System (UMLS), a well-developed medical terminology system, is crucial for medical entity normalization. However, the UMLS primarily consists of English medical terms.
View Article and Find Full Text PDFThe Influenza A (H1N1) pdm09 virus caused a global pandemic in 2009 and has circulated seasonally ever since. As the continual genetic evolution of hemagglutinin in this virus leads to antigenic drift, rapid identification of antigenic variants and characterization of the antigenic evolution are needed. In this study, we developed PREDAC-H1pdm, a model to predict antigenic relationships between H1N1pdm viruses and identify antigenic clusters for post-2009 pandemic H1N1 strains.
View Article and Find Full Text PDFBackground: Phenotype information in electronic health records (EHRs) is mainly recorded in unstructured free text, which cannot be directly used for clinical research. EHR-based deep-phenotyping methods can structure phenotype information in EHRs with high fidelity, making it the focus of medical informatics. However, developing a deep-phenotyping method for non-English EHRs (ie, Chinese EHRs) is challenging.
View Article and Find Full Text PDFIEEE J Biomed Health Inform
August 2022
Electronic health record (EHR) resources are valuable but remain underexplored because most clinical information, especially phenotype information, is buried in the free text of EHRs. An intelligent annotation tool plays an important role in unlocking the full potential of EHRs by transforming free-text phenotype information into a computer-readable form. Deep phenotyping has shown its advantage in representing phenotype information in EHRs with high fidelity; however, most existing annotation tools are not suitable for the deep phenotyping task.
View Article and Find Full Text PDFFront Endocrinol (Lausanne)
April 2022
Background: Phenotypes characterize the clinical manifestations of diseases and provide important information for diagnosis. Therefore, the construction of phenotype knowledge graphs for diseases is valuable to the development of artificial intelligence in medicine. However, phenotype knowledge graphs in current knowledge bases such as WikiData and DBpedia are coarse-grained knowledge graphs because they only consider the core concepts of phenotypes while neglecting the details (attributes) associated with these phenotypes.
View Article and Find Full Text PDFSince coronavirus disease 2019 (COVID-19) might circulate in the following seasons, it is essential to understand how COVID-19 influences other respiratory diseases, especially influenza. In this study, we analyzed the influenza activity from mid-November 2019 to March 2020 in Chinese mainland and found that the influenza season ended much earlier than previous seasons for all subtypes and lineages, which may have resulted from the circulation of COVID-19 and measures such as travel control and personal protection. These findings provide rudimentary knowledge of the co-circulation patterns of the two types of viruses.
View Article and Find Full Text PDFMotivation: Newly emerging influenza viruses keep challenging global public health. To evaluate the potential risk of the viruses, it is critical to rapidly determine the phenotypes of the viruses, including the antigenicity, host, virulence and drug resistance.
Results: Here, we built FluPhenotype, a one-stop platform to rapidly determinate the phenotypes of the influenza A viruses.
Background: A wealth of clinical information is buried in free text of electronic health records (EHR), and converting clinical information to machine-understandable form is crucial for the secondary use of EHRs. Laboratory test results, as one of the most important types of clinical information, are written in various styles in free text of EHRs. This has brought great difficulties for data integration and utilization of EHRs.
View Article and Find Full Text PDFMany host specific mutations have been detected in influenza A viruses (IAVs). However, their effects on hydrogen bond (H-bond) variations have rarely been investigated. In this study, 60 host specific sites were identified in the internal proteins of avian and human IAVs, 27 of which contained mutations with effects on H-bonds.
View Article and Find Full Text PDFMotivation: Previously, we developed a computational model to identify genomic co-occurrence networks that was applied to capture the coevolution patterns within genomes of influenza viruses. To facilitate easy public use of this model, an R package 'cooccurNet' is presented here.
Results: 'cooccurNet' includes functionalities of construction and analysis of residues (e.
Motivation: Timely surveillance of the antigenic dynamics of the influenza virus is critical for accurate selection of vaccine strains, which is important for effective prevention of viral spread and infection.
Results: Here, we provide a computational platform, called PREDAC-H3, for antigenic surveillance of human influenza A(H3N2) virus based on the sequence of surface protein hemagglutinin (HA). PREDAC-H3 not only determines the antigenic variants and antigenic cluster (grouped for similar antigenicity) to which the virus belongs, based on HA sequences, but also allows visualization of the spatial distribution and temporal dynamics of antigenic clusters of viruses isolated from around the world, thus assisting in antigenic surveillance of human influenza A(H3N2) virus.
The notion that animals can detect the Earth's magnetic field was once ridiculed, but is now well established. Yet the biological nature of such magnetosensing phenomenon remains unknown. Here, we report a putative magnetic receptor (Drosophila CG8198, here named MagR) and a multimeric magnetosensing rod-like protein complex, identified by theoretical postulation and genome-wide screening, and validated with cellular, biochemical, structural and biophysical methods.
View Article and Find Full Text PDFMotivation: Protein domains are fundamental units of protein structure, function and evolution; thus, it is critical to gain a deep understanding of protein domain organization. Previous works have attempted to identify key residues involved in organization of domain architecture. Because one of the most important characteristics of domain architecture is the arrangement of secondary structure elements (SSEs), here we present a picture of domain organization through an integrated consideration of SSE arrangements and residue contact networks.
View Article and Find Full Text PDFMany template-based modeling (TBM) methods have been developed over the recent years that allow for protein structure prediction and for the study of structure-function relationships for proteins. One major problem all TBM algorithms face, however, is their unsatisfactory performance when proteins under consideration are low-homology. To improve the performance of TBM methods for such targets, a novel model evaluation method was developed here, and named MEFTop.
View Article and Find Full Text PDF