Text mining has been shown to be an auxiliary but key driver for modeling, data harmonization, and interpretation in bio-medicine. Scientific literature holds a wealth of information and embodies cumulative knowledge and remains the core basis on which mechanistic pathways, molecular databases, and models are built and refined. Text mining provides the necessary tools to automatically harness the potential of text. In this study, we show the potential of large-scale text mining for deriving novel insights, with a focus on the growing field of microbiome. We first collected the complete set of abstracts relevant to the microbiome from PubMed and used our text mining and intelligence platform Taxila for analysis. We drive the usefulness of text mining using two case studies. First, we analyze the geographical distribution of research and study locations for the field of microbiome by extracting geo mentions from text. Using this analysis, we were able to draw useful insights on the state of research in microbiome w. r.t geographical distributions and economic drivers. Next, to understand the relationships between diseases, microbiome, and food which are central to the field, we construct semantic relationship networks between these different concepts central to the field of microbiome. We show how such networks can be useful to derive useful insight with no prior knowledge encoded.
Download full-text PDF |
Source |
---|---|
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC9473635 | PMC |
http://dx.doi.org/10.3389/fphys.2022.933069 | DOI Listing |
mSystems
December 2024
Laboratory of Microbiology, Immunology, and Metabolism, Diprobio (Shanghai) Co., Limited, Shanghai, China.
Unlabelled: The gut microbiota plays a crucial role in infant health, with its development during the first 1,000 days influencing health outcomes. Understanding the relationships within the microbiota is essential to linking its maturation process to these outcomes. Several network-based methods have been developed to analyze the developing patterns of infant microbiota, but evaluating the reliability and effectiveness of these approaches remains a challenge.
View Article and Find Full Text PDFJ Biomol Struct Dyn
January 2025
Division of Agricultural Bioinformatics, ICAR-Indian Agricultural Statistics Research Institute, Pusa, New Delhi, India.
Rice blast disease, instigated by (), significantly impedes global rice production. Targeting the signaling protein, cAMP-Protein Kinase A (CPKA), which facilitates appressorium development and host penetration, this study explores the potential inhibitory effects of natural compounds. Virtual screening, molecular docking and text mining approaches were used to find the nimonol and curcumin that inhibit the CPKA protein.
View Article and Find Full Text PDFClin Trials
January 2025
Central Monitoring and Data Analytics, GSK, Brentford, UK.
Background: Clinical trials handle a huge amount of data which can be used during the trial to improve the ongoing study conduct. It is suggested by regulators to implement the remote approach to evaluate clinical trials by analysing collected data. Central statistical monitoring helps to achieve that by employing quantitative methods, the results of which are a basis for decision-making on quality issues.
View Article and Find Full Text PDFCancer research has been significantly advanced by the integration of transcriptomic data through high-throughput sequencing technologies like RNA sequencing (RNA-seq). This paper reviews the transformative impact of transcriptomics on understanding cancer biology, focusing on the use of extensive datasets such as The Cancer Genome Atlas (TCGA) and Genotype-Tissue Expression (GTEx). While transcriptomic data provides crucial insights into gene expression patterns and disease mechanisms, the analysis is fraught with technical and biological biases.
View Article and Find Full Text PDFMol Biol Res Commun
January 2025
Department of Medical Genetics, School of Medicine, Tehran University of Medical Sciences, Tehran, Iran.
Long non-coding RNAs (lncRNAs) have recently emerged as critical regulators of oncogenic or tumor-suppressive pathways in human cancers. LINC01133 is a lncRNA that has exhibited dichotomous roles in various malignancies but to the best of our knowledge, the role of LINC01133 in laryngeal squamous cell carcinoma (LSCC) has not been previously investigated. This study aimed to investigate the expression, clinical significance, and potential functions of the LINC01133 in LSCC.
View Article and Find Full Text PDFEnter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!