The analysis of omic data depends on machine-readable information about protein interactions, modifications, and activities as found in protein interaction networks, databases of post-translational modifications, and curated models of gene and protein function. These resources typically depend heavily on human curation. Natural language processing systems that read the primary literature have the potential to substantially extend knowledge resources while reducing the burden on human curators. However, machine-reading systems are limited by high error rates and commonly generate fragmentary and redundant information. Here, we describe an approach to precisely assemble molecular mechanisms at scale using multiple natural language processing systems and the Integrated Network and Dynamical Reasoning Assembler (INDRA). INDRA identifies full and partial overlaps in information extracted from published papers and pathway databases, uses predictive models to improve the reliability of machine reading, and thereby assembles individual pieces of information into non-redundant and broadly usable mechanistic knowledge. Using INDRA to create high-quality corpora of causal knowledge we show it is possible to extend protein-protein interaction databases and explain co-dependencies in the Cancer Dependency Map.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC10167483PMC
http://dx.doi.org/10.15252/msb.202211325DOI Listing

Publication Analysis

Top Keywords

molecular mechanisms
8
mechanisms scale
8
natural language
8
language processing
8
processing systems
8
automated assembly
4
assembly molecular
4
scale text
4
text mining
4
mining curated
4

Similar Publications

Digging deeper into necrotizing enterocolitis: bridging clinical, microbial, and molecular perspectives.

Gut Microbes

December 2025

Department of Pediatrics, Key Laboratory of Birth Defects and Related Diseases of Women and Children (Ministry of Education), West China Second University Hospital, Sichuan University, Chengdu, China.

Necrotizing Enterocolitis (NEC) is a severe, life-threatening inflammatory condition of the gastrointestinal tract, especially affecting preterm infants. This review consolidates evidence from various biomedical disciplines to elucidate the complex pathogenesis of NEC, integrating insights from clinical, microbial, and molecular perspectives. It emphasizes the modulation of NEC-associated inflammatory pathways by probiotics and novel biologics, highlighting their therapeutic potential.

View Article and Find Full Text PDF

TRPV4 as a Novel Regulator of Ferroptosis in Colon Adenocarcinoma: Implications for Prognosis and Therapeutic Targeting.

Dig Dis Sci

January 2025

Ningxia Medical University, Xing Qing Block, Shengli Street No.1160, Yin Chuan City, 750004, Ningxia Province, People's Republic of China.

Background: Colon adenocarcinoma (COAD) is a leading cause of cancer-related mortality worldwide. Transient receptor potential vanilloid 4 (TRPV4), a calcium-permeable non-selective cation channel, has been implicated in various cancers, including COAD. This study investigates the role of TRPV4 in colon adenocarcinoma and elucidates its potential mechanism via the ferroptosis pathway.

View Article and Find Full Text PDF

The nutrient germinant receptors (GRs) in spores of Bacillus species consist of a cluster of three proteins- designated A, B, and C subunits- that play a critical role in initiating the germination of dormant spores in response to specific nutrient molecules. The Bacillus cereus GerI GR is essential for inosine-induced germination; however, the roles of the individual subunits and the mechanism by which germinant binding activates GR function remain unclear. In this study, we report the backbone chemical shift assignments of the N-terminal domain (NTD) of the A subunit of GerI (GerIA).

View Article and Find Full Text PDF

Objective: Rheumatoid arthritis (RA) is an autoimmune condition that causes severe joint deformities and impaired functionality, affecting the well-being and daily life of individuals. Consequently, there is a pressing demand for identifying viable therapeutic targets for treating RA. This study aimed to explore the molecular mechanisms of osteoclast differentiation in PBMC from patients with RA through transcriptome sequencing and bioinformatics analysis.

View Article and Find Full Text PDF

Integrating the milk microbiome signatures in mastitis: milk-omics and functional implications.

World J Microbiol Biotechnol

January 2025

Area of Biochemistry and Molecular Biology, OneHealth-UR Research Group, University of La Rioja, 26006, Logroño, Spain.

Mammalian milk contains a variety of complex bioactive and nutritional components and microorganisms. These microorganisms have diverse compositions and functional roles that impact host health and disease pathophysiology, especially mastitis. The advent and use of high throughput omics technologies, including metagenomics, metatranscriptomics, metaproteomics, metametabolomics, as well as culturomics in milk microbiome studies suggest strong relationships between host phenotype and milk microbiome signatures in mastitis.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!