Literature-based discovery (LBD) summarizes information and generates insight from large text corpuses. The SemNet framework utilizes a large heterogeneous information network or "knowledge graph" of nodes and edges to compute relatedness and rank concepts pertinent to a user-specified target. SemNet provides a way to perform multi-factorial and multi-scalar analysis of complex disease etiology and therapeutic identification using the 33+ million articles in PubMed. The present work improves the efficacy and efficiency of LBD for end users by augmenting SemNet to create SemNet 2.0. A custom Python data structure replaced reliance on Neo4j to improve knowledge graph query times by several orders of magnitude. Additionally, two randomized algorithms were built to optimize the HeteSim metric calculation for computing metapath similarity. The unsupervised learning algorithm for rank aggregation (ULARA), which ranks concepts with respect to the user-specified target, was reconstructed using derived mathematical proofs of correctness and probabilistic performance guarantees for optimization. The upgraded ULARA is generalizable to other rank aggregation problems outside of SemNet. In summary, SemNet 2.0 is a comprehensive open-source software for significantly faster, more effective, and user-friendly means of automated biomedical LBD. An example case is performed to rank relationships between Alzheimer's disease and metabolic co-morbidities.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC9351549PMC
http://dx.doi.org/10.3390/bdcc6010027DOI Listing

Publication Analysis

Top Keywords

user-specified target
8
rank aggregation
8
semnet
7
optimizations computing
4
computing relatedness
4
relatedness biomedical
4
biomedical heterogeneous
4
heterogeneous networks
4
networks semnet
4
semnet literature-based
4

Similar Publications

Understanding the influence of the cellular environment on protein conformations is crucial for elucidating protein functions within living cells. In studies using molecular dynamics (MD) simulation, carbon nanotubes and hydrophobic cages have been widely used to emulate the cellular environment inside specific large biomolecules such as ribosome tunnels and chaperones. However, recent studies suggest that these uniform hydrophobic models may not adequately capture the environmental effects inside each biomolecule.

View Article and Find Full Text PDF

Compound identification is at the center of metabolomics, usually by comparing experimental mass spectra against library spectra. However, most compounds are not commercially available to generate library spectra. Hence, for such compounds, MS/MS spectra need to be predicted.

View Article and Find Full Text PDF

A dosimetrically motivated pathfinding approach for non-isocentric dynamic trajectory radiotherapy.

Phys Med Biol

September 2024

Division of Medical Radiation Physics and Department of Radiation Oncology, Inselspital, Bern University Hospital, and University of Bern, Bern 3010, Switzerland.

Article Synopsis
  • Non-isocentric dynamic trajectory radiotherapy (DTRT) is a new way to target radiation therapy that uses special movements of the machine to hit the tumor from different angles without being stuck in one spot.
  • Researchers are creating a technique that helps decide the best path for the radiation beams, making sure they avoid healthy organs while still reaching the tumor effectively.
  • Comparisons of different treatment plans show that this new method can give better radiation distribution to the tumor and lower the radiation dose to nearby healthy organs.
View Article and Find Full Text PDF
Article Synopsis
  • Researchers are developing new data-driven methods to create realistic lung lesions for better imaging assessments and virtual clinical trials.
  • They proposed a generative adversarial network (GAN) that generates lung lesions based on size and solidity, utilizing two discriminators focused on lesion volume and radiomics features.
  • The evaluation confirms that the generated lesions closely resemble real ones, maintaining consistent characteristics and distributions, making this approach valuable for medical imaging assessments.
View Article and Find Full Text PDF

Background: with a rich history of traditional medicinal use, has garnered significant attention in contemporary research for its potential therapeutic applications in various human diseases, including pain, inflammation, cancer, and osteoarthritis. However, the specific molecular targets and mechanisms underlying the synergistic effects of its diverse phytochemical constituents remain elusive. Understanding these mechanisms is crucial for developing targeted, effective cannabis-based therapies.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!