Predicting potential target genes in molecular biology experiments using machine learning and multifaceted data sources.

iScience

Department of Physiological Chemistry, Graduate School of Pharmaceutical Science, The University of Tokyo, Tokyo 113-0033, Japan.

Published: March 2024

Experimental analysis of functionally related genes is key to understanding biological phenomena. The selection of genes to study is a crucial and challenging step, as it requires extensive knowledge of the literature and diverse biomedical data resources. Although software tools that predict relationships between genes are available to accelerate this process, they do not directly incorporate experiment information derived from the literature. Here, we develop LEXAS, a target gene suggestion system for molecular biology experiments. LEXAS is based on machine learning models trained with diverse information sources, including 24 million experiment descriptions extracted from full-text articles in PubMed Central by using a deep-learning-based natural language processing model. By integrating the extracted experiment contexts with biomedical data sources, LEXAS suggests potential target genes for upcoming experiments, complementing existing tools like STRING, FunCoup, and GOSemSim. A simple web interface enables biologists to consider newly derived gene information while planning experiments.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC10933549PMC
http://dx.doi.org/10.1016/j.isci.2024.109309DOI Listing

Publication Analysis

Top Keywords

potential target
8
target genes
8
molecular biology
8
biology experiments
8
machine learning
8
data sources
8
biomedical data
8
genes
5
predicting potential
4
genes molecular
4

Similar Publications

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!