From tradition to innovation: conventional and deep learning frameworks in genome annotation.

Brief Bioinform

National Key Laboratory for Tropical Crop Breeding, Shenzhen Branch, Guangdong Laboratory for Lingnan Modern Agriculture, Genome Analysis Laboratory of the Ministry of Agriculture, Agricultural Genomics Institute at Shenzhen, Chinese Academy of Agricultural Sciences, Shenzhen, Guangzhou 518120, China.

Published: March 2024

Following the milestone success of the Human Genome Project, the 'Encyclopedia of DNA Elements (ENCODE)' initiative was launched in 2003 to unearth information about the numerous functional elements within the genome. This endeavor coincided with the emergence of numerous novel technologies, accompanied by the provision of vast amounts of whole-genome sequences, high-throughput data such as ChIP-Seq and RNA-Seq. Extracting biologically meaningful information from this massive dataset has become a critical aspect of many recent studies, particularly in annotating and predicting the functions of unknown genes. The core idea behind genome annotation is to identify genes and various functional elements within the genome sequence and infer their biological functions. Traditional wet-lab experimental methods still rely on extensive efforts for functional verification. However, early bioinformatics algorithms and software primarily employed shallow learning techniques; thus, the ability to characterize data and features learning was limited. With the widespread adoption of RNA-Seq technology, scientists from the biological community began to harness the potential of machine learning and deep learning approaches for gene structure prediction and functional annotation. In this context, we reviewed both conventional methods and contemporary deep learning frameworks, and highlighted novel perspectives on the challenges arising during annotation underscoring the dynamic nature of this evolving scientific landscape.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC10998533PMC
http://dx.doi.org/10.1093/bib/bbae138DOI Listing

Publication Analysis

Top Keywords

deep learning
12
learning frameworks
8
genome annotation
8
functional elements
8
elements genome
8
learning
6
genome
5
tradition innovation
4
innovation conventional
4
conventional deep
4

Similar Publications

Purpose: To quantify outer retina structural changes and define novel biomarkers of inherited retinal degeneration associated with biallelic mutations in RPE65 (RPE65-IRD) in patients before and after subretinal gene augmentation therapy with voretigene neparvovec (Luxturna).

Methods: Application of advanced deep learning for automated retinal layer segmentation, specifically tailored for RPE65-IRD. Quantification of five novel biomarkers for the ellipsoid zone (EZ): thickness, granularity, reflectivity, and intensity.

View Article and Find Full Text PDF

Understanding the spatial and temporal dynamics of gene expression is crucial for unraveling molecular mechanisms underlying various biological processes. While traditional methods have offered insights into gene expression patterns, they primarily focus on mature mRNA transcripts, lacking real-time visualization of newly synthesized or nascent transcription events. Recent advancements in monitoring nascent transcription in live cells provide valuable insights into transcriptional dynamics.

View Article and Find Full Text PDF

Women are disproportionately affected by chronic autoimmune diseases (AD) like systemic lupus erythematosus (SLE), scleroderma, rheumatoid arthritis (RA), and Sjögren's syndrome. Traditional evaluations often underestimate the associated cardiovascular disease (CVD) and stroke risk in women having AD. Vitamin D deficiency increases susceptibility to these conditions.

View Article and Find Full Text PDF

Metabolite identification from 1D H NMR spectra is a major challenge in NMR-based metabolomics. This study introduces NMRformer, a Transformer-based deep learning framework for accurate peak assignment and metabolite identification in 1D H NMR spectroscopy. Unlike traditional approaches, NMRformer interprets spectra as sequences of spectral peaks and integrates a self-attention mechanism and peak height ratios directly into the Transformer encoder layer.

View Article and Find Full Text PDF

Monolayer assembly of charged colloidal particles at liquid interfaces opens a new avenue for advancing the additive manufacturing of thin film materials and devices with tailored properties. In this study, we investigated the dynamics of electrosprayed colloidal particles at curved droplet interfaces through a combination of physics-based computational simulations and machine learning. We employed a novel mesh-constrained Brownian dynamics (BD) algorithm coupled with Ansys® electric field simulations to model the transport and assembly of charged particles on a non-spherical droplet surface.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!