Genome annotation techniques: new approaches and challenges.

Drug Discov Today

European Bioinformatics Institute (EMBL-EBI), Wellcome Trust Genome Campus, Hinxton CB101SD, Cambridge, UK.

Published: June 2002

As more of the human genome draft sequence is finished, and genomes from other organisms begin to be sequenced, the demand for accurate and reliable genome annotation will increase significantly. To facilitate this industrial-scale genome annotation, automated bioinformatics solutions are increasingly required. As a result, automatic genome annotation systems have become more important in gene discovery within recent years. The design of such large-scale bioinformatics systems is an evolving and dynamic field, based on central cores of bioinformatics software tools and relational databases. Not only must these systems efficiently manage and integrate large volumes of genomic data, but they must also deliver accurate gene predictions and effectively distribute annotation data to the biosciences community.

Download full-text PDF

Source
http://dx.doi.org/10.1016/s1359-6446(02)02289-4DOI Listing

Publication Analysis

Top Keywords

genome annotation
16
genome
5
annotation techniques
4
techniques approaches
4
approaches challenges
4
challenges human
4
human genome
4
genome draft
4
draft sequence
4
sequence finished
4

Similar Publications

Genome-wide association studies are enriched for interacting genes.

BioData Min

January 2025

The Department of Computational Biomedicine, Cedars-Sinai Medical Center, Los Angeles, CA, 90069, USA.

Background: With recent advances in single cell technology, high-throughput methods provide unique insight into disease mechanisms and more importantly, cell type origin. Here, we used multi-omics data to understand how genetic variants from genome-wide association studies influence development of disease. We show in principle how to use genetic algorithms with normal, matching pairs of single-nucleus RNA- and ATAC-seq, genome annotations, and protein-protein interaction data to describe the genes and cell types collectively and their contribution to increased risk.

View Article and Find Full Text PDF

TOM40 as a prognostic oncogene for oral squamous cell carcinoma prognosis.

BMC Cancer

January 2025

Department of Otorhinolaryngology, Shenzhen Key Laboratory of Otorhinolaryngology, Longgang Otorhinolaryngology Hospital, Shenzhen Institute of Otorhinolaryngology, No. 3004 Longgang Avenue, Shenzhen, Guangdong, China.

Background: To investigate the role of the translocase of the outer mitochondrial membrane 40 (TOM40) in oral squamous cell carcinoma (OSCC) with the aim of identifying new biomarkers or potential therapeutic targets.

Methods: TOM40 expression level in OSCC was evaluated using datasets downloaded from The Cancer Genome Atlas (TCGA), as well as clinical data. The correlation between TOM40 expression level and the clinicopathological parameters and survival were analyzed in TCGA.

View Article and Find Full Text PDF

Environmental gradients shape genetic variation in the desert moss, Syntrichia caninervis Mitt. (Pottiaceae).

Sci Rep

January 2025

Department of Biological Sciences, California State University Los Angeles, 5151 State University Dr, Los Angeles, CA, 90032, USA.

The moss Syntrichia caninervis Mitt. is distributed throughout drylands globally, and often anchors ecologically significant communities known as biological soil crusts (biocrusts). The species occupies a variety of dryland habitats with varying levels of drought and temperature stress, suggesting the potential for ecological specialization within S.

View Article and Find Full Text PDF

Massively parallel characterization of transcriptional regulatory elements.

Nature

January 2025

Department of Bioengineering and Therapeutic Sciences, University of California, San Francisco, San Francisco, CA, USA.

The human genome contains millions of candidate cis-regulatory elements (cCREs) with cell-type-specific activities that shape both health and many disease states. However, we lack a functional understanding of the sequence features that control the activity and cell-type-specific features of these cCREs. Here we used lentivirus-based massively parallel reporter assays (lentiMPRAs) to test the regulatory activity of more than 680,000 sequences, representing an extensive set of annotated cCREs among three cell types (HepG2, K562 and WTC11), and found that 41.

View Article and Find Full Text PDF

Chromosome-level genome assembly and annotation of largemouth bronze gudgeon (Coreius guichenoti).

Sci Data

January 2025

Key Laboratory of Freshwater Biodiversity Conservation, Ministry of Agriculture and Rural Affairs, Yangtze River Fisheries Research Institute, Chinese Academy of Fishery Sciences, Wuhan, 430223, China.

Coreius guichenoti, mainly distributed in upstream regions of the Yangtze River China, is currently on the brink of extinction and listed as national secondary protected animal. In this study, we aimed to obtain the chromosome-level genome of C. guichenoti using PacBio and Hi-C techniques.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!