AI Article Synopsis

  • Large-scale gene annotations often mispredict transcription start sites, making it hard to reliably identify promoter regions.
  • Researchers have created databases (Human Upstream and Mouse Upstream) using data from human and mouse genomes along with expressed sequence tags (dbEST) to improve the accuracy of identifying these regions.
  • This method utilizes the ENSEMBL genome annotation system for more accurate transcript start site detection and provides free access to the databases for researchers.

Article Abstract

Large-scale genome annotations, based largely on gene prediction programs, may be inaccurate in their predictions of transcription start sites, so that the identification of promoter regions remains unreliable. Here we focus on the identification of reliable gene promoter regions, critical to the understanding of transcriptional regulation. We report the construction of databases of upstream sequences Human Upstream and Mouse Upstream based on information from both the human and mouse genomes and the database of expressed sequence tags (dbEST). Using the ENSEMBL generic genome annotation system, our approach allows more reliable identification of transcript start sites, and therefore extraction of more reliable promoters regions. The Human Upstream and Human Upstream databases are available free of charge.

Download full-text PDF

Source
http://dx.doi.org/10.1089/omi.2005.9.220DOI Listing

Publication Analysis

Top Keywords

human upstream
12
sequences human
8
human mouse
8
mouse genomes
8
start sites
8
promoter regions
8
human
5
upstream
5
humanupstream mouseupstream
4
mouseupstream databases
4

Similar Publications

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!