Background: Single-pass, partial sequencing of complementary DNA (cDNA) libraries generates thousands of chromatograms that are processed into high quality expressed sequence tags (ESTs), and then assembled into contigs representative of putative genes. Usually, to be of value, ESTs and contigs must be associated with meaningful annotations, and made available to end-users.

Results: A web application, Expressed Sequence Tag Information Management and Annotation (ESTIMA), has been created to meet the EST annotation and data management requirements of multiple high-throughput EST sequencing projects. It is anchored on individual ESTs and organized around different properties of ESTs including chromatograms, base-calling quality scores, structure of assembled transcripts, and multiple sources of comparison to infer functional annotation, Gene Ontology associations, and cDNA library information. ESTIMA consists of a relational database schema and a set of interactive query interfaces. These are integrated with a suite of web-based tools that allow a user to query and retrieve information. Further, query results are interconnected among the various EST properties. ESTIMA has several unique features. Users may run their own EST processing pipeline, search against preferred reference genomes, and use any clustering and assembly algorithm. The ESTIMA database schema is very flexible and accepts output from any EST processing and assembly pipeline. ESTIMA has been used for the management of EST projects of many species, including honeybee (Apis mellifera), cattle (Bos taurus), songbird (Taeniopygia guttata), corn rootworm (Diabrotica vergifera), catfish (Ictalurus punctatus, Ictalurus furcatus), and apple (Malus x domestica). The entire resource may be downloaded and used as is, or readily adapted to fit the unique needs of other cDNA sequencing projects.

Conclusions: The scripts used to create the ESTIMA interface are freely available to academic users in an archived format from http://titan.biotec.uiuc.edu/ESTIMA/. The entity-relationship (E-R) diagrams and the programs used to generate the Oracle database tables are also available. We have also provided detailed installation instructions and a tutorial at the same website. Presently the chromatograms, EST databases and their annotations have been made available for cattle and honeybee brain EST projects. Non-academic users need to contact the W.M. Keck Center for Functional and Comparative Genomics, University of Illinois at Urbana-Champaign, Urbana, IL, for licensing information.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC533868PMC
http://dx.doi.org/10.1186/1471-2105-5-176DOI Listing

Publication Analysis

Top Keywords

est
9
expressed sequence
8
database schema
8
est processing
8
est projects
8
estima
7
estima tool
4
tool est
4
management
4
est management
4

Similar Publications

Background: Advanced pancreatic ductal adenocarcinoma (aPDAC) is often accompanied by significant muscle mass loss, contributing to poor prognosis. SarcAPACaP, an ancillary study of the GERCOR-APACaP phase III trial, evaluated the role of adapted physical activity (APA) in aPDAC Western patients receiving first-line chemotherapy. The study aimed to assess (1) the potential impact of computed tomography (CT)-quantified muscle mass before and during treatments on health-related quality of life (HRQoL) and overall survival (OS) and (2) the role of APA in mitigating muscle mass loss.

View Article and Find Full Text PDF

In response to the increasing emergence of zoonotic pathogens, flexible, multisectoral surveillance systems capable of generating alerts thanks to rapid, nonspecific detection, are crucial before pathogens reach human populations. Syndromic surveillance has proven to be a breakthrough for near real-time disease surveillance in the public health sector. It relies on existing nonspecific data, usually collected for other purposes.

View Article and Find Full Text PDF

There is an urgent need to develop effective and sustainable methods to decrease sulfonamide (SA) contamination of soil. Herein, a non-homogeneous system of zero-valent metal-biochar-based composites was proposed and tested for persulfate (PS) activation. This system employed zero-valent iron (Fe) as an electron donor to catalyze the cleavage of the OO bond in PS, thereby generating reactive oxygen species (ROS) that degrade SAs.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!