Gene prediction and gene classes in Arabidopsis thaliana.

J Biotechnol

Laboratorium voor Genetica, Department of Genetics, Flanders Interuniversity Institute for Biotechnology (VIB), Universiteit Gent, B-9000, Gent, Belgium.

Published: March 2000

AI Article Synopsis

  • Gene prediction methods for eukaryotic genomes, like those used for Arabidopsis thaliana, are still not fully effective, but using multiple gene models can enhance accuracy.
  • Researchers classified genes into two classes (CU(1) and CU(2)) based on statistical features and developed separate Markov models for each.
  • The study found that the CU(1) model is more sensitive, CU(2) is more specific, and combining both models significantly improves prediction efficiency compared to using a single model.

Article Abstract

Gene prediction methods for eukaryotic genomes still are not fully satisfying. One way to improve gene prediction accuracy, proven to be relevant for prokaryotes, is to consider more than one model of genes. Thus, we used our classification of Arabidopsis thaliana genes in two classes (CU(1) and CU(2)), previously delineated according to statistical features, in the GeneMark gene identification program. For each gene class, as well as for the two classes combined, a Markov model was developed (respectively, GM-CU(1), GM-CU(2) and GM-all) and then used on a test set of 168 genes to compare their respective efficiency. We concluded from this analysis that GM-CU(1) is more sensitive than GM-CU(2) which seems to be more specific to a gene type. Besides, GM-all does not give better results than GM-CU(1) and combining results from GM-CU(1) and GM-CU(2) greatly improve prediction efficiency in comparison with predictions made with GM-all only. Thus, this work confirms the necessity to consider more than one gene model for gene prediction in eukaryotic genomes, and to look for gene classes in order to build these models.

Download full-text PDF

Source
http://dx.doi.org/10.1016/s0168-1656(00)00196-6DOI Listing

Publication Analysis

Top Keywords

gene prediction
16
gene
10
gene classes
8
arabidopsis thaliana
8
eukaryotic genomes
8
gm-cu1 gm-cu2
8
prediction gene
4
classes
4
classes arabidopsis
4
thaliana gene
4

Similar Publications

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!