Applying genetic programming to the prediction of alternative mRNA splice variants.

Genomics

Institut für Genetik, Universität zu Köln, Zülpicher Strasse 47, 50674 Köln, Germany.

Published: April 2007

Genetic programming (GP) can be used to classify a given gene sequence as either constitutively or alternatively spliced. We describe the principles of GP and apply it to a well-defined data set of alternatively spliced genes. A feature matrix of sequence properties, such as nucleotide composition or exon length, was passed to the GP system "Discipulus." To test its performance we concentrated on cassette exons (SCE) and retained introns (SIR). We analyzed 27,519 constitutively spliced and 9641 cassette exons including their neighboring introns; in addition we analyzed 33,316 constitutively spliced introns compared to 2712 retained introns. We find that the classifier yields highly accurate predictions on the SIR data with a sensitivity of 92.1% and a specificity of 79.2%. Prediction accuracies on the SCE data are lower, 47.3% (sensitivity) and 70.9% (specificity), indicating that alternative splicing of introns can be better captured by sequence properties than that of exons.

Download full-text PDF

Source
http://dx.doi.org/10.1016/j.ygeno.2007.01.001DOI Listing

Publication Analysis

Top Keywords

genetic programming
8
alternatively spliced
8
sequence properties
8
cassette exons
8
retained introns
8
constitutively spliced
8
introns
5
applying genetic
4
programming prediction
4
prediction alternative
4

Similar Publications

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!