Modelling splice sites with locality-sensitive sequence features.

Int J Data Min Bioinform

Graduate School of Engineering Science and Technology, National Yunlin University of Science and Technology, 123 University Road, Section 3, Touliu, Yunlin, Taiwan 640, ROC.

Published: April 2013

The splice sites are essential for pre-mRNA maturation and crucial for Splice Site Modelling (SSM); however, there are gaps between the splicing signals and the computationally identified sequence features. In this paper, the Locality Sensitive Features (LSFs) are proposed to reduce the gaps by homogenising their contexts. Under the skewness-kurtosis based statistics and data analysis, SSM attributed with LSFs is fulfilled by double-boundary outlier filters. The LSF-based SSM had been applied to six model organisms of diverse species; by the accuracy and Receiver Operating Characteristic (ROC) analysis, the promising results show the proposed methodology is versatile and robust for the splice-site classification. It is prospective the LSF-based SSM can serve as a new infrastructure for developing effective splice-site prediction methods and have the potential to be applied to other sequence prediction problems.

Download full-text PDF

Source
http://dx.doi.org/10.1504/ijdmb.2013.050979DOI Listing

Publication Analysis

Top Keywords

splice sites
8
sequence features
8
lsf-based ssm
8
modelling splice
4
sites locality-sensitive
4
locality-sensitive sequence
4
features splice
4
sites essential
4
essential pre-mrna
4
pre-mrna maturation
4

Similar Publications

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!