Splice sites detection using chaos game representation and neural network.

Genomics

Department of Mathematical Sciences, Tsinghua University, Beijing 100084, P.R. China. Electronic address:

Published: March 2020

A novel method is proposed to detect the acceptor and donor splice sites using chaos game representation and artificial neural network. In order to achieve high accuracy, inputs to the neural network, or feature vector, shall reflect the true nature of the DNA segments. Therefore it is important to have one-to-one numerical representation, i.e. a feature vector should be able to represent the original data. Chaos game representation (CGR) is an iterative mapping technique that assigns each nucleotide in a DNA sequence to a respective position on the plane in a one-to-one manner. Using CGR, a DNA sequence can be mapped to a numerical sequence that reflects the true nature of the original sequence. In this research, we propose to use CGR as feature input to a neural network to detect splice sites on the NN269 dataset. Computational experiments indicate that this approach gives good accuracy while being simpler than other methods in the literature, with only one neural network component. The code and data for our method can be accessed from this link: https://github.com/thoang3/portfolio/tree/SpliceSites_ANN_CGR.

Download full-text PDF

Source
http://dx.doi.org/10.1016/j.ygeno.2019.10.018DOI Listing

Publication Analysis

Top Keywords

neural network
20
splice sites
12
chaos game
12
game representation
12
feature vector
8
true nature
8
dna sequence
8
neural
5
network
5
sites detection
4

Similar Publications

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!