A PHP Error was encountered

Severity: Warning

Message: file_get_contents(https://...@pubfacts.com&api_key=b8daa3ad693db53b1410957c26c9a51b4908&a=1): Failed to open stream: HTTP request failed! HTTP/1.1 429 Too Many Requests

Filename: helpers/my_audit_helper.php

Line Number: 176

Backtrace:

File: /var/www/html/application/helpers/my_audit_helper.php
Line: 176
Function: file_get_contents

File: /var/www/html/application/helpers/my_audit_helper.php
Line: 250
Function: simplexml_load_file_from_url

File: /var/www/html/application/helpers/my_audit_helper.php
Line: 1034
Function: getPubMedXML

File: /var/www/html/application/helpers/my_audit_helper.php
Line: 3152
Function: GetPubMedArticleOutput_2016

File: /var/www/html/application/controllers/Detail.php
Line: 575
Function: pubMedSearch_Global

File: /var/www/html/application/controllers/Detail.php
Line: 489
Function: pubMedGetRelatedKeyword

File: /var/www/html/index.php
Line: 316
Function: require_once

Using the Chou's 5-steps rule to predict splice junctions with interpretable bidirectional long short-term memory networks. | LitMetric

AI Article Synopsis

  • Neural models have shown excellent performance in genome sequence prediction tasks by automatically learning important features from nucleotide sequences, but interpreting these features remains difficult.
  • This study evaluates various visualization techniques to extract relevant sequence information learned by a recurrent neural network (RNN) for identifying splice junctions, using genomic data at various nucleotide levels.
  • Results demonstrate that different visualization methods provide comparable results for branchpoint detection, with perturbation techniques outperforming back-propagation for canonical motifs, while the opposite is true for non-canonical motifs; the tool for this visualization is available on GitHub.

Article Abstract

Neural models have been able to obtain state-of-the-art performances on several genome sequence-based prediction tasks. Such models take only nucleotide sequences as input and learn relevant features on their own. However, extracting the interpretable motifs from the model remains a challenge. This work explores various existing visualization techniques in their ability to infer relevant sequence information learnt by a recurrent neural network (RNN) on the task of splice junction identification. The visualization techniques have been modulated to suit the genome sequences as input. The visualizations inspect genomic regions at the level of a single nucleotide as well as a span of consecutive nucleotides. This inspection is performed based on the modification of input sequences (perturbation based) or the embedding space (back-propagation based). We infer features pertaining to both canonical and non-canonical splicing from a single neural model. Results indicate that the visualization techniques produce comparable performances for branchpoint detection. However, in the case of canonical donor and acceptor junction motifs, perturbation based visualizations perform better than back-propagation based visualizations, and vice-versa for non-canonical motifs. The source code of our stand-alone SpliceVisuL tool is available at https://github.com/aaiitggrp/SpliceVisuL.

Download full-text PDF

Source
http://dx.doi.org/10.1016/j.compbiomed.2019.103558DOI Listing

Publication Analysis

Top Keywords

visualization techniques
12
sequences input
8
perturbation based
8
back-propagation based
8
based visualizations
8
based
5
chou's 5-steps
4
5-steps rule
4
rule predict
4
predict splice
4

Similar Publications

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!