A PHP Error was encountered

Severity: Warning

Message: file_get_contents(https://...@pubfacts.com&api_key=b8daa3ad693db53b1410957c26c9a51b4908&a=1): Failed to open stream: HTTP request failed! HTTP/1.1 429 Too Many Requests

Filename: helpers/my_audit_helper.php

Line Number: 176

Backtrace:

File: /var/www/html/application/helpers/my_audit_helper.php
Line: 176
Function: file_get_contents

File: /var/www/html/application/helpers/my_audit_helper.php
Line: 250
Function: simplexml_load_file_from_url

File: /var/www/html/application/helpers/my_audit_helper.php
Line: 1034
Function: getPubMedXML

File: /var/www/html/application/helpers/my_audit_helper.php
Line: 3152
Function: GetPubMedArticleOutput_2016

File: /var/www/html/application/controllers/Detail.php
Line: 575
Function: pubMedSearch_Global

File: /var/www/html/application/controllers/Detail.php
Line: 489
Function: pubMedGetRelatedKeyword

File: /var/www/html/index.php
Line: 316
Function: require_once

gene prediction for protein-coding regions. | LitMetric

gene prediction for protein-coding regions.

Bioinform Adv

Department of Physics and Optical Science, University of North Carolina at Charlotte, NC 28223, United States.

Published: August 2023

Motivation: gene prediction in nonmodel organisms is a difficult task. While many methods have been developed, their average accuracy over long segments of a genome, and especially when assessed over a wide range of species, generally yields results with sensitivity and specificity levels in the low 60% range. A common weakness of most methods is the tendency to learn patterns that are species-specific to varying degrees. The need exists for methods to extract genetic features that can distinguish coding and noncoding regions that are not sensitive to specific organism characteristics.

Results: A new method based on a neural network (NN) that uses a collection of sensors to create input features is presented. It is shown that accurate predictions are achieved even when trained on organisms that are significantly different phylogenetically than test organisms. A consensus prediction algorithm for a CoDing Sequence (CDS) is subsequently applied to the first nucleotide level of NN predictions that boosts accuracy through a data-driven procedure that optimizes a CDS/non-CDS threshold. An aggregate accuracy benchmark at the nucleotide level shows that this new approach performs better than existing methods, while requiring significantly less training data.

Availability And Implementation: https://github.com/BioMolecularPhysicsGroup-UNCC/MachineLearning.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC10448985PMC
http://dx.doi.org/10.1093/bioadv/vbad105DOI Listing

Publication Analysis

Top Keywords

gene prediction
8
nucleotide level
8
prediction protein-coding
4
protein-coding regions
4
regions motivation
4
motivation gene
4
prediction nonmodel
4
nonmodel organisms
4
organisms difficult
4
difficult task
4

Similar Publications

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!