A PHP Error was encountered

Severity: Warning

Message: file_get_contents(https://...@pubfacts.com&api_key=b8daa3ad693db53b1410957c26c9a51b4908&a=1): Failed to open stream: HTTP request failed! HTTP/1.1 429 Too Many Requests

Filename: helpers/my_audit_helper.php

Line Number: 176

Backtrace:

File: /var/www/html/application/helpers/my_audit_helper.php
Line: 176
Function: file_get_contents

File: /var/www/html/application/helpers/my_audit_helper.php
Line: 250
Function: simplexml_load_file_from_url

File: /var/www/html/application/helpers/my_audit_helper.php
Line: 1034
Function: getPubMedXML

File: /var/www/html/application/helpers/my_audit_helper.php
Line: 3152
Function: GetPubMedArticleOutput_2016

File: /var/www/html/application/controllers/Detail.php
Line: 575
Function: pubMedSearch_Global

File: /var/www/html/application/controllers/Detail.php
Line: 489
Function: pubMedGetRelatedKeyword

File: /var/www/html/index.php
Line: 316
Function: require_once

A Pipeline for Classifying Deleterious Coding Mutations in Agricultural Plants. | LitMetric

AI Article Synopsis

  • Deleterious mutations in plants are not fully understood in terms of their impact on fitness and crop productivity, largely due to insufficient mutational datasets.
  • A novel machine learning classifier was developed using 18 features to distinguish deleterious mutations from neutral ones, with Random Forest showing the best accuracy compared to other methods and the PolyPhen-2 tool.
  • The classifier demonstrated high accuracy in predicting deleterious mutations across different plant species and has the potential to enhance the identification of functional mutations, aiding in breeding improvements and the creation of new cultivars.

Article Abstract

The impact of deleterious variation on both plant fitness and crop productivity is not completely understood and is a hot topic of debates. The deleterious mutations in plants have been solely predicted using sequence conservation methods rather than function-based classifiers due to lack of well-annotated mutational datasets in these organisms. Here, we developed a machine learning classifier based on a dataset of deleterious and neutral mutations in by extracting 18 informative features that discriminate deleterious mutations from neutral, including 9 novel features not used in previous studies. We examined linear SVM, Gaussian SVM, and Random Forest classifiers, with the latter performing best. Random Forest classifiers exhibited a markedly higher accuracy than the popular PolyPhen-2 tool in the dataset. Additionally, we tested whether the Random Forest, trained on the dataset, accurately predicts deleterious mutations in and and observed satisfactory levels of performance accuracy (87% and 93%, respectively) higher than obtained by the PolyPhen-2. Application of Transfer learning in classifiers did not improve their performance. To additionally test the performance of the Random Forest classifier across different angiosperm species, we applied it to annotate deleterious mutations in and validated them using population frequency data. Overall, we devised a classifier with the potential to improve the annotation of putative functional mutations in QTL and GWAS hit regions, as well as for the evolutionary analysis of proliferation of deleterious mutations during plant domestication; thus optimizing breeding improvement and development of new cultivars.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC6279870PMC
http://dx.doi.org/10.3389/fpls.2018.01734DOI Listing

Publication Analysis

Top Keywords

deleterious mutations
20
random forest
16
deleterious
8
mutations
8
forest classifiers
8
pipeline classifying
4
classifying deleterious
4
deleterious coding
4
coding mutations
4
mutations agricultural
4

Similar Publications

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!