Accurate eQTL prioritization with an ensemble-based framework.

Hum Mutat

Computer Science and Artificial Intelligence Lab, Massachusetts Institute of Technology, Cambridge, Massachusetts.

Published: September 2017

We present a novel ensemble-based computational framework, EnsembleExpr, that achieved the best performance in the Fourth Critical Assessment of Genome Interpretation expression quantitative trait locus "(eQTL)-causal SNPs" challenge for identifying eQTLs and prioritizing their gene expression effects. eQTLs are genome sequence variants that result in gene expression changes and are thus prime suspects in the search for contributions to the causality of complex traits. When EnsembleExpr is trained on data from massively parallel reporter assays, it accurately predicts reporter expression levels from unseen regulatory sequences and identifies sequence variants that exhibit significant changes in reporter expression. Compared with other state-of-the-art methods, EnsembleExpr achieved competitive performance when applied on eQTL datasets determined by other protocols. We envision EnsembleExpr to be a resource to help interpret noncoding regulatory variants and prioritize disease-associated mutations for downstream validation.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC5561514PMC
http://dx.doi.org/10.1002/humu.23198DOI Listing

Publication Analysis

Top Keywords

ensembleexpr achieved
8
gene expression
8
sequence variants
8
reporter expression
8
expression
5
accurate eqtl
4
eqtl prioritization
4
prioritization ensemble-based
4
ensemble-based framework
4
framework novel
4

Similar Publications

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!