Challenges and Opportunities for Bayesian Statistics in Proteomics.

J Proteome Res

Department of Statistics, University of Oxford, Oxford OX1 3LB, United Kingdom.

Published: April 2022

Proteomics is a data-rich science with complex experimental designs and an intricate measurement process. To obtain insights from the large data sets produced, statistical methods, including machine learning, are routinely applied. For a quantity of interest, many of these approaches only produce a point estimate, such as a mean, leaving little room for more nuanced interpretations. By contrast, Bayesian statistics allows quantification of uncertainty through the use of probability distributions. These probability distributions enable scientists to ask complex questions of their proteomics data. Bayesian statistics also offers a modular framework for data analysis by making dependencies between data and parameters explicit. Hence, specifying complex hierarchies of parameter dependencies is straightforward in the Bayesian framework. This allows us to use a statistical methodology which equals, rather than neglects, the sophistication of experimental design and instrumentation present in proteomics. Here, we review Bayesian methods applied to proteomics, demonstrating their potential power, alongside the challenges posed by adopting this new statistical framework. To illustrate our review, we give a walk-through of the development of a Bayesian model for dynamic organic orthogonal phase-separation (OOPS) data.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC8982455PMC
http://dx.doi.org/10.1021/acs.jproteome.1c00859DOI Listing

Publication Analysis

Top Keywords

bayesian statistics
12
probability distributions
8
bayesian
6
proteomics
5
data
5
challenges opportunities
4
opportunities bayesian
4
statistics proteomics
4
proteomics proteomics
4
proteomics data-rich
4

Similar Publications

The kinetically-derived maximal dose (KMD) is defined as the maximum external dose at which kinetics are unchanged relative to lower doses, e.g., doses at which kinetic processes are not saturated.

View Article and Find Full Text PDF

Graphical models are powerful tools to investigate complex dependency structures in high-throughput datasets. However, most existing graphical models make one of two canonical assumptions: (i) a homogeneous graph with a common network for all subjects or (ii) an assumption of normality, especially in the context of Gaussian graphical models. Both assumptions are restrictive and can fail to hold in certain applications such as proteomic networks in cancer.

View Article and Find Full Text PDF

Animal growth is a fundamental component of population dynamics, which is closely tied to mortality, fecundity, and maturation. As a result, estimating growth often serves as the basis of population assessments. In fish, analysing growth typically involves fitting a growth model to age-at-length data derived from counting growth rings in calcified structures.

View Article and Find Full Text PDF

Harsh operating conditions imposed by vehicular applications significantly limit the utilization of proton exchange membrane fuel cells (PEMFCs) in electric propulsion systems. Improper/poor management and supervision of rapidly varying current demands can lead to undesired electrochemical reactions and critical cell failures. Among other failures, flooding and catalytic degradation are failure mechanisms that directly impact the composition of the membrane electrode assembly and can cause irreversible cell performance deterioration.

View Article and Find Full Text PDF

Background: Epidemiological and genetic studies have elucidated associations between antihypertensive medication and Alzheimer's disease (AD), with the directionality of these associations varying upon the specific class of antihypertensive agents.

Methods: Genetic instruments for the expression of antihypertensive drug target genes were identified using expression quantitative trait loci (eQTL) in blood, which are associated with systolic blood pressure (SBP). Exposure was derived from existing eQTL data in blood from the eQTLGen consortium and in the brain from the PsychENCODE and subsequently replicated in GTEx V8 and BrainMeta V2.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!