Unlabelled: Confounding variables are a recurrent challenge for causal discovery and inference. In many situations, complex causal mechanisms only manifest themselves in extreme events, or take simpler forms in the extremes. Stimulated by data on extreme river flows and precipitation, we introduce a new causal discovery methodology for heavy-tailed variables that allows the effect of a known potential confounder to be almost entirely removed when the variables have comparable tails, and also decreases it sufficiently to enable correct causal inference when the confounder has a heavier tail. We also introduce a new parametric estimator for the existing causal tail coefficient and a permutation test. Simulations show that the methods work well and the ideas are applied to the motivating dataset.

Supplementary Information: The online version contains supplementary material available at 10.1007/s10687-022-00456-4.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC10423152PMC
http://dx.doi.org/10.1007/s10687-022-00456-4DOI Listing

Publication Analysis

Top Keywords

heavy-tailed variables
8
causal discovery
8
causal
6
causal modelling
4
modelling heavy-tailed
4
variables
4
variables confounders
4
confounders application
4
application river
4
river flow
4

Similar Publications

On the Bayesian generalized extreme value mixture autoregressive model with adjusted SNR in non-standard actuarial data.

MethodsX

June 2025

Department of Statistics, Institut Teknologi Sepuluh Nopember, Surabaya 60111 Indonesia.

This research introduces the Generalized Extreme Value Mixture Autoregressive (GEVMAR) model as an innovative approach for examining non-standard actuarial datasets within general insurance. Information concerning claim reserves often reveals notable volatility and multimodal distributions, attributes that standard models, including previous method such as the Gaussian Mixture Autoregressive (GMAR) model and other autoregressive methodologies, find problematic to manage effectively. The GEVMAR model integrates the Generalized Extreme Value (GEV) distribution alongside Bayesian estimation techniques, augmented by a modified Signal-to-Noise Ratio (SNR) metric to improve predictive accuracy.

View Article and Find Full Text PDF

Simulations of large-scale brain dynamics are often impacted by overexcitation resulting from heavy-tailed structural network distributions, leading to biologically implausible simulation results. We implement a homeodynamic plasticity mechanism, known from other modeling work, in the widely used Jansen-Rit neural mass model for The Virtual Brain (TVB) simulation framework. We aim at heterogeneously adjusting the inhibitory coupling weights to reach desired dynamic regimes in each brain region.

View Article and Find Full Text PDF

Robust double machine learning model with application to omics data.

BMC Bioinformatics

November 2024

Department of Biostatistics, Key Laboratory of Public Health Safety of Ministry of Education, Key Laboratory for Health Technology Assessment, National Commission of Health, School of Public Health, Fudan University, Shanghai, China.

Background: Recently, there has been a growing interest in combining causal inference with machine learning algorithms. Double machine learning model (DML), as an implementation of this combination, has received widespread attention for their expertise in estimating causal effects within high-dimensional complex data. However, the DML model is sensitive to the presence of outliers and heavy-tailed noise in the outcome variable.

View Article and Find Full Text PDF

Structural and hydrodynamic controls on fluid travel time distributions across fracture networks.

Proc Natl Acad Sci U S A

November 2024

Fractory, Geosciences UMR 6118, Univ Rennes, CNRS, Rennes 35042, France.

Fracture networks are preferential flow paths playing a critical role in a wide range of environmental and industrial problems. Their complex multiscale structure leads to broad distributions of fluid travel times, affecting many biogeochemical processes. Yet, the relationship between the fracture network structures, their hydrodynamic properties, and the resulting anomalous transport dynamics remains unclear.

View Article and Find Full Text PDF
Article Synopsis
  • Variable selection methods are crucial in cancer genomics for identifying key features related to disease traits, but their reliability is questioned without proper techniques to measure uncertainty.
  • This article reviews both frequentist and Bayesian methods that provide uncertainty quantification, discussing how they connect through specific frameworks for regularization and analysis.
  • The authors emphasize the benefits of robust Bayesian variable selection in cancer studies, showing superior results and valid statistical inference, particularly in cases with heavy-tailed errors and outliers.
View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!