Inferring causal relationships from observational data is a key challenge in understanding the interpretability of Machine Learning models. Given the ever-increasing amount of observational data available in many areas, Machine Learning algorithms used for forecasting have become more complex, leading to a less understandable path of how a decision is made by the model. To address this issue, we propose leveraging ensemble models, e.g., Random Forest, to assess which input features the trained model prioritizes when making a forecast and, in this way, establish causal relationships between the variables. The advantage of these algorithms lies in their ability to provide feature importance, which allows us to build the causal network. We present our methodology to estimate causality in time series from oil field production. As it is difficult to extract causal relations from a real field, we also included a synthetic oil production dataset and a weather dataset, which is also synthetic, to provide the ground truth. We aim to perform causal discovery, i.e., establish the existing connections between the variables in each dataset. Through an iterative process of improving the forecasting of a target's value, we evaluate whether the forecasting improves by adding information from a new potential driver; if so, we state that the driver causally affects the target. On the oil field-related datasets, our causal analysis results agree with the interwell connections already confirmed by tracer information; whenever the tracer data are available, we used it as our ground truth. This consistency between both estimated and confirmed connections provides us the confidence about the effectiveness of our proposed methodology. To our knowledge, this is the first time causal analysis using solely production data is employed to discover interwell connections in an oil field dataset.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC10349147PMC
http://dx.doi.org/10.1038/s41598-023-37929-wDOI Listing

Publication Analysis

Top Keywords

causal relationships
12
time series
8
causal
8
ensemble models
8
observational data
8
machine learning
8
oil field
8
ground truth
8
causal analysis
8
interwell connections
8

Similar Publications

Background: Gastrointestinal bleeding (GIB) is a severe and potentially life-threatening complication in patients with acute myocardial infarction (AMI), significantly affecting prognosis during hospitalization. Early identification of high-risk patients is essential to reduce complications, improve outcomes, and guide clinical decision-making.

Objective: This study aimed to develop and validate a machine learning (ML)-based model for predicting in-hospital GIB in patients with AMI, identify key risk factors, and evaluate the clinical applicability of the model for risk stratification and decision support.

View Article and Find Full Text PDF

The ionizable lipid component of lipid nanoparticle (LNP) formulations is essential for mRNA delivery by facilitating endosomal escape. Conventionally, these lipids are synthesized through complex, multistep chemical processes that are both time-consuming and require significant engineering. Furthermore, the development of new ionizable lipids is hindered by a limited understanding of the structure-activity relationships essential for effective mRNA delivery.

View Article and Find Full Text PDF

Long-term Double-J stenting is superior to short-term Single-J stenting in kidney transplantation.

PLoS One

January 2025

Division of Hepatobiliary and Transplantation Surgery, Department of Surgery, Erasmus MC Transplant Institute, University Medical Center Rotterdam, Rotterdam, The Netherlands.

Background And Objectives: Urological complications after kidney transplantation, due to the ureteroneocystostomy, are associated with significant morbidity, prolonged hospital stay and even mortality. Ureteral stents can minimize the number of complications but are not consistently used, as previous studies were retrospective in nature. We aim to prospectively determine the most effective stenting approach.

View Article and Find Full Text PDF

Influenza virus pandemics and seasonal epidemics have claimed countless lives. Recurrent zoonotic spillovers of influenza viruses with pandemic potential underscore the need for effective countermeasures. In this study, we show that pre-exposure prophylaxis with broadly neutralizing antibody (bnAb) MEDI8852 is highly effective in protecting cynomolgus macaques from severe disease caused by aerosolized highly pathogenic avian influenza H5N1 virus infection.

View Article and Find Full Text PDF

Relationships between parasites, host physiology, and behaviours are complex. Parasites can influence host hormonal microenvironment and behaviour through "sickness behaviours" that generally conserve energy. Using a parasite removal experiment, we examined the effects of gastrointestinal parasites on fecal glucocorticoid metabolites (fGC) and behaviours of vervet monkeys (Chlorocebus pygerythrus) at Lake Nabugabo, Uganda.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!