This article proposes an exploration technique for multiagent reinforcement learning (MARL) with graph-based communication among agents. We assume that the individual rewards received by the agents are independent of the actions by the other agents, while their policies are coupled. In the proposed framework, neighboring agents collaborate to estimate the uncertainty about the state-action space in order to execute more efficient explorative behavior.
View Article and Find Full Text PDFThis work focuses on the development of a new family of decision-making algorithms for adaptation and learning, which are specifically tailored to decision problems and are constructed by building up on first principles from decision theory. A key observation is that estimation and decision problems are structurally different and, therefore, algorithms that have proven successful for the former need not perform well when adjusted for the latter. Exploiting classical tools from quickest detection, we propose a tailored version of Page's test, referred to as BLLR (barrier log-likelihood ratio) test, and demonstrate its applicability to real-data from the COVID-19 pandemic in Italy.
View Article and Find Full Text PDFPatterns of brain structural connectivity (SC) and functional connectivity (FC) are known to be related. In SC-FC comparisons, FC has classically been evaluated from between functional time series, and more recently from or their unnormalized version encoded in the matrix. The latter FC metrics yield more meaningful comparisons to SC because they capture 'direct' statistical dependencies, that is, discarding the effects of mediators, but their use has been limited because of estimation issues.
View Article and Find Full Text PDFBackground: The tumor suppressor protein p53 plays important roles in DNA damage repair, cell cycle arrest and apoptosis. Due to its critical functions, the level of p53 is tightly regulated by a negative feedback mechanism to increase its tolerance towards fluctuations and disturbances. Interestingly, the p53 level is controlled by post-translational regulation rather than transcriptional regulation in this feedback mechanism.
View Article and Find Full Text PDFAnnu Int Conf IEEE Eng Med Biol Soc
July 2013
Using the transient interleukin (IL)-2 secretion of effector T helper (T(eff)) cells as an example, we show that self-organizing multicellular behavior can be modeled and predicted by an adaptive gene network model. Incorporating an adaptation algorithm we established previously, we construct a network model that has the parameter values iteratively updated to cope with environmental change governed by diffusion and cell-cell interactions. In contrast to non-adaptive models, we find that the proposed adaptive model for individual T(eff) cells can generate transient IL-2 secretory behavior that is observed experimentally at the population level.
View Article and Find Full Text PDFBiological systems are often treated as time-invariant by computational models that use fixed parameter values. In this study, we demonstrate that the behavior of the p53-MDM2 gene network in individual cells can be tracked using adaptive filtering algorithms and the resulting time-variant models can approximate experimental measurements more accurately than time-invariant models. Adaptive models with time-variant parameters can help reduce modeling complexity and can more realistically represent biological systems.
View Article and Find Full Text PDFBackground: Vasospasm is a common complication of aneurismal subarachnoid hemorrhage (SAH) that may lead to cerebral ischemia and death. The standard method for detection of vasospasm is conventional cerebral angiography, which is invasive and does not allow continuous monitoring of arterial radius. Monitoring of vasospasm is typically performed by measuring Cerebral Blood Flow Velocity (CBFV) in the major cerebral arteries and calculating the Lindegaard ratio.
View Article and Find Full Text PDF