Active reinforcement learning enables dynamic prediction and control, where one should not only maximize rewards but also minimize costs such as of inference, decisions, actions, and time. For an embodied agent such as a human, decisions are also shaped by physical aspects of actions. Beyond the effects of reward outcomes on learning processes, to what extent can modeling of behavior in a reinforcement-learning task be complicated by other sources of variance in sequential action choices? What of the effects of action bias (for actions per se) and action hysteresis determined by the history of actions chosen previously? The present study addressed these questions with incremental assembly of models for the sequential choice data from a task with hierarchical structure for additional complexity in learning.
View Article and Find Full Text PDFParabolic equations are among the most popular numerical techniques in many fields of physics. This article considers extra-wide-angle parabolic equations, wide-angle parabolic equations, and narrow-angle parabolic equations (EWAPEs, WAPEs, and NAPEs, respectively) for sound propagation in moving inhomogeneous media with arbitrarily large variations in the sound speed and Mach number of the (subsonic) wind speed. Within their ranges of applicability, these parabolic equations exactly describe the phase of the sound waves and are, thus, termed the phase-preserving EWAPE, WAPE, and NAPE.
View Article and Find Full Text PDFNoise generated by wind turbines is significantly impacted by its propagation in the atmosphere. Hence, for annoyance issues, an accurate prediction of sound propagation is critical to determine noise levels around wind turbines. This study presents a method to predict wind turbine sound propagation based on linearized Euler equations.
View Article and Find Full Text PDFHeuristics can inform human decision making in complex environments through a reduction of computational requirements (accuracy-resource trade-off) and a robustness to overparameterisation (less-is-more). However, tasks capturing the efficiency of heuristics typically ignore action proficiency in determining rewards. The requisite movement parameterisation in sensorimotor control questions whether heuristics preserve efficiency when actions are nontrivial.
View Article and Find Full Text PDF