Using a Stochastic Agent Model to Optimize Performance in Divergent Interest Tacit Coordination Games.

Sensors (Basel)

Department of Industrial Engineering and Management, Ariel University, Ariel 40700, Israel.

Published: December 2020

In recent years collaborative robots have become major market drivers in industry 5.0, which aims to incorporate them alongside humans in a wide array of settings ranging from welding to rehabilitation. Improving human-machine collaboration entails using computational algorithms that will save processing as well as communication cost. In this study we have constructed an agent that can choose when to cooperate using an optimal strategy. The agent was designed to operate in the context of divergent interest tacit coordination games in which communication between the players is not possible and the payoff is not symmetric. The agent's model was based on a behavioral model that can predict the probability of a player converging on prominent solutions with salient features (e.g., focal points) based on the player's Social Value Orientation (SVO) and the specific game features. The SVO theory pertains to the preferences of decision makers when allocating joint resources between themselves and another player in the context of behavioral game theory. The agent selected stochastically between one of two possible policies, a greedy or a cooperative policy, based on the probability of a player to converge on a focal point. The distribution of the number of points obtained by the autonomous agent incorporating the SVO in the model was better than the results obtained by the human players who played against each other (i.e., the distribution associated with the agent had a higher mean value). Moreover, the distribution of points gained by the agent was better than any of the separate strategies the agent could choose from, namely, always choosing a greedy or a focal point solution. To the best of our knowledge, this is the first attempt to construct an intelligent agent that maximizes its utility by incorporating the belief system of the player in the context of tacit bargaining. This reward-maximizing strategy selection process based on the SVO can also be potentially applied in other human-machine contexts, including multiagent systems.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC7763831PMC
http://dx.doi.org/10.3390/s20247026DOI Listing

Publication Analysis

Top Keywords

divergent interest
8
interest tacit
8
tacit coordination
8
coordination games
8
agent
8
agent choose
8
probability player
8
player context
8
focal point
8
stochastic agent
4

Similar Publications

Alkylated polycyclic aromatic hydrocarbons (PAHs) are abundant constituents of many PAH mixtures and contribute to risk at contaminated sites. Despite their abundance, the movement of alkylated PAHs remains understudied relative to unsubstituted PAHs. In the present study, passive sampling devices were deployed in the air, water, and sediments at 11 locations across multiple seasons to capture spatial and temporal variability in the abundance and movement of alkylated PAHs at a Brownsfield creosote site in Oregon, USA.

View Article and Find Full Text PDF

Unlabelled: The increased rate of anterior cruciate ligament (ACL) tears has led to a greater number of revisions. Revision surgery can be performed in one or two stages. Single-stage revision ACL reconstruction (ssRACLR) may be performed when prior tunnels can be re-used or bypassed whereas a two-stage procedure is indicated when bone grafting of dilated tunnels prior to revision is necessary.

View Article and Find Full Text PDF

is a parasitic nematode of domestic and wild canids of the world. This nematode induces esophageal spirocercosis and may eventually lead to carcinomas, aortic aneurisms, and death of the animal. Two genotypes of have been described based on specimens from Europe, Asia, Africa, and Oceania, but no profound analysis has been conducted with from the Americas.

View Article and Find Full Text PDF

Control of protein levels is vital to cellular homeostasis, for maintaining a steady state, to coordinate changes during differentiation and other roles. In African trypanosomes surface proteins contribute to immune evasion, drug sensitivity and environmental sensing. The trypanosome surface is dominated by the GPI-anchored variant surface glycoprotein, but additional GPI-anchored and -membrane domain proteins are present with known roles as nutrient receptors and signal transducers.

View Article and Find Full Text PDF

Adaptive divergence and increased genetic differentiation among populations can lead to reproductive isolation. In Lake Constance, Germany, a population of invasive three-spined stickleback () is currently diverging into littoral and pelagic ecotypes, which both nest in the littoral zone. We hypothesized that assortative mating behaviour contributes to reproductive isolation between these ecotypes and performed a behavioural experiment in which females could choose between two nest-guarding males.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!