We consider concurrent games played on graphs. At every round of a game, each player simultaneously and independently selects a move; the moves jointly determine the transition to a successor state. Two basic objectives are the safety objective to stay forever in a given set of states, and its dual, the reachability objective to reach a given set of states. First, we present a simple proof of the fact that in concurrent reachability games, for all [Formula: see text], memoryless -optimal strategies exist. A memoryless strategy is independent of the history of plays, and an -optimal strategy achieves the objective with probability within of the value of the game. In contrast to previous proofs of this fact, our proof is more elementary and more combinatorial. Second, we present a strategy-improvement (a.k.a. policy-iteration) algorithm for concurrent games with reachability objectives. Finally, we present a strategy-improvement algorithm for turn-based stochastic games (where each player selects moves in turns) with safety objectives. Our algorithms yield sequences of player-1 strategies which ensure probabilities of winning that converge monotonically (from below) to the value of the game.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC4579905PMC
http://dx.doi.org/10.1016/j.jcss.2012.12.001DOI Listing

Publication Analysis

Top Keywords

concurrent reachability
8
turn-based stochastic
8
concurrent games
8
set states
8
games
5
strategy improvement
4
concurrent
4
improvement concurrent
4
reachability
4
reachability turn-based
4

Similar Publications

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!