[PDF] Symmetry warrants rational cooperation by co-action in Social Dilemmas

Abstract

Is it rational for selfish individuals to cooperate? The conventional answer based on analysis of games such as the Prisoners Dilemma (PD) is that it is not, even though mutual cooperation results in a better outcome for all. This incompatibility between individual rationality and collective benefit lies at the heart of questions about the evolution of cooperation, as illustrated by PD and similar games. Here, we argue that this apparent incompatibility is due to an inconsistency in the standard Nash framework for analyzing non-cooperative games and propose a new paradigm, that of the co-action equilibrium. As in the Nash solution, agents know that others are just as rational as them and taking this into account leads them to realize that others will independently adopt the same strategy, in contrast to the idea of unilateral deviation central to Nash equilibrium thinking. Co-action equilibrium results in better collective outcomes for games representing social dilemmas, with relatively "nicer" strategies being chosen by rational selfish individuals. In particular, the dilemma of PD gets resolved within this framework, suggesting that cooperation can evolve in nature as the rational outcome even for selfish agents, without having to take recourse to additional mechanisms for promoting it.

Full PDF

aa r X i v : . [ phy s i c s . s o c - ph ] J u l Symmetry warrants rational cooperation byco-action in Social Dilemmas

V. Sasidevan and Sitabhra Sinha The Institute of Mathematical Sciences, CIT Campus, Taramani, Chennai 600113, India. The Institute of Mathematical Sciences, CIT Campus, Taramani, Chennai 600113, India. * [email protected], [email protected] ABSTRACT

Is it rational for selﬁsh individuals to cooperate? The conventional answer based on analysis of games such as the PrisonersDilemma (PD) is that it is not, even though mutual cooperation results in a better outcome for all. This incompatibility betweenindividual rationality and collective beneﬁt lies at the heart of questions about the evolution of cooperation, as illustrated byPD and similar games. Here, we argue that this apparent incompatibility is due to an inconsistency in the standard Nashframework for analyzing non-cooperative games and propose a new paradigm, that of the co-action equilibrium. As in theNash solution, agents know that others are just as rational as them and taking this into account leads them to realize thatothers will independently adopt the same strategy, in contrast to the idea of unilateral deviation central to Nash equilibriumthinking. Co-action equilibrium results in better collective outcomes for games representing social dilemmas, with relatively“nicer” strategies being chosen by rational selﬁsh individuals. In particular, the dilemma of PD gets resolved within thisframework, suggesting that cooperation can evolve in nature as the rational outcome even for selﬁsh agents, without havingto take recourse to additional mechanisms for promoting it.

Introduction

Strategic interactions occur all around us in a multitude of forms between autonomous agents . These interacting agentscould correspond to individual humans or animals or even computer algorithms, as well as, collective entities such as groups,organizations or nations. Analyzing their interactions in terms of games is a promising approach for understanding thebehavior of a wide variety of socio-economic and biological systems, and ﬁnds applications in ﬁelds ranging from economicsand political science to computer science and evolutionary biology. A game is described by the set of all possible actions bya speciﬁed number of agents, where each possible combination of actions is associated with a payoff for each agent. Thus,the payoff received by an agent depends on her choice of action, as well as that of others. Agents are assumed to be rationaland selﬁsh, who want to maximize their individual payoffs. In addition, every agent knows that all agents satisfy these criteria(for a detailed discussion of these ideas see, e.g., Ref. ). Each of these assumptions is crucial in determining the outcome ofa game. While they may or may not hold in speciﬁc real-life scenarios, the agent behavior embodied by these assumptionsprovides a crucial benchmark for strategic behavior.In order to solve a game, i.e., to ﬁnd the set of actions that the agents will employ given the structure of the game, oneneeds a solution concept that will form the basis for strategy selection by the agents. For non-cooperative games, whereagents choose their actions independently without communicating with other agents, the canonical solution concept employedis that of the Nash equilibrium. It is deﬁned informally as the set of actions chosen by the agents where no agent cangain by unilaterally deviating from this equilibrium. Nash equilibria exist for all games having a ﬁnite number of agentschoosing from a ﬁnite set of actions, making it a very general concept that has wide applicability. Indeed, the concept hasbeen central to various attempts at developing quantitative descriptions of socio-economic phenomena. However, analyzingspeciﬁc games using the concept of Nash equilibrium can raise the following issues: (i) A game may have more than oneNash equilibria and hence, deciding which of these will be adopted by rational agents is a non-trivial problem. Additionalcriteria need to be provided for selecting an equilibrium; however their success is not always guaranteed. (ii) The Nashequilibrium of a game may sometimes be inferior to an alternative choice of actions by the agents in which all the partiesget higher payoff. This gives rise to apparently paradoxical situations in games representative of social dilemmas, such asthe Prisoner’s Dilemma (PD), the Traveler’s Dilemma, etc. For example, in PD, where each agent has the option toeither cooperate with the other agents or defect, mutual defection is the only Nash equilibrium, although mutual cooperationwill result in higher payoffs for all agents. Results of experimental realizations of such games also show deviation from theNash solutions. That rational action by individual agents can result in an undesirable collective outcome for the agentsis a long-standing puzzle. In particular, it raises questions about how cooperation could have evolved and is maintained inatural populations. Here we argue that the genesis of this problem can be traced to a mutual inconsistency between the assumptions underlyingthe Nash equilibrium for symmetric game situations. One of these assumptions is that each agent is equally capable ofanalyzing the game situation and that all of them are aware of this. However, it is also assumed that agents can make unilateral deviations in their strategy, which is used to obtain a dominant strategy in games like PD. In other words, eachagent looks only at the payoff structure of the game and takes a decision that is independent of how other agents decide. Thisis inconsistent with the earlier assumption because if the agents are aware that the others are also rational, they should takethis (rational decision-making by the other agents) into account. To put it informally, the player will argue that “if the otherplayer is like me, then she will be independently choosing the same strategy (although not necessarily the same action if it isa mixed strategy) as I, because we are faced with the same situation.” In this paper we present a novel solution paradigm forpayoff-symmetric games, referred to as co-action equilibrium , that resolves this inconsistency, building on a concept originallyintroduced in the context of minority games. As we shall see, the optimal action of rational agents in co-action equilibriumis markedly different from Nash equilibrium and leads to better collective outcomes, solving various social dilemmas such asPD.The mutual inconsistency between (i) the assumption of players being aware that all of them are rational and (ii) thepossibility of a dominant strategy, had been earlier pointed out informally in the speciﬁc context of PD - although, to thebest of our knowledge, there have been no attempts to develop a quantitative framework that addresses this problem. Inwhat is possibly the earliest statement about the rationality of cooperation in PD, Rapoport had argued that because of thesymmetry of the game, rational players will choose the same action - and as it involves a higher payoff, they will alwaysopt for mutual cooperation. This argument has been independently put forward by Hofstadter in the context of a N -personPD. The response of conventional game theory to this line of reasoning, as set forth at length by Binmore, centers on theargument that these approaches crucially rely on constraining the set of feasible outcomes of the game to the main diagonalof the payoff matrix, thereby making it effectively a collective decision-making process. As outlined in detail below, theco-action approach presented here allows the agents access to the full set of outcomes in the game matrix and the solutionis obtained without restricting their choices of action. It is also general, applying to all symmetric non-cooperative games.As the theory of strategic interactions is central to the analysis of many phenomena across economics, social sciences andevolutionary biology, the co-action concept could potentially lead to new insights across a broad range of disciplines.In this paper we analyze single-stage games with two actions per agent, where the payoff structure is unchanged onexchanging the identities of the agents (payoff symmetry). We primarily focus on two-person games, with agents playingthe game once (in contrast to repeated games where agents can interact many times in an iterative manner) and analyze indetail three well-known instances, viz., PD, Chicken (also referred to as snow-drift or Hawk-Dove) and Stag Hunt. Thesegames model a wide variety of conﬂict situations in nature where cooperation may emerge under certain circumstances. We describe the co-action solution for these games which, in general, leads to “nicer” strategies being selected by the agentscompared to the Nash solution. For example, the co-action equilibrium in PD corresponds to full cooperation among agentsat lower values of temptation to defect, while for higher temptation each agent employs a probabilistic strategy. Thus, co-action typically results in more globally efﬁcient outcomes, reconciling the apparent conﬂict between individual rationalityand collective beneﬁt. Further, the co-action equilibrium is unique and therefore, agents are not faced with the problem ofequilibrium selection. The concept can be extended to other scenarios, such as, symmetric games involving several players,or even non-symmetric games when agents can be grouped into clusters with symmetry holding within each. In fact, the lattercase can be seen as deﬁning a new class of games between players, where each “player” represents a group of agents whoindependently choose the same strategy.

The Co-action equilibrium

To describe the co-action solution concept, we consider the general case of a payoff-symmetric, two-person game whereeach agent (say, A and B ) has two possible actions (Action 1 and Action 2) available to her. Each agent receives a payoffcorresponding to the pair of choices made by them. If both agents choose the same option, Action 1 (or 2), each receives thepayoff R (or P , respectively), while if they opt for different choices, the agent choosing Action 1 receives payoff S while theother receives T . Thus, the game can be represented by a payoff matrix that speciﬁes all possible outcomes (Fig. 1). An agentmay employ a mixed strategy, in which she randomly selects her options, choosing Action 1 with some probability p (say)and Action 2 with probability ( − p ) . A pure strategy corresponds to p being either 0 or 1. A Nash equilibrium for a gamecan be in pure strategies or in mixed strategies. As noted earlier, a given game may have more than one Nash equilibrium,possibly involving mixed strategies. Assuming that agent A ( B ) chooses Action 1 with probability p ( p ) and Action 2 with igure 1. A generic representation of the payoff matrix for a two-person symmetric game where each agent has two actionsavailable to her. For each pair of actions, the ﬁrst entry in each payoff pair belongs to Agent A while the second belongs toAgent B . Different games discussed in the text, such as PD, Chicken and Stag-hunt, are deﬁned in terms of differenthierarchical relations among the elements T , R , P and S .probability 1 − p (1 − p , respectively), their expected payoffs are, W A = p ( p ( R + P − T − S ) + S − P ) + p ( T − P ) + P , W B = p ( p ( R + P − T − S ) + S − P ) + p ( T − P ) + P . (1)The symmetry of the game is reﬂected in the fact that W A and W B are interchanged on exchanging p with p . It is easily seenthat if a mixed strategy Nash equilibrium exists, it is the same for both agents and given by the probabilities p ∗ = p ∗ = P − S ( R + P − T − S ) . (2)The Nash solution assumes that all agents are rational and that each agent knows the planned equilibrium strategies ofthe other agents. Furthermore, a unilateral deviation in strategy by one of them will not change the strategy choice of others(who are assumed to be just as rational as the one who deviated!). This is implicit in Eq. 1 where each agent maximizesher payoff independent of the strategy of the other agent. In other words, while making a choice the agents do not take intoaccount the fact that the other agents (who are assumed to have identical capabilities) are also deciding simultaneously ontheir choice and that they are all aware of this. Although this latter assumption is deeply embedded in standard game theory,it is inconsistent with the assumption that every agent is aware that all other agents are just as rational as them. By contrast, inthe co-action concept, by virtue of the symmetry of the game, each agent will argue that whatever complicated processes sheemploys in arriving at the optimal decision, the other agents will choose the same strategy as they have the same informationand capabilities. It is important to note that this does not require any communication between the agents nor does it invoke theexistence of trust or other extraneous concepts. Rather, it arises from the fact that both agents are equally rational and being ina symmetric situation, will reach the same conclusion about the choice of strategy; moreover, they realize and consider this inmaking their decision . It is important to note that the co-action concept does not imply that both agents will necessarily endup choosing the same action. For instance, the co-action solution for the single-stage PD is not to always cooperate - whichdistinguishes the present approach from the earlier arguments of Rapoport and Hofstadter where all agents always choosethe same action - but to resort to a mixed strategy when the temptation to defect is sufﬁciently high.In the co-action concept, each agent maximizes her payoff assuming that all other agents in a symmetric situation willbe making the same decision. Formally this amounts to optimizing the expected payoff functions of each of the two agents,which in this case are identical: W A , B = W = p ( R + P − T − S ) + p ( T + S − P ) + P . (3) ere p is the probability with which each of the agents A and B chooses Action 1. Under the co-action concept, the equilibriumstrategy p ∗ of the agents is obtained by maximizing W with respect to p ∈ [ , ] . If the maximum of function W in [ , ] occursat one of the ends (i.e., p = W has a maximum inside ( , ) then the co-action equilibrium is a non-trivial mixed strategy, viz., p ∗ = P − ( T + S ) ( R + P − T − S ) . (4)The existence of the co-action equilibrium for all symmetric games is guaranteed from the smoothness of polynomial functionssuch as Eq. 3. Also, unlike the Nash equilibrium, the co-action equilibrium is unique and thus, for a given symmetric gamethere is no ambiguity about the optimal choice of action for the agents. Case studies

Having described the concept of co-action equilibrium, we will now apply it to three well-known two-person symmetricgames, illustrating in each case the differences between the co-action and Nash equilibria. Each of these games is deﬁned interms of a speciﬁc hierarchical relationship between the payoffs R , S , T and P (using the terminology of the payoff matrixshown in Fig. 1). Prisoner’s Dilemma

PD is one of the most well-studied games in the literature of strategic choices in social sciences and evolutionary biology.

It is the canonical paradigm for analyzing the problems associated with evolution of cooperation among selﬁsh individuals. The game represents a strategic interaction between two agents who have to choose between cooperation (Action 1) anddefection (Action 2). If both players decide to cooperate, each receives a “reward” payoff R and if both players decide todefect, then each receives a “punishment” payoff P . If one of the players decides to defect and the other to cooperate, thenthe former gets a payoff T (often termed as the “temptation” to defect) and the latter gets the “sucker’s payoff” S .In PD the hierarchical relation between the different payoffs is T > R > P > S . The only Nash equilibrium for this game isboth agents choose defection (each receiving payoff P ), as unilateral deviation by an agent would yield a lower payoff ( S ) forher. Note that, mutual defection is the only Nash solution even if the game is repeatedly played between the players a ﬁnitenumber of times. However, it is easy to see that mutual cooperation would have resulted in a higher payoff ( R ) for both agents.This illustrates the apparently paradoxical aspect of the Nash solution for PD where pursuit of self-interest by rational agentsleads to a less preferable outcome for all parties involved. The failure on the part of the agents - who have been referred toas “rational fools” - to see the obviously better strategy is at the core of the dilemma and has important implications for thesocial sciences, including economists’ assumptions about the efﬁciency of markets. Further, experimental realizations ofPD show that some degree of cooperation is achieved when the game is played by human subjects, which is at variance withthe Nash solution.

In more general terms, PD raises questions about how cooperation can emerge in a society of rational individuals pursuingtheir self-interest and there have been several proposals to address this issue. These have mostly been in the context of theiterative PD (rather than the single-stage game that we are considering here) and typically involve going beyond the standardstructure of the game, e.g., by introducing behavioral rules such as direct or indirect reciprocity, assuming informationalasymmetry, etc. By contrast, in the co-action solution, rational selﬁsh agents achieve non-zero levels of cooperation in thestandard single-stage PD, with the degree of cooperation depending on the ratio of temptation T to reward R .To obtain the co-action solution of PD, we use the formalism described earlier with the value of the lowest payoff S assumed to be zero without loss of generality. From Eq. 3 and using the hierarchical relation among the payoffs T , R and P forPD, it follows that when T ≤ R , the optimal strategy for the agents is p ∗ =

1, i.e., both agents always cooperate. On the otherhand, when the temptation to defect T > R , the optimal strategy is a mixed one with the probability of cooperation [Eq. 4], p ∗ = T − P ( T − R − P ) , (5)i.e., the agents randomly choose between the available actions, defecting with probability 1 − p ∗ . As temptation keeps increas-ing, the probability of cooperation decreases and in the limit T → ¥ , p ∗ → /

2, i.e., the agents choose to cooperate or defectwith equal probability, receiving an expected payoff W ∗ → T /

4. Thus, unlike the Nash solution of PD where cooperation isnot possible, the co-action solution of the game always allows a non-zero level of cooperation, with 1 / < p ∗ < -essentially a collective rationality argument - which suggests that rational agents will always cooperate. To the best of our igure 2. The variation of the optimal strategy - probability of choosing Action 1, p ∗ - under the co-action solution conceptfor the games (a) Prisoner’s Dilemma (PD) and (b) Chicken, as a function of the payoff matrix elements T , P , R and S . Inboth games, for low values of T (corresponding to temptation for defection in PD and for being aggressive in Chicken), theagents always opt for Action 1 (corresponding to cooperation in PD and being docile in Chicken). However, as T increases,agents opt for a mixed strategy, where Action 1 is chosen with decreasing probability. In both cases, in the limit of very high T , the agent strategy becomes fully random with the two actions being chosen with equal probability. Note that in PD, theoptimal strategy also has a very weak dependence on P (corresponding to punishment payoff for mutual defection).knowledge, co-action is the ﬁrst solution concept which allows probabilistic cooperation by the players in the single-stage PD.The existence of non-zero level of cooperation in the co-action solution means that there is no longer any incompatibilitybetween the individual actions of rational agents trying to maximize their payoffs and achieving the best possible collectiveoutcome, thereby resolving the “dilemma” in PD. The co-action concept may be used to solve other games involving similardilemmas such as traveler’s dilemma. It is of interest to note in this context that in the various experimental realizations ofPD, the level of cooperation observed is neither zero (as in the Nash solution) nor complete - the average being about 50% butwith signiﬁcant variation across experiments. While it is unclear if such realistic game conditions conform to the idealizedassumption of rational agents, the co-action solution does provide a benchmark strategy for these situations.

Chicken

Chicken (also referred to as Snowdrift or Hawk-Dove) is a two-person game that has been extensively investigated in thecontext of the study of social interactions and evolutionary biology.

It represents a strategic interaction between twoagents who have to choose between being docile (Action 1) or being aggressive (Action 2). If both agents decide to be docile,they receive the payoff R , while if one is docile when the other resorts to aggression, the former - considered the “loser” -receives a lower payoff S ( < R ) and the latter - the “winner” - receives a higher payoff T ( > R ). However, the worst possibleoutcome corresponds to when both players choose to be aggressive, presumably resulting in severe damage to both, which isassociated with the lowest payoff P . Thus, the hierarchical relation between the different payoffs in Chicken is T > R > S > P .Note that it differs from PD in that the payoff S is higher than P . Therefore, an agent beneﬁts by being aggressive only if theother is docile but is better off being docile otherwise, as the cost of mutual aggression is high.The game has three Nash equilibria, of which two correspond to pure strategies where one agent is docile while the otheris aggressive. The mixed strategy Nash equilibrium p ∗ = p ∗ = S / ( T + S − R ) is given by Eq. (2), where it is assumed thatthe lowest of the possible payoffs P is zero [see Fig. 2 (b)]. As in many other non-cooperative games with multiple Nashequilibria, one has to invoke additional criteria (viz., equilibrium reﬁnements ) to decide which of these solutions will beselected by the agents. In Chicken, a commonly used reﬁnement concept is that of evolutionarily stable strategy (ESS) -an important concept in evolutionary game theory - which, in this game, gives the mixed strategy Nash equilibrium as theunique solution.To obtain the co-action solution for Chicken, we note that under this solution concept, agents choose their actions so as tooptimize the payoff function Eq. (3). Using the hierarchical relation of the payoffs for Chicken (assuming the lowest payoff P is zero without loss of generality), it is easy to see that for 2 R ≥ T + S , p ∗ = R < T + S , agents choose to be docile with a probability [Eq. (4)], p ∗ = T + S ( T + S − R ) . (6)Thus, for low values of T , both agents decide to be docile (non-aggressive) always and avoid damaging each other, whereas,when the stakes are high (for large T ) they randomly choose between the available actions, being docile with probability p ∗ and aggressive with probability 1 − p ∗ . As in PD, in the limit of large T , i.e., T → ¥ , the optimal strategy is p ∗ → /

2, wherethe agents choose to be aggressive or docile with equal probability, receiving an expected payoff W ∗ → T / T → ¥ , the ESS suggests that both agents should resort to mutual aggression [i.e., p ∗ = p ∗ → P . Compared to this, the co-action concept yields a a signiﬁcantly better outcome for both agents, as noted above. Thisdifference is remarkable as the co-action solution shows that “nice” behavior among rational agents can occur even in a highlycompetitive environment. As in PD, experimental realizations of the single-stage game have reported a signiﬁcant level (about50%) of cooperative behavior. Stag Hunt

The last of the two-person games we discuss here is the Stag Hunt which is used to describe many social situations wherecooperation is required to achieve the best possible outcome. The game represents a strategic interaction between twoagents who have to choose between a high-risk strategy having potentially large reward, viz., hunting for stag (Action 1) ora relatively low-risk, but poor-yield, strategy, viz., hunting for hare (Action 2). The agents can catch a stag (which is worthmore than a hare) only if they both opt for it, i.e., cooperate, thereby receiving the highest payoff R . However, being unsure ofwhat the other will do, they may both choose the safer option of hunting hare, which can be done alone, so that each receives alower payoff P . However, if one agent chooses to hunt stag while the other decides to hunt hare, the former being unsuccessfulin the hunt receives the lowest possible payoff S , while the latter (who succeeds in catching hare) gets the payoff T . Thus, thehierarchical relation between the payoffs in Stag Hunt is R > T ≥ P > S .As in Chicken, the game has three Nash equilibria, of which two correspond to pure strategies where both agents opt forhunting stag or both choose to hunt hare. Note that both strategies are also evolutionarily stable, so that the ESS reﬁnement,unlike in Chicken, does not yield a unique solution for this game. The mixed strategy Nash equilibrium p ∗ = p ∗ = P / ( P + R − T ) is given by Eq. (2) where it is assumed that the lowest of the possible payoffs S is zero.The co-action solution for Stag Hunt is obtained by noting that as R is greater than T , the payoff function [Eq. (3)] increasesmonotonically in the interval [ , ] . Thus, the co-action payoff W = p ( R + P − T ) + p ( T − P ) + P is optimized when p ∗ = R , T and P . Therefore, the solution of the game under the co-action concept is unique, with bothagents opting to hunt stag, resulting in the best outcome for them. It may be of interest to note that experiments in single-stageStag Hunt have reported that players tend to choose to coordinate on the higher payoff outcome in the majority of cases. Unlike in the previous two case-studies, there is no conﬂict of interest among the agents playing Stag Hunt, who are insteadtrying to coordinate their actions in the absence of any communication. Thus, it can be viewed as a problem of equilibriumselection, with the co-action solution corresponding to the better one.

Discussion

In this paper we have shown that the conﬂict between pursuit of individual self-interest and occurrence of collective outcomesthat are mutually beneﬁcial in the context of social dilemmas such as PD may only be an apparent one. The co-actionconcept presented here resolves this conﬂict by making mutually consistent assumptions about the behavior of rational agents.The different games that are analyzed in detail here show that the co-action solution concept leads to strategies that arerelatively “nicer” and globally more efﬁcient compared to the standard Nash equilibrium concept. In particular, it resolvesthe dilemma in PD as the mutually beneﬁcial action, viz., cooperation, always has a signiﬁcant probability ( ≥ /

2) of beingchosen by both agents. Similarly, co-action yields more cooperative outcomes in the other games, i.e., agents playing Chickenresort to non-aggressive strategies and agents achieve perfect coordination to receive the highest possible payoff in Stag Hunt.Thus, this solution concept reconciles the idea of individual self-interest pursued by rational agents with the achievement ofcollective outcomes that are mutually beneﬁcial, even for single-stage games. While we do not claim that co-action is the onlymechanism by which cooperation may originate and be maintained in nature, it certainly shows that cooperation can evolveamong selﬁsh rational agents. Note that our results do not depend on the speciﬁc deﬁnition of rationality one uses, as long asthe same deﬁnition applies to all agents.For an N -player ( N >

2) game, if it can be considered as the set of all pair-wise interactions between agents who aresymmetric in every respect, it is easy to see that the optimal co-action strategy will be exactly the same as that of the two- erson game. The co-action solution concept can be generalized even to cases where the symmetry assumption does not holdacross all agents. If the agents are aware that some of the other agents are different from them, one can still apply co-actionwithin each cluster of agents (group) whose members consider each other to be identical (i.e., the symmetry assumptionholds). For agents belonging to different groups, however, the payoffs are not invariant under interchanging the identitiesof the players. Thus, the symmetry of agents is broken across groups. For a population of agents whose members can beconsidered as belonging to two groups, one can treat the game as a two-player Nash-like scenario where each “player” isnow a group of agents. However, unlike the standard Nash setting where one cannot have a mixed strategy as a stable Nashequilibrium, it is now possible for mixed strategy equilibria to be stable. In general, one can consider a game with N agents,clustered into M symmetry groups, who have to choose between two actions. Assuming that the size of each group i is n i ( S i n i = N ), the payoff for an agent belonging to the i -th group is a polynomial of degree n i in p i ( i = , . . . , M ), where each p i is the probability of agents in that group to choose one of the actions. By contrast, the corresponding formulation of the gamein terms of Nash solution concept will involve N variables with the payoffs being linear in each of these variables. Therefore,this deﬁnes a novel class of games between multiple clusters of agents, with agents independently choosing the same strategyas the other members of the cluster they belong to. The co-action results for such games may have potential implications formulti-agent strategic interactions, as in the tragedy of commons. While the results discussed here are in the context of idealized situations involving rational selﬁsh agents, one may askunder what conditions would the co-action framework apply in real life. As we have outlined above, symmetry is a crucialingredient for co-action thinking to apply. Such symmetry is more likely to be realized among members of a given communitywho share the same beliefs and a common identity. It has indeed been observed that cooperation is more common withinan in-group than between agents belonging to different groups. The signiﬁcant levels of cooperative behavior reported inexperimental realizations of social dilemmas (e.g., see Refs. for PD and Ref. for its N -person generalization, i.e., thepublic goods game, Ref. for Chicken and Refs. for stag-hunt) could, to some extent, be explained by players ascribingto other players the same reasoning process as themselves and therefore resorting to co-action-like thinking. Experimentswith human subjects playing PD have shown that the level of cooperation depends on the actual values of payoffs and ingeneral decreases with the ratio of temptation for defection to reward for cooperation - in line with the co-action solution.Also, players are known to employ non-deterministic strategies in PD realizations, similar to what agents do in the co-actionequilibrium for sufﬁciently high temptation. Game situations that allow “cheap talk” (i.e., communication between agents thatdoes not directly affect payoff) which presumably allow players to afﬁrm shared set of values - and thereby promote co-actionthinking - have been shown to increase the level of cooperation in experiments. In other experimental realizations, whereplayers in a public goods game indicated their preferred contributions for different average levels of contribution by othergroup members, about half the players were observed to match what the others would do. Such “conditional cooperation”could be an illustration of the symmetry considerations that players might engage in. Also, the co-action framework couldprovide a natural setting for the emergence of tag-based cooperation schemes among “sufﬁciently similar” agents. The ideaof “social projection” provides yet another instance where such considerations may be relevant. We note that there havebeen other approaches towards explaining cooperation in social dilemmas based on symmetry of the game situation, e.g., arecent model in which agents decide their strategies based on their most optimistic forecast about how the game would beplayed if they formed coalitions. In this paper we have focused on single-stage games but the co-action concept discussed here applies also to repeatedgames where information about the choices made by agents in the past are used to decide their future action. In this situation,the co-action solution developed in the context of single-stage games is applied at each iteration, with the past actions ofagents used to deﬁne the different symmetry groups. This is inherently a dynamical process, as the membership of thesegroups can evolve in time. For example, in iterative PD with N agents having memory of the choices made in the previousiteration, all agents who made the same decision in the last round will belong to the same symmetry group and will behaveidentically. The resulting solution can allow coexistence of cooperators and defectors in the game, which we will discuss in afuture publication.To conclude, we have introduced here a solution framework for non-cooperative games that resolves the apparent conﬂictbetween rationality of individual agents and globally efﬁcient outcomes. It suggests that cooperation can evolve in nature asthe rational outcome even with selﬁsh agents, without having to take recourse to additional mechanisms for promoting it. Inpractice, the co-action and Nash solutions could represent two extreme benchmark strategies for non-cooperative games, thelatter applying when the agents cannot be considered to be “sufﬁciently similar”. While we do not address here the questionof which concept is more appropriate for a given situation, it is conceivable that agent behavior in reality may be describedby a strategy between these two extremes and can potentially be represented by a combination of them. Although we havediscussed co-action in the context of the evolution of cooperation among rational agents, the concept is far more generaland could provide a mechanism for understanding strategic interactions across groups of sufﬁciently similar agents in manydifferent settings. cknowledgements This work was partially supported by the IMSc Econophysics project funded by the Department of Atomic Energy, Govern-ment of India. We thank Deepak Dhar for useful discussions and Shakti N. Menon for useful comments on the manuscript.

References Morgenstern, O. & Von Neumann, J.

Theory of games and economic behavior (Princeton University Press, Princeton,1944). Colman, A. M.

Game theory and its applications in the social and biological sciences (Routledge, New York, 1999). Hargreaves Heap, S. P. & Varoufakis, Y.

Game theory: A critical text (Routledge, London, 2002). Osborne, M. J. & Rubenstein, A.

A course in game theory (MIT Press, Cambridge, Mass. 1994). Nash, J. F. Equilibrium points in n-person games.

Proc. Natl. Acad. Sci. USA. Holt, C. A.& Roth, A. E. The Nash equilibrium: A perspective.

Proc. Natl. Acad. Sci. USA. Harsanyi, J. C. & Selten, R.

A General Theory of Equilibrium Selection in Games (MIT Press, Cambridge, 2003). Carlsson, H. & van Damme, E. Equilibrium selection in stag hunt games in

Frontiers of Game Theory (eds Binmore, K.G., Kirman, A. P. & Tani, P.) (MIT Press, Cambridge MA, 1993). Rapoport, A. & Chammah, A. M.

Prisoners Dilemma (Univ of Michigan Press, Ann Arbor, MI, 1965).

Basu, K. The traveler’s dilemma: Paradoxes of rationality in game theory.

The American Economic Review

Basu, K. The traveler’s dilemma.

Scientiﬁc American Magazine

Andreoni, J. & Miller, J. Rational cooperation in the ﬁnitely repeated prisoner’s dilemma: Experimental evidence.

TheEconomic Journal

Kollock, P. Social dilemmas: The anatomy of cooperation.

Annual Review of Sociology

Axelrod, R.

The Evolution of Cooperation (Basic Books, New York, 1984).

Sasidevan, V. & Dhar, D. Strategy switches and co-action equilibria in a minority game.

Physica A: Statistical Mechanicsand its Applications

Rapoport, A.

Two-person game theory: The essential ideas (University of Michigan Press, Ann Arbor, 1966).

Hofstadter, D. The calculus of cooperation is tested through a lottery.

Scientiﬁc American

Metamagical Themas

Binmore, K. G.

Game theory and the social contract. Vol 1. Playing fair (MIT Press, Cambridge, 1994).

McMahon, C.

Collective Rationality and Collective Reasoning (Cambridge University Press, Cambridge, 2001).

Archetti, M. & Scheuring, I. Review: Game theory of public goods in one-shot social dilemmas without assortment.

Journal of Theoretical Biology

Rapoport, A. & Chammah, A. M. The game of chicken.

American Behavioral Scientist

Nowak, M. A. & Sigmund, K. Evolutionary dynamics of biological games.

Science

Sen, A. Rational fools: A critique of the behavioral foundations of economic theory.

Philosophy and Public Affairs Morgan, M. S.

The World in the Model (Cambridge University Press, New York, 2012).

Sally, D. Conversation and Cooperation in Social Dilemmas A Meta-Analysis of Experiments from 1958 to 1992.

Ratio-nality and Society

Sigmund, K.

The Calculus of Selﬁshness (Princeton University Press, Princeton, NJ, 2010).

Kreps, D., Milgrom, P., Roberts, J. & Wilson, R. Rational cooperation in the ﬁnitely repeated prisoners’ dilemma.

Journalof Economic Theory

Smith, J. M. & Price, G. R. The Logic of Animal Conﬂict.

Nature

Hofbauer, J. & Sigmund, K.

Evolutionary Games and Population Dynamics (Cambridge University Press, Cambridge,1998). Neugebauer, T., Poulsen, A. & Schram, A. Fairness and reciprocity in the Hawk–Dove game.

Journal of EconomicBehavior & Organization

Skyrms, B.

The Stag Hunt and the Evolution of Social Structure (Cambridge University Press, Cambridge, MA, 2004).

Battalio, R., Samelson, L. & Huyck, J. V. Optimization incentives and coordination failure in laboratory Stag Hunt games.

Econometrica

69 (3),

Schimdt, D., Shupp, R., Walker, J. M. & Ostrom, E. Playing safe in coordination games: the roles of risk dominance,payoff dominance, and history of play.

Games and Economic Behavior

Stability of a solution can be deﬁned with respect to small changes in strategy parameters. We discuss this in Sasidevan,V. & Sinha, S. A Dynamical view of different solution paradigms in two-person symmetric games: Nash versus co-actionequilibria in

Econophysics and Data Driven Modelling of Market Dynamics (eds Abergel, F. et al.) (Springer, Milan,2015), where Nash and co-action equilibria for two-person games are viewed in a dynamical systems framework.

Hardin, G. The tragedy of the commons.

Science

Balliet, D., Wu, J. & De Dreu, C. K. W. Ingroup favoritism in cooperation: a meta-analysis.

Psychol. Bull.

Ledyard, J. O. Public goods: A survey of experimental resarch in

Handbook of Experimental Economics (eds Kagel, J. &Roth, A.) (Princeton University Press, Princeton NJ, 1995).

Cooper, R., Dejong, D. V., Forsythe, R & Ross, T. W. Cooperation without reputation: Experimental evidence fromPrisoner’s Dilemma games.

Games and Economic Behavior

Farrell, J. & Rabin M. Cheap talk.

Journal of Economic Perspectives

Crawford, V. A survey of experiments on communication via cheap talk.

Journal of Economic Theory

Fischbacher, U., Gachter, S. & Fehr, E. Are people conditionally cooperative? Evidence from a public goods experiment.

Economics Letters

Riolo, R. L., Cohen, M. D. & Axelrod, R. Evolution of cooperation without reciprocity.

Nature : 441-443 (2001).

Krueger, J. I., DiDonato, T. E. & Freestone, D. Social projection can solve social dilemmas.

Psychological Inquiry

Capraro, V. A model of human cooperation in social dilemmas.

Plos One e72427 (2013).e72427 (2013).