[PDF] Dynamic Multi-Agent Path Finding based on Conflict Resolution using Answer Set Programming

Abstract

We study a dynamic version of multi-agent path finding problem (called D-MAPF) where existing agents may leave and new agents may join the team at different times. We introduce a new method to solve D-MAPF based on conflict-resolution. The idea is, when a set of new agents joins the team and there are conflicts, instead of replanning for the whole team, to replan only for a minimal subset of agents whose plans conflict with each other. We utilize answer set programming as part of our method for planning, replanning and identifying minimal set of conflicts.

Full PDF

FF. Ricca, A. Russo et al. (Eds.): Proc. 36th International Conferenceon Logic Programming (Technical Communications) 2020 (ICLP 2020)EPTCS 325, 2020, pp. 223–229, doi:10.4204/EPTCS.325.27 c (cid:13)

B. Atiq, V. Patoglu, E. ErdemThis work is licensed under theCreative Commons Attribution License.

Dynamic Multi-Agent Path Finding based onConﬂict Resolution using Answer Set Programming

Basem Atiq and Volkan Patoglu and Esra Erdem

Faculty of Engineering and Natural Sciences, Sabanci University, Istanbul, Turkey { basem, vpatoglu, esra.erdem } @sabanciuniv.edu We study a dynamic version of multi-agent path ﬁnding problem (called

D-MAPF ) where existingagents may leave and new agents may join the team at different times. We introduce a new methodto solve

D-MAPF based on conﬂict-resolution. The idea is, when a set of new agents joins the teamand there are conﬂicts, instead of replanning for the whole team, to replan only for a minimal subsetof agents whose plans conﬂict with each other. We utilize answer set programming as part of ourmethod for planning, replanning and identifying minimal set of conﬂicts.

Multi-agent path ﬁnding (

MAPF ) problem aims to ﬁnd paths for multiple agents from their initial loca-tions to destinations such that no two agents collide with each other while they follow these paths. Thisproblem has been studied under various constraints (e.g., where an upper bound is given on the planlength) or attempting to reach a certain objective (e.g., minimizing the total time taken for all agents toreach their goals, or minimizing the maximum time taken for each agent to reach its goal location). Allthese variants are NP-hard [8, 10].We study a dynamic version of the

MAPF problem that emerges when changes in our environmentbegin to take place, e.g., when new agents are added to the team at different times with their own initialand goal locations, or when some obstacles are placed into the environment. We refer to this problemas Dynamic Multi-Agent Path Finding (

D-MAPF ) problem.

D-MAPF has many direct applications inautomated warehouses, where teams of hundreds of robots are utilized to prepare dynamic orders in anevery changing environment [12].We propose a new method to solve the

D-MAPF problem, which involves replanning for a smallset of agents that conﬂict with each other. When several new agents join the team, if some conﬂictsoccur, our objective is to minimize the number of agents that are required to replan to resolve theseconﬂicts. In this way, we avoid having to replan for all agents and rather, keep the plans of as many ofthe existing agents ﬁxed. We identify a minimal set of agents whose paths should be replanned by meansof identifying conﬂicts and then resolving them by replanning.The proposed method utilizes Answer Set Programming (ASP) [6, 7, 5] (based on answer sets [3, 4])for planning, replanning and identifying a minimal set of agents with conﬂicts. The ASP formulationused for planning is presented in our earlier study [2], to which we refer the reader for details. In thefollowing, we will focus more on the use of ASP for the latter two problems.

D-MAPF can be thought of as a generalization of the

MAPF problem. In the case of

D-MAPF , we dealwith changes that take place with the passage of time. These changes can include, but are not be limited24

Dynamic MAPF based on Conﬂict Resolution using ASP (a) (b) (c)

Figure 1: An illustrative example. (a) 2 agents with their already determined respective paths. (b) A newagent a is added to the environment but cannot ﬁnd a collision-free path. (c) All agents replan theirsolutions to ﬁnd a collision free path.to, the addition of obstacles into the environment, the addition of new agents into the environment, andthe changes in the objectives of each agent for a given problem.The inputs to a D-MAPF problem are the same as that for the

MAPF problem which includesthe initial and goal positions of each agent, the updates or modiﬁcations that have taken place in theenvironment, a restriction on the makespan of each agent, and the paths of existing agents. Figure 1above gives an example of a

D-MAPF problem where a new agent a is added to the existing environmentwhich consists of two agents a and a who already have their paths determined as in Figure 1(a). Witha makespan of each agent restricted to 3, agent a is unable to ﬁnd a collision free solution as shown inFigure 1(b) for the given instance.As the main objective of any MAPF problem is to ﬁnd a collision free solution for all agents, replan-ning is attempted for all agents in the example as shown in Figure 1(c).

We introduce a new method to solve

D-MAPF where replanning for all agents is avoided most of thetime. This method keeps track of two sets of agents throughout the program: nonConﬂictSet , whichcontains the set of agents (and their plans) that do not conﬂict with each other and, ideally, remain asthey are despite the changes in the environment; and conﬂictSet , which contains the set of agents (andtheir plans) that conﬂict with each other, and, ideally, replanning for a minimal subset of this set wouldresolve conﬂicts.Our algorithm applies when some new agents join the team, as the existing agents are executing theirplans.1. When a set of new agents join the set of existing agents, then try to ﬁnd a

MAPF solution for thenew agents so that they do not conﬂict with each other or the existing agents.2. If such a solution exists, then include the new agents (with their plans) in nonConﬂictSet .3. Otherwise, include the new agents (with their plans) in conﬂictSet .4. While there is some conﬂict to resolve do the following:(a) Try to ﬁnd a minimal(-cardinality) subset of agents in conﬂictSet , such that replanning forthem resolves the conﬂicts in conﬂictSet . . Atiq, V. Patoglu, E. Erdem conﬂictSet into nonConﬂictSet .(c) Otherwise, some conﬂicts exist between some agents in nonConﬂictSet and conﬂictSet , ex-pand conﬂictSet by a set of agents (and their plans) from nonConﬂictSet that cause the mini-mum number of conﬂicts.(d) Meanwhile, move the agents from conﬂictSet that are not involved in these conﬂicts to non-ConﬂictSet .Note that, in the worst case, the algorithm above replans for all agents.We use a slight variation of the ASP formulation Π for MAPF from our earlier studies [2] to ﬁnd a

MAPF solution for the new agents in Step 1 above, by generating plans for the new agents only and byincorporating the plans of the existing agents as facts.In Step 4(a), we enumerate all subsets of conﬂictSet with cardinality 2,3,... incrementally, and use aslight variation of Π for replanning for each subset of agents only and by incorporating the plans of theother agents in conﬂictSet as facts.In Step 4(c), we expand the conﬂictSet by utilizing ASP’s noteworthy feature of weak constraints.In particular, we identify the minimum number of conﬂicts between agents in the conﬂictSet and thosein the nonConﬂictSet : ∼ ←− plan ( t , a , x , y ) , path ( t , a , x , y ) , conﬂictSet ( a ) , nonConﬂictSet ( a ) [ , a , a , t ] ( a (cid:54) = a ) . Here, a penalty of 1 is assigned each time such a conﬂict is detected. ASP solver generates severalsolutions with the addition of this weak constraint, however, a solution with the lowest penalty cost ischosen. In addition to the weak constraints, note that we still include hard constraints to prevent collisionsbetween agents within the conﬂict set: ← path ( t , a , x , y ) , path ( t , a , x , y ) , conﬂictSet ( a ) , conﬂictSet ( a ) ( a (cid:54) = a ) . Figure 2 below gives an example of a scenario where our algorithm manages to ﬁnd a collision freesolution for the agents in the environment without having to replan for all agents. The existing agents( a , a , a , a ) are added to the nonConﬂictSet and their existing paths are stored. The three new agents( a , a , a ) are added to the environment.The algorithm ﬁrst attempts to ﬁnd a solution for the new agents while keeping the paths of the pre-existing agents ﬁxed. Unable to ﬁnd a solution, it places the new agents in conﬂictSet . It further triesto resolve conﬂicts within conﬂictSet . Unable to resolve the conﬂicts, the algorithm tries to expand theconﬂict set by trying to ﬁnd a minimum set of conﬂicts between agents in conﬂictSet and the agents in nonConﬂictSet . The algorithm ﬁnds, as shown in Figure 2(c), that the agents a , a , a , a and a conﬂictamongst each other. Then, conﬂictSet is updated to contain these agents only, while a is moved to nonConﬂictSet .The algorithm then proceeds to resolve the conﬂicts within conﬂictSet = { a , a , a , a , a } . It enu-merates all subsets of size ≥ conﬂictSet : { a , a } , { a , a } , { a , a } , { a , a } , { a , a } , { a , a } , { a , a } , { a , a } , { a , a } , { a , a } . Each subset is selected one at a time, and the algorithm pro-ceeds to determine whether a solution can be found by replanning only for those two agents in the givensubset. In this particular case, the algorithm is unable to ﬁnd a solution for any of the 10 subsets of size2. Then the algorithm enumerates all subsets of size 3: { a , a , a } , { a , a , a } , { a , a , a } , { a , a , a } , { a , a , a } , { a , a , a } , { a , a , a } , { a , a , a } , { a , a , a } , { a , a , a } . Once again the algorithm at-tempts to replan for each subset one at a time until a solution is found. Fortunately, this time the al-gorithm is able to ﬁnd a solution for the subset { a , a , a } and replanning is performed only for thoseagents to devise a collision free solution for all agents as shown in Figure 2(d).26 Dynamic MAPF based on Conﬂict Resolution using ASP (a) (b) (c) (d)

Figure 2: An illustrative example. (a) 4 agents with their already determined respective paths are addedto the nonConﬂictSet . (b) 3 new agents a , a , a are added to the conﬂictSet and a solution is attempted.(c) Regions of conﬂicts amongst agents in the conﬂictSet and the nonConﬂictSet are shown. (d) Newplans for all agents are presented. We have compared our algorithm to solve

D-MAPF with the straightforward approach of replanning,by means of some experiments. The algorithm described in the previous section has been implementedusing Python 3.6.4 and C

LINGO t = conﬂictSet along with the size of the subset for which a solution is found is also shownfor better analysis.To serve as an example, let us look at the forth instance in Table 1 with 38 agents on a 50 ×

50 grid,with a makespan of 98. When four new agents are added to the environment, a new solution is computedby our algorithm in 394.36 seconds whereas replanning for all agents requires 2766.37 seconds. Thenumber of agents that were conﬂicting with each other in this case were 6 and the size of the subset forwhich a solution was found was 3. . Atiq, V. Patoglu, E. Erdem

Initial ×

20 grid 4 38 142.66 Y 56.57 Y 8 4 1201 58 7.34 Y 180.63 Y 2 2 12 2 58 18.07 Y 186.77 Y 4 2 328 agents 3 58 57.90 Y 247.68 Y 6 3 1630 ×

30 grid 4 58 551.74 Y 261.07 Y 8 4 1201 78 21.83 Y > > > ×

40 grid 4 78 62.59 Y 1440.14 Y 5 2 11 98 39.67 Y > > > ×

50 grid 4 98 394.36 Y 2766.37 Y 6 3 16546 agents 4 138 23408.82 Y - N 8 4 8570 ×

70 grid

For small grid sizes, replanning for all agents outperformed our implementation. This was expected,however, as calling the ASP program with weak constraints generates many more possible conﬁgurationsand for such small instances, it is more efﬁcient to replan for all agents. The results get more interestingas the grid size and the number of agents increase. When the grid size increases, we obtain the results asexpected in almost all of the remaining test cases.There were exceptions to the efﬁciency of our algorithm as shown by the last test case for 20 × ×

40 and 50 ×

50 instances show how much more effective it can be to replan onlyfor a subset of agents. For the test case with a grid size of 40 ×

40 and 4 new agents, our algorithm wasat least 20 times as efﬁcient as replanning for all agents.As the grid size and the number of agents increases further, there is a notable difference between thetime taken to ﬁnd a solution by our algorithm and replanning. For the largest grid size of 70 ×

70, ouralgorithm found a solution in about 6 hours. However, replanning for all agents was not possible due tothe sheer size of the input.From these results, we observe that the underlying idea of reusing existing solutions may be quiteefﬁcient in terms of computation time, in particular, for large instances with a large makespan.

Regarding conﬂicts : A sort of conﬂict-resolution has been utilized by the Conﬂict-Based Search Algo-rithm (CBS) [9] introduced to solve the

MAPF problem. The approach attempts to decompose a

MAPF problem into several constrained single-agent path ﬁnding problems. At the high level, the algorithmmaintains a binary tree referred to as a Conﬂict Tree (CT) which detects conﬂicts and adds a set of con-straints for every agent to each node of the tree. At the low level, the shortest path for every agent withrespect to its constraints are searched for. The algorithm then checks to determine whether any conﬂictsarise with the new paths computed at that node. If conﬂicts do arise, the algorithm declares the currentnode as a non-goal node. What is interesting about their approach is the way that they deal with conﬂicts.While we generate all possible subsets of the conﬂicting agents, attempt to replan for each subset until a28

Dynamic MAPF based on Conﬂict Resolution using ASP solution is found or expand the conﬂict set, [9] splits the node at which a conﬂict arises into its two chil-dren nodes and both these nodes are then checked to see if a solution exists. If a conﬂict exists betweentwo agents α i and α j , each child node contains an additional constraint to its parent node for either α i or α j . Search is then performed for only the agent which is associated with the new constraint while thepaths of all other agents are kept ﬁxed. When conﬂicts are generated amongst more than 2 agents, focusis placed on the ﬁrst two agents and the same procedure as described above is followed. Further conﬂictsare dealt with at a deeper level of the tree. Regarding dynamic

MAPF : Online

MAPF [11] considers the addition of new agents to the teamwhile a plan is being executed, under the assumptions that agents disappear when they reach their goaland that new agents may wait before entering their initial location in the environment. These assumptionsrelax the

D-MAPF problem: the new agents may enter the environment one at a time, and they providemore space for the other agents when they disappear. To solve online

MAPF with these assumptions,Svancara et al. investigate algorithms that rely on replanning (e.g., for all agents) and conﬂict-resolution(e.g., planning for the new agents one at a time ignoring others, and then resolving conﬂicts by replan-ning). Our approach does not rely on such assumptions and tries to resolve conﬂicts by identifying theminimal set of agents that cause conﬂicts.In an earlier study [1], we introduce an alternative method for

D-MAPF . It does not rely on conﬂict-resolution. The idea is to revise the traversals of paths of the existing agents (up to the given upper boundon makespan) while computing new plans for the new agents so that there is no conﬂict between any twoagents. If a solution cannot be found, then replanning is applied for all agents. We plan to use the methodbased on conﬂict-resolution, in combination with the revise and augment method to further reduce thenumber of replannings as part of our ongoing studies.

Our approach to minimizing the number of agents that are required to replan their solutions has beenshown to be very efﬁcient as detailed above. Replanning for all agents tends to become expensive veryquickly once our environment becomes larger or more congested. An alternate approach as described byour algorithm can help reduce the cost of performing such a search while minimizing the modiﬁcationsapplied to the paths of agents that already exist.

Acknowledgements

This work has been partially supported by Tubitak Grant 188E931.

References [1] Aysu Bogatarkan, Volkan Patoglu & Esra Erdem (2019):

A Declarative Method for Dynamic Multi-AgentPath Finding . In:

Proc. of GCAI , 65, pp. 54–67, doi:10.1609/aaai.v33i01.33017732.[2] Esra Erdem, Doga Gizem Kisa, Umut Oztok & Peter Schueller (2013):

A General Formal Framework forPathﬁnding Problems with Multiple Agents . In:

Proc. of AAAI . https://dl.acm.org/doi/10.5555/2891460.2891501 .[3] M. Gelfond & V. Lifschitz (1988): The stable model semantics for logic programming . In:

Proc. of ICLP ,MIT Press, pp. 1070–1080. . Atiq, V. Patoglu, E. Erdem [4] Michael Gelfond & Vladimir Lifschitz (1991):

Classical negation in logic programs and disjunctivedatabases . New Generation Computing

9, pp. 365–385, doi:10.1007/bf03037169.[5] Vladimir Lifschitz (2002):

Answer set programming and plan generation . Artiﬁcial Intelligence

Stable models and an alternative logic programmingparadigm . In:

The Logic Programming Paradigm: a 25-Year Perspective , Springer Verlag, pp. 375–398,doi:10.1007/978-3-642-60085-2 17.[7] Ilkka Niemel¨a (1999):

Logic programs with stable model semantics as a constraint programming paradigm . Annals of Mathematics and Artiﬁcial Intelligence

25, pp. 241–273, doi:10.1023/A:1018930122475.[8] Daniel Ratner & Manfred K. Warmuth (1986):

Finding a Shortest Solution for the N × N Extension of the 15-PUZZLE Is Intractable . In:

Proc. of AAAI , pp. 168–172. https://dl.acm.org/doi/10.5555/2887770.2887797 .[9] Guni Sharon, Roni Stern, Ariel Felner & Nathan R. Sturtevant (2015):

Conﬂict-based search for optimalmulti-agent pathﬁnding . Artif. Intell.

An Optimization Variant of Multi-Robot Path Planning Is Intractable . In:

Proc. ofAAAI . https://dl.acm.org/doi/10.5555/2898607.2898808 .[11] Jiri Svancara, Marek Vlk, Roni Stern, Dor Atzmon & Roman Bartak (2019): Online Multi-Agent Pathﬁnding .In:

Proc. of AAAI , doi:10.1609/aaai.v33i01.33017732.[12] Peter Wurman, Raffaello D’Andrea & Mick Mountz (2008):