[PDF] ABIDES: Towards High-Fidelity Market Simulation for AI Research

Abstract

We introduce ABIDES, an Agent-Based Interactive Discrete Event Simulation environment. ABIDES is designed from the ground up to support AI agent research in market applications. While simulations are certainly available within trading firms for their own internal use, there are no broadly available high-fidelity market simulation environments. We hope that the availability of such a platform will facilitate AI research in this important area. ABIDES currently enables the simulation of tens of thousands of trading agents interacting with an exchange agent to facilitate transactions. It supports configurable pairwise network latencies between each individual agent as well as the exchange. Our simulator's message-based design is modeled after NASDAQ's published equity trading protocols ITCH and OUCH. We introduce the design of the simulator and illustrate its use and configuration with sample code, validating the environment with example trading scenarios. The utility of ABIDES is illustrated through experiments to develop a market impact model. We close with discussion of future experimental problems it can be used to explore, such as the development of ML-based trading algorithms.

Full PDF

AABIDES: T

OWARDS H IGH -F IDELITY M ARKET S IMULATIONFOR

AI R

ESEARCH

A P

REPRINT

David Byrd

School of Interactive ComputingGeorgia Institute of TechnologyAtlanta, GA 30308 [email protected]

Maria Hybinette

Department of Computer ScienceThe University of GeorgiaAthens, GA 30303 [email protected]

Tucker Hybinette Balch

School of Interactive ComputingGeorgia Institute of TechnologyAtlanta, GA 30308 [email protected] A BSTRACT

We introduce ABIDES, an Agent-Based Interactive Discrete Event Simulation environment. ABIDESis designed from the ground up to support AI agent research in market applications. While simulationsare certainly available within trading ﬁrms for their own internal use, there are no broadly availablehigh-ﬁdelity market simulation environments. We hope that the availability of such a platform willfacilitate AI research in this important area. ABIDES currently enables the simulation of tens ofthousands of trading agents interacting with an exchange agent to facilitate transactions. It supportsconﬁgurable pairwise network latencies between each individual agent as well as the exchange. Oursimulator’s message-based design is modeled after NASDAQ’s published equity trading protocolsITCH and OUCH. We introduce the design of the simulator and illustrate its use and conﬁgurationwith sample code, validating the environment with example trading scenarios. The utility of ABIDESis illustrated through experiments to develop a market impact model. We close with discussion offuture experimental problems it can be used to explore, such as the development of ML-based tradingalgorithms.

We have developed ABIDES, an agent-based interactive discrete event simulation, to facilitate the creation, deployment,and study of strategic agents in a highly conﬁgurable market environment. We were inspired by Daniel Freidman’s viewthat simulation provides a powerful tool to analyze individual participant behavior as well as overall market outcomesthat emerge from the interaction of the individual agents. In Freidman’s review of empirical approaches to the analysisof continuous double auction (CDA) markets such as NASDAQ and the New York Stock Exchange, he outlines thestrengths and weaknesses of three major approaches:1. Field studies of actual operating markets,2. Laboratory studies of small controlled markets,3. Computer simulation of markets.Freidman concludes that ﬁeld studies are clearly relevant, but do not provide experimental access to all relevantinformation; laboratory studies improve control and observation, but are of necessity small and expensive; and computersimulations feature perfect control and observation. However “trader’s strategies are not endogenously chosen, butrather must be speciﬁed exogenously” [2].As Freidman observed, market simulations provide an attractive platform for research in equity trading questions. Thishas led to the development of a number of simulation platforms such as those on which X. Wang and Wellman [9]and J. Wang et al. [8] have reported their results. We developed ABIDES as a fresh implementation to incorporatelessons learned from the deployment of prior platforms. With ABIDES, we aim to address Freidman’s primary concernregarding computerized market simulations – that strategies must be exogenously speciﬁed – with a platform enablingpowerful learning agents to easily participate in a realistically structured market via a common framework. We believe a r X i v : . [ c s . M A ] A p r BIDES: Towards High-Fidelity Market Simulation for AI Research A PREPRINT this is necessary to properly investigate the behavior and impact of intelligent agents interacting in a complex marketenvironment.ABIDES is intended to be a curated, collaborative open-source project that provides researchers with tools that supportthe rapid prototyping and evaluation of complex market agents. With it, we hope to to further empower researchersof ﬁnancial markets to undertake studies which would be difﬁcult or impossible in the ﬁeld, due to the absence ofﬁne-grained data identiﬁable to individual traders (see Figure 1), a lack of knowledge concerning participant motivation,and an inability to run controlled “what if” studies against particular historical dates.Figure 1: Simulation allows agent-identiﬁable data which is lost in the ﬂow of real-world orders.We acknowledge existing high-quality academically-targeted multi-agent market simulators such as that used byX. Wang and Wellman. In a recent study they used their simulation platform to study spooﬁng agents in a marketenvironment populated by zero intelligence (ZI) and heuristic belief learning (HBL) traders [9]. Their approach analyzesthe results from an empirical game-theoretic view [10]. We believe ABIDES makes a complementary contributionthrough its experimental focus on the “market physics” of the real world including: • Support for continuous double-auction trading at the same nanosecond time resolution as real markets such asNASDAQ; • Ability to simulate speciﬁc dates in market history with gated access to historical data; • Variable electronic network latency and agent computation delays; • Requirement that all agents intercommunicate solely by means of standardized message protocols; • Easy implementation of complex agents through a full-featured hierarchy of base agent classes.The focus on these features should enable an expanded range of experimental studies. We believe ABIDES is also theﬁrst full-featured, modern market simulator to be shared with the community as an open source project.

ABIDES can support a number of different kinds of investigations into market behavior that are not easily conductedusing historical data or live experiments. • The beneﬁts of co-location : In the past 20 years hedge funds and other market participants have investedin the deployment of computing resources co-located at major exchanges [11]. This so-called “co-location”enables quicker access to market information than if the trading server were located further away. It is notfeasible to investigate the value of the advantage co-location provides with available historical data, because itdoes not include information about the geographic location, network latency, or network reliability of eachactor. With a platform that does not require formal arms-length messaging using a realistic network model,we cannot simulate the effects of these factors even if they are known. ABIDES provides a network modeland mandatory messaging protocol that enables detailed experiments in this area: Creating a population of2BIDES: Towards High-Fidelity Market Simulation for AI Research A PREPRINT agents with known distribution of network latency and reliability, conducting trials in which one agent isincrementally shifted from a co-location facility out to a great distance, and evaluating the impact of this shifton each agent’s proﬁtability while otherwise pursuing the same strategies. • The impact of large orders on price : The very act of trading, and even placing orders in a market may affectthe price. For instance, if there is signiﬁcant selling pressure evidenced by a large volume of sell orders, it isgenerally expected that the price will go down. The extent to which the price moves because of an order isreferred to as market impact . Market participants of course want to minimize such impact, because the marketusually moves contrary to their proﬁt incentives. In a market ﬁeld study, it is not feasible to perform controlledA/B tests. One cannot place a market buy at the NYSE for one million shares of IBM at 10 AM on Oct 22,2018, and then also not place that order, and compare the difference. Without the “control”, any observedresult from the large order could be attributable to some other factor. A key feature of ABIDES is the abilityto re-simulate the same historical market day with known, limited changes while holding all other factorsconstant, thus enabling the desired experimental control population. • Cost-beneﬁt analysis of AI : When analyzing historical market data, we cannot know the logic behindindividual trader actions. In a simulation without a model for computational time delays that directly impacttime-to-market for the resulting orders, we cannot readily study the trade-off between simpler, faster predictorsand slower, more powerful predictors. ABIDES introduces a ﬂexible, integrated model for computation delaythat permits the “speed” of each agent’s thought process to be represented, and to have that representationaffect the timing of all of outbound messages as well as the next time at which the agent can be roused forparticipation. Thus heavier thinkers will take longer to deliver a resulting order to the exchange and will beunable to act as frequently. • Explanation of learning agent behavior : A current key area of AI research across all application ﬁelds isexplainability – once taken for granted in classic knowledge-based AI, but now increasingly difﬁcult with“black box” ML algorithms. By providing a platform with centralized, time-synced event logging for all agents,we envision a “clear box” in which each agent’s decision, intent, behavior, and result for every action are fullyvisible. We hope to use this ability to dive deeply into the why of learned policy functions.

The ABIDES framework includes a customizable conﬁguration system, a simulation kernel, and a rich hierarchy ofagent functionality as illustrated in Figure 2.Figure 2: Class relations within the ABIDES simulation framework.3BIDES: Towards High-Fidelity Market Simulation for AI Research A PREPRINT

The simulation is built around a discrete event-based kernel [1]. The kernel resides in the

Kernel class in the defaultpackage and is required in all simulations. All agent messages must pass through the kernel’s event queue. Thekernel supports simulation of geography, computation time, and network latency. It also acts as enforcer of simulation“physics”, maintaining the current simulation time, tracking a separate “current time” for each agent, and ensuring thereis no inappropriate time travel. Key features of the ABIDES kernel include: • Historical date simulation

All simulation occurs on a conﬁgurable historical date or sequence of dates.This permits “real” historical information to be seamlessly injected into the simulation at appropriate timeswhen required for a particular experiment. • Nanosecond resolution : Because we seek to emulate real markets, we simulate time at the same resolution asan example exchange: the NASDAQ exchange. All simulation times are represented as Pandas Timestampobjects with nanosecond resolution. This allows a mixture of agents to participate in the simulation onvery different time scales with minimal developer overhead. Events that occur simultaneously (in the samenanosecond) will be executed in arbitrary order. • Global Virtual Time (GVT) : GVT is the latest simulated time for which all messages are guaranteed to havebeen processed. The kernel tracks GVT as the simulation progresses. It is usually the case that GVT advancesmuch more quickly than wall clock time, but for very complex scenarios, it may not. The GVT value is notavailable to the agents. • Current time per agent : The kernel tracks a “current time” per individual participating agent which isincremented upon return from any call to

Agent.receiveMessage() or Agent.wakeup() . In situationswhere the current time for the agent is “in the future” (i.e., larger than GVT), the kernel will delay delivery ofmessages or wakeup calls to this agent until GVT catches up. • Computation delay : The kernel stores a computation delay per agent which is added to the agent’s “currenttime” after each activity. The delay is also added to the sent time and delivery time of any outbound messagefrom an agent to account for the agent’s computation effort. Agents may alter this computation delay toaccount for different sorts of computation events. • Conﬁgurable network latency : The kernel maintains a pairwise agent latency matrix and a latency noisemodel which are applied to all messages between agents. This permits simulation of network conditions andagent location, including co-location. • Deterministic but random execution : The kernel accepts a single pseudo-random number generator (PRNG)seed at initialization. This PRNG is then used to generate seeds for an individual PRNG object per agent,which must rely solely on that object for stochastic methods. Since our system is single-threaded, this allowsthe entire simulation to be guaranteed identical when the same seed is initialized within the same experimentalconﬁguration. This would not ordinarily permit the desired A/B testing, because the “agent of change” mightconsume an additional pseudo-random number from the sequence and thus change the stochastic source forall subsequent agents. Because of our careful use of the primary PRNG only to generate subsidiary PRNGsper agent, the “agent of change” in an ABIDES A/B experiment will not alter the set of pseudo-randomnumbers given to any other agent throughout the simulation, even if it uses more or fewer such inputs for itschanged activity. In this way, changes in the behavior of other agents will be caused by a changed simulationenvironment (e.g. stock prices) and not simple stochastic perturbation.

During a simulation, the kernel follows a series of life cycle phases. All except the event queue processing phase consistentirely of sending the relevant event notiﬁcation to all agents, and are described in the

Agent subsection below. Theevent queue processing phase is elaborated upon here:1. Kernel Initializing2. Kernel Starting3. Repeat until the event queue is empty or currentTime > stopTime : − Extract next scheduled event and set currentTime = event.deliveryTime − If agentTimes[event.target] > currentTime : · event.deliveryTime = agentTimes[event.target] · Place event back in queue and goto 3 A PREPRINT − agentTimes[event.target] = event.deliveryTime − Call target.wakeup() or target.receiveMessage() − agentTimes[event.target] += computationDelay[event.target]

4. Kernel Stopping5. Kernel TerminatingThe kernel additionally supports a few critical methods upon which agents depend: • sendMessage(sender, recipient, message, delay) - Schedules message to be transmitted from sender to recipient with an (optional) non-negative additional delay. The “sent time” will be the sender’scurrent time, plus its computation delay, plus any requested extra delay. The “delivery time” will be the senttime plus network latency plus jitter, as determined by conﬁgured parameters for the experiment. • setWakeup(sender, requestedTime) - Schedules a wakeup call for the sender at the requested futuretime. • findAgentByType(type) - Returns the numeric identiﬁer of an agent of the requested type if one can befound. If multiple agents of the type exist, one is selected arbitrarily. It is not possible for an agent to obtain a reference to another agent (and thus bypass the kernel in the future). • writeLog(sender, dfLog) - Called by an agent to request that its log be archived to disk for analysis. Thelog is expected to be a Pandas DataFrame with index type DatetimeIndex. All simulator agents are deﬁned in the agent package. All participants in a simulation must inherit from the base agent.Agent class, which implements a number of required methods that allow basic participation in the full lifecycle of the simulation.The following methods must be supported by all simulation agents and will be called exactly one time per agent by thekernel. The order in which agents are activated in each life cycle phase is arbitrary. • kernelInitializing(kernel) - The kernel has just started running. The existence of other agents shouldnot be assumed. There is no “current time”. The base Agent simply retains the given kernel reference. • kernelStarting(startTime) - Event queue processing is about to begin. All other agents are nowguaranteed to exist. There is no “current time”. startTime contains what will be the initial simulationtimestamp. The base Agent requests a wakeup call for this initial timestamp. • kernelStopping() - Event queue processing has just ended. All other agents are still guaranteed to exist.There is no longer a “current time”. The base Agent takes no action. • kernelTerminating() - The kernel is about to shut down. The existence of other agents should not beassumed. There is no longer a “current time”. Agents are expected to log any ﬁnal data and clean up. The base Agent passes off its individual event log, if there are entries, to the kernel for archival.The following methods must be supported by all simulation agents. They will be called by the kernel in order ofincreasing delivery timestamp of queued messages and wakeup calls. In both cases, the base

Agent simply updates itsinternal current time and displays an informative message. • receiveMessage(currentTime, msg) - The kernel is delivering a message from another agent. currentTime is the current simulation time as a Pandas Timestamp (nanosecond resolution). msg is aninstance of class message.Message which the agent must interpret. • wakeup(currentTime) - The kernel is delivering a previously-scheduled “wakeup call” to the agent. currentTime is the current simulation time. No message is delivered, thus the agent must use internalstate and logic to determine what it should do next.While not required by the simulation kernel, the base Agent class also provides logEvent(eventType, event) ,which can be called by any agent to append to an individual timestamped log of events. As noted above, by default thislog is reported to the kernel for archival during the kernelTerminating life cycle phase.5BIDES: Towards High-Fidelity Market Simulation for AI Research A PREPRINT

The agent.ExchangeAgent class inherits from agent.Agent and represents a stock exchange such as NASDAQ.The message protocols supported by this agent are based on NASDAQ’s published ITCH and OUCH protocols. [5, 6]The exchange is initialized with market opening and closing times, which it will enforce. These are not required tomatch the simulation start and stop times. The exchange agent is not privileged in any way; it must participate in thesimulation just as any other agent. The

ExchangeAgent understands how to respond to these types of messages: • Market Open Time : Returns the timestamp at which the exchange will begin processing order-relatedmessages. • Market Close Time : Returns the timestamp at which the exchange will stop processing order-related mes-sages. • Query Last Trade : Returns the last trade price for a requested symbol. Until the ﬁrst trade of the day, theexchange reports the oracle open price (historical or generated data) as the “last trade price”. The exchangedoes not yet implement the opening cross auction. • Query Spread / Depth : Returns a list of the N best bid and best ask prices for a requested symbol and theaggregate volume available at each price point. With a requested depth of one, this is equivalent to querying“the spread”. • Limit Order : Forwards the attached limit order to the requested symbol’s order book for matching oracceptance. Agents currently simulate market orders using a limit order with an arbitrarily high or low limitprice. • Cancel Order : Forwards the attached order to the requested symbol’s order book to attempt cancellation.Outside of market hours, the exchange will only honor messages relating to market hour inquiries and ﬁnal tradeprices (after the close). The exchange sends a “market closed” message to any agent which contacts it with disallowedmessages outside of market hours.The exchange agent demonstrates one use of the inbuilt Kernel logging facility, recording either the full order stream orsnapshots of its order books at a requested frequency, enabling extremely detailed visualization and analysis of theorder book at any time during simulation. For example, Figure 3 shows a time window surrounding one “high impact”market buy order, which drives prices upward immediately and has a follow-on effect on other agents’ value beliefs.

Within an Exchange Agent, an order book tracks all open orders, plus the last trade price, for a single stock symbol. Allorder book activity is logged through the exchange agent. The order book implements the following functionality: • Order Matching

Attempts to match the incoming order against the appropriate side of the order book. Thebest price match is selected. In the case of multiple orders at the same price, the oldest order is selected. • Partial Execution

Either the incoming order or the matched limit order may be partially executed. When thematched limit order is partially executed, the order is left in the book with its quantity reduced. When theincoming order is partially executed, its quantity is reduced and a new round of matching begins. Participantsreceive one “order executed” message, sent via the exchange, per partial execution noting the ﬁll price of each.When the incoming order is executed in multiple parts, the average price per share is recorded as the last tradeprice for the symbol. • Order Acceptance

When the incoming limit order has remaining quantity after all possible matches havebeen executed, it will be added to the order book for later fulﬁllment, and an “order accepted” message will besent via the exchange. • Order Cancellation

The order book locates the requested order by unique order id, removes any remainingunﬁlled quantity from the order book, and sends an “order cancelled“ message via the exchange.One might reasonably expect the order book in a market simulation to include a model for slippage. We assert that ourplatform produces realistic slippage naturally, without the need for such a model. Orders directed to the exchange sufferdynamic computation and network delays, during which time other orders are being executed.6BIDES: Towards High-Fidelity Market Simulation for AI Research A PREPRINT

Figure 3: Example of order book visualization around the time of a high impact trade.

The agent.TradingAgent class inherits from agent.Agent and represents the base class for a ﬁnancial trading agent.It implements a number of additional features beyond the basic simulator

Agent , upon which subclassed strategy agentsmay rely: • Portfolio

The base trading agent maintains an equity portfolio including a cash position. It automaticallyupdates this portfolio in response to “order executed” messages. • Open Orders

The trading agent keeps a list of unﬁlled orders that is automatically updated upon receipt of“order executed” and “order cancelled” messages, and when new orders are originated. • Last Known Symbol Info

The trading agent tracks known information about all symbols in its awareness,including the most recent trade prices, daily close prices (after the close), and order book spread or depth.These are automatically updated when receiving related messages. • Market Status

Upon initially waking at simulation start, the trading agent automatically locates an exchangeagent, requests market open and close times, and schedules a second wakeup call for the time of market open.It also maintains and provides a simple “market closed” ﬂag for the beneﬁt of subclassing agents. • Mark to Market

The trading agent understands how to mark its portfolio to market at any time, using itsmost current knowledge of equity pricing. It automatically marks to market at the end of the day. • Messages

The trading agent knows how to originate all of the messages the exchange understands, and tousefully interpret and store all of the possible responses from the exchange. • Logging

The trading agent logs all signiﬁcant activity: when it places orders; receives notiﬁcation of orderacceptance, execution, or cancellation; when its holdings change for any reason; or when it marks to market atthe end of the day. 7BIDES: Towards High-Fidelity Market Simulation for AI Research A PREPRINT

The ABIDES simulator is implemented using Python, currently 3.6, and the data analytical libraries NumPy [7], andPandas [4]. It makes use of a virtual environment to provide platform independence and provides a straightforwarddeployment. It is seamlessly built to facilitate quick reconﬁguration of varying agent populations, market conditions,exchange rules, and agent hyperparameters.Basic execution of the simulation can be as simple as: python abides.py -c config , where config is the nameof an experimental conﬁguration ﬁle. Additional command line parameters are forwarded to the conﬁguration code forprocessing, so each experimental conﬁguration can add its own required parameters to a standard interface. Complexexperimental conﬁguration can be performed directly within the conﬁg ﬁle since it is simply Python code, however theinclusion of command line arguments is beneﬁcial for coarse grain parallelization of multiple experiments of the sametype, but with varied simulation parameters.A typical conﬁguration ﬁle will specify a historical date to simulate and a simulation start and stop time as a nanosecond-precision pandas.Timestamp objects. It will then initialize a population of agents for the experiment, conﬁguring eachas desired. For example, an experiment could involve 1,000 background agents (perhaps Zero Intelligence agents orHeuristic Belief Learning agents), 100 high-frequency trading agents, and one impact agent with various initializationparameters to control their behavior. Each agent will at least be given a unique identiﬁer and name. The conﬁgurationﬁle will also construct a latency matrix (pairwise between all agents at nanosecond precision) and latency noise modelwhich will be applied to all inter-agent communications. If a “data oracle”, a utility with access to a data source outsidethe simulation, is required for the experiment, the conﬁguration ﬁle will initialize one. Finally a simulation kernel willbe initialized and run, passing it the agent population, oracle, and other simulation parameters.Note that there is nothing ﬁnance-speciﬁc about the bootstrapper, conﬁguration template, simulation kernel, or the base

Agent class. All are appropriate for use in any continuous-time discrete event simulation.

To highlight the simplicity of creating a functional trading agent in our simulated environment, we present the code fora basic momentum trader. It wakes each minute during the day, queries the last trade price, projects a future price usinglinear regression over a conﬁgurable last N data points, and places a market order based on this projection. Followingis the complete source, excluding import statements: class MomentumAgent ( TradingAgent ):def __init__ ( self , id , name , symbol , startingCash , lookback ):super (). __init__ (id , name , startingCash )self . symbol = symbolself . lookback = lookbackself . state = " AWAITING_WAKEUP "self . trades = []def wakeup ( self , currentTime ):can_trade = super (). wakeup ( currentTime )if not can_trade : returnself . getLastTrade ( self . symbol )self . state = " AWAITING_LAST_TRADE "def receiveMessage ( self , currentTime , msg ):super (). receiveMessage ( currentTime , msg )if self . state == " AWAITING_LAST_TRADE " and \msg . type == " QUERY_LAST_TRADE ":last = self . last_trade [ self . symbol ]self . trades = ( self . trades + [ last ])[: self . lookback ] A PREPRINT if len ( self . trades ) >= self . lookback :m , b = np . polyfit ( range ( len ( self . trades )) , self . trades , 1)pred = self . lookback * m + bholdings = self . getHoldings ( self . symbol )if pred > last :self . placeLimitOrder ( self . symbol , 100 - holdings ,True , self . MKT_BUY )else :self . placeLimitOrder ( self . symbol , 100+ holdings ,False , self . MKT_SELL )self . setWakeup ( currentTime + pd . Timedelta ("1m" ))self . state = " AWAITING_WAKEUP "

A long-term goal is to produce realistic but possibly noisy re-simulations of particular days in history to play out various“what if” scenarios. The idea is to populate the simulation with a large number of trading agents that provide a realisticenvironment into which experimental agents can be injected. (a) IBM: September 30, 2008 (b) MSFT: June 24, 2016

Figure 4: Simulated trades versus historical trades on two days.Our initial effort towards this goal involves the introduction of a data oracle with access to ﬁne-resolution historicaltrade information, and the creation of a “background” agent which is able to request a noisy observation of the mostrecent historical trade as of the agent’s current simulated time. The approach is meant to reproduce the behavior ofa trader whose beliefs regarding the fundamental value of a stock are informed by interpretations of news and otherincoming information. It was inspired by the concept of a stock’s “fundamental value” as used in the work of Wang andWellman. [9] Our approach is similar, but it uses historical data as a baseline rather than a mean-reverting stochasticprocess.A common baseline agent in the continuous double auction literature is the Zero Intelligence (ZI) trader [3] whichsubmits random bids and offers to the market, usually drawn from some stochastic distribution around a central valuebelief for the underlying instrument.Our agent.BackgroundAgent class follows the general spirit of the ZI trader, but with two important distinctions:The central value belief at any time is a mixture of the prior belief with a noisy observation of a historical trade; and theagent implements an extremely basic arbitrage strategy between the last simulated trade price and its internal belief.Thus the valuation is inﬂuenced by random factors, but the direction of limit orders placed is then rational, with theagent assuming the simulated price will converge to its value belief over time. Each background agent trades only asingle symbol on a single exchange.In our current conﬁguration, a background agent typically follows the following basic logic, given some wake frequency F in some unit of time (microseconds, seconds, etc): 9BIDES: Towards High-Fidelity Market Simulation for AI Research A PREPRINT

1. Request an initial wakeup time selected randomly from a uniform distribution across the ﬁrst F interval aftermarket open2. On wakeup, cancel any unﬁlled orders and wait for conﬁrmation.3. Query the exchange for the last trade price of this agent’s symbol of interest and wait for the response.4. Request a new noisy historical observation from the data oracle, and mix this observation with any prior beliefto obtain a new posterior value belief.5. Determine the direction from the simulated last trade to this agent’s value belief. Place a limit order to bringthe agent’s holdings in line with a presumed proﬁtable position: entering, exiting, or reversing position asnecessary.6. Request a new wakeup call for the current time plus approximately F .Figure 4 compares the behavior of 100 background agents interacting in ABIDES with the actual intra-day price ontwo separate days in history. Ideally, we will see a price history that closely resembles the day in history, with similarstatistical properties. One area in which we believe simulation can add signiﬁcant value to the current state of knowledge in ﬁnance is moreaccurate models of the market impact of large trades. Each order placed at the exchange potentially “moves the market”due to the nature of the market microstructure within the order book: arriving orders can add liquidity at a better price,altering the spread; or can match existing orders and remove liquidity from the market. See Figure 5 for an example ofmechanical market impact. Figure 5: Example of mechanical market impact.Models that rely on historical data encounter limitations stemming from the inability to repeat history while introducingan experimental change and allowing subsequent events to be altered by that change. Models can attempt to compare“similar” days in history, but no two market days are ever the same.If one could instead create a multi-agent simulation of a particular date in history such that a near approximation ofhistorical trades emerged in the absence of any signiﬁcant change, but the trading agents would realistically react to anysuch changes, a more accurate understanding of large trade impact could be attained. Here we present a preliminaryinvestigation of this idea.We begin each simulation with a population of background agents and at least one exchange agent. For this experiment,we add a single experimental agent, agent.ImpactAgent , which simply places a single large market order at apredetermined time of day. The experimental parameter for the agent is its “greed”; that is, the proportion of available10BIDES: Towards High-Fidelity Market Simulation for AI Research A PREPRINT order book liquidity near the spread it consumes at the time of trade. For example, a long impact agent with greed = 0 . will place a market buy order for of the shares on offer.Our experiment includes 100 background agents and one exchange agent handling an order book for a set of symbolsincluding IBM. In Figure 6, the blue line represents each trade made by our population of background agents inthe absence of an impact trader. The orange line shows each trade made by the simulated trading agents given theintroduction of a single impact agent with varying “greed”, acting one time with one trade at 10:00 AM on September30, 2008. Both series are smoothed to improve visibility of the differences. (a) MARKET BUY 1232 IBM (b) MARKET BUY 2874 IBM(c) MARKET BUY 5338 IBM (d) MARKET BUY 7801 IBM Figure 6: Market impact of trades at 10:00 AM. (a) Impact agent with greed 0.5 (b) Impact agent with greed 0.1

Figure 7: Market impact event studies.11BIDES: Towards High-Fidelity Market Simulation for AI Research A PREPRINT

The impact trader has a clear effect on the market, despite the background agents’ central tendency to arbitrage the pricetoward historical levels, and the impact grows larger proportionally with its market bid size. The change is particularlynoticeable in the cyclical peaks of the auction. Due to the price elevation it caused, the impact trader’s total proﬁtincreased with the size of its bid from an average of $2,633 with greed = 0 . to $12,502 with greed = 1 . . Howeverits proﬁt per share declined from $2.14 to $1.60. We found a correlation between proﬁt per share and trade size of r = − . across sixty experimental trials.It is useful to consider these market impacts in aggregate across multiple experimental examples. ABIDES makes iteasy to produce study plots from logged simulation data. Figure 7 shows a time-aligned event study of many impacttrades at different times, on different days, to illustrate the range of likely price effects after the time of impact. We presented the design and implementation of ABIDES, a high-ﬁdelity equity market simulator. ABIDES provides anenvironment within which complex research questions regarding trading agents and market behavior can be investigated.The simulation is demonstrated in two case studies. The ﬁrst case study shows how previous intra-day transaction histo-ries are closely reproduced by a population of interacting background trading agents communicating with an exchangeagent. These background agents are designed to provide a realistic market environment into which experimental agentscan be injected. The second case study illustrates how large market orders impact simulated prices not just immediately,but for a signiﬁcant period after the order arrives at the exchange. It is also intended to demonstrate the experimentalpotential of the ABIDES platform.We now have a robust simulation environment in which to develop and experiment with more complex trading agents,including those based on approaches in machine learning and artiﬁcial intelligence.

ABIDES is available through GitHub at https://github.com/abides-sim/abides under the BSD 3-clause license.

This material is based upon research supported by the National Science Foundation under Grant No. 1741026.12BIDES: Towards High-Fidelity Market Simulation for AI Research A PREPRINT

References [1] B

ANKS , J.

Handbook of simulation: principles, methodology, advances, applications, and practice . JohnWiley & Sons, 1998.[2] F

REIDMAN , D. The double auction market institution: A survey.

The Double Auction Market Institutions,Theories and Evidence, Addison Wesley (1993).[3] G

ODE , D. K.,

AND S UNDER , S. Allocative efﬁciency of markets with zero-intelligence traders: Market asa partial substitute for individual rationality.

Journal of political economy 101 , 1 (1993), 119–137.[4] M C K INNEY , W.,

ET AL . Data structures for statistical computing in python. In

Proceedings of the 9thPython in Science Conference (2010), vol. 445, Austin, TX, pp. 51–56.[5] NASDAQ OMX G

ROUP . NASDAQ TotalView - ITCH 5.0. .Accessed: 2018-10-25.[6] NASDAQ OMX G

ROUP . O*U*C*H Version 4.2. . Accessed: 2018-10-25.[7] O

LIPHANT , T. E.

A guide to NumPy , vol. 1. Trelgol Publishing USA, 2006.[8] W

ANG , J., G

EORGE , V., B

ALCH , T.,

AND H YBINETTE , M. Stockyard: A discrete event-based stockmarket exchange simulator. In

Simulation Conference (WSC), 2017 Winter (2017), IEEE, pp. 1193–1203.[9] W

ANG , X.,

AND W ELLMAN , M. P. Spooﬁng the limit order book: An agent-based model. In

Proceedingsof the 16th Conference on Autonomous Agents and MultiAgent Systems (2017), International Foundationfor Autonomous Agents and Multiagent Systems, pp. 651–659.[10] W

ELLMAN , M. P. Methods for empirical game-theoretic analysis. In

AAAI (2006), pp. 1552–1556.[11] Z

OOK , M.,

AND G ROTE , M. H. The microgeographies of global ﬁnance: High-frequency trading and theconstruction of information inequality.

Environment and Planning A: Economy and Space 49 , 1 (2017),121–140., 1 (2017),121–140.