[PDF] Design and Analysis of a Synthetic Prediction Market using Dynamic Convex Sets

Abstract

We present a synthetic prediction market whose agent purchase logic is defined using a sigmoid transformation of a convex semi-algebraic set defined in feature space. Asset prices are determined by a logarithmic scoring market rule. Time varying asset prices affect the structure of the semi-algebraic sets leading to time-varying agent purchase rules. We show that under certain assumptions on the underlying geometry, the resulting synthetic prediction market can be used to arbitrarily closely approximate a binary function defined on a set of input data. We also provide sufficient conditions for market convergence and show that under certain instances markets can exhibit limit cycles in asset spot price. We provide an evolutionary algorithm for training agent parameters to allow a market to model the distribution of a given data set and illustrate the market approximation using two open source data sets. Results are compared to standard machine learning methods.

Full PDF

DDesign and Analysis of a Synthetic Prediction Market usingDynamic Convex Sets

Nishanth Nakshatri ∗ Arjun Menon ∗ C. Lee Giles † Sarah Rajtmajer † Christopher Griﬃn ‡ January 7, 2021

Abstract

We present a synthetic prediction market whose agent purchase logic is deﬁned using a sigmoid trans-formation of a convex semi-algebraic set deﬁned in feature space. Asset prices are determined by alogarithmic scoring market rule. Time varying asset prices aﬀect the structure of the semi-algebraic setsleading to time-varying agent purchase rules. We show that under certain assumptions on the underlyinggeometry, the resulting synthetic prediction market can be used to arbitrarily closely approximate a bi-nary function deﬁned on a set of input data. We also provide suﬃcient conditions for market convergenceand show that under certain instances markets can exhibit limit cycles in asset spot price. We providean evolutionary algorithm for training agent parameters to allow a market to model the distribution ofa given data set and illustrate the market approximation using two open source data sets. Results arecompared to standard machine learning methods.

Prediction markets in their current form trace their roots to the original studies by Hanson [1–4] and sincethen have been studied and used extensively [5–11]. For a survey of work in this area through 2007 see [12].In these markets, assets corresponding to future events (e.g., elections [13], sports outcomes [14] etc.) canbe bought and sold thereby manipulating underlying asset prices. These asset prices can be interpreted asprobabilities [7, 15] thereby providing a mechanism for event forecasting. Recent applications of predictionmarkets include forecasting infectious disease activity [16], evaluating scientiﬁc hypotheses [17], predictingthe reproducibility of scientiﬁc work [18], and aggregation of employee wisdom in a corporate setting [19,20].In practice, many of these markets have been remarkably successful in eﬃciently aggregating informationabout uncertain future events [21]. There are a number of compelling explanations for this. Financialstakes incentivize participants to search for better information [22] and the forecasts of more conﬁdentagents are weighted more heavily, where conﬁdence is measured as willingness to risk more money [23].The eﬃcient markets hypothesis suggests that the market price reﬂects available information at least aswell as any competing method [24], although some have suggested that this hypothesis is not upheld inprediction markets [7]. Work has explored speciﬁc concerns about liquidity, price manipulation, outcomemanipulation, bias, and their respective impacts on market eﬃciency [15, 25–29]. A separate thread of thisresearch has studied the accuracy of prediction markets based on real versus play money, to disentanglethe speciﬁc role of ﬁnancial incentives (see, e.g., [6, 30–32]). The arrival of blockchain technologies hasfacilitated the development of decentralized prediction markets (e.g., [33–35]), which beneﬁt from the trustand transparency inherent in these ownerless peer-to-peer systems. Blockchain-based prediction marketsoﬀer anonymity for their traders [36,37], support broad participation, and reduce single points of failure [38].Design of decentralized prediction markets is an ongoing area of research [39–41]. ∗ Dept. of Computer Science and Engineering, Pennsylvania State University, University Park, PA 16802 † College of Information Sciences and Technology, Pennsylvania State University, University Park, PA 16802 ‡ Applied Research Laboratory, Pennsylvania State University, University Park, PA 16802 a r X i v : . [ c s . C E ] J a n ver the last decade, a body of work has emerged on so-called artiﬁcial (equivalently, synthetic) predictionmarkets. These are numerically simulated markets populated by artiﬁcial participants (agents) for thepurpose of supervised learning of probability estimators [42]. Like their human-populated counterparts,artiﬁcial prediction markets have found a number of applications, including lymph node detection from CTscans [43] and early stage detection of epidemics from crowd-sourced data [44]. The theoretical promise ofartiﬁcial markets was ﬁrst explored by Chen and colleagues [45–47]. They highlight the deep mathematicalconnections between prediction markets and learning, demonstrating that any cost function based predictionmarket with bounded loss can be interpreted as a no-regret learning algorithm [46]. And, that every convexcost function based prediction market can be interpreted as a Follow the Regularized Leader algorithm witha convex regularizer [47].In an initial construction put forward by Barbu and Lay [42] patterned after the Iowa Electronic Markets[5], each agent is represented as a budget and a simple betting function. During training, each agent’sbudget is updated based on the accuracy of its prediction for each training data point. The contract pricefor an outcome is an estimator of its class-conditional probability. These markets, authors found, were ableto outperform random forest and implicit online learning in benchmark classiﬁcation tasks. In follow-upwork [48], the same authors generalized the market framework to support regression and reported similargains in performance. Storkey and colleagues [49, 50] develop an artiﬁcial prediction market with a diﬀerentmarket mechanism, the so-called machine learning market. In their formulation, each agent purchasescontracts for possible outcomes in order to maximize its own utility function. The equilibrium price of thecontracts is computed by an optimization procedure. The market is shown to outperform standard classiﬁerson a number of machine learning benchmarks. A 2014 extension of this work [51] models agents using staticrisk measures. The authors demonstrate that the resulting market approaches a global objective, formallyasserting the potential of the market to solve problems in machine learning. More recently, authors haveproposed continuous artiﬁcial prediction markets [52] for online regression. These markets consider agentswith adaptive trading strategies, using reinforcement learning to dynamically identify actions that maximizetheir own reward.In this paper we study synthetic prediction markets in which the agents’ purchase logic is governed bytime-varying semi-algebraic sets. For the purposes of this work, we focus on convex semi-algebraic setsdeﬁned by ellipsoids in R n . Time variation of the set volume is governed by asset prices in the market.Agents specialize in the purchase of a single asset class and will only purchase an asset at time t if an inputfeature vector is contained in the (time-varying) set deﬁning the agent. We show the following:1. Given an arbitrarily large but ﬁnite labeled data set, we show how to construct a market that willperfectly assign to each input the appropriate output. This allows us to derive a form of universalapproximation for our market structure.2. We provide a suﬃcient condition in terms of the underlying geometric structures for a market toconverge to a single ﬁnal price for all assets.3. We show that the market can exhibit limit cycles and these limit cycles correspond to input data thatlie near decision boundaries of agents.4. We develop an evolutionary algorithm for training agent behavior in a market to represent a set ofinput data.5. We illustrate this algorithm using three open source data sets.Our results are complementary to the existing synthetic prediction market literature and establish a geometricfoundation for building more complex prediction markets.The remainder of this paper is organized as follows: In Section 2 we discuss the synthetic predictionmarket model and establish relevant notation. Theoretical results on the prediction market are establishedin Section 3. We discuss an algorithm for training a market to classify samples from a speciﬁc data setin Section 4. In Section 5 we show empirical results on three open source machine learning data sets.Conclusions and future directions of research are presented in Section 6.2 Binary Market Model

Let Z + be the positive integers. Assume we have a binary option market with the two options denoted asAssets 0 and 1. Assume q t = ( q t , q t ) ∈ Z units of (Asset 0, Asset 1) at time t have been sold. A (binaryoption) market [15] M consists of a set of agents A = { a , . . . , a n } who buy (and sell) Assets 0 and 1 usingpolicies { γ , . . . , γ n } . If agent purchase policy γ i is conditioned on exogenous information x ∈ D ⊆ R n then, γ i : ( q t , x ) (cid:55)→ ( r , r ) and Agent i purchases r units of Asset 0 and r units of Asset 1, thus causing a stateupdate. When the market is conditioned on x ∈ D we denote it M x .Assuming time passes discretely (is epochal) and we have an input x ∈ D , market M x is a dynamicalsystem ( Z , Z + , Γ x ) where the dynamic Γ x : Z → Z arises from the interaction of the individual policies { γ , . . . , γ n } and the conditional information x . At any time t , the state q t can be mapped into a pair ofasset prices p t = ( p t , p t ) that may be used in the policies of the agents in place of q t . For the remainder of this paper, we will assume that Γ x is ﬁxed when given x and that an initial state q isgiven. We use the Logarithmic Market Scoring Rule (LMSR) [53] to aggregate estimates from a set of agents A = { a , . . . , a n } and determine asset prices. Given state ( q t , q t ), the current asset prices are computedusing LMSR: p t = exp ( βq t )exp ( βq t ) + exp ( βq t ) p t = exp ( βq t )exp ( βq t ) + exp ( βq t ) . This is the softmax function (Boltzmann distribution with constant β = k/T for ﬁxed k and T ) of the inputs( q t , q t ). The β term is a liquidity factor [54] that adjusts the amount the price will increase or decreasegiven a change in the asset quantities. By using a Boltzmann distribution, the prices can be interpreted asprobabilities.The true asset purchase prices (trade costs) are not given by p t , since LMSR incorporates a marketmaker cost. The trade costs are given by: κ t (∆ q ) = 1 β log (cid:18) exp[ β ( q t + ∆ q )] + exp[ βq t ]exp[ βq t ] + exp[ βq t ] (cid:19) κ t (∆ q ) = 1 β log (cid:18) exp[ βq t ] + exp[ β ( q t + ∆ q )]exp[ βq t ] + exp[ βq t ] (cid:19) , where ∆ q i is the change in the quantify of Asset i as a result of purchases deﬁned by Γ x .Let P ( x , t ) = p t ( x ) assuming ﬁxed q and Γ x . The market converges to a price pair ¯ p if:lim t →∞ P ( x , t ) = ¯ p . (1)Convergence is not necessarily guaranteed in all markets, however for the markets we consider, we will showsuﬃcient conditions for convergence to occur.Let φ : D ⊆ R n → [0 ,

1] be a binary function. Our objective is to construct Γ x , which deﬁnes a market M and agents A , so that: (cid:90) D | φ ( x ) − ¯ p ( x ) | d x < (cid:15), (2)where ¯ p ( x ) is the long-run price of Asset 1 and (cid:15) > L error when the price of Asset 1 is used as an approximation function for φ . We make this more precisein subsequent sections. 3 .2 Agent Purchase Policies Let f ( x ; θ ) be a quasi-concave function parameterized by θ with maximum at . By this we mean a functionthat satisﬁes the inequality: f ( λ x + (1 − λ ) x ; θ ) ≥ max { f ( x ; θ ) , f ( x ; θ ) } . (3)If Θ is a positive deﬁnite, diagonal matrix, then the quadratic function: g ( x ; θ ) = 1 − x T Θx = 1 − (cid:88) j θ i x i (4)is such a function and the set: E Q = { x ∈ R n : g ( x ; Q ) ≤ } (5)is an ellipsoid centered at and oriented along the standard basis.For the chosen quasi-concave function, deﬁne the translated function: f ( x ; h , θ ) = f ( x − h ; θ ) (6)In terms of the quadratic function this is just: g ( x ; h , Θ ) = ( x − h ) T Θ ( x − h ) . (7)Under these assumptions, Θ deﬁnes a simple local metric that is used to determine how close the conditioningpoint x is to a reference point h .Assume we are given a set of labeled training data H = { h i } Ni =1 with labels y = { y i } Ni =1 with y i ∈ { , } .For each data point h i in H (or possibly an appropriate subset of H ) with label y i deﬁne Agent i who buys only Asset y i ( i = 0 , i specializes in buying y i . Given an input featurevector x , Agent i estimates the value of Asset y i using the formula: π it ( x , p y i t ; h i ; θ i , α i , w ip ) = σ [ α i · f ( x ; h i , θ i ) + θ i + w ip ( p y i t − p y i )] , (8)where p y i t is the price of Asset y i at time t , θ is a bias, α is a scaling factor and σ is the logistic sigmoidfunction . When using an ellipsoidal function, the exact formula is: π it ( x , p y i t ; h i , θ i , α i , w ip ) = σ  α i ·  − (cid:88) j θ ij ( x − h ij )  + w ip ( p y i t − p y i ) + θ i  . (9)We note that if θ ij = 0, then the ellipsoid structure is replaced (eﬀectively) with a cylinder in R n .We assume Agent i can only buy one unit of Asset y i at a time (per epoch). The agent logic deﬁning γ i is then:1. For ∆ q y i = 1, if 1 κ y i t (cid:0) π it (cid:0) x , p y i t ; h i ; θ i , α i , w ip (cid:1) − κ y i t [∆ q y i ] (cid:1) ≥ τ, then the agent purchases a single unit of Asset y i . Here τ ∈ [0 ,

1) determines the opportunity costconsidered by the agent. When τ = 0, the agent purchases an asset precisely when it has suﬃcientfunds and when it’s estimated price is higher than the actual asset price.2. Otherwise, the agent purchases nothing.For our model, each agent only buys when the conditioning data x ∈ D is close enough (in the derivedmetric) to its initialized data point h i . Thus, we are using the data set H to construct a covering of the set D and then using that covering to construct the market and its dynamics. A unit step function could be substituted with minimal change to the sequel. Properties of the Market

In this section, we study the theoretical properties of markets in which agents have unlimited funds.

Proposition 1.

Let H = (cid:8) h i (cid:9) Ni =1 be a ﬁnite but arbitrarily large data set with labels y = { y i } Ni =1 . Assumethe data are separable; i.e., if h i = h j , then y i = y j . For all (cid:15) > , there is a market M with agents A = { a , . . . , a N } such that for all i = 1 , . . . , N : lim t →∞ | p ( h i , t ) − y i | < (cid:15), (10) where p ( x , t ) is the price of Asset 1 in the market (the market spot price).Proof. Set τ = 0. The fact that H is ﬁnite implies there is a set of open spheres centered at h , . . . , h N withradii r , . . . , r N so that: h j ∈ B r i ( h i ) ⇐⇒ h j = h i (11)From Eq. (9), for all i and j , deﬁne θ ij = 1 /r i . For all i set w ip = θ i = 0. Assume that Agent i purchasesonly Asset y i . For Agent i using Eq. (9) the estimated price given h i is constant and given by: π i ( h i ) = π it ( h i , p y i t ; h i , θ i , α i , w ip ) = σ ( α i ) > . (12)Likewise, it is clear that for h j (cid:54) = h i : π i ( h j ) = π it ( h j , p y i t ; h i , θ i , α i , w ip ) < , (13)since by construction: 1 − (cid:88) k (cid:16) h jk − h ik (cid:17) r i < . Set α i = α so that (by choice of α ) for all i, j : π i ( h i ) > − δπ i ( h j ) < δ, for a δ ∈ (0 , (cid:15)/ α must exist because σ is monotonic and bounded between 0 and 1. When h i isused as the market input (i.e., x = h i ), then Agent i will purchase one share of Asset y i per epoch until theﬁrst time t (1) when: 1 − δ < π i ( h i ) < κ y i t (1) . Choose β small enough to ensure that at this point:1 − δ < p y i t (1) = e βt (1) e βt (1) < κ y i t (1) . (14)There are two possibilities. Case I : For all j : π j ( h i ) < δ < κ − y i t (1) . In this case, the market converges to price p y i t (1) > − δ > − (cid:15) as required. Case II:

There is at least one j so that π j ( h i ) > κ − y i t (1) . t (1) all such agents will purchase shares of asset (1 − y i ) and will continue to do so until t (2) at whichpoint either Case I holds or Agent i purchases again. In each case, assume β is chosen small enough so thatat time t (2) : p y i t (2) > − δ. (15)This ensures that the purchases of the other agents cannot drive the price too far from 1 − δ . Such a β mustexist because asset price moves are monotonically decreasing in β . Since π i ( h i ) and π j ( h i ) are ﬁxed for alltime and H is ﬁnite, a smallest ﬁxed value of β must exist to make Eqs. (14) and (15) true for all time. (SeeFig. 1.) We repeat the above logic to see that for time t ≥ t (1) , p y i t ∈ (1 − δ, − δ ) and Eq. (10) holds. Thiscompletes the proof.Figure 1: We illustrate the diﬀerence between the spot price p t and the purchase price κ t (∆ q ) for asset oneunder varying values of q and q with ∆ q = 1. The value of β = 1 / β decreases, the diﬀerence κ t (∆ q ) − p t →

0. Thus ensuring Eqs. (14) and (15) .Using the prior result, it is straightforward to see that if D ⊂ R n is a simply connected closed and boundedset and χ D ( x ) is its characteristic function, then if (cid:15) >

0, there is a market M with agents A = { a , . . . , a N } (for some possibly large N) so that: (cid:90) D (cid:12)(cid:12) χ D ( x ) − ¯ p ( x ) (cid:12)(cid:12) d x < (cid:15). (16)To see this, choose a large but ﬁnite sample of points from D and add to this an appropriately large sampleof points near the boundary of D . Call this set H and apply an argument like the one for Proposition 1 toconstruct the market. From this we conclude: Proposition 2. If D is a ﬁnite union of simply connected closed and bounded subsets of R n and (cid:15) > , thenthere is a market M and a ﬁnite (but large) set of agents so that Eq. (16) holds. We eﬀectively illustrate Proposition 2 in Section 5.3.

Let: Ω it = (cid:26) x ∈ R n : 1 κ y i t (cid:0) π it (cid:0) x , p y i t ; h i ; θ i , α i , w ip (cid:1) − κ y i t [∆ q y i ] (cid:1) ≥ τ (cid:27) (17)the following proposition provides a suﬃcient condition for the convergence of the market price to a singlevalue. 6 roposition 3. Consider a market M with agent set A = { a , . . . , a N } and a ﬁxed β , τ . Given an input x ∈ R n , if there is a time t ∗ and an index set I ∗ = { i , . . . , i k } ⊂ { , . . . , N } so that for all t ≥ t ∗ : x ∈ (cid:92) i ∈ I ∗ Ω it , (18) and if j (cid:54)∈ I ∗ , then x (cid:54)∈ Ω jt , then the price p t converges to a ﬁxed value.Proof. Suppose there is a t ∗ and I ∗ = ∅ . Then no agent purchases occur at time t ≥ t ∗ and the market price p t remains constant at the value p t − . If I ∗ is not empty, then assume there are r ≥ I ∗ and s ≥ I ∗ . Then for all time the spot price for Asset1 is given by: p t = exp (cid:2) β (cid:0) q t − + rt (cid:1)(cid:3) exp (cid:2) β (cid:0) q t − + rt (cid:1)(cid:3) + exp (cid:2) β (cid:0) q t − + st (cid:1)(cid:3) , (19)because at all future times the agents in I ∗ will purchase 1 unit of the appropriate asset. Taking the limitat t → ∞ yields: ¯ p =  s > r s < r exp ( βq ) exp( βq )+exp( βq ) if r = s (20)This completes the proof.We note that when each agent is given a ﬁnite bank account, then convergence of the market is ensuredand the decision logic must be amended to include a test for suﬃcient funds.It is easy to construct an example in which the market does not converge to a ﬁxed point. To see this,consider a market with a two dimensional feature space and two agents with h = (0 ,

0) and h = (2 , x = (1 . , τ = 0, β = 1 / i = 1 , α i = 3, w ip = 2. If we assume both agents have r ij = 1 .

015 (i.e., agent geometry is circular), then this market will oscillate in price forever as illustrated inFig. 2. The oscillation in the price is caused by the oscillation in the geometric structure of the sets Ω t and A ss e t P r i c e Figure 2: An example of an oscillating market price for an input near the decision boundary.Ω t . As the market price varies in time, each agent oscillates between determining the price is too high orlow enough to purchase. Thus, when input information is close to a decision boundary we see that marketprices may exhibit a limit cycle. Establishing suﬃcient conditions for the emergence of a limit cycle in themarket is left to future work. However, as we have illustrated limit cycles will emerge when input (test)points are near multiple agent decision boundaries in feature space and thus can indicate indecision if themarket is used as a machine learning model. 7 Training Agents within a Market

In this section we discuss a practical implementation of the prediction market described above and detail amethod to train such a market to approximate a data set. For practical purposes, we make three simplifyingimplementation changes:1. We assume time is ﬁnite. That is, the market will terminate after a ﬁxed large time.2. We assume all agents have a ﬁnite bank account.3. We assume that agents recurrently arrive at the market to buy assets with inter-arrival times governedby an exponential distribution. Thus not all agents interact with the market simultaneously.The third assumptions is made to increase the execution speed of the market and to ensure a suﬃcientnumber of training epochs can be executed in a reasonable amount of wall-clock time.

Let each training data-point be denoted as ( x i , y i ), where y i denotes the output label. Let m be the totalnumber of training data-points. Deﬁne: C k = (cid:8) i ∈ { , . . . , m } : y i = k (cid:9) , (21)for k = 0 ,

1. Training will proceed in batches. Assume a batch size of b , where b < m , to denote the numberof data-points used to train the model in one pass. Thus, there will be a total of (cid:100) m/b (cid:101) batches. A set of n Agents are initialized for every data-point x i in a batch B j where i ∈ [1 , b ] and j ∈ [1 , (cid:100) m/b (cid:101) ]]. The agentsare initialized as hyperspheres centered at h i = x i . To determine the initial radius, let r i = arg min j ∈C yi (cid:13)(cid:13) x i − x j (cid:13)(cid:13) r i = 12 · arg min j ∈C − yi (cid:13)(cid:13) x i − x j (cid:13)(cid:13) . These are the distances to the nearest point with similar classiﬁcation and half the distance to the nearestpoint with opposite classiﬁcation. Then set: p i = max (cid:8) r , min (cid:8) r i , r i (cid:9)(cid:9) q i = max (cid:8) r (cid:48) , r i (cid:9) where r and r (cid:48) are default values. The radius of the hyper-sphere is initialized with: r i ∼ U ( p i , q i ) , where U is a uniform distribution. That is, we model each agent with an ellipsoid so all axial radii areinitialized to r i . The initial value for w ip is chosen from a standard normal distribution for each i . Finally,if Agent i is centered at x i = h i with class y i , then that agent will only purchase assets of Class y i . Each market run is parameterized by an input feature vector x shared by all agents. This feature vectoris used in agent purchase logic. Agents are initialized with a ﬁnite bank. During an execution of themarket, each agent is seeded with an initial time it will interact with the market drawn from an exponentialdistribution. The next time of execution is set when the agent interacts with the market and uses the sameexponential distribution. All agents have a common exponential distribution. Agents buy assets accordingto the decision logic discussed above and keep track of purchased assets and the price paid. There is a globalclock that is updated to determine when agents participate. At market completion (after a ﬁxed time haspassed), agent proﬁts and losses are calculated assuming assets that match the ground truth class y i areworth 1 and other assets are valued at 0. 8 .3 Evolutionary Algorithm The evolutionary algorithm deﬁned below is used to identify parameters θ i , w ip , and α . For the purposesof this work, we assume that θ i = 0 is ﬁxed for all i , we set β = 1 /

100 and τ = . Optimizing theseparameters is a subject of future work. Evolutionary AlgorithmInput:

Feature vectors X = (cid:8) x i (cid:9) Ni =1 , ground truth labels (cid:8) y i (cid:9) Ni =1

1. For each data point x i with label y i create n agents centered at h i = x i and an initial random radius r i ∼ U ( p i , q i ), a random scale parameter α i ∼ U (0 . ,

5) and a random w ip ∼ N (0 , y i .2. Run N markets one for each input x i ∈ X .3. For each market, sort all agents into three groups (i) those that did not participate, (ii) those thatmade a proﬁt and (iii) those that had a loss.(a) For each center h i :i. If no agent with center h i participated, continue.ii. Among the agents who participated retain l < n agents who had the highest proﬁt (or lowestloss).iii. Delete the n − l under-performing agents.iv. Create n − l new agents from the agent pool centered at h i using mutation and crossover ofthe parameters α , θ and w p . Speciﬁcally, mutation is carried out as follows:A. Compute σ = 2 (cid:114) n Σ mi =1 (cid:0) y i − ¯ p y i (cid:1) . (22)B. Update r i ← r i + σ · U ( p i − r i , q i − r i ).C. Update w ip ← w ip + σ · N (0 , g generations.Because each agent is modeled by an ellipsoid with a ﬁnite volume, not every agent will participate in everymarket. In particular, if x i (cid:54)∈ Ω it for any time t , then Agent i will not participate. Of those agents that doparticipate in a given market, those that are most successful are preserved and replicate with mutation andcrossover. The mutation rate is controlled by the current root mean-square error of the approximation. Asthis value decreases, the mutation decreases. This section discusses the results obtained by the application of the proposed market model on standarddatasets such as IRIS Dataset and Heart Disease Dataset. We also apply the model to perform the recordlinkage task of disambiguating inventor records from the USPTO PatentsView database as a real-worldapplication usecase. For all the experiments, we have chosen n = 5 (agent replicants), l = 3 (retainedagents) and g = 20 (generations). We study the standard IRIS data set [55], which consists of features describing three species of iris plants -Iris setosa, Iris virginica and Iris versicolor. The data set contains 50 instances of feature vectors from eachclass. It is known that Iris Setosa is linearly separable from the other two classes. However, Iris Versicolorand Iris Virginica are not linearly separable from each other. We use four attributes, length and width ofsepals and petals, to classify an instance into one of the three classes.The proposed market model is generalized to be a binary classiﬁer. However, the dataset consists ofthree classes. Therefore, we take the union of two classes and train the model on the one-against-two binaryclassiﬁcation problem. We used a train-test split of 75:25.9 nion of Iris Setosa and Iris Versicolor

We combined the two classes, Iris Setosa and Iris Versicolor,and represented them as Class 0. Class 1 was composed of data from to Iris Virginica. A test accuracy of94.6% was obtained in this case. A detailed analysis is shown in Table 1.

Class Precision Recall F1-Score

Class 0 (Setosa/Versicolor) 1.00 0.91 0.95Class 1 (Virginica) 0.88 1.00 0.95Table 1: Shows the test performance when instances of Iris Setosa and Iris Versicolor are combined togetheras Class0.

Union of Iris Setosa and Iris Virginica

We combined the instances of Iris Setosa and Iris Virginicaas Class 0. Class 1 contains data from Iris Versicolor. A test accuracy of 97.29% was observed and Table 2shows a detailed analysis.

Class Precision Recall F1-Score

Class 0 (Setosa/Virginica) 0.96 1.00 0.98Class 1 (Versicolor) 1.00 0.93 0.96Table 2: Shows the test performance when instances of Iris Setosa and Iris Virginca are combined togetheras Class0.

Union of Iris Versicolor and Iris Virginica

We combined the instances of Iris Versicolor and IrisVirginica as Class 1. Class 0 consists of data from Iris Setosa. A test accuracy of 100.0% was observedTable 3 shows a detailed analysis. We have to note that instances of Iris Setosa are linearly separable fromthe other two classes and thus, the model is able to separate the two classes with 100% accuracy in this case.

Class Precision Recall F1-Score

Class 0 (Setosa) 1.00 1.00 1.00Class 1 (Versicolor/Virginica) 1.00 1.00 1.00Table 3: Shows the test performance when instances of Iris Setosa and Iris Virginca are combined togetheras Class0.

This is a publicly available dataset [56] provided by UCI. There are four databases available for use withinthe dataset. Published experiments in Machine Learning use the Cleveland database with a maximum of14 of the 76 available attributes which are known to be considerably linked to heart disease. We use thefollowing 14 numerical attributes to train the market model to classify patients to one of the targets; presenceof heart disease, no heart disease.1. Age2. Sex: male, female3. Chest pain type: typical angina (angina), atypical angina (abnang), non-anginal pain (notang), asymp-tomatic (asymp)4. Trestbps: resting blood pressure on admission5. Chol: serum cholestrol 10. Fbs: indicates whether fasting blood sugar is greater than 120 mg/dl7. Restecg: normal(norm), abnormal(abn): ST-T wave abnormality, ventricular hypertrophy (hyp)8. Thalach: maximum heart rate achieved9. Exang: exercise induced angina10. Oldpeak: ST depression induced by exercise relative to rest11. Slope: upsloping, ﬂat, downsloping: the slope characteristics of the peak exercise ST segment12. Ca: number of ﬂuoroscopy colored major vessels13. Thal: normal, ﬁxed defect, reversible defect - the heart status14. Class/target labelThe data set has a total of 303 data points. To evaluate the performance of the market (M), we split thedata using an 80%-20% ratio. The market was tested on 20% of the randomly sampled data. A total of60 data points were used for testing the model. For one of the randomly chosen split, we obtained a testaccuracy of 86.66%. The confusion matrix associated with the test data is shown in Fig. 3. +HDUW'LVHDVH 1R+HDUW'LVHDVH 3UHGLFWHG + HD U W ' L V HD V H 1 R + HD U W ' L V HD V H $ F W XD O Figure 3: Confusion Matrix obtained for the test set.The obtained results from the market model are compared with the output obtained from a RandomForest (RF) classiﬁer for the same split. The RF classiﬁer obtained a test accuracy of 96.66%. Table 4compares the F1-Score obtained for both the models. We see that Random Forest outperformed the marketin this case.

Model No Heart Disease (%) Heart Disease (%)RF

96 97 M

84 89Table 4: Shows the F1 scores for each class for both the classiﬁers.To measure the sensitivity of the model with respect to the variation in inputs, we performed the exper-iment with six randomly sampled data splits. A train-test ratio of 80%-20% was retained for all the splits.Fig. 4 shows the comparison of the Market model with RF Classiﬁer. We observe variations in the twomodels with respect to changing input data. The RF classiﬁer outperformed the market in ﬁve out of sixcases. Market performance was comparable to RF classiﬁer for the ﬁfth split. Fig. 5 shows the box plot ofF1-scores associated with each class for the two models.The lower performance of the market can be attributed to a lack of generalization using the underlyinggeometry. The use of simple geometric agents allows us to quantify this. Fig. 6 shows the number of markets11 'DWD)ROG $ FF X U D F\ &RPSDULVRQRI5DQGRP)RUHVWZLWK0DUNHW0RGHO 0DUNHW5DQGRP)RUHVW Figure 4: Comparison of the Market Model with Random Forest Classiﬁer for six diﬀerent data splits. 0DUNHW+HDUW'LVHDVH 5)+HDUW'LVHDVH ) 6 F R U H 9DULDWLRQLQ)6FRUHIRUFODVV+HDUW'LVHDVH (a) Class - Heart Disease 0DUNHW+HDUW'LVHDVH 5)+HDUW'LVHDVH ) 6 F R U H 9DULDWLRQLQ)6FRUHIRUFODVV+HDUW'LVHDVH (b) Class - No Heart Disease Figure 5: Variation in F1-Scores for both models in each class. 'DWD)ROGV 1 RQ 3 D U W L F L SDQ W 0 D U N H W V 1XPEHURI1RQ3DUWLFLSDQW0DUNHWVYV'DWD6SOLWV Figure 6: Shows the number of markets with No-Agent Participation for six diﬀerent data splits.12ith no agent participation for various data splits. We observed that accuracy increased with increase inagent participation across all markets. The highest obtained accuracy of 86.66%, as seen in Fig. 4, hadonly six markets with no agent participation. For future work, we will include agents whose decision logic ischaracterized by either ellipsoids or a convex cone, which generalizes a hyperplane separator but also remainsinterpretable. The challenge in this will be to alter the evolutionary algorithm for account for agents withmultiple geometries.

We curate a subset of the publicly available database that was released as part of the PatentsView InventorDisambiguation Workshop. A random sample of labeled inventor records from the database containing 346patent records of 74 distinct inventors is selected. We then build a data set based on pairwise similarityvectors of records in the sample to train the prediction market classiﬁer. We have three distinct types ofpairs for the inventor/patent records in the data set which yields a total of 2646 patent record pairs:1. Positive pairs: We leverage the labeled records from the database relating to distinct inventors tocreate inventor clusters where all possible pairs within a cluster are assigned a label 1 indicating thatthey are the same person.2. Similar negative pairs: For each inventor cluster in the sample, we retrieve patent records from thedatabase such that the ﬁrst name and the last name of the inventor are an exact match, but they area diﬀerent person; i.e., the record does not belong to this inventor cluster. These are prime candidatesthat make disambiguation necessary and have a label 0.3. Random negative pairs: We generate random pairs by using candidates that belong to diﬀerent clustersthat are assigned a label 0.We compute the similarity and distance measures outlined in Table 5 for each patent record pair for therespective inventor features in the sample.

Type Measure Feature

Token Cosine, Jaccard Similarity TitleSectionSubsectionGroupSub-groupOrganizationString Jaro-Winkler, Soundex First NameLast NameCityStateGeographic distance Haversine distance Latitude | Longitude

Table 5: USPTO Inventor Disambiguation pairwise feature classesFig. 7 oﬀers a visualization of the data set in two dimensions obtained using t-SNE [57] for nonlineardimensional reduction. It showcases a general sense of the topology intrinsic to the data. Cluster sizes maynot mean anything, nor does the distance between identiﬁed clusters or within them as discussed in [58].However, we see that there exists a non-linear, complex separation in the overall topological structure, con-sistent across diﬀerent perplexity settings. The two classes have cases where there are clearly identiﬁableclusters and some less so with overlapping instances that require complex decision boundaries. The occur-rences of outlier instances of a class in clusters predominantly composed of instances of the other class isof particular interest. These cases are a great test to validate the performance of the synthetic prediction odel Precision Recall F1-Score RF 0.996 1.0 0.998M 0.996 1.0 0.998

Table 6: F1 score for each model for USPTO inventor disambiguation.market classiﬁer, which can utilize the subtle variances in local geometry to diﬀerentiate between the twoclasses. This also illustrates Proposition 2. This data set is an ideal candidate to showcase the market’sability to distinguish non-linearly separable data. We hypothesized that the agent initialization as discussedin Section 4.1 would enable the market model to perform well by oﬀering a good covering of the data setand is experimentally conﬁrmed as discussed in subsequent results.Figure 7: Two dimensional t-SNE plots of the pairwise feature space for diﬀerent perplexity conﬁgurations.Left to right: 40, 60, and 70.To evaluate the performance of the market, we split the data using an 80%-20% ratio. Table 6 comparesthe performance of our model against a classic machine learning model (Random Forest Classiﬁer). We seethat both models perform very well for this dataset and obtain a classiﬁcation accuracy of 99.81%.

In this paper we study a speciﬁc class of synthetic binary prediction markets in which agent decision logic isspeciﬁed by a convex semi-algebraic set. We showed that these prediction markets satisfy certain universalapproximation properties and gave suﬃcient conditions for the market to converge to a ﬁnal set of asset prices.We also showed that these markets can enter limit cycles, which indicate the the conditioning data may benear a decision boundary. We provided an evolutionary algorithm for training such a market on a given dataset and illustrated this process on three example data sets. While the market under-performed the best-in-class random forest algorithms for some data sets, we were able to show consistent or equal performance tothe random forest method in all tests. In addition, we use the underlying geometric structures to infer thereason for the under-performance and devised an approach to mitigate this in future work.For future work, we will introduce agents whose decision rules are characterized by convex cones. Theseagents will work along side the existing agents (who use ellipsoidal regions) to characterize data sets, thusimproving generalization. We will also study the possible dynamics of these markets and determine whethera more robust stability theorem can be proven.

Acknowledgement

Portions of this work were sponsored by the DARPA SCORE Program (Cooperative Agreement W911NF-19-2-0272.) 14 eferences [1] Robin Hanson. Market-based foresight-a proposal.

Foresight Update , 10(1):3, 1990.[2] R Hanson. More market-based foresight.

Foresight Update , 11(11), 1991.[3] Robin Hanson. Could gambling save science? encouraging an honest consensus. 1995.[4] Russ Ray. Idea futures: Gambling on science.

The Futurist , 31(1):25, 1997.[5] Justin Wolfers and Eric Zitzewitz. Prediction markets.

Journal of economic perspectives , 18(2):107–126,2004.[6] Emile Servan-Schreiber, Justin Wolfers, David M Pennock, and Brian Galebach. Prediction markets:Does money matter?

Electronic markets , 14(3):243–251, 2004.[7] Charles F Manski. Interpreting the predictions of prediction markets. economics letters , 91(3):425–429,2006.[8] Joyce E Berg and Thomas A Rietz. Prediction markets as decision support systems.

Informationsystems frontiers , 5(1):79–93, 2003.[9] Justin Wolfers and Eric Zitzewitz. Prediction markets in theory and practice. Technical report, nationalbureau of economic research, 2006.[10] Min Dai, Yanwei Jia, and Steven Kou. The wisdom of the crowd and prediction markets.

Journal ofEconometrics , 2020.[11] Mithun Chakraborty and Sanmay Das. Trading on a rigged game: Outcome manipulation in predictionmarkets. In

IJCAI , pages 158–164, 2016.[12] George Tziralis and Ilias Tatsiopoulos. Prediction markets: An extended literature review.

The journalof prediction markets , 1(1):75–91, 2007.[13] Joyce Berg, Robert Forsythe, and Thomas Rietz. What makes markets predict well? evidence from theiowa electronic markets. In

Understanding Strategic Interaction , pages 444–463. Springer, 1997.[14] Richard H Thaler and William T Ziemba. Anomalies: Parimutuel betting markets: Racetracks andlotteries.

Journal of Economic perspectives , 2(2):161–174, 1988.[15] Justin Wolfers and Eric Zitzewitz. Interpreting prediction market prices as probabilities. Technicalreport, National Bureau of Economic Research, 2006.[16] Philip M Polgreen, Forrest D Nelson, George R Neumann, and Robert A Weinstein. Use of predictionmarkets to forecast infectious disease activity.

Clinical Infectious Diseases , 44(2):272–279, 2007.[17] Johan Almenberg, Ken Kittlitz, and Thomas Pfeiﬀer. An experiment on prediction markets in science.

PLoS One , 4(12):e8500, 2009.[18] Anna Dreber, Thomas Pfeiﬀer, Johan Almenberg, Siri Isaksson, Brad Wilson, Yiling Chen, Brian ANosek, and Magnus Johannesson. Using prediction markets to estimate the reproducibility of scientiﬁcresearch.

Proceedings of the National Academy of Sciences , 112(50):15343–15347, 2015.[19] Bo Cowgill, Justin Wolfers, and Eric Zitzewitz. Using prediction markets to track information ﬂows:Evidence from google. In amma , page 3, 2009.[20] Benjamin J Gillen, Charles R Plott, and Matthew Shum. Information aggregation mechanisms in theﬁeld: Sales forecasting inside intel. Technical report, Working paper, 2012.[21] Vernon L Smith. Constructivist and ecological rationality in economics.

American economic review ,93(3):465–508, 2003. 1522] Kenneth J Arrow, Robert Forsythe, Michael Gorham, Robert Hahn, Robin Hanson, John O Ledyard,Saul Levmore, Robert Litan, Paul Milgrom, Forrest D Nelson, et al. The promise of prediction markets.

Science , 320(5878):877, 2008.[23] Sharad Goel, Daniel M Reeves, Duncan J Watts, and David M Pennock. Prediction without markets.In

Proceedings of the 11th ACM conference on Electronic commerce , pages 357–366, 2010.[24] Rational Expectations. the theory of price movements.

Econometrica , 29(3):315–35, 1961.[25] Paul C Tetlock. Liquidity and prediction market eﬃciency.

Available at SSRN 929916 , 2008.[26] Robin Hanson and Ryan Oprea. Manipulators increase information market accuracy.

George MasonUniversity , 2004.[27] Paul C Tetlock and Robert W Hahn. Optimal liquidity provision for decision makers.

AEI-BrookingsJoint Center Working Paper , (06-18), 2007.[28] Cass R Sunstein.

Infotopia: How many minds produce knowledge . Oxford University Press, 2006.[29] Marco Ottaviani and Peter Norman Sørensen. Outcome manipulation in corporate prediction markets.

Journal of the European Economic Association , 5(2-3):554–563, 2007.[30] David M Pennock, Steve Lawrence, C Lee Giles, Finn Arup Nielsen, et al. The real power of artiﬁcialmarkets.

Science , 291(5506):987–988, 2001.[31] Earl S Rosenbloom and William Notz. Statistical tests of real-money versus play-money predictionmarkets.

Electronic Markets , 16(1):63–69, 2006.[32] Thomas S Gruca, Joyce E Berg, and Michael Cipriano. Incentive and accuracy issues in movie predictionmarkets.

The Journal of Prediction Markets , 2(1):29–43, 2008.[33] Augur: Your global, no-limit betting platform. . Accessed: 2020-10-04.[34] Gnosis: Redistribute the future. https://gnosis.io . Accessed: 2020-10-04.[35] Stox: The blockchain prediction markets platform. . Accessed: 2020-10-04.[36] Jeremy Clark, Joseph Bonneau, Edward W Felten, Joshua A Kroll, Andrew Miller, and ArvindNarayanan. On decentralizing prediction markets and order books. In

Workshop on the Economicsof Information Security, State College, Pennsylvania , volume 188, 2014.[37] Ethan Heilman, Foteini Baldimtsi, and Sharon Goldberg. Blindly signed contracts: Anonymous on-blockchain and oﬀ-blockchain bitcoin transactions. In

International conference on ﬁnancial cryptographyand data security , pages 43–60. Springer, 2016.[38] Iddo Bentov, Alex Mizrahi, and Meni Rosenfeld. Decentralized prediction market without arbiters. In

International Conference on Financial Cryptography and Data Security , pages 199–217. Springer, 2017.[39] Jack Peterson and Joseph Krug. Augur: a decentralized, open-source platform for prediction markets. arXiv preprint arXiv:1501.01042 , 2015.[40] Hemang Subramanian. Decentralized blockchain-based electronic marketplaces.

Communications of theACM , 61(1):78–84, 2017.[41] Shuai Wang, Xiaochun Ni, Yong Yuan, Fei-Yue Wang, Xiao Wang, and Liwei Ouyang. A preliminary re-search of prediction markets based on blockchain powered smart contracts. In ,pages 1287–1293. IEEE, 2018.[42] Adrian Barbu and Nathan Lay. An introduction to artiﬁcial prediction markets for classiﬁcation.

TheJournal of Machine Learning Research , 13(1):2177–2204, 2012.1643] Adrian Barbu and Nathan Lay. Artiﬁcial prediction markets for lymph node detection. In , pages 1–7. IEEE, 2013.[44] Fatemeh Jahedpari, Julian Padget, Marina De Vos, and Benjamin Hirsch. Artiﬁcial prediction marketsas a tool for syndromic surveillance.

Crowd Intelligence: Foundations, Methods and Practices , 2014.[45] Yiling Chen, Lance Fortnow, Nicolas Lambert, David M Pennock, and Jennifer Wortman. Complexityof combinatorial market makers. In

Proceedings of the 9th ACM conference on Electronic commerce ,pages 190–199, 2008.[46] Yiling Chen and Jennifer Wortman Vaughan. A new understanding of prediction markets via no-regretlearning. In

Proceedings of the 11th ACM conference on Electronic commerce , pages 189–198, 2010.[47] Jacob Abernethy, Yiling Chen, and Jennifer Wortman Vaughan. An optimization-based framework forautomated market-making. In

Proceedings of the 12th ACM conference on Electronic commerce , pages297–306, 2011.[48] Nathan Lay and Adrian Barbu. The artiﬁcial regression market. arXiv preprint arXiv:1204.4154 , 2012.[49] Amos Storkey. Machine learning markets. In

Proceedings of the Fourteenth International Conferenceon Artiﬁcial Intelligence and Statistics , pages 716–724, 2011.[50] Amos Storkey, Jono Millin, and Krzysztof Geras. Isoelastic agents and wealth updates in machinelearning markets. arXiv preprint arXiv:1206.6443 , 2012.[51] Jinli Hu and Amos Storkey. Multi-period trading prediction markets with connections to machinelearning. In

International Conference on Machine Learning , pages 1773–1781, 2014.[52] Fatemeh Jahedpari, Talal Rahwan, Sattar Hashemi, Tomasz P Michalak, Marina De Vos, Julian Padget,and Wei Lee Woon. Online prediction via continuous artiﬁcial prediction markets.

IEEE IntelligentSystems , 32(1):61–68, 2017.[53] Robin Hanson. Logarithmic markets scoring rules for modular combinatorial information aggregation.

The Journal of Prediction Markets , 1(1):3–15, 2007.[54] Suparerk Lekwijit and Daricha Sutivong. Optimizing the liquidity parameter of logarithmic marketscoring rules prediction markets.

Journal of Modelling in Management , 2018.[55] Hedyeh A Kholerdi, Nima TaheriNejad, and Axel Jantsch. Enhancement of classiﬁcation of small datasets using self-awareness—an iris ﬂower case-study. In , pages 1–5. IEEE, 2018.[56] Dheeru Dua and Casey Graﬀ. UCI machine learning repository, 2017.[57] L. V. D. Maaten and Geoﬀrey E. Hinton. Visualizing data using t-sne.