[PDF] All You Need is DAG

Abstract

We present DAG-Rider, the first asynchronous Byzantine Atomic Broadcast protocol that achieves optimal resilience, optimal amortized communication complexity, and optimal time complexity. DAG-Rider is post-quantum safe and ensures that all messages proposed by correct processes eventually get decided. We construct DAG-Rider in two layers: In the first layer, processes reliably broadcast their proposals and build a structured Directed Acyclic Graph (DAG) of the communication among them. In the second layer, processes locally observe their DAGs and totally order all proposals with no extra communication.

Full PDF

AAll You Need is DAG

IDIT KEIDAR,

Technion

ELEFTHERIOS KOKORIS-KOGIAS,

IST Austria and Novi Research

ODED NAOR,

Technion ∗ ALEXANDER SPIEGELMAN,

Novi Research

We present

DAG-Rider , the first asynchronous Byzantine Atomic Broadcast protocol that achieves optimal resilience, optimal amortizedcommunication complexity, and optimal time complexity. DAG-Rider is post-quantum safe and ensures that all messages proposed bycorrect processes eventually get decided. We construct DAG-Rider in two layers: In the first layer, processes reliably broadcast theirproposals and build a structured Directed Acyclic Graph (DAG) of the communication among them. In the second layer, processeslocally observe their DAGs and totally order all proposals with no extra communication.

The amplified need in scalable geo-replicated Byzantine fault-tolerant reliability systems has motivated an enormousamount of study on the Byzantine State Machine Replication (SMR) problem [17, 31]. Many variants of the problemwere defined in recent years [28, 32, 43] to capture the needs of blockchain systems. To address the fairness issues thatnaturally arise in interorganizational deployments, we focus on the classic long-lived Byzantine Atomic Broadcast(BAB) problem [13, 19], which in addition to total order, progress, and validity, guarantees eventual fairness . That is, all proposals by correct processes are eventually included.Up until recently, asynchronous protocols for the Byzantine consensus problem [13, 16, 26] have been consideredtoo costly or complicated to be used in practical SMR solutions. However, two recent single-shot Byzantine consensuspapers, VABA [1] and later Dumbo [35], presented asynchronous solutions with (1) optimal resilience, (2) expectedconstant time complexity, and (3) optimal quadratic communication and optimal amortized linear communicationcomplexity (for the latter). In this paper, we follow this recent line of work and present

DAG-Rider : the first asynchronousBAB protocol with optimal resilience, optimal round complexity, and optimal amortized communication complexity. Inaddition, given a perfect shared coin abstraction, our protocol does not use signatures and does not rely on asymmetriccryptographic assumptions. Therefore, when using a deterministic threshold-based coin implementation with aninformation theoretical agreement guarantee [14, 34], the safety properties of our BAB protocol are post-quantumsecure.

Overview.

We construct

DAG-Rider in two layers: a communication layer and a zero-overhead ordering layer. In thecommunication layer, processes reliably broadcast their proposals with some meta-data that help them form a

DirectedAcyclic Graph (DAG) of the messages they deliver. That is, the DAG consists of rounds s.t. every process broadcasts atmost one message in every round and every message has 𝑂 ( 𝑛 ) references to messages in previous rounds, where 𝑛 isthe total number of processes. The ordering layer does not require any extra communication. Instead, processes observetheir local DAGs and with the help of a little randomization (one coin flip per 𝑂 ( 𝑛 ) decisions on values proposed bydifferent processes) locally order all the delivered messages in their local DAGs.A nice feature of DAG-Rider is that the propose operation is simply a single reliable broadcast. The agreementproperty of the reliable broadcast ensures that all correct processes eventually see the same DAG. Moreover, the Validity ∗ Part of the work done while at Novi Research. 1 a r X i v : . [ c s . D C ] F e b dit Keidar, Eleftherios Kokoris-Kogias, Oded Naor, and Alexander Spiegelman Communication Expected time Post-Quantum EventualComplexity Complexity Safety FairnessVABA SMR 𝑂 ( 𝑛 ) 𝑂 ( log ( 𝑛 )) no noDumbo SMR amortized 𝑂 ( 𝑛 ) 𝑂 ( log ( 𝑛 )) no noDAG-Rider + [11] amortized 𝑂 ( 𝑛 ) 𝑂 ( ) yes yesDAG-Rider + [25] amortized 𝑂 ( 𝑛 log ( 𝑛 )) 𝑂 (cid:16) log ( 𝑛 ) log ( log ( 𝑛 )) (cid:17) yes (1- 𝜖 )-fairDAG-Rider + [15] amortized O(n) 𝑂 ( ) yes yes Table 1. A comparison between our protocol with different reliable broadcast instantiations and VABA and Dumbo based SMRprotocols. property of the reliable broadcast guarantees that all broadcast messages by correct processes are eventually included inthe DAG. As a result, in contrast to the VABA and Dumbo protocols that retroactively ignore half the protocol messagesand commit one value out of 𝑂 ( 𝑛 ) proposals, DAG-Rider does not waste any of the messages and all proposed valuesby correct processes are eventually ordered (i.e., there is no need to re-propose). Complexity.

We measure time complexity as the asynchronous time [16] required to commit 𝑂 ( 𝑛 ) values proposedby different correct processes, and we measure communication complexity by the number of bits processes send tocommit a single value. To compare DAG-Rider to the state-of-the-art asynchronous Byzantine agreement protocols, weconsider SMR implementations that run an unbounded sequence of the VABA or Dumbo protocols to independentlyagree on every slot. To compare apples to apples in respect to our time complexity definition, we allow VABA andDumbo based SMRs to run up to 𝑛 slots concurrently. Note, however, that to satisfy external validity (e.g., no doublespend) processes must output the slot decisions in a sequential order (no gaps). Therefore, based on the proof in [6], thetime complexity of VABA and Dumbo based SMRs is 𝑂 ( log ( 𝑛 )) . Table 1 compares DAG-Rider to VABA and Dumbobased SMRs.Since our protocol uses a reliable broadcast abstraction as a basic building block, different instantiations yield differentcomplexity. For example, if we use the classic Bracha broadcast [11] to propose a single value in each message, we get acommunication complexity of 𝑂 ( 𝑛 ) per decision. This is because the Bracha broadcast complexity is 𝑂 ( 𝑛 ) , and inorder to form a DAG each message has to include an 𝑂 ( 𝑛 ) references to previous messages. If we are willing to allow aprobability 𝜖 to violate progress, then we can use Guerraoui et al.’s broadcast protocol [25] and reduce the complexityto 𝑂 ( 𝑛 log ( 𝑛 )) per decision.Now, just as Dumbo amortizes VABA’s communication complexity from quadratic to linear by using batchingand adding a phase of erasure coding to more economically distribute the data, we can amortize our communicationcomplexity to be linear per decision as well. First, since we are anyway including a vector of 𝑂 ( 𝑛 ) references in everybroadcast, batching 𝑂 ( 𝑛 ) proposals in each message shaves a factor of 𝑛 of the total communication complexity evenwith Bracha broadcast. To arrive at the optimal linear complexity, we can replace the reliable broadcast with theasynchronous verifiable information dispersal of Cachin and Tessaro [15]. The communication complexity of thatprotocol is 𝑂 ( 𝑛 log ( 𝑛 ) + 𝑛 | 𝑉 |) , where | 𝑉 | is the message size, which allows us to batch 𝑂 ( 𝑛 log ( 𝑛 )) proposals to achieveoptimal amortized communication complexity.A final feature of our protocol, which is sometimes underestimated and cannot be presented in a table, is elegance: (1)DAG-Rider’s modularity clearly separates the communication layer from the ordering logic; (2) the reliable broadcast ll You Need is DAG abstraction’s different instantiations yield protocols with different trade-offs, and; (3) the entire detailed pseudocode ofthe ordering logic spans less than 30 lines.The rest of this paper is structured as follows: §2 describes the model and the building blocks used for DAG-Rider; §3formally defines the BAB problem; §4 describes the DAG construction layer; §5 specifies the DAG-Rider protocol on topof the DAG layer; §6 proves the correctness of the protocol and analyzes its performance; §7 describes related work;and lastly, §8 concludes the paper. The system consists of a set Π = { 𝑝 , . . . , 𝑝 𝑛 } of 𝑛 processes, up to 𝑓 < 𝑛 / of which can act arbitrarily, i.e., be Byzantine .For simplicity, we consider a total of 𝑛 = 𝑓 + processes. The link between every two correct processes is reliable.Namely, when a correct process sends a message to another correct process, the message eventually arrives and therecipient can verify the sender’s identity. The communication is asynchronous, i.e., there is no bound on the messagedelivery time.We consider an adaptive adversary that can dynamically corrupt up to 𝑓 processes during the run. Once the adversarycorrupts a process, it can drop undelivered messages previously sent from that process to others. The adversary controlsthe arrival times of messages.As part of the construction, we use two building blocks: a reliable broadcast layer and a delayed global perfect coin,which we describe next. Reliable broadcast.

We use a reliable broadcast [12] abstraction: Each sender process 𝑝 𝑘 can send messages by calling r_bcast 𝑘 ( 𝑚, 𝑟 ) , where 𝑚 is a message, 𝑟 ∈ N is a round number. Every process 𝑝 𝑖 has an output deliver 𝑖 ( 𝑚, 𝑟, 𝑝 𝑘 ) , where 𝑚 is a message, 𝑟 is a round number, and 𝑝 𝑘 is the process that called the corresponding r_bcast 𝑘 ( 𝑚, 𝑟 ) . The reliablebroadcast abstraction guarantees the following properties: Agreement

If a correct processes 𝑝 𝑖 outputs deliver 𝑖 ( 𝑚, 𝑟, 𝑝 𝑘 ) , then every other correct process 𝑝 𝑗 eventually outputs deliver 𝑗 ( 𝑚, 𝑟, 𝑝 𝑘 ) . Integrity

For each round 𝑟 ∈ N and process 𝑝 𝑘 ∈ Π , a correct process 𝑝 𝑖 outputs deliver 𝑖 ( 𝑚, 𝑟, 𝑝 𝑘 ) at most once. Validity

If a correct process 𝑝 𝑘 calls r_bcast 𝑘 ( 𝑚, 𝑟 ) , then every correct processes 𝑝 𝑖 eventually outputs deliver 𝑖 ( 𝑚, 𝑟, 𝑘 ) .There are known algorithms such as Bracha broadcast [11] to realize the reliable broadcast abstraction in theasynchronous network model. There are also efficient gossip protocols [9, 10, 25, 27] that provide reliable broadcast whpat a sub-quadratic communication cost in the number of processes, and asynchronous verifiable information dispersalprotocols [15, 35] that use erasure codes to efficiently batch the broadcast values. Global perfect coin.

We use a global perfect coin , which is unpredictable by the adversary. An instance 𝑤 , 𝑤 ∈ N ,of the coin is invoked by process 𝑝 𝑖 ∈ Π by calling choose_leader 𝑖 ( 𝑤 ) . This call returns a process 𝑝 𝑗 ∈ Π , which isthe chosen leader for instance 𝑤 . Let 𝑋 𝑤 be the random variable that represents the probability that the coin returnsprocess 𝑝 𝑗 as the return value of the call choose_leader 𝑖 ( 𝑤 ) . The global perfect coin has the following guarantees: Agreement

If two correct processes call choose_leader 𝑖 ( 𝑤 ) and choose_leader 𝑗 ( 𝑤 ) with respective return values 𝑝 and 𝑝 , then 𝑝 = 𝑝 . Termination

If at least 𝑓 + processes call choose_leader ( 𝑤 ) , then every choose_leader ( 𝑤 ) call eventually returns. dit Keidar, Eleftherios Kokoris-Kogias, Oded Naor, and Alexander Spiegelman Unpredictability

As long as less than 𝑓 + processes call choose_leader ( 𝑤 ) , the return value is indistinguishablefrom a random value except with negligible probability 𝜖 . Namely, the probability 𝑝𝑟 that the adversary can guessthe returned process 𝑝 𝑗 of the call choose_leader ( 𝑤 ) is 𝑝𝑟 ≤ Pr [ 𝑋 𝑤 = 𝑝 𝑗 ] + 𝜖 . Fairness

The coin is fair, i.e., ∀ 𝑤 ∈ N , ∀ 𝑝 𝑗 ∈ Π : Pr [ 𝑋 𝑤 = 𝑝 𝑗 ] = / 𝑛 .Such coins were used as part of previous Byzantine Agreement protocols such as [1, 7, 14, 35]. Implementationexamples can be found in [14, 34]. One way to implement a global perfect coin is by using PKI and a threshold signaturescheme [8, 33, 42] with a threshold of ( 𝑓 + ) -of- 𝑛 . When a process invokes an instance 𝑤 of the coin, it signs 𝑤 withits private key and sends the share to all the processes. Once a process receives 𝑓 + shares, it can combine them to getthe threshold signature and hash it to get a random process. Since the threshold signature value is deterministicallydetermined by the instance name 𝑤 such that any 𝑓 + shares reveal it (e.g., the schema in [42] is based on Shamir’ssecret sharing [41]), the coin is perfect (all process agree on the leader) and its agreement property has informationtheoretical guarantee. However, to ensure unpredictability, the PKI must be trusted to ensure that the adversary cannotgenerate enough shares to reveal the randomness before a correct process produces them. Usually, one assumes thata trusted dealer is used to set up the random keys for all processes. However, this assumption can be relaxed byexecuting an 𝑂 ( 𝑛 ) message complexity Asynchronous Distributed Key Generation protocol [30]. Either way, thisscheme remains unpredictable only if the adversary is computationally bounded. However, since DAG-Rider relies onthe unpredictability property of the coin only for liveness, its safety properties are post-quantum secure. The problem we solve is

Byzantine Atomic Broadcast (BAB) , which allows processes to agree on a sequence of messagesas needed for State Machine Replication (SMR). To capture the practical settings of a system, we require externalvalidity [13], whereby a well-known predicate validate ( 𝑡𝑥 ) ∈ { True , False } returns true if and only if a transaction 𝑡𝑥 is eligible to be included in the SMR. For instance, in a blockchain system that is used to log payments betweenusers, a transaction is valid if there is no double-spending and the signature is correct. We say that a value 𝑣 is valid if validate ( 𝑣 ) returns true.Due to the FLP result [23], BAB cannot be solved deterministically in the asynchronous setting, and therefore we usethe global perfect coin to provide randomness that ensures liveness with probability 1.Atomic broadcast is usually defined in terms of broadcast and deliver events. To avoid confusion with the events ofthe underlying reliable broadcast abstraction, we rename those actions propose and decide . Definition 3.1 (Byzantine Atomic Broadcast) . Each correct process 𝑝 𝑖 ∈ Π can call propose 𝑖 ( 𝑣 ) and output decide 𝑖 ( ℓ, 𝑣 ) , ℓ ∈ N . We say that 𝑝 𝑖 proposes a value 𝑣 when propose 𝑖 ( 𝑣 ) is called, and decides 𝑣 in slot ℓ when decide 𝑖 ( ℓ, 𝑣 ) is output.An Atomic Broadcast protocol provides the following guarantees: Eventual Fairness

If a correct process proposes a valid value 𝑣 , then all correct processes eventually decide 𝑣 withprobability 1. External validity

If a correct process decides 𝑣 , then validate ( 𝑣 ) = True . Termination

If a correct process decides in slot ℓ , then all correct processes eventually decide in slot ℓ withprobability 1. Agreement

If two correct processes 𝑝 𝑖 , 𝑝 𝑗 decide values 𝑣 , 𝑣 , respectively in the same slot ℓ , then 𝑣 = 𝑣 . Integrity

For a single slot number ℓ , a correct process 𝑝 𝑖 decides at most one value 𝑣 . ll You Need is DAG v round 6 7Source:1234 5 v Fig. 1.

Illustration of the DAG structure at process 1.

Strong edges are illustrated as solid black arrows, the weak edge is illustratedas the dotted arrow.

Note that the Eventual Fairness property requires that if a correct process proposes a valid value 𝑣 , then eventually 𝑣 gets decided. In contrast, many Byzantine State Machine Replication (SMR) protocols [17, 37, 43] require that in aninfinite run, an infinite number of decisions are made, but not every proposed valid value has to be decided. Communication measurement.

For the communication analysis, we say that a value 𝑣 is ordered when all honestparties decide 𝑣 . We measure communication complexity as the total number of bits sent by honest processes to order asingle proposal. To be able to measure the asynchronous running time we follow [16] and define a time unit for everyexecution 𝑟 to be the maximum time delay of all messages among correct processes in 𝑟 . We measure time complexity as the expected number of time units it takes for a correct party to decide on 𝑂 ( 𝑛 ) values proposed by different correctprocesses starting from any point in the execution. Our BAB protocol, DAG-Rider, is based on a Directed Acyclic Graph (DAG) abstraction, which represents the commu-nication layer of the processes. In a nutshell, each vertex in the DAG represents a message from a process, and eachmessage contains, among other data, references to previously broadcast vertices. Those references are the edges ofthe DAG. All messages are reliably broadcast and each correct process maintains a copy of the DAG as it perceives it.Different correct processes might observe different states of the DAG during different times of the run, but the reliablebroadcast prevents equivocation and guarantees that all correct processes eventually deliver the same messages, sotheir views of the DAG eventually converge.For each process 𝑝 𝑖 , denote 𝑝 𝑖 ’s local view of the DAG as 𝐷𝐴𝐺 𝑖 , which is stored as an array 𝐷𝐴𝐺 𝑖 [] . As we shortlyexplain, each vertex in the DAG is associated with a unique round number and a source (its generating process). At anygiven time, 𝐷𝐴𝐺 𝑖 [ 𝑟 ] for 𝑟 ∈ N is the set of all the vertices associated with round 𝑟 that 𝑝 𝑖 is aware of. Each round hasat most 𝑛 vertices, each with a different source. Due to the reliable broadcast, no process can generate two vertices inthe same round.Each vertex 𝑣 in a round 𝑟 has two sets of outgoing edges: a set of at least 𝑓 + strong edges and a set of up to 𝑓 weak edges . Strong edges point to vertices in round 𝑟 − and weak edges point to vertices in rounds 𝑟 ′ < 𝑟 − such dit Keidar, Eleftherios Kokoris-Kogias, Oded Naor, and Alexander Spiegelman Algorithm 1

Data structures and basic utilities for process 𝑝 𝑖 Local variables: struct vertex 𝑣 : ⊲ The struct of a vertex in the DAG 𝑣. round - the round of 𝑣 in the DAG 𝑣. source - the process that broadcast 𝑣𝑣. block - a block of transactions 𝑣. strongEdges - a set of vertices in 𝑣. round − that represent strong edges 𝑣. weakEdges - a set of vertices in rounds < 𝑣. round − that represent weak edges 𝐷𝐴𝐺 [] - An array of sets of vertices, initially: 𝐷𝐴𝐺 𝑖 [ ] ← predefined hardcoded set of 𝑓 + vertices ∀ 𝑗 ≥ 𝐷𝐴𝐺 𝑖 [ 𝑗 ] ← {} blocksToPropose - A queue, initially empty, 𝑝 𝑖 enqueues valid blocks of transactions from clients procedure path ( 𝑣,𝑢 ) ⊲ Check if exists a path consisting of strong and weak edges in the DAG return exists a sequence of 𝑘 ∈ N , vertices 𝑣 , 𝑣 , . . . , 𝑣 𝑘 s.t. 𝑣 = 𝑣 , 𝑣 𝑘 = 𝑢 , and ∀ 𝑖 ∈ [ ..𝑘 ] : 𝑣 𝑖 ∈ (cid:208) 𝑟 ≥ 𝐷𝐴𝐺 𝑖 [ 𝑟 ] ∧ ( 𝑣 𝑖 ∈ 𝑣 𝑖 − . weakEdges ∪ 𝑣 𝑖 − . strongEdges ) procedure strong_path ( 𝑣,𝑢 ) ⊲ Check if exists a path consisting of only strong edges in the DAG return exists a sequence of 𝑘 ∈ N , vertices 𝑣 , 𝑣 , . . . , 𝑣 𝑘 s.t. 𝑣 = 𝑣 , 𝑣 𝑘 = 𝑢 , and ∀ 𝑖 ∈ [ ..𝑘 ] : 𝑣 𝑖 ∈ (cid:208) 𝑟 ≥ 𝐷𝐴𝐺 𝑖 [ 𝑟 ] ∧ 𝑣 𝑖 ∈ 𝑣 𝑖 − . strongEdges that otherwise there is no path from 𝑣 to them. As explained in detail in §5, strong edges are used for agreement andweak edges make sure we eventually include all vertices in the decision, to satisfy the Eventual Fairness property of theBAB problem.The data types and variables for process 𝑝 𝑖 are specified in Algorithm 1 and the DAG construction is specified inAlgorithm 2. A vertex 𝑣 is a struct that holds a round number 𝑟 , a source which is the process that created 𝑣 , a block ofvalid transactions that were previously proposed by the upper BAB protocol, strong edges to at least 𝑓 + vertices inround 𝑟 − , and weak edges to vertices in rounds 𝑟 ′ < 𝑟 − . Vertices in the DAG are reliably broadcast (Line 14), andwhen the reliable broadcast layer delivers a vertex 𝑣 (Line 20), it uses the round number 𝑟 and the source process whichare available from the reliable broadcast and adds them to 𝑣 , and then calls the is_valid_vertex procedure (Line 25) toverify that the transactions that 𝑣 holds are externally valid and that 𝑣 has strong edges to at least 𝑓 + vertices fromround 𝑟 − . If 𝑣 passes the check, it is added to a buffer.The process 𝑝 𝑖 continuously goes through the buffer to check if there is a vertex 𝑣 in it that can be added to its 𝐷𝐴𝐺 𝑖 (Line 6). A vertex 𝑣 can be added to the DAG once the DAG contains all the vertices that 𝑣 has a strong or weak edgeto (Line 7). When 𝑝 𝑖 has at least 𝑓 + vertices in the current round 𝑟 , it moves to the next round 𝑟 + (Line 9) bycreating and reliably broadcasting a new vertex 𝑣 ′ . The new vertex 𝑣 ′ includes a block of transactions for which 𝑝 𝑖 previously invoked propose (can be empty), strong edges to the vertices in 𝐷𝐴𝐺 𝑖 [ 𝑟 ] (Line 15), and weak edges to anyvertices with no path from 𝑣 ′ to them (Line 27). Note that a vertex might be delivered at 𝑝 𝑖 ’s DAG after 𝑝 𝑖 has moved toa later round. In this case, the vertex is still added to the DAG, but 𝑝 𝑖 ’s vertices do not include strong edges to it. Weakedges are possible. As noted, the weak edges are used to ensure the eventual fairness property of the BAB problem.An example of our DAG construction is illustrated in Fig. 1. The DAG in the example represents 𝐷𝐴𝐺 , i.e., theDAG at process 1, out of a total of four processes, numbered 1 to 4. On each horizontal dotted line are the verticesfrom a single source, e.g., the bottom line shows the vertices delivered from process 4. Each vertical column of verticesis a single round. Each completed round has at least 𝑓 + = vertices. Each vertex in the DAG has at least 𝑓 + strong edges to vertices from the previous round shown as black solid arrows. Each vertex can also have weak edges to ll You Need is DAG Algorithm 2 DAG Construction , pseudocode for process 𝑝 𝑖 Local variables: 𝑟 ← ⊲ round number buffer ← {} while True do for 𝑣 ∈ buffer : 𝑣. round ≤ 𝑟 do if ∀ 𝑣 ′ ∈ 𝑣. strongEdges ∪ 𝑣. weakEdges : 𝑣 ′ ∈ (cid:208) 𝑘 ≥ 𝐷𝐴𝐺 [ 𝑘 ] then ⊲ We have 𝑣 ’s predecessors 𝐷𝐴𝐺 [ 𝑣. round ] ← 𝐷𝐴𝐺 [ 𝑣. round ] ∪ { 𝑣 } if | 𝐷𝐴𝐺 [ 𝑟 ] | ≥ 𝑓 + then ⊲ Start a new round if 𝑟 mod 4 = then ⊲ If a new wave is complete wave_ready ( 𝑟 / ) ⊲ Signal to Algorithm 3 that a new wave is complete 𝑟 ← 𝑟 + 𝑣 ← create_new_vertex ( 𝑟 ) r_bcast 𝑖 ( 𝑣, 𝑟 ) procedure create_new_vertex (round) 𝑣. block ← blocksToPropose . dequeue () ⊲ Returns ⊥ if the queue is empty. Proposed blocks are enqueued (Line 32) 𝑣. strongEdges ← 𝐷𝐴𝐺 [ round − ] set_weak_edges ( 𝑣, round ) return 𝑣 upon deliver 𝑖 ( 𝑣, round , 𝑝 𝑘 ) do ⊲ The deliver output from the reliable broadcast 𝑣. source ← 𝑝 𝑘 𝑣. round ← round if is_valid_vertex ( 𝑣 ) then buffer ← buffer ∪ { 𝑣 } procedure is_valid_vertex ( 𝑣 ) return | 𝑣. strongEdges | ≥ 𝑓 + ∧ ∀ 𝑡𝑥 ∈ 𝑣. block : validate ( 𝑡𝑥 ) ⊲ validate ( 𝑡𝑥 ) checks external validity procedure set_weak_edges ( 𝑣, round ) ⊲ Add weak edges to orphan vertices 𝑣. weakEdges ← {} for 𝑟 = round − down to 1 do for every 𝑢 ∈ 𝐷𝐴𝐺 𝑖 [ 𝑟 ] s.t. ¬ path ( 𝑣,𝑢 ) do 𝑣. weakEdges ← 𝑣. weakEdges ∪ { 𝑢 } vertices in case there is no other path in the DAG to the vertex. E.g., 𝑣 in the illustration has a weak edge to 𝑣 , shownas a dotted arrow to 𝑣 . In this section, we describe the DAG-Rider protocol, by equipping the DAG from the previous section with a globalperfect coin and show how the DAG and the coin can be used to construct a locally-computed protocol for the BABproblem. That is, given our DAG and a perfect coin, DAG-Rider does not require any extra communication amongthe processes. Instead, each process 𝑝 𝑖 observes its local 𝐷𝐴𝐺 𝑖 and deduces which blocks of transactions to decideand in what order. The protocol is detailed in Algorithm 3. Below we give a high-level intuition as well as a detaileddescription of the protocol and in §6 we analyze complexity and prove the correctness of the full protocol.When a propose 𝑖 ( 𝑣 ) is invoked, 𝑝 𝑖 simply pushes 𝑣 to the DAG layer (line 33), which in turn includes it in a vertex itreliably broadcasts. To interpret the DAG, each process 𝑝 𝑖 divides its local 𝐷𝐴𝐺 𝑖 into waves, where each wave consists A possible implementation of the coin using threshold signatures is described in §2. The coin can be easily implemented as part of the DAG itself byhaving each process send its share of the threshold signature when reliably broadcasting a vertex.7 dit Keidar, Eleftherios Kokoris-Kogias, Oded Naor, and Alexander Spiegelman

Algorithm 3 DAG-Rider: Byzantine Atomic Broadcast based on DAG.

Pseudocode for process 𝑝 𝑖 Local Variables: slot ← decidedWave ← decidedVertices ← {} leadersStack ← initialize empty stack with isEmpty(), push(), and pop() functions upon propose 𝑖 ( 𝑏 ) do ⊲ The propose call of the BAB problem blocksToPropose . enqueue ( 𝑏 ) ⊲ pushes a block of transactions to Alg 2 upon wave_ready ( 𝑤 ) do ⊲ Signal from the DAG layer that a new wave is completed (Line 11) 𝑣 ← get_wave_vertex_leader ( 𝑤 ) if 𝑣 = ⊥ ∨ |{ 𝑣 ′ ∈ 𝐷𝐴𝐺 𝑖 [ round ( 𝑤, ) ] : strong_path ( 𝑣 ′ , 𝑣 ) }| < 𝑓 + then ⊲ No commit return leadersStack . push ( 𝑣 ) for wave 𝑤 ′ from 𝑤 − down to decidedWave + do 𝑣 ′ ← get_wave_vertex_leader ( 𝑤 ′ ) if 𝑣 ′ ≠ ⊥ ∧ strong_path ( 𝑣, 𝑣 ′ ) then leadersStack . push ( 𝑣 ′ ) 𝑣 ← 𝑣 ′ decidedWave ← 𝑤 order_vertices ( leadersStack ) procedure get_wave_vertex_leader ( 𝑤 ) 𝑝 𝑗 ← choose_leader 𝑖 ( 𝑤 ) if ∃ 𝑣 ∈ 𝐷𝐴𝐺 [ round ( 𝑤, ) ] s.t. 𝑣.𝑠𝑜𝑢𝑟𝑐𝑒 = 𝑝 𝑗 then return 𝑣 ⊲ There can only be one such vertex return ⊥ procedure order_vertices ( leadersStack ) while ¬ leadersStack . isEmpty () do 𝑣 ← leadersStack . pop () verticesToDecide ← { 𝑣 ′ ∈ (cid:208) 𝑟 > 𝐷𝐴𝐺 𝑖 [ 𝑟 ] | 𝑝𝑎𝑡ℎ ( 𝑣, 𝑣 ′ ) ∧ 𝑣 ′ ∉ decidedVertices } for every 𝑣 ′ ∈ verticesToDecide in some deterministic order do decide 𝑖 ( slot , 𝑣 ′ . block ) slot ← slot + decidedVertices ← decidedVertices ∪ { 𝑣 ′ } of 4 consecutive rounds. For example, 𝑝 𝑖 ’s first wave consists of 𝐷𝐴𝐺 𝑖 [ ] , 𝐷𝐴𝐺 𝑖 [ ] , 𝐷𝐴𝐺 𝑖 [ ] , and 𝐷𝐴𝐺 𝑖 [ ] . Formally,the 𝑘 -th round of wave 𝑤 , where 𝑘 ∈ [ .. ] , 𝑤 ∈ N , is defined as round ( 𝑤, 𝑘 ) ≜ ( 𝑤 − ) + 𝑘 . We also say that aprocess 𝑝 𝑖 completes round 𝑟 once 𝐷𝐴𝐺 𝑖 [ 𝑟 ] has at least 𝑓 + vertices, and a process completes wave 𝑤 once the processcompletes round ( 𝑤, ) .In a nutshell, the idea is to interpret the DAG as a wave-by-wave protocol and try to commit a randomly chosensingle leader vertex in every wave. Once the sequence of leaders is determined, processes decide on all the blocksincluded in their causal histories. While reading the high-level description below, bear in mind that due to the reliablebroadcast, Byzantine processes cannot equivocate, so two correct processes cannot have different vertices with thesame source in the same round, leading to eventually consistent DAGs among all correct processes.When wave 𝑤 completes (Line 34), we use the global perfect coin to retrospectively elect some process and considerits vertex in the wave’s first round as the leader of wave 𝑤 (Line 35). The goal of the protocol is to commit this leader,provided that it has been observed by sufficiently many processes in the wave. Note that since we advance rounds assoon as we deliver 𝑓 + of the 𝑓 + potential vertices, a process 𝑝 𝑖 might not have 𝑤 ’s leader in its local 𝐷𝐴𝐺 𝑖 when ll You Need is DAG round 7 8Source:1234 6 95 wave 2 v

10 11 12 wave 3 v Fig. 2.

Illustration of

DAG i . The highlighted vertices 𝑣 and 𝑣 are the leaders of waves 2 and 3, respectively. The commit rule isnot met in wave 2 since there are less than 𝑓 + vertices in round 8 with a strong path to 𝑣 . However, the commit rule is met inwave 3 since there are 𝑓 + vertices in round 12 with a strong path to 𝑣 . Since there is a strong path from 𝑣 to 𝑣 (highlighted), 𝑝 𝑖 commits 𝑣 before 𝑣 in wave 3. it completes 𝑤 . In this case, 𝑝 𝑖 completes 𝑤 without committing any vertex and simply proceeds to the next wave. Note,however, that some other correct process might have 𝑤 ’s leader in its local DAG and commit it in the same wave. Thus,we need to make sure that if one correct process commits the wave vertex leader 𝑣 , then all the other correct processeswill commit 𝑣 later. To this end, we use standard quorum intersection. Process 𝑝 𝑖 commits the wave 𝑤 vertex leader 𝑣 if: (cid:12)(cid:12)(cid:8) 𝑣 ′ ∈ 𝐷𝐴𝐺 𝑖 [ round ( 𝑤, )] : strong_path ( 𝑣 ′ , 𝑣 ) (cid:9)(cid:12)(cid:12) ≥ 𝑓 + (Line 36) . In addition, if 𝑝 𝑖 commits vertex 𝑣 in wave 𝑤 and there is a strong path from 𝑣 to 𝑣 ′ such that 𝑣 ′ is an uncommittedleader vertex in a wave 𝑤 ′ < 𝑤 , then 𝑝 𝑖 commits 𝑣 ′ in 𝑤 as well. The leaders committed in the same wave are orderedby their round numbers, so that leaders of earlier waves are ordered before those of later ones, meaning 𝑣 ′ is orderedbefore 𝑣 (Lines 39-43).The next lemma, which we formally prove in §6, shows that our commit rule guarantees that if a correct processcommits a wave leader vertex 𝑣 in some wave, then all wave vertex leaders in later waves in the local DAGs of allcorrect processes have a strong path to 𝑣 , ensuring the agreement property. Lemma 1.

If some process 𝑝 𝑖 commits the leader vertex 𝑣 of a wave 𝑤 , then for every leader vertex 𝑢 of a wave 𝑤 ′ > 𝑤 and for every process 𝑝 𝑗 , if 𝑢 ∈ 𝐷𝐴𝐺 𝑗 [ round ( 𝑤 ′ , )] , then strong_path ( 𝑢, 𝑣 ) returns true in wave 𝑤 ′ . We show below how we leverage the above lemma to satisfy the agreement property, but first, we give an intuitionfor liveness, i.e., the Eventual Fairness and Termination properties. Our protocol achieves progress in a constant numberof waves, in expectation, by guaranteeing that for every wave, the probability for every correct process to committhe wave leader is at least / . To ensure this, we borrow the technique from the common-core abstraction [2], whichguarantees that after three rounds of all-to-all sending and collecting accumulated sets of values, all correct processeshave at least 𝑓 + common values. The set of these values is referred to as the common-core. In respect to our DAG,we prove in §6 the following lemma: dit Keidar, Eleftherios Kokoris-Kogias, Oded Naor, and Alexander Spiegelman Lemma 2.

Let 𝑝 𝑖 be a correct process that completes wave 𝑤 . Then there is a set 𝑉 ⊆ 𝐷𝐴𝐺 𝑖 [ round ( 𝑤, )] and a set 𝑈 ⊆ 𝐷𝐴𝐺 𝑖 [ round ( 𝑤, )] s.t. | 𝑉 | ≥ 𝑓 + , | 𝑈 | ≥ 𝑓 + and ∀ 𝑣 ∈ 𝑉 , ∀ 𝑢 ∈ 𝑈 : strong_path ( 𝑢, 𝑣 ) . Note that by the commit rule, if the leader of a wave 𝑤 belongs to the set 𝑉 (from the lemma statement), then 𝑝 𝑖 commits the leader once it completes 𝑤 . So to deal with an adversary that totally controls the network, parties flip theglobal coin only after they complete 𝑤 (Line 35). Therefore, by the coin’s unpredictability property, the probability ofthe adversary to guess the wave’s leader before the point after which it cannot manipulate the set 𝑉 is less than 𝑛 + 𝜖 .Thus, with a probability of at least / − 𝜖 , 𝑤 ’s leader is in the set 𝑉 and 𝑝 𝑖 commits it. Thus, in expectation, correctprocesses commit every / waves.To satisfy agreement, we leverage the property proven in Lemma 1 to make sure all processes commit the samewaves’ leaders. Once we find a leader to commit in a wave 𝑤 we check if it is possible that some process committed awave in between 𝑤 and the previous wave we committed, let it be 𝑤 ′ . We do this iteratively in Lines 39-43, we firstcheck if it is possible that some process committed the leader of 𝑤 − . We do it by checking if there is a strong pathfrom the leader of wave 𝑤 to the leader of wave 𝑤 − in our local DAG (Line 41). If no such path exists, by Lemma 1,no correct process will ever commit 𝑤 − . Otherwise, we choose to commit 𝑤 − before 𝑤 . Now, if such a path indeedexists, we recursively check if it is possible that some process committed a wave in between 𝑤 − and 𝑤 ′ . Otherwise, ifno such path exists, we check if there is a path from the leader of wave 𝑤 to the leader of wave 𝑤 − and continuein the same way. The recursion ends once we reach a wave that we previously committed, 𝑤 ′ in our example. Anillustration of this process is given in Fig. 2.Since vertices are reliably broadcast and since we never add a vertex 𝑣 to the DAG before we add all the vertices 𝑣 points to with strong or weak edges, two correct processes always have the same causal history for any vertexthey both have in their DAGs. Therefore, once we agree on a sequence of leaders, all that is left to do is to order thecausal histories of the leaders in some deterministic order. To this end, we go through the waves’ committed leadersone-by-one and decide, in some deterministic order, on all the transaction blocks in their causal histories that we didnot previously decide on (Procedure order_vertices in Line 51). The causal history of a wave leader vertex 𝑣 in 𝐷𝐴𝐺 𝑖 isthe set { 𝑢 ∈ 𝐷𝐴𝐺 𝑖 | path ( 𝑣, 𝑢 )} .The purpose of the weak edges is to satisfy the Eventual Fairness property. Recall that strong edges might not pointto all vertices from the previous round in the DAG because we might advance the round before we deliver all thebroadcasts of that round (we advance the round once at least 𝑓 + vertices are added to the DAG). Therefore, withoutthe weak edges, slow processes may not be able to get vertices from higher rounds to point to theirs. So to satisfyEventual Fairness, each correct process, when creating a new vertex, adds weak edges to all vertices in its local DAG towhich it otherwise does not point. In §6.1 we prove the correctness of DAG-Rider, and in §6.2 we analyze the communication the time complexity.

We show that DAG-Rider achieves the properties of the BAB problem, as defined in §3.

Lemma 3.

DAG-Rider achieves the integrity property of the BAB problem. ll You Need is DAG Proof. When a correct process 𝑝 𝑖 decides on a block in a slot number ℓ (Line 56) it increases by one the slot variablein the next line, so that next time 𝑝 𝑖 decides on a block, it will be in slot number ℓ + , ensuring that 𝑝 𝑖 does not decideon more than one block in a single slot number. □ Lemma 4.

DAG-Rider achieves the external validity property of the BAB problem.

Proof. A vertex 𝑣 is added to the 𝐷𝐴𝐺 𝑖 of process 𝑝 𝑖 only after 𝑝 𝑖 checks if all transactions in 𝑣. block pass theexternal validity predicate (Line 25). Since 𝑝 𝑖 decides only on blocks in vertices that are in its DAG, then any decisionmade is on an externally valid block of transactions. □ Claim 1.

When a correct process 𝑝 𝑖 adds a vertex 𝑣 to its 𝐷𝐴𝐺 𝑖 (Line 8), all of 𝑣 ’s causal history is already in 𝐷𝐴𝐺 𝑖 . Proof. We prove this claim by induction on the execution of every correct process 𝑝 𝑖 . Denote by 𝑣 𝑘 the 𝑘 -th vertexthat 𝑝 𝑖 adds to 𝐷𝐴𝐺 𝑖 . We show that for every 𝑘 ∈ N , after 𝑣 𝑘 is added to the DAG, the causal histories of all the verticesin the set { 𝑣 , . . . , 𝑣 𝑘 } , and in particular 𝑣 𝑘 , are in 𝐷𝐴𝐺 𝑖 .In the base step of the induction, there are no vertices in the DAG, and the property vacuously holds. Next, assumethat after 𝑣 𝑘 is added to the DAG at process 𝑝 𝑖 , all the causal histories of all the vertices in the set 𝑉 = { 𝑣 , . . . , 𝑣 𝑘 } arealready in 𝐷𝐴𝐺 𝑖 .For 𝑣 𝑘 + to be added to the DAG at process 𝑝 𝑖 , its strong and weak edges must reference vertices that are already in 𝐷𝐴𝐺 𝑖 (Line 7), i.e., 𝑣 𝑘 + ’s edges are only to vertices in 𝑉 . Since all the vertices in 𝑉 already have their causal historiesin the DAG, when 𝑣 𝑘 + is added to the DAG, its causal history is in the DAG as well, and we are done. □ Claim 2.

If a correct process 𝑝 𝑖 adds a vertex 𝑣 to its 𝐷𝐴𝐺 𝑖 , then eventually all correct processes add 𝑣 to their DAG. Proof. By induction on rounds, for process 𝑝 𝑖 to add a vertex 𝑣 in round 𝑟 to its 𝐷𝐴𝐺 𝑖 , first 𝑣 needs to be deliveredto 𝑝 𝑖 by the reliable broadcast layer (Line 20), and by the agreement of the reliable broadcast, 𝑣 will be eventuallydelivered to all other correct processes.Next, 𝑣 has to be added to the buffer variable at 𝑝 𝑖 , and this is done if the process who broadcast 𝑣 added the correct 𝑣. source and 𝑣. round which are verified through the guarantees of the reliable broadcast layer (Line 23). Therefore thesechecks will also pass at any other correct process when 𝑣 is delivered to it. Vertex 𝑣 also has to pass the isValidVertex procedure (Line 25), which ensures that the proposed block of transactions passes the external validity condition, aswell as that the block references at least 𝑓 + vertices from round 𝑣. round − . If 𝑣 passes the two checks in 𝑝 𝑖 then itwill pass these two checks at any other correct process, since these checks are computed locally based on 𝑣 ’s fields( 𝑣. block and 𝑣. strongEdges ).Lastly, after 𝑣 is added to the buffer , for 𝑝 𝑖 to add 𝑣 to its 𝐷𝐴𝐺 𝑖 , 𝑝 𝑖 also checks that it has all the vertices that 𝑣 is referencing to (in 𝑉 = 𝑣. strongEdges ∪ 𝑣. weakEdges ) in its 𝐷𝐴𝐺 𝑖 as well (Line 7). By the induction assumption, allcorrect processes’ DAGs contain the same vertices in rounds < 𝑟 .Thus, this ensures that any vertex 𝑣 that appears in any round at 𝐷𝐴𝐺 𝑖 of some correct process, will eventually alsoappear in 𝐷𝐴𝐺 𝑗 of every other correct process 𝑝 𝑗 . □ Claim 3.

If for some correct process 𝑝 𝑖 there is a round 𝑟 with a set 𝑉 of at least 𝑓 + vertices in 𝐷𝐴𝐺 𝑖 [ 𝑟 ] s.t. ∀ 𝑣 ∈ 𝑉 : strong_path ( 𝑣, 𝑢 ) to some vertex 𝑢 ∈ 𝐷𝐴𝐺 𝑖 , then every other process 𝑝 𝑗 that completes round 𝑟 has a set 𝑉 ′ ⊆ 𝐷𝐴𝐺 𝑗 [ 𝑟 ] s.t. | 𝑉 ′ | ≥ 𝑓 + and ∀ 𝑣 ′ ∈ 𝑉 ′ : strong_path ( 𝑣 ′ , 𝑢 ) . Proof. Let 𝑉 ′ = 𝑉 ∩ 𝐷𝐴𝐺 𝑗 [ 𝑟 ] . Round 𝑟 is complete for 𝑝 𝑖 and 𝑝 𝑗 when their DAGs have at least 𝑓 + vertices.Therefore, when 𝑝 𝑖 and 𝑝 𝑗 complete round 𝑟 , | 𝑉 ′ | ≥ 𝑓 + by a standard quorum intersection of 𝑓 + out of 𝑓 + dit Keidar, Eleftherios Kokoris-Kogias, Oded Naor, and Alexander Spiegelman possible vertices of round 𝑟 (due to the reliable broadcast, Byzantine processes cannot equivocate). Since every 𝑣 ′ ∈ 𝑉 ′ is already in 𝐷𝐴𝐺 𝑗 when 𝑝 𝑗 completes round 𝑟 , then 𝑢 is in 𝐷𝐴𝐺 𝑗 by 𝑡 as well (by Claim 1), and there is a strong pathbetween every 𝑣 ′ ∈ 𝑉 ′ to 𝑢 in 𝐷𝐴𝐺 𝑗 . □ For the next part, we say a process commits a wave leader vertex 𝑣 when 𝑣 is popped from the stack in Line 53. Claim 4.

In every wave, at most one vertex 𝑣 can be a wave leader vertex for all correct processes. Proof. For a vertex 𝑣 to be a wave leader vertex in wave 𝑤 it has to be the return value from the get_wave_vertex_leader procedure (Line 46). The procedure gets the wave’s chosen process 𝑝 𝑗 by the global coin,and checks if the 𝐷𝐴𝐺 𝑖 at process 𝑝 𝑖 has the vertex 𝑣 from 𝑝 𝑗 in the first round of wave 𝑤 . Due to the agreementproperty of the global perfect coin, the same process 𝑝 𝑗 is chosen for all correct processes, and because of the agreementproperty of the reliable broadcast, Byzantine processes cannot equivocate. □ Claim 5.

If a correct process 𝑝 𝑖 commits wave leader vertex 𝑣 in wave 𝑤 and after that 𝑝 𝑖 commits vertex 𝑣 in wave 𝑤 , then 𝑤 < 𝑤 . Proof. A vertex is committed when it is popped from the stack (Line 53). Vertices are pushed to the stack in Lines 38and 42, which only happens in waves which vertices were not committed before, since the for loop goes down only to decidedWave + (Line 39), where decidedWave is updated each time vertices are pushed to the stack to be the maximumwave in which vertices were committed (Line 44). This means that vertices are pushed to the stack in decreasing wavenumbers.Lastly, all the vertices in the stack are popped out and committed, and this is done in reverse order to the order thatthey were pushed to the stack, therefore, the wave numbers of committed waves are in an increasing order. □ Lemma 5.

If some process 𝑝 𝑖 commits the leader vertex 𝑣 of a wave 𝑤 , then for every leader vertex 𝑢 of a wave 𝑤 ′ > 𝑤 and for every process 𝑝 𝑗 , if 𝑢 ∈ 𝐷𝐴𝐺 𝑗 [ round ( 𝑤 ′ , )] , then strong_path ( 𝑢, 𝑣 ) returns true in wave 𝑤 ′ . Proof. Since vertex 𝑣 is committed by process 𝑝 𝑖 in wave 𝑤 , the commit rule is met, i.e., at the end of wave 𝑤 thereare at least 𝑓 + vertices in 𝐷𝐴𝐺 𝑖 [ round ( 𝑤, )] with a strong path to 𝑣 . By Claim 3, every correct process 𝑝 𝑗 (whetherit committs 𝑣 in 𝑤 or not) has a set 𝑉 of at least 𝑓 + vertices in 𝐷𝐴𝐺 𝑗 [ round ( 𝑤, )] with a strong path to 𝑣 . Any futurevertex 𝑣 ′ from waves 𝑤 ′ > 𝑤 , including 𝑢 , will have a strong path to at least one vertex in 𝑉 , resulting in a strong pathbetween 𝑢 and 𝑣 . This does not matter if vertex 𝑣 ′ is created by a Byzantine process or not, since Byzantine processescannot equivocate due to the reliable broadcast guarantees. □ Lemma 6.

Algorithm 3 satisfies the agreement property of the BAB problem.

Proof. By Claim 4, each wave has only one vertex that can be committed. By Claim 5 every correct process commitsvertices in an increasing wave number. By Lemma 1, if a correct process 𝑝 𝑖 commits a vertex 𝑣 , then there is a strongpath to 𝑣 from any vertex 𝑢 in future waves that might be committed. By combining all the claims, if two correctprocesses commit the same wave leader vertices, they do so in the same order.Once a correct process commits a wave vertex leader 𝑣 , it decides on all of 𝑣 ’s causal history in some deterministicorder, which is identical for all other correct processes. By Claim 1, when 𝑣 is committed, all of 𝑣 ’s causal history isalready in the DAG. Thus, since all correct processes commit the same wave leader vertices in the same order, and sincethose vertices have the same causal histories, all correct processes that decide in some slot, decide the same value. □ ll You Need is DAG Lemma 7.

Let 𝑝 𝑖 be a correct process that completes wave 𝑤 . Then there is a set 𝑉 ⊆ 𝐷𝐴𝐺 𝑖 [ round ( 𝑤, )] and a set 𝑈 ⊆ 𝐷𝐴𝐺 𝑖 [ round ( 𝑤, )] s.t. | 𝑉 | ≥ 𝑓 + , | 𝑈 | ≥ 𝑓 + and ∀ 𝑣 ∈ 𝑉 , ∀ 𝑢 ∈ 𝑈 : strong_path ( 𝑢, 𝑣 ) . Proof. First, we show that there is a set 𝑉 , | 𝑉 | ≥ 𝑓 + s.t. when 𝑝 𝑖 completes round ( 𝑤, ) and broadcasts a newvertex 𝑣 in round ( 𝑤, ) , then 𝑣 has a strong path to all the vertices in 𝑉 .To this end, we use the common-core abstraction, that first appeared in [2], and was adapted (and proven) for theByzantine case in [20]. The model for this abstraction is identical to our model. Each correct process 𝑝 𝑖 has some inputvalue 𝑣 𝑖 , and it returns a set 𝑉 𝑖 of input values from different processes. The guarantee of the common-core abstractionis that there is a subset 𝑉 of at least 𝑓 + values, s.t. for each correct process 𝑉 ⊆ 𝑉 𝑖 , i.e., there is a common core of atleast 𝑓 + input values that appear in the returned sets of all the correct processes that complete the common-coreabstraction.The algorithm to realize the common-core abstraction consists of three rounds of communication: in the first round,each process sends its input value 𝑣 𝑖 , and then waits for 𝑓 + input values from other processes (including itself).Denote this first set at process 𝑝 𝑖 as 𝐹 𝑖 .In the second stage, each process sends its 𝐹 𝑖 set and waits until it receives 𝑓 + 𝐹 𝑗 sets from other processes(including itself). When this stage ends, process 𝑝 𝑖 creates the union of all the 𝐹 𝑗 sets it received. Denote this set of setsfor process 𝑝 𝑖 as 𝑆 𝑖 . In the third and last stage, process 𝑝 𝑖 sends the set 𝑆 𝑖 it created and again waits to receive 𝑓 + 𝑆 𝑗 sets from other processes (including itself). When this stage ends, process 𝑝 𝑖 returns the union of all the 𝑆 𝑗 sets,denoted 𝑇 𝑖 , as the output of the common-core abstraction.We show that the first three rounds of a wave 𝑤 can be mapped exactly to the three stages of the common-core algorithm. Denote 𝑟 , 𝑟 , 𝑟 , 𝑟 as round ( 𝑤, ) , round ( 𝑤, ) , round ( 𝑤, ) , round ( 𝑤, ) , respectively. When a correctprocess 𝑝 𝑖 adds the vertex 𝑣 created in 𝑟 to 𝐷𝐴𝐺 𝑖 [ 𝑟 ] , by Claim 2, eventually all other correct processes add 𝑣 to theirDAG, which can be mapped to 𝑝 𝑖 sending its input value to all other processes in the common-core algorithm. Next, 𝑝 𝑖 moves to round 𝑟 once it has at least 𝑓 + vertices in 𝑟 , which is mapped to 𝑝 𝑖 waiting for 𝑓 + input valuesfrom different processes in the common-core algorithm. When 𝑝 𝑖 enters 𝑟 it broadcasts a vertex 𝑣 that references allthe vertices it has in 𝑟 , which is equivalent to 𝑝 𝑖 sending 𝐹 𝑖 at the beginning of the second stage of the common-corealgorithm. In a similar way, when 𝑝 𝑖 completes 𝑟 and enters 𝑟 , it broadcasts 𝑣 which references all the vertices it hasin 𝑟 , which is equivalent to sending 𝑆 𝑖 (by Claim 1, when 𝑣 is added to 𝐷𝐴𝐺 𝑗 [ 𝑟 ] of some correct process 𝑝 𝑗 , then allthe vertices 𝑝 𝑖 has in 𝐷𝐴𝐺 𝑖 [ 𝑟 ] with a strong path from 𝑣 are in the 𝐷𝐴𝐺 𝑗 [ 𝑟 ] as well). To complete the mapping,when 𝑝 𝑖 completes 𝑟 and broadcasts 𝑣 in round 𝑟 , then 𝑣 has in its causal history the same values that would havebeen in 𝑇 𝑖 in the equivalent common-core algorithm.Note that since Byzantine processes cannot equivocate, and since every round in the DAG has at least 𝑓 + vertices,any vertex that 𝑝 𝑖 adds to 𝐷𝐴𝐺 𝑖 [ 𝑟 ] has to reference at least 𝑓 + vertices that 𝑣 also references, even vertices sentfrom Byzantine processes. Thus, based on the common-core guarantee, there is a set 𝑉 ⊂ 𝐷𝐴𝐺 𝑖 [ 𝑟 ] s.t. | 𝑉 | ≥ 𝑓 + and ∀ 𝑣 ∈ 𝑉 : strong_path ( 𝑣 , 𝑣 ) , and also this set 𝑉 appears in the DAG of any other correct process 𝑝 𝑗 that completesround 𝑟 . Next, when 𝑝 𝑖 completes wave 𝑤 , i.e., when it completes round 𝑟 , it has in 𝐷𝐴𝐺 𝑖 [ 𝑟 ] at least 𝑓 + vertices,and each of those vertices has a path to each of the vertices in 𝑉 , which concludes the proof. □ Claim 6.

For every correct process 𝑝 𝑖 and for every wave 𝑤 , the expected number of waves, starting from 𝑤 , until thecommit rule is met is equal to or smaller than / + 𝜖 . dit Keidar, Eleftherios Kokoris-Kogias, Oded Naor, and Alexander Spiegelman Proof. By Lemma 2, in each wave 𝑤 , the probability that for a correct process 𝑝 𝑖 the commit rule is met is at least 𝑝𝑟 = ( 𝑓 + )/( 𝑓 + ) − 𝜖 . The number of waves until the commit rule is met is geometrically distributed with asuccess probability of 𝑝𝑟 . Thus, the expected number of waves is bounded by / + 𝜖 waves. □ Lemma 8.

DAG-Rider guarantees the termination property of the BAB problem.

Proof. If a correct process 𝑝 𝑖 decides a value in slot ℓ it means there is some value in the block of some vertex 𝑢 that is in the causal history of some wave 𝑤 leader vertex 𝑣 that is decided in that slot. By Claim 6, every other correctprocess 𝑝 𝑗 that has not committed 𝑣 yet will eventually, with probability 1, have a wave 𝑤 ′ > 𝑤 in which the commitrule is met. When 𝑝 𝑗 commits 𝑤 ′ , by the proved agreement property, it will also commit 𝑣 , and thus decide on all of 𝑣 ’scausal history in the same slots, including vertex 𝑢 in slot ℓ . □ Claim 7.

Every vertex that is broadcast by a correct process is eventually added to the DAG of all correct processes.

Proof. We prove this by showing that for every correct process 𝑝 𝑖 that broadcasts a vertex 𝑣 , 𝑣 is eventually addedto 𝐷𝐴𝐺 𝑖 , and by Claim 2, 𝑣 is eventually added to the DAG of all other correct processes.When a correct process 𝑝 𝑖 broadcasts a vertex 𝑣 (Line 14) it broadcasts a valid vertex, i.e., a vertex that passes theexternal validity check, and that references vertices that are already in 𝐷𝐴𝐺 𝑖 . Because of the validity property of thereliable broadcast, 𝑜 𝑖 eventually delivers 𝑣 to itself, and when it does so, it adds 𝑣 to its own 𝐷𝐴𝐺 𝑖 . Thus, as explained,by Claim 2, 𝑣 is eventually added to the DAGs of all other correct processes. □ Lemma 9.

DAG-Rider guarantees the eventual fairness property of the BAB problem.

Proof. When a correct process 𝑝 𝑖 proposes a value, it is inserted into a queue (Line 33), and eventually will beincluded in a vertex 𝑣 created by 𝑝 𝑖 (Line 16). Vertex 𝑣 is eventually delivered to all the correct processes and added totheir DAGs (Claim 7).When a correct process broadcasts a new vertex 𝑣 in round 𝑟 it also makes sure that it has a path (either a strong pathor path that includes weak edges) to all the vertices in rounds 𝑟 ′ < 𝑟 , and if not, it adds weak edges to 𝑣 that guaranteethis (Line 27), therefore 𝑣 will eventually be included in the causal history of all correct processes. By Lemma 7, withprobability 1, eventually 𝑣 will be in the causal history of a committed wave vertex leader, and therefore decided. □ We proved that DAG-Rider achieves all the properties of the BAB problem as described in §3.

We analyze DAG-Rider in terms of expected communication complexity and expected time complexity.

Communication complexity.

We analyze the communication complexity of DAG-Rider when instantiated with Cachinand Tesero’s [15] information dispersal protocol. A similar analysis can be made for other broadcast implementationsas well. For clarity, in §4, we say that strongEdges and weakEdges are sets of vertices. However, in order to refer to avertex it is enough to only store its source and round number. We assume that any round number during an executioncan be expressed in a constant number of bits, that is, the DAG never reaches round number (note that roundnumbers grow slower than slot numbers).We count the number of bits sent by correct processes in every round of the DAG and divide it by the total numberof ordered values therein. The complexity of [15] is 𝑂 ( 𝑛 log ( 𝑛 ) + 𝑛𝑀 ) , where 𝑀 is the message (vertex) size. Each It is also possible to store vertices hashes. 14 ll You Need is DAG message includes a set of proposed values and 𝑛 references, and each reference includes a process id of size log ( 𝑛 ) .Thus, if we batch 𝑛 log ( 𝑛 ) values in every message, the bit complexity is 𝑂 ( 𝑛 log ( 𝑛 ) + 𝑛 log ( 𝑛 )) = 𝑂 ( 𝑛 log ( 𝑛 )) fora broadcast.Since each process is allowed to broadcast a single message in each round, a correct process will not participate inmore than 𝑛 reliable broadcasts in a round, and thus the total bit complexity of correct processes in a round is boundedby 𝑂 ( 𝑛 log ( 𝑛 )) . On the other hand, at least 𝑓 + = 𝑂 ( 𝑛 ) vertices are ordered in every round. Thus, 𝑂 ( 𝑛 log ( 𝑛 )) values are ordered in every round, which means that the amortized communication complexity of DAG-Rider is 𝑂 ( 𝑛 ) . Time complexity.

By Claim 6, the number of waves, in expectation, between two waves that satisfy the commit rulein

𝐷𝐴𝐺 𝑖 for a correct process 𝑝 𝑖 is expected constant. Since each wave consists of constant size chains of messages, bythe definition of time units, the number of time units, in expectation, between two 𝑝 𝑖 ’s commits is constant. Every time 𝑝 𝑖 commits a wave, it commits the wave’s leader causal history, which contains at least 𝑂 ( 𝑛 ) proposals from differentcorrect processes. Therefore, DAG-Rider’s time complexity is 𝑂 ( ) in expectation. The first asynchronous Byzantine Agreement protocols [5, 39] showed that the FLP [23] impossibility result canbe circumvented with randomization. Their communication and time complexity was exponential and a significantamount of work has been done since then in attempt to achieve optimal complexity under different assumptions.Some works consider the information theoretical settings and present protocols with polylogarithmic complexitythat tolerate adversaries with unbounded computational power [4, 26, 38]. Others follow a more practical approachand consider a computationally bounded adversary in order to be able to use cryptographic primitives to improvecomplexity [1, 13, 14, 35]. The pioneering crypto-based protocols [13, 14] were later realized in HoneyBadgerBFT, thefirst asynchronous Byzantine SMR system [36]. However, while the state-of-the-art asynchronous Byzantine Agreementprotocols VABA [1] and Dumbo [35] rely on cryptographic assumptions for both safety and liveness, DAG-Rideruses a hybrid alternative by providing safety with information theoretical guarantees and relying on cryptographicassumptions only for liveness.Many other works also presented protocols for the BAB problem in the asynchronous setting. Some works like [29, 40]use cryptographic schemes for safety, and others like [19] do not use signatures. Other works like [22] encapsulatetiming assumptions by relying on a failure detector. All these works have higher expected communication complexity.The idea of building a communication DAG and locally interpreting total order was considered before [18, 21]. To thebest of our knowledge, the only algorithms that realize this idea in the Byzantine settings are HashGraph [3] and laterAleph [24]. In contrast to DAG-Rider, HashGraph builds an unstructured DAG in which processes (unreliably) sendmessages with 2 references to previous vertices and on top of it run an inefficient binary agreement protocol, whichleads to expected exponential time complexity. Their communication complexity is not straightforward to analyzesince they did not clearly describe the mechanism that ensures that eventually all DAG information is propagated to allprocesses, and no analysis is provided. Aleph improves HashGraph’s complexity by building a round-based DAG andusing a more efficient binary agreement protocol [14]. They use an expected constant time binary agreement instanceto agree on whether to commit every vertex in a round. However, similarly to VABA and Dumbo based SMR, since theyneed all instances to terminate before they can totally order all vertices in a round, their time complexity is 𝑂 ( log ( 𝑛 )) .They do not amortize complexity and have 𝑂 ( 𝑛 ) per decision. In contrast to DAG-Rider, both HashGraph and Aleph(1) do not satisfy eventual fairness; and (2) rely on signatures for safety and thus are not post-quantum safe. dit Keidar, Eleftherios Kokoris-Kogias, Oded Naor, and Alexander Spiegelman We presented DAG-Rider: an asynchronous Byzantine Atomic Broadcast protocol with optimal resilience, optimalamortized communication complexity, and optimal time complexity. DAG-Rider does not rely on cryptographicassumptions for safety. Instead, it rules out Byzantine equivocation by relying on the reliable broadcast to guaranteethat all correct processes eventually see the same DAG. Finally, we believe that DAG-Rider’s elegant design, perfectload balancing, and modular separation of concerns make it an adequate candidate for future Byzantine SMR systems.

REFERENCES [1] Ittai Abraham, Dahlia Malkhi, and Alexander Spiegelman. 2019. Asymptotically Optimal Validated Asynchronous Byzantine Agreement. In

Proceedings of the 2019 ACM Symposium on Principles of Distributed Computing (Toronto ON, Canada) (PODC ’19) . ACM, New York, NY, USA,337–346. https://doi.org/10.1145/3293611.3331612[2] Hagit Attiya and Jennifer Welch. 2004.

Distributed computing: fundamentals, simulations, and advanced topics . Vol. 19. John Wiley & Sons.[3] Leemon Baird. 2016. The swirlds hashgraph consensus algorithm: Fair, fast, Byzantine fault tolerance.

Swirlds Tech Reports SWIRLDS-TR-2016-01,Tech. Rep (2016).[4] Laasya Bangalore, Ashish Choudhury, and Arpita Patra. 2018. Almost-surely terminating asynchronous Byzantine agreement revisited. In

Proceedingsof the 2018 ACM Symposium on Principles of Distributed Computing . 295–304.[5] Shai Ben-David, Allan Borodin, Richard Karp, Gabor Tardos, and Avi Wigderson. 1994. On the power of randomization in on-line algorithms.

Algorithmica

11, 1 (1994), 2–14.[6] Michael Ben-Or and Ran El-Yaniv. 2003. Resilient-optimal interactive consistency in constant time.

Distributed Computing

16, 4 (2003), 249–262.[7] Erica Blum, Jonathan Katz, Chen-Da Liu-Zhang, and Julian Loss. 2020. Asynchronous Byzantine Agreement with Subquadratic Communication. In

Theory of Cryptography Conference . Springer, 353–380.[8] Dan Boneh, Ben Lynn, and Hovav Shacham. 2001. Short signatures from the Weil pairing. In

International conference on the theory and application ofcryptology and information security . Springer, 514–532.[9] Edward Bortnikov, Maxim Gurevich, Idit Keidar, Gabriel Kliot, and Alexander Shraer. 2009. Brahms: Byzantine resilient random membershipsampling.

Computer Networks

53, 13 (2009), 2340–2359.[10] Stephen Boyd, Arpita Ghosh, Balaji Prabhakar, and Devavrat Shah. 2006. Randomized gossip algorithms.

IEEE/ACM Transactions on Networking(TON)

14, SI (2006), 2508–2530.[11] Gabriel Bracha. 1987. Asynchronous Byzantine agreement protocols.

Information and Computation

75, 2 (1987), 130–143.[12] Christian Cachin, Rachid Guerraoui, and Luís Rodrigues. 2011.

Introduction to reliable and secure distributed programming . Springer Science &Business Media.[13] Christian Cachin, Klaus Kursawe, Frank Petzold, and Victor Shoup. 2001. Secure and efficient asynchronous broadcast protocols. In

AnnualInternational Cryptology Conference . Springer, 524–541.[14] Christian Cachin, Klaus Kursawe, and Victor Shoup. 2005. Random oracles in Constantinople: Practical asynchronous Byzantine agreement usingcryptography.

Journal of Cryptology

18, 3 (2005), 219–246.[15] Christian Cachin and Stefano Tessaro. 2005. Asynchronous verifiable information dispersal. In . IEEE, 191–201.[16] Ran Canetti and Tal Rabin. 1993. Fast asynchronous Byzantine agreement with optimal resilience. In

Proceedings of the twenty-fifth annual ACMsymposium on Theory of computing . 42–51.[17] Miguel Castro, Barbara Liskov, et al. 1999. Practical Byzantine fault tolerance. In

OSDI , Vol. 99. 173–186.[18] Gregory V Chockler, Nabil Huleihel, and Danny Dolev. 1998. An adaptive totally ordered multicast protocol that tolerates partitions. In

Proceedingsof the seventeenth annual ACM symposium on Principles of distributed computing . 237–246.[19] Miguel Correia, Nuno Ferreira Neves, and Paulo Veríssimo. 2006. From consensus to atomic broadcast: Time-free Byzantine-resistant protocolswithout signatures.

Comput. J.

49, 1 (2006), 82–96.[20] Danny Dolev and Eli Gafni. 2016. Some garbage in-some garbage out: Asynchronous t-Byzantine as asynchronous benign t-resilient system withfixed t-trojan-horse inputs. arXiv preprint arXiv:1607.01210 (2016).[21] Danny Dolev, Shlomo Kramer, and Dalia Malki. 1993. Early delivery totally ordered multicast in asynchronous environments. In

FTCS-23 TheTwenty-Third International Symposium on Fault-Tolerant Computing . IEEE, 544–553.[22] Assia Doudou, Benoit Garbinato, and Rachid Guerraoui. 2000. Abstractions for devising Byzantine-resilient state machine replication. In

Proceedings19th IEEE Symposium on Reliable Distributed Systems SRDS-2000 . IEEE, 144–153.[23] Michael J Fischer, Nancy A Lynch, and Michael S Paterson. 1985. Impossibility of distributed consensus with one faulty process.

Journal of the ACM(JACM)

32, 2 (1985), 374–382.[24] Adam Gkagol, Damian Lesniak, Damian Straszak, and Michal Swiketek. 2019. Aleph: Efficient atomic broadcast in asynchronous networks withByzantine nodes. In

Proceedings of the 1st ACM Conference on Advances in Financial Technologies . 214–228.16 ll You Need is DAG [25] Rachid Guerraoui, Petr Kuznetsov, Matteo Monti, Matej Pavlovic, and Dragos-Adrian Seredinschi. 2019. Scalable Byzantine Reliable Broadcast. In . Schloss Dagstuhl-Leibniz-Zentrum fuer Informatik.[26] Bruce M Kapron, David Kempe, Valerie King, Jared Saia, and Vishal Sanwalani. 2010. Fast asynchronous Byzantine agreement and leader electionwith full information.

ACM Transactions on Algorithms (TALG)

6, 4 (2010), 1–28.[27] Richard Karp, Christian Schindelhauer, Scott Shenker, and Berthold Vocking. 2000. Randomized rumor spreading. In

Proceedings 41st AnnualSymposium on Foundations of Computer Science . IEEE, 565–574.[28] Mahimna Kelkar, Fan Zhang, Steven Goldfeder, and Ari Juels. 2020. Order-fairness for Byzantine consensus. In

Annual International CryptologyConference . Springer, 451–480.[29] Kim Potter Kihlstrom, Louise E Moser, and P Michael Melliar-Smith. 2001. The SecureRing group communication system.

ACM Transactions onInformation and System Security (TISSEC)

4, 4 (2001), 371–406.[30] Eleftherios Kokoris Kogias, Dahlia Malkhi, and Alexander Spiegelman. 2020. Asynchronous Distributed Key Generation for Computationally-SecureRandomness, Consensus, and Threshold Signatures.. In

Proceedings of the 2020 ACM SIGSAC Conference on Computer and Communications Security .1751–1767.[31] Ramakrishna Kotla, Lorenzo Alvisi, Mike Dahlin, Allen Clement, and Edmund Wong. 2007. Zyzzyva: speculative Byzantine fault tolerance. In

ACMSIGOPS Operating Systems Review , Vol. 41. ACM, 45–58.[32] Kfir Lev-Ari, Alexander Spiegelman, Idit Keidar, and Dahlia Malkhi. 2020. FairLedger: A Fair Blockchain Protocol for Financial Institutions. In . Schloss Dagstuhl-Leibniz-Zentrum für Informatik.[33] Benoît Libert, Marc Joye, and Moti Yung. 2016. Born and raised distributively: Fully distributed non-interactive adaptively-secure thresholdsignatures with short shares.

Theoretical Computer Science

645 (2016), 1–24.[34] Julian Loss and Tal Moran. 2018. Combining Asynchronous and Synchronous Byzantine Agreement: The Best of Both Worlds.

IACR Cryptol. ePrintArch.

Proceedings of the 39th Symposium on Principles of Distributed Computing . 129–138.[36] Andrew Miller, Yu Xia, Kyle Croman, Elaine Shi, and Dawn Song. 2016. The honey badger of BFT protocols. In

Proceedings of the 2016 ACM SIGSACConference on Computer and Communications Security . 31–42.[37] Satoshi Nakamoto. 2008.

Bitcoin: A peer-to-peer electronic cash system . Technical Report.[38] Arpita Patra, Ashish Choudhury, and C Pandu Rangan. 2014. Asynchronous Byzantine agreement with optimal resilience.

Distributed computing . IEEE, 403–409.[40] Michael K Reiter. 1994. Secure agreement protocols: Reliable and atomic group multicast in Rampart. In

Proceedings of the 2nd ACM Conference onComputer and Communications Security . 68–80.[41] Adi Shamir. 1979. How to share a secret.

Commun. ACM

22, 11 (1979), 612–613.[42] Victor Shoup. 2000. Practical threshold signatures. In

International Conference on the Theory and Applications of Cryptographic Techniques . Springer,207–220.[43] Maofan Yin, Dahlia Malkhi, MK Reiter and, Guy Golan Gueta, and Ittai Abraham. 2019. HotStuff: BFT consensus with linearity and responsiveness.In38th ACM symposium on Principles of Distributed Computing (PODC’19)