[PDF] Strongly Universal Hamiltonian Simulators

Abstract

A universal family of Hamiltonians can be used to simulate any local Hamiltonian by encoding its full spectrum as the low-energy subspace of a Hamiltonian from the family. Many spin-lattice model Hamiltonians -- such as Heisenberg or XY interaction on the 2D square lattice -- are known to be universal. However, the known encodings can be very inefficient, requiring interaction energy that scales exponentially with system size if the original Hamiltonian has higher-dimensional, long-range, or even all-to-all interactions. In this work, we provide an efficient construction by which these universal families are in fact "strongly" universal. This means that the required interaction energy and all other resources in the 2D simulator scale polynomially in the size of the target Hamiltonian and precision parameters, regardless of the target's connectivity. This exponential improvement over previous constructions is achieved by combining the tools of quantum phase estimation algorithm and circuit-to-Hamiltonian transformation in a non-perturbative way that only incurs polynomial overhead. The simulator Hamiltonian also possess certain translation-invariance. Furthermore, we show that even 1D Hamiltonians with nearest-neighbor interaction of 8-dimensional particles on a line are strongly universal Hamiltonian simulators, although without any translation-invariance. Our results establish that analog quantum simulations of general systems can be made efficient, greatly increasing their potential as applications for near-future quantum technologies.

Full PDF

SStrongly Universal Hamiltonian Simulators

Leo Zhou ∗ and Dorit Aharonov † Department of Physics, Harvard University, Cambridge, MA 02138, USA School of Computer Science and Engineering, The Hebrew University, Jerusalem 91904, Israel (Dated: Feb 4, 2021)A universal family of Hamiltonians can be used to simulate any local Hamiltonian by encodingits full spectrum as the low-energy subspace of a Hamiltonian from the family. Many spin-latticemodel Hamiltonians—such as Heisenberg or XY interaction on the 2D square lattice—are knownto be universal. However, the known encodings can be very ineﬃcient, requiring interaction energythat scales exponentially with system size if the original Hamiltonian has higher-dimensional, long-range, or even all-to-all interactions. In this work, we provide an eﬃcient construction by which theseuniversal families are in fact “strongly” universal. This means that the required interaction energyand all other resources in the 2D simulator scale polynomially in the size of the target Hamiltonianand precision parameters, regardless of the target’s connectivity. This exponential improvement overprevious constructions is achieved by combining the tools of quantum phase estimation algorithmand circuit-to-Hamiltonian transformation in a non-perturbative way that only incurs polynomialoverhead. The simulator Hamiltonian also possess certain translation-invariance. Furthermore, weshow that even 1D Hamiltonians with nearest-neighbor interaction of 8-dimensional particles ona line are strongly universal Hamiltonian simulators, although without any translation-invariance.Our results establish that analog quantum simulations of general systems can be made eﬃcient,greatly increasing their potential as applications for near-future quantum technologies.

I. INTRODUCTION

Building a simpler model of a quantum system whilereproducing all its physical properties has many appli-cations in physics, chemistry, and computation. This isthe task of analog quantum simulation, where one simu-lates a Hamiltonian H by another Hamiltonian H (cid:48) that issimpler or more easily implemented. This goal has beenidentiﬁed as a main motivation for quantum computersas early as 1981 by Feynman [1]. Due to its less strin-gent requirements on error correction and controls, ana-log simulation is considered to be an important practicalapplication in the era of noisy intermediate-scale quan-tum technology [2, 3]. Eﬃcient implementation of analogHamiltonian simulators allows one to probe new many-body physics, develop new materials and drugs[4], andimprove feasibility of Hamiltonian-based quantum com-putations such as adiabatic algorithms [5, 6]. In fact,coherent analog quantum simulation in systems as largeas hundreds of qubits have already been successfully re-alized to solve condensed matter physics problems [7–9].When seeking analog simulators of Hamiltonians, it isnatural to consider families of such simulators that areuniversal, in the sense that they can simulate any lo-cal Hamiltonian. For any target Hamiltonian H , thereshould exists a Hamiltonian H (cid:48) in the family that cansimulate H . The ability to implement these universalfamilies enables analog simulation of all local Hamilto-nians, much like how a universal set of quantum gatesallows implementation of any unitary quantum opera-tion. This notion of universal Hamiltonians was devel- ∗ [email protected] † [email protected] oped in Ref. [10], in which various simple families of quan-tum spin-lattice models in two dimensions with tunablenearest-neighbor interaction energy are shown to be uni-versal. More families were shown to be universal by laterworks [11–13]. These can reproduce all physical proper-ties of the target system—including time-evolution, ther-mal states, and eﬀects of local noise processes—to anyprecision.However, the constructions given by Refs. [10–13] arenot eﬃcient in the general case. The eﬃciency in factdepends on the connectivity or spatial dimensionality ofthe target Hamiltonian. While any Hamiltonian H in 2Dcan be simulated by a Hamiltonian from the universalfamily with spatially local interactions in 2D with onlypolynomial overhead in both the number of particles andthe interaction energy, an exponential overhead in the in-teraction energy is required in general by these construc-tions if the target Hamiltonian H is embedded in a higherdimension (e.g., when H is 3D or has all-to-all interac-tions). We call such families, in which the eﬃciency ofthe simulation is not guaranteed, weakly universal . Notethat using some gadgets [14], one can maintain polyno-mial interaction energy if one is willing to make otherresources exponential: the number of particles and thedegree (connectivity) of interaction. In any case, whenusing the constructions of [10–13], either an overhead ofexponential-strength interaction or exponential numberof particles (along with exponential interaction degree)is required for simulating general Hamiltonians.In this work, we overcome this exponential overheadand arrive at what we call strong universality . We pro-vide a constructive method to design an analog simulatorthat is eﬃcient in both the number of particles as wellas the interaction energy, and allows simulation in 2D ofany target local Hamiltonian, regardless of the geometry a r X i v : . [ qu a n t - ph ] F e b of its interaction graph. In fact, we show that this can bedone by 2D spin-lattice models which include only a sin-gle type of nearest-neighbor interaction, with interactionenergy that vary for diﬀerent neighboring pairs (we callthis semi-translation-invariant ). The overheads in par-ticle number and energy both only grow polynomiallywith the target system size. Our results show that anyof the semi-translation-invariant 2D families of Hamil-tonians that have been found to be weakly universal inRef. [10] are in fact also strongly universal, when moreeﬃcient constructions are applied.We further show that a similar result holds even in onespatial dimension. Nevertheless, there is a caveat whenrestricting to simulators in 1D: the strongly universal 1Dfamily that we construct is no longer semi-translation-invariant. The interactions in the simulating Hamiltoni-ans take on a more complicated form that need to varyin space, but the simulation is still eﬃcient.We note that these results are tight in the followingsense: We cannot hope to bring the polynomial over-head in the interaction energy down to a constant whilestill requiring that the simulating Hamiltonian is embed-ded in 1D (or 2D). This is due to the existence of somecounterexamples [15] showing that general (i.e., univer-sal) Hamiltonian simulation is impossible if the interac-tion energy is required to not increase with the systemsize and the simulator is set on a lattice (or any geometrywith bounded degree of connectivity).To achieve our results, we begin with a method similarto that used in our previous work [15] (and recently ap-plied in Ref. [13]) in which we convert the target Hamil-tonian to a quantum phase estimation circuit embeddedin 1D. We then map this circuit back to a low-degree sim-ulating Hamiltonian, using the Feynman-Kitaev circuit-to-Hamiltonian construction [16]. The reason for trans-forming Hamiltonians via circuits is that unlike Hamilto-nians, circuits can be straightforwardly made “sparse”—e.g. each qubit is only acted on by a few gates. This canbe done by swapping qubits to fresh ancilla qubits af-ter every computational gate. In this work, we extendthis method to simulate any target Hamiltonian witha 1D or 2D Hamiltonian, by embedding the circuit ina spatially local manner in 1D or 2D using techniquesfrom earlier Hamiltonian complexity literature [6, 17]. Toobtain a semi-translation-invariant simulator Hamilto-nian in 2D, we borrow additional gadgets from Ref. [10].To obtain a 1D Hamiltonian simulator, we employ amodiﬁed construction of QMA-complete 1D Hamiltoni-ans from [18, 19] to simulate the circuit using nearest-neighbor Hamiltonian interactions on a line of particleswith 8 internal dimensions. These combinations of tech-niques allow us to overcome the exponential overheadcommon to previous constructions that mostly rely onperturbative gadgets for simulations [10–12, 17]. II. BACKGROUND ON UNIVERSALHAMILTONIANS FOR ANALOG SIMULATION

We ﬁrst deﬁne what it means for a Hamiltonian tosimulate another. We adopt the well-motivated deﬁnitionof Ref. [10], which posits that H (cid:48) simulates H if the fullspectrum of H can be encoded as the low-lying part of thespectrum of H (cid:48) , by an encoding that preserves localityof observables. More precisely, Deﬁnition 1 (Local encoding, adapted from [10]) . Con-sider an encoding map E taking Hermitian operators on n qudits ( d -dimensional systems), into operators acting on n (cid:48) ≥ n particles (not necessarily of the same dimension d ). We say E is a local encoding if we can write E ( H ) = V ( H ⊗ P + ¯ H ⊗ Q ) V † , (1) such that V is an isometry, and can be written as V = (cid:78) i V i , where each V i is an isometry acting on at most qudit of the original system. Furthermore, P and Q arelocally orthogonal projectors (i.e., ∀ i ∃ orthogonal projec-tors P i , Q i acting on the same subsystem as V i such that P i Q i = 0 , P i P = P and Q i Q = Q ). ¯ H denotes complexconjugation. Deﬁnition 2 (Hamiltonian simulation, adapted from[10]) . Given an n -qudit Hamiltonian H , we say a Hamil-tonian H (cid:48) is a (∆ , η, (cid:15) ) -simulation of H if for some localencoding E of the form of Eq. (1) , we have1. There exists an isometry ˜ V and corresponding en-coding ˜ E ( H ) = ˜ V ( H ⊗ P + ¯ H ⊗ Q ) ˜ V † such that (cid:107) ˜ V − V (cid:107) ≤ η and ˜ E ( ) = P ≤ ∆( H (cid:48) ) .2. (cid:107) H (cid:48)≤ ∆ − ˜ E ( H ) (cid:107) ≤ (cid:15) .Here, (cid:107) · (cid:107) is the spectral norm, P ≤ ∆( H (cid:48) ) is the projectoronto the subspace of eigenstates of H (cid:48) with eigenvalue ≤ ∆ , and H (cid:48)≤ ∆ = H (cid:48) P ≤ ∆( H (cid:48) ) is the restriction of H (cid:48) ontothese states. We say the simulation is eﬃcient if both thenumber of particles in H (cid:48) and its maximum energy (cid:107) H (cid:48) (cid:107) are at most O (poly( n, η − , (cid:15) − , ∆)) , and the descriptionof H (cid:48) is eﬃciently computable. Under this deﬁnition, Ref. [10] showed that implement-ing H (cid:48) allows one to approximately reproduce all physi-cal properties of H , implying that the term “simulation”means essentially any aspect. Speciﬁcally, since E is alocal encoding, all local A observables with respect to H are mapped to local observables E ( A ) for H (cid:48) . Corre-spondingly, there is a local map E state ( ρ ) that maps quan-tum states, satisfying Tr( Aρ ) = Tr[ E ( A ) E state ( ρ )]. Gibbsstates (thermal ensembles) of H are mapped to Gibbsstates of H (cid:48) , and errors are exponentially suppressed bythe energy cutoﬀ ∆. Time-evolution e − iHt can be alsosimulated by e − iH (cid:48) t applied on the appropriately encodedstate, and the error in this simulation grows as O ( t(cid:15) + η ).Additionally, under a reasonable physical assumption, lo-cal noise and errors in the simulator has been shown tocorrespond to local noise and errors in the original sys-tem. Since any real physical system is subject to (typ-ically local) noise, the simulator can be used to probemany of its properties without error-correction.Our goal is to understand which families of Hamiltoni-ans can be used to (eﬃciently) simulate all other physicalHamiltonians, as characterized by the notion of “univer-sal Hamiltonians” that we deﬁne below: Deﬁnition 3 (Weak and strong universality) . A fam-ily of Hamiltonians F = { H m } is weakly universal ifgiven any ∆ , η, (cid:15) > , any O (1) -local n -particle Hamil-tonian can be (∆ , η, (cid:15) ) -simulated by some H m ∈ F .Such a family is strongly universal if the simulation isalways eﬃcient—i.e., H m is eﬃciently computable in O (poly( n )) time, requires n (cid:48) = O (poly( n, η − , (cid:15) − , ∆)) particles, and (cid:107) H m (cid:107) = O (poly( n, η − , (cid:15) − , ∆)) . Following Ref. [10], we consider Hamiltonians that onlyinvolve up to 2-local interactions, which can be writtenin the following form: H (cid:48) = (cid:88) (cid:104) i,j (cid:105)∈ E J ij h ( i,j ) α ij , where (cid:107) h α ij (cid:107) ≤ J ij ∈ R . (2)Here, E is some set of edges describing the connectiv-ity of the qudits, h ( i.j ) α ij is some two-body operator h α ij acting on qudit i and j , and J ij is the interaction en-ergy. Ref. [10] studied such Hamiltonians in the casewhen h α ij is drawn from some set of two-body interac-tions S = { h α } , which could be highly restricted andsometimes even just contain a single term. In this set-ting, we say H (cid:48) is an S -Hamiltonian.It is shown in Ref. [10] that many families of S -Hamiltonians on qubits are weakly universal, in the sensethat any local Hamiltonian can be simulated by a Hamil-tonian drawn from such a family. In fact, even restrict-ing the connectivity E of the qubits to the 2D squarelattice, many such S -Hamiltonian remains weakly uni-versal. For example, models such as Heisenberg in-teraction ( S = { X ⊗ X + Y ⊗ Y + Z ⊗ Z } ) or XY-interaction ( S = { X ⊗ X + Y ⊗ Y } ) on the 2D squarelattice are weakly universal. Here and below, we de-note ( X, Y, Z ) = ( σ x , σ y , σ z ) as the Pauli matrices. Thismeans that the terms in Eq. (2) are all equal up to theirrelative weights, so S contains only a single term; yet,universality can still be achieved. We will need anotherdeﬁnition to state this more concisely: Deﬁnition 4 (Full and Semi-Translation-Invariance) . We say that a Hamiltonian H (cid:48) has semi-translation-invariance (or semi-TI) if every two-body operators arethe same up to the scaling by J ij , i.e., h ( i,j ) α ij = h ( i,j ) . Wesay that H (cid:48) has full translation-invariance (or full-TI) ifit has semi-translation-invariance and all the interactionenergy are the same, i.e., J ij = J . We note that more generally, Ref. [10] has shown thatany family of S -Hamiltonians (even when restricted to the 2D lattice) is weakly universal as long as S is non-2SLD, which roughly means that the 2-local part of allthe interactions in S are not simultaneously and locally(i.e., by 1-local unitaries) diagonalizable. More precisely,the property of 2SLD is deﬁned as: Deﬁnition 5 (2SLD interactions [10]) . Suppose S isa set of interaction on 2 qubits. We say S is if there exists U ∈ SU(2) such that for each H i ∈ S , U ⊗ H i ( U † ) ⊗ = α i Z ⊗ Z + A i ⊗ + ⊗ B i , where α i ∈ R and A i , B i are any 1-local operator. Otherwise, S is non-2SLD . These results are summarized in the following theorem:

Theorem 1 ([10]) . Any S -Hamiltonian set on a D square lattice of qubits is weakly universal as long as S is non-2SLD. Since there are many non-2SLD set of interactions S which contain a single interaction term, this impliesthat there are many semi-translation-invariant familiesof Hamiltonians in 2D that are weakly universal. Laterworks have extended these results to show weak univer-sality of various families that are using qudits [11], em-bedded in higher dimensions [12], or fully translation-invariant in 2D or 1D [12, 13]. A major question is ofcourse whether the simulation can be made eﬃcient , sothat universality is achieved in the strong sense.Unfortunately, the constructions used by Ref. [10] toprove Theorem 1 are only eﬃcient if the target Hamilto-nian have the same or lower spatial dimensionality as thesimulator Hamiltonian. When attempting to simulate a3 D (or worse, all-to-all interacting) target Hamiltonianby a 2 D simulator, however, the simulation is no longereﬃcient and requires exponentially large interaction en-ergy J ij = 2 O (poly( n )) . Alternatively, one can circumventthe exponentially large interaction by using exponentiallymany particles and degree of interaction [14], which alsogives up spatial locality. III. MAIN RESULTS: STRONGLY UNIVERSALHAMILTONIANS

We are motivated by the fact that in many importantsituations, one is interested in studying Hamiltonians em-bedded in large spatial dimensions, sometimes even all-to-all interactions such as the SYK model [20]. On theother hand, experimental implementations typically onlyhave access to simulator Hamiltonians that are restrictedin their interaction geometry.In this work, we show how to use the families of Hamil-tonians proposed in Ref. [10] to achieve not only weakuniversality but also strong universality for analog sim-ulation. In order to accomplish this improved eﬃciency,we have applied a diﬀerent constructive method for sim-ulation than that of Ref. [10]. spatial dimension translation-invariance interaction energy particle numberCubitt et al. [10] 2D semi exp(poly( n )) poly( n )Piddock-Bausch [12] 2D full exp(poly( n )) exp(poly( n ))Kohler et al. [13] 1D full* exp(poly( n )) poly( n )Kohler et al. [13] 1D full exp(poly( n )) exp(poly( n ))This work (Theorem 2) 2D semi poly( n ) poly( n )This work (Theorem 3) 1D none poly( n ) poly( n )TABLE I. The properties of currently known constructions of universal families of Hamiltonians that can simulate any O (1)-local n -qudit Hamiltonian. The 1D construction of Kohler et al. [13] with full* translation-invariance uses a Hamiltonian interactionthat changes depending on the target Hamiltonian. Theorem 2.

Any S -Hamiltonian on the 2D square lat-tice is strongly universal, as long as S is non-2SLD. Inparticular, it’s suﬃcient for S to contain only a single in-teraction (such Heisenberg or XY-interaction), implyingthat there are semi-translation-invariant Hamiltonians in2D that are strongly universal. We further show that strongly universal simulationcan even be achieved by 1 D Hamiltonians with nearest-neighbor interactions, acting on particles of 8 internaldimensions. However, we give up any form of translation-invariance in the process.

Theorem 3.

There is a strongly universal family of 1DHamiltonians consisting of nearest-neighbor interactionsacting on a line of particles with 8 internal dimension.

This family of 1D Hamiltonians is based a circuit com-putation on a line of qubits. The interactions in theHamiltonian family are tailored to represent various (uni-versal) gates that make up the computations, and thusdo not have any translation-invariance.In Table I, we summarize our results and compare themto previously known constructions of universal families ofHamiltonians, in terms of the resources required for simu-lating general local Hamiltonians. Importantly, our con-structions only require polynomial overhead in both in-teraction energy and particle number for simulating gen-eral Hamiltonians, which makes them much more feasi-ble than previous constructions that require exponentialoverhead in one or both resources. We can do this evenwhen restricting our simulator to a low-dimensional spa-tial geometry, and imposing semi-translation-invariancein the case of 2D.

IV. PROOF SKETCHESA. Overview

The starting point of our proofs for both Theorem 2and 3 is as follows. We note that previous construc-tions [10–12, 17] relied purely on perturbative gadgets toreduce the connectivity of qudits. Speciﬁcally, in theseconstructions, in order to reduce degree in the interactiongraph from O ( n ) to O (1) so that the Hamiltonian can be embedded on a ﬁnite-dimensional lattice, it is necessaryto apply O (log n ) rounds of perturbative gadgets, eachof which roughly halves the degree. Since the requiredinteraction energy increase polynomially for each appli-cation of perturbation gadget (i.e., J → [ J poly( n )] c forsome constant c > J ﬁnal = n c O (log n ) = 2 O (poly( n )) in these constructions.To circumvent this problem, we reduce the connectiv-ity of the qudits by ﬁrst mapping the target Hamiltonian H to a quantum circuit performing the phase estimationalgorithm with respect to e iHt . Using standard tech-niques including Trotter decomposition [21], we can em-bed this circuit in 1D, using only nearest-neighbor gateson a line of qubits, while still applying the desired phaseestimation with suﬃcient accuracy. This circuit writesdown the energy eigenvalue of H as a bit-string in someancilla qubits.In the 2D case, we can utilize swap gates to make thecircuit spatially sparse non-perturbatively. Here, spatialsparsity means that the qubits can be placed on a 2Dplane where each qubit participates in a constant numberof spatially local gates, in a sequence that traverses spacein a local way (see Deﬁnition 6). Note that a circuit thatuses only nearest-neighbor gates in 1D is not spatiallysparse if each qubit participates in O (poly( n )) gates.Applying the Feynman-Kitaev circuit-to-Hamiltonianmapping [16] to the spatially sparse circuit gives us aspatially sparse Hamiltonian H circuit which has an ex-ponentially large degeneracy of ground states: for eacheigenstate of H , we have a groundstate of H circuit corre-sponding to the computational history of the phase esti-mation circuit running on that eigenstate as input. Wethen restore the spectral features of H in H circuit by im-posing bit-wise energy penalties on the energy bit-stringancilla qubits to match the energy of the eigenstate. Theincoherence induced by diﬀerent computational historiesof diﬀerent eigenstates can be repaired by the tricks of“uncomputing” and “idling.” Finally, H circuit , which isspatially sparse in 2D, can then be converted to a semi-translation-invariant Hamiltonian H (cid:48) on a 2D square lat-tice using gadgets from Ref. [10]. This means that such afamily of semi-TI 2D Hamiltonian is strongly universal,as all steps of our construction incur only polynomialoverhead in the interaction energy and the number of any O(1)-localHamiltonianPhase estimation circuit using NN gates in 1D1D NN Hamiltonian on8-dimensional particles 2D spatially sparseHamiltonian2D semi-TI Hamiltonianon a square latticeProposition 1 Proposition 2Theorem 3 Theorem 2 FIG. 1. Overview of our constructions of 1D and 2D universalfamilies of Hamiltonians. NN = nearest-neighbor. qubits.Likewise, in the 1D case, we construct a family ofHamiltonians that simulate the 1D phase estimation cir-cuits. We combine the tools of uncomputing and idlingwith previously known circuit-to-Hamiltonian construc-tions deriving QMA-complete 1D Hamiltonians [18, 19]to achieve this result. The interactions in this Hamilto-nian family are nearest-neighbor operators that enforcesa set of transition rules, so that the zero-energy eigen-states are those corresponding to performing the circuitcomputation correctly. The spectral properties of H aresimilarly recovered by imposing bit-wise energy penalties.We have to employ some additional tricks to modify thetransition rules so that the original qudits of H are en-coded locally in the new 1D Hamiltonian.In both cases, our non-perturbative techniques avoidthe exponential blow-up in the required interaction en-ergy, which brings the scaling of the energy overhead J ﬁnal down to only O (poly( n )).Figure 1 explains the proof structure for constructingboth the 2D and 1D universal Hamiltonians. The proofsdiverge only after the ﬁrst step, Proposition 1, in whichthe target Hamiltonian is replaces by a phase estimationcircuit in 1D. The 1D and 2D cases diﬀer in the way wemap the resultant 1D circuit back to a Hamiltonian. B. The 1D phase estimation circuit: Proposition 1

We ﬁrst show—and this is fairly standard—that onecan construct a phase estimation circuit U NNPE that wouldtake eigenstates of H as input and (approximately) writedown their energy on ancilla qubits, using only nearest-neighbor gates acting on a line of qubits. Note that if H is an O (1)-local n -qu d it Hamiltonian where d is a con-stant, we can easily convert it to an O (1)-local O ( n )-qubit Hamiltonian by simply encoding each qu d it in thesubspace of a group of (cid:100) log d (cid:101) qubits. We can separatethe extra states in this redundant encoding (when d isnot a power of 2) from the spectrum by adding to theHamiltonian a local energy penalty term on each groupwith (cid:107) H (cid:107) = O (poly( n )) magnitude. Hence, for simplic- f l o w o f t i m e FIG. 2. Illustration of converting a circuit to a spatially sparseone. (a) Adding swap gates to make a 2-qubit gate (red)acting on distant qubits nearest-neighbor. (b) Adding newqubits so that the execution of gates are in a spatially localsequence. ity of the discussion that follows, we will assume that thisconversion is always performed ﬁrst, and H is an O (1)-local n -qubit Hamiltonian whose interaction energy is atmost O (poly( n )).Given such a Hamiltonian H , let us write H = (cid:80) µ E µ | ψ µ (cid:105)(cid:104) ψ µ | in its energy eigenbasis, where 0 ≤ E µ ≤ E max without loss of generality. Note the upper bound E max = O (poly( n )) can be computed without knowledgeof the energy eigenvalues, e.g. by adding up the spectralnorm of individual local terms of H .Ideally, we want a phase estimation circuit U idealPE that acts on any input state of the form | ψ (cid:105) | m (cid:105) = (cid:80) µ c µ | ψ µ (cid:105) | m (cid:105) in the following way U idealPE (cid:88) µ c µ | ψ µ (cid:105) | m (cid:105) = (cid:88) µ c µ | ψ µ (cid:105) | E µ (cid:105) , (3)where the ﬁrst s = O (log( n )) qubits of the m -qubitancilla register encodes the energy E µ as | E µ (cid:105) = | ϕ ϕ . . . ϕ s (cid:105) ⊗ | rest µ (cid:105) . Here, ϕ µ = 0 .ϕ ϕ . . . is the bi-nary representation of the real number ϕ µ = E µ /E max .Nevertheless, we want to implement U idealPE with onlypolynomial number of local gates, which can be doneusing the Trotter decomposition [21]. We also want touse a discrete set of 2-qubit universal gates, which can bedone by invoking the Solovay-Kitaev theorem [22]. Thisgives us U localPE , which will have some error ζ = (cid:107) ( U localPE − U idealPE ) | ψ (cid:105) | m (cid:105) (cid:107) . This error ζ can be made small usingonly O (poly( n, ζ − )) resources (see Appendix A). Then,as shown in Fig. 2(a), we make all gates to be nearest-neighbor on a line by adding swap gates, obtaining U NNPE .This fact is summarized as Proposition 1:

Proposition 1.

Given any n -qubit O (1) -local Hamilto-nian H = (cid:80) µ E µ | ψ µ (cid:105)(cid:104) ψ µ | , one can construct a circuit U NNPE consisting of poly( n, ζ − ) gates acting on n + m qubits with 1- or 2-qubit nearest-neighbor gates chosenfrom a universal gate set, where m = poly( n ) , such thatfor any normalized input state (cid:80) µ c µ | ψ µ (cid:105) (cid:13)(cid:13)(cid:13) U NNPE (cid:88) µ c µ | ψ µ (cid:105) | m (cid:105) − (cid:88) µ c µ | ψ µ (cid:105) | E µ (cid:105) (cid:13)(cid:13)(cid:13) ≤ ζ (4) C. Constructing 2D semi-TI strongly universalHamiltonians

To prove Theorem 2, we want to simulate any localHamiltonian H with a S -Hamiltonian on a 2D squarelattice with polynomial overhead, for any non-2SLD S .We start by ﬁrst constructing a spatially sparse cir-cuit Hamiltonian H circuit that simulates our original tar-get Hamiltonian H (Proposition 2). Then the (semi-translation-invariant) S -Hamiltonian simulator on the2D lattice is obtained from H circuit by applying a series ofgadgets. The full technical proof in given in Appendix B,but we will sketch the essential ideas below. To that end,we borrow the notion of spatial sparsity from Ref. [10, 17]and generalize it to circuits: Deﬁnition 6 (Spatial sparsity of Hamiltonians and cir-cuits (adapted from [17])) . A Hamiltonian on n -qudits is spatially sparse if its interaction hypergraph is one where(i) every vertex participates in O (1) hyper-edges, and ( ii ) there is a straight-line drawing in the plane such that ev-ery hyper-edge overlaps with O (1) other hyper-edges, andthe surface covered by every hyper-edge is O (1) . More-over, we say a quantum circuit U = (cid:81) Tt =1 U t is spatiallysparse if there is a placement of the qudits in the two-dimensional plane such that (i) each qudit participates in O (1) gates, and (ii) the spatial supports of U t and U t +1 are only O (1) distance apart for all t , each covering O (1) contiguous area. With the deﬁnition of spatial sparsity in hand, we nowformally state the Proposition we want to show:

Proposition 2.

Given any O (1) -local n -qudit Hamil-tonian H with (cid:107) H (cid:107) = O (poly( n )) , one can constructa spatially sparse -local Hamiltonian H circuit that ef-ﬁciently simulates H to precision (∆ , η, (cid:15) ) , with ∆ = O ( (cid:15) − (cid:107) H (cid:107) + η − (cid:107) H (cid:107) ) . H circuit has O (poly( n, (cid:15) − )) terms and qubits, and interaction energy at most O (poly( n, η − , (cid:15) − )) . To prove this Proposition, we start from the 1D phaseestimation circuit obtained using Proposition 1 and con-struct a spatially sparse circuit using swap gates. To dothis, we ﬁrst make U NNPE spatially sparse into U sparsePE bymoving it from the line to a 2D grid. As visualized inFig. 2(b), starting from the leftmost column of qubits,we apply just one nearest-neighbor gate before swappingall qubits to the next column and applying the next gate,getting us U sparsePE . By ordering the gates in a snake-likefashion similar to Ref. [6, 17], we make sure that eachqubit participates in only a constant number of gates,and that the temporally proximate gates in the sequencehave spatially proximate support.Before converting the circuit back to a Hamiltonian,we need to address the issue of the entanglement betweeneach energy eigenstate and the ancilla register that hasthe energy bit-string, as evident in Eq. (3). This inco-herence between diﬀerent eigenstates causes a large error for the simulation if left unchecked. We repair this er-ror by running the circuit backwards (“uncomputing”)and then adding L identity gates at the end (“idling”),so that most of the computational history of each eigen-state is simply the state itself. We then get a new circuitwhich U circuit = ( ) L ( U sparsePE ) † ( ) s U sparsePE , which is spa-tially sparse as long as U sparsePE is. Note we inserted s identity gates before applying ( U sparsePE ) † so that we canexamine the energy bit-string bit-by-bit before it is un-computed.Subsequently, we apply Kitaev’s circuit-to-Hamiltonian mapping [16] to convert the spatiallysparse, “uncomputed” circuit U circuit to a spatiallysparse Hamiltonian H circuit . The energy of the eigen-states of H are restored by adding an appropriatebit-wise energy penalty on the s qubits where the energyis written, while we idle between U sparsePE and ( U sparsePE ) † [see Eq. (B7)]. We then use perturbative argument(such as those shown in Ref. [23]) to show that H circuit simulates H with polynomially small error, with onlypolynomial overheads. This proves Proposition 2.Finally, to prove Theorem 2, we map the spatiallysparse Hamiltonian H circuit to a Hamiltonian in a uni-versal family on the 2D square lattice with additionalpolynomial overhead. This is done via a sequence of re-ductions, with known techniques [10, 17]: We ﬁrst con-verted H circuit to a real-valued Hamiltonian by doublingthe number of qubits, and encoding any Pauli Y ’s intoa pair of Y ⊗ Y . We then remove all Pauli Y ’s, andreduce the locality of the Hamiltonian to 2-local via ap-plications of perturbative gadgets. This is subsequentlyconverted to a spatially sparse S -Hamiltonian where S = { XX + Y Y + ZZ } or { XX + Y Y } . Through-out this sequence of reductions, the spatial sparsity ofthe Hamiltonian is preserved, and both the interactionenergy and qubit number only increases polynomially asthe original system size and the target precision param-eters (∆ , η, (cid:15) ). This spatially sparse S -Hamiltonian in-volving only Pauli-interactions without Y ’s can then bemapped to a S -Hamiltonian on the 2D lattice. Since theinput Hamiltonian is spatially sparse, this mapping onlyincurs polynomial overhead in the interaction energy asshown in Ref. [10, 17]. Furthermore, Ref. [10] has shownthat S -Hamiltonian on a 2D square lattice can be sim-ulated by any S -Hamiltonian on a 2D square lattice forany non-2SLD S , with polynomial overhead. We havethus provided a construction that allows any such familyof S -Hamiltonians on the 2D square lattice to eﬃcientlysimulate any local Hamiltonian with arbitrary geometry. D. Constructing 1D universal Hamiltonians

We now show how to construct a strongly univer-sal family of Hamiltonian simulators in 1D. To provethis result as stated in Theorem 3, we extend thecircuit-to-1D-Hamiltonian constructions in Ref. [18, 19].These constructions were originally used to show QMA-completeness of Hamiltonians involving nearest-neighborinteraction on a 1D line of particles, by using them toencode the outcome of any circuit computation in theirground state energy.The basic idea in these constructions is as follows.Suppose we are given any quantum circuit consistingof R rounds of nearest-neighbor gates on n qubits in1D such as U NNPE , where each round is of the form U n − ,n · · · U U (some may be identity). We consideran equivalent circuit on a line of 2 nR particles, dividedinto R blocks, where each block encodes the computa-tional state of the original n qubits. In this equivalentcircuit, a single round of nearest-neighbor gate is appliedin each block before the qubits are moved to the nextblock where subsequent gates can be performed. The8 internal dimensions of each particle are necessary tostore both the computational state of the original qubitand marker states that allows us to locally distinguishdiﬀerent stages of the computation of each particle (e.g.,whether a gate has already been applied or needs to beapplied).We want to apply these constructions to simulate gen-eral Hamiltonians. Following the same idea as in theproof of Theorem 2, we ﬁrst convert the target Hamilto-nian H to a phase estimation circuit U NNPE using nearest-neighbor gates on a line of qubits as in Proposition 1.Then, we use the method in Ref. [19] as well as our tricksof “uncomputing” and “idling” to map the circuit to a1D Hamiltonian of nearest neighbor-interactions, and pe-nalize the energy bit-string so that we simulate the fullspectral properties of H . However, a na¨ıve applicationof this method would yield a highly non-local encoding,since the idling part of the circuit corresponds to sim-ply move the computational qubits down the line, caus-ing the encoded eigenstates of H to be delocalized overmany blocks of particles. To circumvent this issue, wemodiﬁed the method so that the idling step is done with-out moving the computational qubits, while maintainingthe consistency of all transition rules so that the legalcomputational history states are spectrally gapped fromthe rest. Consequently, the eigenstates of H are encodedin the qubit-subspace of some 8-dimensional qudits fromjust one block of the line, yielding a local encoding. Forthe full proof with technical details, see Appendix C.In this construction, our 1D simulator does not haveany form of translation-invariance, since its nearest-neighbor terms vary from block to block where diﬀerentgates from U NNPE are applied. Note the semi-translation-invariance in our 2D simulator is achieved (as done inRefs. [10–12]) using gadgets to simulate general inter-actions with a single type of interaction. Since thesegadgets require ancilla particles placed in more than onedimension, it appears we cannot apply them to make our1D construction semi-translation-invariant.

V. DISCUSSION

We have signiﬁcantly improved the prospects of univer-sal analog quantum simulation by showing that a strongnotion of universality is possible with simple families ofHamiltonians embedded in constant dimensions. Unlikeprevious works [10–13], our results show that only poly-nomial overheads in both particle number and interactionenergy are suﬃcient to simulate any local Hamiltonianwith arbitrary connectivity by some universal Hamilto-nians embedded in 1D or 2D. Our results are tight inthe sense that the overhead in the interaction energycannot be brought down to a constant using constant-dimensional Hamiltonian simulators, due to an earlierresult [15] that gave counterexamples showing such simu-lations are impossible. We remark that even though thereare weakly universal, fully translation-invariant familiesof Hamiltonians [12, 13], the interaction energy in thoseHamiltonians has to scale exponentially with the size ofthe target Hamiltonian, since the target Hamiltonian’sspectrum is encoded in an exponentially vanishing frac-tion of the spectrum of the simulator.An interesting open question is: Are there are stronglyuniversal semi-translation-invariant Hamiltonians in 1D?Since the known gadgets to simulate general interactionswith a single type of interaction seem to require ancillaparticles placed in more than one dimension, we wouldlikely need to invent new gadgetry.We can also ask if there are strongly universal Hamil-tonians with full translation-invariance in any constantdimensions. This is impossible if translation-invarianceis required in the strongest sense, where the only freeparameter of the Hamiltonians is the number of particle n (cid:48) ; since such Hamiltonians can be described by O (log n (cid:48) )bits of information, they cannot represent general Hamil-tonians on n qudits that are described by poly( n ) bits un-less n (cid:48) = exp(poly( n )). Indeed, Refs. [12, 13] constructedsuch families of Hamiltonians that are weakly univer-sal, but cannot be strongly universal. This implies thatthere not all weakly universal families are strongly uni-versal. To move towards full-TI strong universality, wemay consider relaxing the notion of simulation: for exam-ple, Ref. [24] constructed a 1D fully translation-invariantfamily of Hamiltonians that can simulate any Hamilto-nian by allowing for polynomial-sized encoding and de-coding circuits. However, the desirable properties of ana-log Hamiltonian simulations such as preserving locality ofobservables and noise will no longer hold once we have en-coding circuits that induce non-local correlations. Alter-natively, one can consider relaxing translation-invarianceby letting Hamiltonian interactions have more free pa-rameters to encode the target Hamiltonian. Ref. [13] hasdone this to keep the number of particles in the simulator O (poly( n )), but their construction still requires exponen-tial overhead in the interaction energy. Nevertheless, itis likely possible to eﬃciently simulate any full-TI Hamil-tonian by a family of full-TI Hamiltonian, where the de-scription of both Hamiltonians are O (log n ) bits. It wouldbe worthwhile to investigate whether these constructionscan be improved, or show that strong universality of full-TI Hamiltonians is impossible.While this work has established that eﬃcient univer-sal analog quantum simulation using simple 1D or 2Dsystems is possible, the constructions we have providedhere is far from optimal. Although we have shown thatresources scaling only polynomially in the target sys- tem size are suﬃcient for analog simulation of any localHamiltonian, there is much room for improvement in thescaling for practical applications. As experimental re-alizations of analog quantum simulators develop rapidly,we hope our work provides a starting point for researchersto develop methods to expand their scope to simulate allphysical systems and tackle classically intractable prob-lems. [1] R. P. Feynman, International Journal of TheoreticalPhysics , 467 (1982).[2] J. I. Cirac and P. Zoller, Nature Physics , 264 (2012).[3] J. Preskill, Quantum , 79 (2018), arXiv:1801.00862.[4] J. Arg¨uello-Luengo, A. Gonz´alez-Tudela, T. Shi,P. Zoller, and J. I. Cirac, Nature , 215 (2019).[5] E. Farhi, J. Goldstone, S. Gutmann, and M. Sipser,(2000), arXiv:0001106 [quant-ph].[6] D. Aharonov, W. van Dam, J. Kempe, Z. Landau,S. Lloyd, and O. Regev, SIAM J. Comput. , 166(2007).[7] J. Y. Choi, S. Hild, J. Zeiher, P. Schauß, A. Rubio-Abadal, T. Yefsah, V. Khemani, D. A. Huse,I. Bloch, and C. Gross, Science , 1547 (2016),arXiv:1604.04178.[8] D. Bluvstein, A. Omran, H. Levine, A. Keesling, G. Se-meghini, S. Ebadi, T. T. Wang, A. A. Michailidis,N. Maskara, W. W. Ho, S. Choi, M. Serbyn, M. Greiner,V. Vuletic, and M. D. Lukin, (2020), arXiv:2012.12276.[9] S. Ebadi, T. T. Wang, H. Levine, A. Keesling, G. Se-meghini, A. Omran, D. Bluvstein, R. Samajdar, H. Pich-ler, W. W. Ho, S. Choi, S. Sachdev, M. Greiner,V. Vuletic, and M. D. Lukin, (2020), arXiv:2012.12281.[10] T. S. Cubitt, A. Montanaro, and S. Piddock, Proceed-ings of the National Academy of Sciences , 9497(2018).[11] S. Piddock and A. Montanaro, (2018), arXiv:1802.07130.[12] S. Piddock and J. Bausch, (2020), arXiv:2001.08050.[13] T. Kohler, S. Piddock, J. Bausch, and T. Cubitt, (2020),arXiv:2003.13753.[14] Y. Cao and D. Nagaj, Quantum Inf. Comput. , 1197 (2015).[15] D. Aharonov and L. Zhou, in Proceedings of the 2019ACM Conference on Innovations in Theoretical Com-puter Science , ITCS ’19 (2019) arXiv:1804.11084.[16] A. Y. Kitaev, A. Shen, and M. N. Vyalyi,

Classical andQuantum Computation (American Mathematical Soci-ety, 2002).[17] R. Oliveira and B. M. Terhal, Quantum Inf. Comput. ,900 (2008).[18] D. Aharonov, D. Gottesman, S. Irani, and J. Kempe,Communications in Mathematical Physics , 41(2009).[19] S. Hallgren, D. Nagaj, and S. Narayanaswami, QuantumInfo. Comput. , 721 (2013).[20] S. Sachdev and J. Ye, Phys. Rev. Lett. , 3339 (1993);A. Kitaev, A simple model of quantum holography (KITPstrings seminar and Entanglement 2015 program, 2015).[21] H. F. Trotter, Proceedings of the American MathematicalSociety , 545 (1959).[22] C. M. Dawson and M. A. Nielsen, Quantum Info. Com-put. , 81 (2006).[23] S. Bravyi and M. Hastings, (2014), arXiv:1410.0703.[24] T. C. Bohdanowicz and F. G. Brand˜ao, (2017),1710.02625 [quant-ph].[25] M. A. Nielsen and I. L. Chuang, Quantum Computationand Quantum Information , 10th ed. (Cambridge Univer-sity Press, New York, NY, USA, 2011).[26] J. Kempe, A. Kitaev, and O. Regev, SIAM J. Comput. , 1070 (2006). Appendix A: A 1D nearest-neighbor implementation of phase estimation circuit

In this appendix, we show that given any local Hamiltonian H , how to construct a phase estimation circuit suchthat the energy of any input eigenstate of H can be written down as bits on some ancilla qubits to O (log n ) bitprecision with O (1 / poly( n )) error in the state. In particular, we show that this can be done with a circuit acting ona line of qubits with nearest-neighbor gates. This will serve as the backbone of our eﬃcient construction of universalHamiltonian simulators. Proposition 1 (formal) . Consider any O (1) -local Hamiltonian H = (cid:80) a H a = (cid:80) µ E µ | ψ µ (cid:105)(cid:104) ψ µ | acting on n qubits,where we assume w.l.o.g. that ≤ E µ ≤ E max for some known number E max = O (poly( n )) . For any s = O (log n ) and ζ > , we can construct a phase estimation circuit U NNPE consisting of O (poly( n, ζ − ))

1- or 2-qubit nearest neighborgates drawn from any universal gate set, acting on a line of n + m qubits, where m = O (poly( n )) . For any normalizedstate (cid:80) µ c µ | ψ µ (cid:105) , the circuit U NNPE satisﬁes (cid:13)(cid:13)(cid:13)(cid:13)(cid:13) U NNPE (cid:88) µ c µ | ψ µ (cid:105) | m (cid:105) − (cid:88) µ c µ | ψ µ (cid:105) | ˜ E µ (cid:105) | rest µ (cid:105) (cid:13)(cid:13)(cid:13)(cid:13)(cid:13) ≤ ζ (A1) where | ˜ E µ (cid:105) = | ϕ µ, ϕ µ, ϕ µ, · · · ϕ µ,s (cid:105) is the s -bit truncated representation of ϕ µ = E µ /E max = 0 .ϕ µ, ϕ µ, ϕ µ, · · · with ϕ µ,j ∈ { , } , and | rest µ (cid:105) is some unimportant state on the remaining ancilla qubits.Proof. Let us ﬁrst consider the standard implementation of quantum phase estimation algorithm circuit U PE . Here,the circuit uses the evolution operator u j = e iHτ j − under H , where τ = 2 π/E max , and writes phase of the eigenvaluesof u = e iHτ on some ancilla qubits. Note the eigenvalues of u are e i πϕ µ , where ϕ µ = E µ τ / (2 π ). Since 0 ≤ ϕ µ ≤ ϕ µ = 0 .ϕ µ, ϕ µ, ϕ µ, · · · .Ideally, the action of the phase estimation circuit on input states {| ψ µ (cid:105) | m (cid:105)} n µ =1 is U idealPE | ψ µ (cid:105) | m (cid:105) = | ψ µ (cid:105) | ˜ E µ (cid:105) | rest µ (cid:105) , (A2)Correspondingly, let us denote ˜ E µ = 2 π ˜ ϕ µ /τ as approximate values of the energy E µ , where ˜ ϕ µ =0 .ϕ µ, ϕ µ, ϕ µ, · · · ϕ µ,s . In the ideal case, E µ = ˜ E µ for some suﬃciently large s .In reality, there are two sources of errors that cause the phase estimation circuit to deviate from U idealPE . Error 1: ﬁnite-bit-precision — The ﬁrst is due to the fact that the energy eigenvalues don’t generally have ﬁnite-bit-precision representation, i.e., | E µ − ˜ E µ | = O (2 − s ) is non-zero. In other words, since ϕ µ (cid:54) = ˜ ϕ µ , there’s additionalerror from imprecise phase estimation. Let us consider a phase estimation circuit U PE implemented to p -bit precision,where p > s . Let b µ be the integer in the range [0 , p −

1] such that 0 ≤ ϕ µ − b µ / p ≤ − p . It is well-known [25] thatthe action of U PE on any input state | ψ µ (cid:105) | (cid:105) result in the following state U PE | ψ µ (cid:105) | m (cid:105) = | ψ µ (cid:105) | rest (cid:48) µ (cid:105) ⊗ p p − (cid:88) k,(cid:96) =0 e − i πk(cid:96)/ p e i πϕ µ k | (cid:96) (cid:105) = | ψ µ (cid:105) | rest (cid:48) µ (cid:105) p − (cid:88) (cid:96) =0 α µ(cid:96) | (cid:96) (cid:105) (A3)where | (cid:96) (cid:105) = | (cid:96) · · · (cid:96) p (cid:105) is the binary representation of (cid:96) , and α µ(cid:96) = 12 p p − (cid:88) k =0 [ e i π ( ϕ µ − (cid:96)/ p ) ] k = 12 p (cid:20) − e i π (2 s ϕ µ − (cid:96) ) − e i π ( ϕ µ − (cid:96)/ p ) (cid:21) (A4)The analysis from Sec. 5.2.1 in Ref. [25] shows that the probability of getting a state that is a distance of e integeraway is p error µ ( e ) ≡ (cid:88) | (cid:96) − b µ | >e | α µ(cid:96) | ≤ e −

1) (A5)Note that we only care about the ﬁrst s < p bits, so we can choose e = 2 p − s − U PE | ψ µ (cid:105) | m (cid:105) = | ψ µ (cid:105) | rest (cid:48) µ (cid:105) ⊗  (cid:88) | (cid:96) − b µ |≤ e α µ(cid:96) | (cid:96) (cid:105) + (cid:88) | (cid:96) − b µ | >e α µ(cid:96) | (cid:96) (cid:105)  = | ψ µ (cid:105) | rest (cid:48) µ (cid:105) ⊗ (cid:16)(cid:113) − p error µ | ˜ E µ (cid:105) | rest µ (cid:105) + (cid:112) p error µ | rest µ (cid:105) (cid:17) (A6)Comparing this with the idealized output in Eq. (A2), we can identify | rest µ (cid:105) = | rest (cid:48) µ (cid:105) | rest µ (cid:105) , and observe that( U PE − U idealPE ) | ψ µ (cid:105) | m (cid:105) = | ψ µ (cid:105) | error µ (cid:105) , where (cid:107)| error µ (cid:105)(cid:107) ≤ p error µ = O (2 − ( p − s ) ) (A7)Thus, for any normalized state | ψ (cid:105) = (cid:80) µ c µ | ψ µ (cid:105) | m (cid:105) , we have (cid:107) ( U PE − U idealPE ) (cid:88) µ c µ | ψ µ (cid:105) | m (cid:105) (cid:107) = O (2 − ( p − s ) ) ≤ O ( ζ ) (A8)where we chose, for example, p = 2 s + O (log ζ − ) and s = O (log( n )), and thus make this ﬁrst source of error due toimprecision to be smaller than any constant c . Error 2: local gate approximation of e − iHτ j — The second source of error is due to the fact that we need toimplement the circuit U PE using only 1 or 2-qubit gates, in order to ensure the corresponding circuit-Hamiltonianis local, The only non-local gates in the p -bit precise phase estimation algorithm that we need to address are thecontrolled-application of Hamiltonian evolution, | (cid:105)(cid:104) | ⊗ + | (cid:105)(cid:104) | ⊗ u j , where u j = e − iHτ j , τ j = 2 j − τ and j =1 , , . . . , p . This can be implemented with local gates via Trotter decomposition [21].0Speciﬁcally, we write H = (cid:80) M a =1 H a , where H a is a k -local term, and M = O (poly( n )) is the number of terms.We can implement ˜ u j = ( (cid:81) M a =1 e − iH a τ j /r j ) r j for some integer r j , so that (cid:107) ˜ u j − u j (cid:107) ≤ O ( τ j /r j ). Since p = O (log n +log ζ − ), we have τ j = O (2 p / (cid:107) H (cid:107) ) = O (poly( n, ζ − )). We can then choose r j = O ( τ j poly( ζ − )) = O (poly( n, ζ − ))to ensure each such error is polynomially small. The error from Trotter decomposition is bounded by (cid:107) U TrotPE − U PE (cid:107) ≤ p (cid:88) j =1 O ( τ j /r j ) ≤ O ( ζ ) (A9)The total number of local gates in U TrotPE is R Trot = O ( M (cid:80) j r j ) = O (poly( n, ζ − )), and the locality of each gate isat most k + 1.We still need a circuit with only 1- or 2-qubit gates drawn from a universal set of gates. To that end, we can applythe Solvay-Kitaev algorithm to approximate each ( k + 1)-local gate with a sequence of 2-local gates. It is known[22]that to approximate any gate in SU ( d ) with 2-local gates to (cid:15) -precision, we’ll need at most O ( d poly log (cid:15) − ) 2-qubit gates from some universal gate set of ﬁnite size. Note d = 2 k +1 when we are approximating ( k + 1)-localgates with 2-qubit gates. As there are at most R Trot such gates, to keep the overall error to be below O ( ζ ), weonly need (cid:15) ≤ O ( ζ/R Trot ) = O (1 / poly( n, ζ − ))). This means we need to approximate each ( k + 1)-local gates with R SK = O (4 k poly(log n, log ζ − )) 2-qubit gates. The ﬁnal circuit is U localPE consisting of only R local = O ( R Trot R SK ) = O (poly( n, ζ − )) 1- or 2-qubit gates, for k = O (1).We can then ensure that this circuit U localPE only consists of nearest-neighbor gates by the following procedure:1. Place all n qubits on a line with any pre-determined ordering.2. Iterate over each gate U t , t = 1 , . . . R local . If U t acts on qubits that are not neighbors on the line, add a sequence S t of swap gates on nearest neighbors in the circuit before U t so that U t acts on neighbors. Then add the sameswap gates in reversed order in the circuit after U t so that the qudits returned to their original order on theline. At the end of this step, the new circuit is of the form U NNPE = (cid:81) R t =1 ( S † t U t S t ), with R = O ( R local N ) gates,each only acting on a neighbors group of qubits. See Fig. 2(a) for an example.Putting everything together, we have U NNPE = U localPE ≈ U TrotPE ≈ U PE ≈ U idealPE . In conclusion, for any ζ >

0, we canconstruct a phase estimation circuit U NNPE comprised of only O (poly( n, ζ − )) 1-qubit or 2-qubit nearest-neighbor gateson a line, such that its action is ζ -close to U idealPE on any valid input state (cid:80) µ c µ | ψ µ (cid:105) | m (cid:105) : (cid:107) ( U NNPE − U idealPE ) | ψ (cid:105) | m (cid:105) (cid:107) ≤ ζ. (A10) Appendix B: Proof that spin models on 2D lattice is strongly universal

In this section, we show the details of the construction for strongly universal Hamiltonian in 2D square lattice.

Theorem 2.

Any S -Hamiltonian on the 2D square lattice is strongly universal, as long as S is non-2SLD. Inparticular, it’s suﬃcient for S to contain only a single interaction (such Heisenberg or XY-interaction), implying thatthere are semi-translation-invariant Hamiltonians in 2D that are strongly universal.

1. Eﬃcient, spatially sparse Hamiltonian simulator

Before proving the strong universality of 2D spin-lattice models, we ﬁrst prove the following result, where we showthat any local Hamiltonian can be simulated by a spatially sparse Hamiltonian.

Proposition 2.

Given any O (1) -local n -qudit Hamiltonian H with (cid:107) H (cid:107) = O (poly( n )) , one can construct a spatiallysparse -local Hamiltonian H circuit that eﬃciently simulates H to precision (∆ , η, (cid:15) ) , with ∆ = O ( (cid:15) − (cid:107) H (cid:107) + η − (cid:107) H (cid:107) ) . H circuit has O (poly( n, (cid:15) − )) terms and qubits, and interaction energy at most O (poly( n, η − , (cid:15) − )) . To prove the above Proposition, we ﬁrst prove two smaller Lemmas 1 and 2 about diﬀerent aspects of using theFeynman-Kitaev circuit-to-Hamiltonian construction [16] for Hamiltonian simulation. The following concept of historystates will be useful in the discussion:1

Deﬁnition 7 (history states) . Let U = U T · · · U U be a quantum circuit acting on n + m qudits. Then for any inputstate | ψ µ (cid:105) ∈ C d n , the history state with respect to U and | ψ µ (cid:105) is the following | η µ (cid:105) = 1 √ T + 1 T (cid:88) t =0 (cid:16) U t · · · U U | ψ µ (cid:105) | m (cid:105) anc (cid:17) | t T − t (cid:105) clock (B1)We now prove the ﬁrst of the two Lemmas, which describes a circuit-to-Hamiltonian transformation that can be usedfor analog Hamiltonian simulation, assuming an appropriate energy penalty Hamiltonian H out can be constructed. Lemma 1 (Circuit-Hamiltonian simulation) . Consider an orthonormal basis of states {| ψ µ (cid:105)} d n µ =1 on n qudits. Let U = (cid:81) Tt =1 U t be a quantum circuit where each gate U t is at most k -local. Let L = span {| η µ (cid:105)} d n µ =1 be the subspace ofhistory states with respect to U and {| ψ µ (cid:105)} , and let H be any Hamiltonian. Suppose there exists a Hamiltonian H out such that (cid:107) V HV † − H out | L (cid:107) ≤ (cid:15)/ where E ( H ) = V HV † is a local encoding, and O | L means operator O restricted to subspace L . Then for any η > ,we can construct a Hamiltonian H circuit from the description of U such that H circuit is a (∆ , η, (cid:15) ) -simulation of H with local encoding E , where ∆ ≥ O ( (cid:15) − (cid:107) H out (cid:107) + η − (cid:107) H out (cid:107) ) , per Def. 2. The constructed H circuit is ( k + 3) -local,has O ( T ) terms and particles, and uses O (poly( n, T, ∆)) interaction energy. Furthermore, H circuit is spatially sparseif the circuit U is spatially sparse. Lemma 2 (Idling to enhance simulation precision) . Consider an uncomputed quantum circuit U D · · · U = . Supposewe add L identity gates to the end of the circuit, so that we obtain a new circuit U = L U D · · · U with length T = D + L .Let | η µ (cid:105) be the history state with respect to U and | ψ µ (cid:105) . Suppose H = (cid:80) µ E µ | ψ µ (cid:105)(cid:104) ψ µ | and H eﬀ = (cid:80) µ E µ | η µ (cid:105)(cid:104) η µ | .For any (cid:15) > , if we choose L = O ( D (cid:107) H (cid:107) (cid:15) ) , then there is an ancilla state | α (cid:105) such that (cid:107) H ⊗ | α (cid:105)(cid:104) α | − H eﬀ (cid:107) ≤ (cid:15) . We are now ready to prove our main Proposition 2:

Proof of Proposition 2 . Given any O (1)-local n -qu d it Hamiltonian where d is a constant, we can easily convert itto an O (1)-local O ( n )-qubit Hamiltonian by simply encoding each qu d it in the subspace of a group of (cid:100) log d (cid:101) qubits.We can separate the extra states in this redundant encoding (when d is not a power of 2) from the relevant part ofspectrum by adding to the Hamiltonian a local energy penalty term on acting each group with (cid:107) H (cid:107) = O (poly( n ))magnitude. Hence, we will call H the O (1)-local n -qubit Hamiltonian containing O (poly( n ))-strength interactionsobtained after this conversion.Let us denote the normalized eigenstates of H as | ψ µ (cid:105) , with corresponding eigenvalues E µ . We assume they areordered such that E ≤ E ≤ E ≤ · · · ≤ E n .From Proposition 1, for any s = O (log n ), we can construct a ζ -approximate, s -bit precise phase estimation circuit U NNPE such that it acts on a line of N = O (poly( n )) qubits with R = O (poly( n, ζ − )) nearest-neighbor gates. Wewant to replace it with a spatially sparse circuit U sparsePE with O ( R N ) qubits and gates (see Deﬁnition 6). This canbe done with polynomial overhead in the same way as in Ref. [6, 17]. We begin by placing the N original qubits onthe ﬁrst column of a N × R grid of qubits. For column i = 1 , , . . . , R , we execute only the i -th gate from the circuit U NNPE , and other uninvolved qubits are acted on by identity gates. After each column, we swap the state of the qubitof column i to i + 1. We order the execution of all the gates such that the gates from U NNPE and identity gates areexecuted top-to-bottom, and the swap gates between column are executed from bottom-to-up [see Fig. 2(b)]. It isthus easy to see that in this new circuit U sparsePE , each qubit participates in at most 3 gates (up to two swap gates anda non-trivial gate), and the gate are executed in a spatially local sequence. Note the action of U sparsePE is equivalent to U NNPE up to re-ordering of the qubits, since we’ve only added swap gates. Thus, Proposition 1 gives us (cid:13)(cid:13)(cid:13)(cid:13)(cid:13) U sparsePE (cid:88) µ c µ | ψ µ (cid:105) | m (cid:105) − (cid:88) µ c µ | ψ µ (cid:105) | ˜ E µ (cid:105) (cid:13)(cid:13)(cid:13)(cid:13)(cid:13) ≤ ζ (B3)where | ˜ E µ (cid:105) = | ϕ µ, ϕ µ, ϕ µ, . . . ϕ µ,s (cid:105) ⊗ | rest µ (cid:105) contains the s -bit truncated representation of ϕ µ = E µ /E max =0 .ϕ µ, ϕ µ, ϕ µ, · · · , and E max is the upper bound on the maximum energy of the target Hamiltonian H used inthe construction of U NNPE . Let˜ E µ = E max × (0 .ϕ µ, ϕ µ, ϕ µ, . . . ϕ µ,s ) = E µ + O ( E max − s ) (B4)be the truncated-approximation to the energy eigenvalue E µ .2The new spatially sparse circuit U sparsePE now has t = O ( R N ) = O (poly( n, ζ − )) gates. From this, we constructthe following uncomputed, spatially sparse circuit U circuit = ( ) L U sparse † PE ( ) s U sparsePE , (B5)which we will transform into our spatially sparse Hamiltonian. Note we add U sparse † PE for uncomputing and s + L idling identity gates, making the entire circuit gate count T = 2 t + s + L . The s identity gates are used for localmeasurements of energy to s -bit precision, and L = O ((2 t + s ) (cid:107) H (cid:107) /(cid:15) ) = O (poly( n, ζ − ) /(cid:15) ) identity gates are usedto ensure O ( (cid:15) ) simulation precision as in Lemma 2. The history states with respect to eigenstate | ψ µ (cid:105) of H and thiscircuit are | η µ (cid:105) = 1 √ T + 1 T (cid:88) t =0 (cid:16) U t · · · U U | ψ µ (cid:105) | m (cid:105) (cid:17) | t T − t (cid:105) (B6)We can convert the circuit to a Hamiltonian H circuit using the method described in Lemma 1, where H out is chosento be H out = ( T + 1) E max s (cid:88) b =1 − b | (cid:105)(cid:104) | anc b ⊗ P clock ( t = t + b ) . (B7)We also denote P clock ( t ) = | (cid:105)(cid:104) | clock t − ,t,t +1 , which projects onto legal clock states corresponding to time step t .To show that H circuit simulates the original Hamiltonian H , we ﬁrst show that H out restricted to the subspace ofhistory states L = span {| η µ (cid:105) : 1 ≤ µ ≤ n } can be approximated by the following eﬀective Hamiltonian H eﬀ = (cid:88) µ E µ | η µ (cid:105)(cid:104) η µ | . (B8)Consider arbitrary states | η (cid:105) ∈ L . We write | η (cid:105) = (cid:80) µ a µ | η µ (cid:105) , and observe (cid:104) η | H out | η (cid:105) = E max s (cid:88) b =1 − b (cid:34)(cid:88) ν a ∗ ν (cid:104) ψ ν | (cid:104) m | (cid:35) U sparse † PE | (cid:105)(cid:104) | b U sparsePE (cid:34)(cid:88) µ a µ | ψ µ (cid:105) | m (cid:105) (cid:35) (B9)Then using (B3), we have (cid:104) η | H out | η (cid:105) = E max s (cid:88) b =1 − b (cid:34)(cid:88) ν a ∗ ν (cid:104) ψ ν | (cid:104) ˜ E ν | + (cid:104) ζ | (cid:35) | (cid:105)(cid:104) | b (cid:34)(cid:88) µ a µ | ψ µ (cid:105) | ˜ E µ (cid:105) + | ζ (cid:105) (cid:35) (B10)where | ζ (cid:105) is some residual state vector with (cid:107) | ζ (cid:105) (cid:107) ≤ ζ . Hence |(cid:104) η | H out − H eﬀ | η (cid:105)| ≤ (cid:88) µ | a µ | | ˜ E µ − E µ | + 2 sζE max ≤ max µ | ˜ E µ − E µ | + 2 sζE max ≤ (2 − s + 2 sζ ) E max (B11)We can ensure this is always less than (cid:15)/ s = log (8 E max /(cid:15) ) = O (log n + log (cid:15) − ) and ζ = (cid:15)/ (16 sE max ) = O (1 / poly( n, (cid:15) − )) . (B12)Hence, |(cid:104) η | H out − H eﬀ | η (cid:105)| ≤ (cid:15)/ ∀ | η (cid:105) ∈ L = ⇒ (cid:107) H eﬀ − H out | L (cid:107) ≤ (cid:15)/ L idling gates such that (cid:107) H ⊗ | α (cid:105)(cid:104) α | − H eﬀ (cid:107) ≤ (cid:15)/ | α (cid:105) byLemma 2, then together with (B13) we have (cid:107) H ⊗ | α (cid:105)(cid:104) α | − H out | L (cid:107) ≤ (cid:15)/ . (B14)Observe that we can rewrite H ⊗ | α (cid:105)(cid:104) α | = V HV † , where V | ψ (cid:105) = | ψ (cid:105) | α (cid:105) ∀ | ψ (cid:105) ∈ C n is an isometry. Hence, byLemma 1, for any η >

0, the constructed H circuit simulates H to precision (∆ , η, (cid:15) ), where ∆ = O ( (cid:15) − (cid:107) H out (cid:107) + η − (cid:107) H out (cid:107) ) = O ( (cid:15) − (cid:107) H (cid:107) + η − (cid:107) H (cid:107) ). Note that H circuit is spatially sparse since U sparsePE is spatially sparse. Since U sparsePE contains at most 2-local gates, which means H circuit is at most 5-local. Furthermore, H circuit contains O ( T ) = O (poly( n, ζ − ) /(cid:15) ) = O (poly( n, (cid:15) − )) terms (and qubits), with O (poly( n, T, (cid:15) − , η − , (cid:107) H out (cid:107) )) = O (poly( n, η − , (cid:15) − ))interaction energy.3To ﬁnish the proof, we just need to prove Lemma 1 and 2. We start with the proof of Lemma 1. Proof of Lemma 1 . For a given circuit U = U T · · · U U , the corresponding circuit-Hamiltonian is H circuit = H + H out (B15)where H = J clock H clock + J prop H prop + J in H in (B16)The role of H is to isolate L = span {| η µ (cid:105)} as its zero-energy groundspace separated by a large spectral gap 2∆from the rest of the eigenstates. Then H out is used recover the eigenvalue structrue of H in the subspace L , allowing H circuit to simulate H .Now we give the explicit form of the circuit-Hamiltonian. The ﬁrst part of H is H clock = T − (cid:88) t =1 | (cid:105)(cid:104) | clock t,t +1 , (B17)which sets the legal state conﬁgurations in the clock register to be of the form | t (cid:105) clock ≡ | t T − t (cid:105) clock . Then, wesimulate the state propagation under the circuit using H prop = T (cid:88) t =1 H prop,t , (B18)where H prop,t = ⊗ | (cid:105)(cid:104) | clock t − ,t,t +1 − U t ⊗ | (cid:105)(cid:104) | clock t − ,t,t +1 − U † t ⊗ | (cid:105)(cid:104) | clock t − ,t,t +1 + ⊗ | (cid:105)(cid:104) | clock t − ,t,t +1 for 1 < t < T,H prop, = ⊗ | (cid:105)(cid:104) | clock12 − U ⊗ | (cid:105)(cid:104) | clock12 − U † ⊗ | (cid:105)(cid:104) | clock12 + ⊗ | (cid:105)(cid:104) | clock12 , and H prop,T = ⊗ | (cid:105)(cid:104) | clock T − ,T − U T ⊗ | (cid:105)(cid:104) | clock T − ,T − U † T ⊗ | (cid:105)(cid:104) | clock T − ,T + ⊗ | (cid:105)(cid:104) | clock T − ,T . These terms check the propagation of states from time t − t is correct. Now, we also need to ensure that theinput states are valid, i.e. ancilla qudits are in the state | m (cid:105) anc when t = 0 (i.e., the clock register is | T (cid:105) clock ). Thiscan be done using H in = m (cid:88) i =1 ( − | (cid:105)(cid:104) | ) anc i ⊗ | (cid:105)(cid:104) | clock t min ( i ) , (B19)where t min ( i ) = min { t : U t acts nontrivially on ancilla qudit i } . In other words, for each ancilla qudit i , H in penalizes the ancilla if it’s not in the state | (cid:105) before it is ﬁrst used bythe t min ( i )-th gate. Note that H circuit has O ( T ) terms, each of which is most ( k + 3)-local when U t are k -local. If U is spatially sparse, then it is easy to see that H circuit is also spatially sparse.Note that H L = 0. We then need to lower bound the spectral gap of H , i.e. λ ( H | L ⊥ ), where λ ( H ) denotesthe lowest eigenvalue of H . To that end, let us denote the following subspaces: S clock = span {| ψ (cid:105) | y (cid:105) | t T − t (cid:105) : | ψ (cid:105) ∈ C d n and | y (cid:105) ∈ C d m , ≤ t ≤ T } , (B20) S prop = span {| η µ , y (cid:105) ≡ √ T + 1 T (cid:88) t =0 (cid:16) U t · · · U U | ψ (cid:105) | y (cid:105) (cid:17) | t T − t (cid:105) : 1 ≤ µ ≤ d n , ≤ y ≤ d n − } . (B21)Note that L ⊂ S prop ⊂ S clock . Let us denote ˜ A = A ∩ L ⊥ for any subspace A . Note H clock S clock = 0, H prop S prop = 0, H in L = 0. We will use the following Projection Lemma 3: Lemma 3 (Projection Lemma, adapted from [26]) . Let H = H + H be sum of two Hamiltonians operating on someHilbert space S = S ⊕ S ⊥ . Assuming that H has a zero-energy eigenspace S ⊆ S so that H S = 0 , and that theminimum eigenvalue λ ( H | S ⊥ ) ≥ J > (cid:107) H (cid:107) , then λ ( H | S ) − (cid:107) H (cid:107) J − (cid:107) H (cid:107) ≤ λ ( H ) ≤ λ ( H | S ) . (B22) In particular, if J ≥ K (cid:107) H (cid:107) + 2 (cid:107) H (cid:107) = O ( K (cid:107) H (cid:107) ) , we have λ ( H | S ) − K ≤ λ ( H ) ≤ λ ( H | S ) . (cid:7) H , we obtain λ ( H | L ⊥ ) ≥ λ (cid:2) ( J prop H prop + J in H in ) | ˜ S clock (cid:3) − K if J clock = O ( K (cid:107) J prop H prop + J in H in (cid:107) ) (B23) ≥ λ (cid:104) ( J in H in ) | ˜ S prop (cid:105) − K if J prop /T = O ( K (cid:107) J in H in (cid:107) ) (B24)where we used the fact that λ ( H clock | S ⊥ clock ) ≥

1, and λ ( H prop | S ⊥ prop ) ≥ c/T for some constant c . We now lowerbound (B24). Let us denote ˆ n = − | (cid:105)(cid:104) | . Then within S clock , we can rewrite H in | S clock = m (cid:88) i =1 ˆ n anc i ⊗ (cid:88) ≤ t ≤ t min ( i ) | t (cid:105)(cid:104) t | clock = max i t min ( i ) (cid:88) t =0 H in,t (B25)where H in,t = (cid:88) { i : t ≤ t min ( i ) } ˆ n anc i ⊗ | t (cid:105)(cid:104) t | clock . In particular, H in,t =0 = (cid:80) mi =1 ˆ n anc i ⊗ | t = 0 (cid:105)(cid:104) t = 0 | . Thus, for any | η µ , y (cid:105) , | η ν , y (cid:48) (cid:105) ∈ ˜ S prop , where necessarily y, y (cid:48) > (cid:104) η ν , y (cid:48) | H in,t =0 | η µ , y (cid:105) = 1 T + 1 (cid:104) ψ ν | (cid:104) y (cid:48) | H in,t =0 | ψ µ (cid:105) | y (cid:105) = 1 T + 1 δ µν (cid:104) y (cid:48) | m (cid:88) i =1 ˆ n anc i | y (cid:105) = 1 T + 1 δ µν δ y,y (cid:48) × w ( y ) , (B26)where w ( y ) is the Hamming weight of y in d -ary representation, which is at least 1 for any y >

0. Hence, the minimumeigenvalue of H in,t =0 | L ⊥ is 1 / ( T + 1). Since H in consists of only positive semi-deﬁnite terms, we have λ ( H in | ˜ S prop ) ≥ λ ( H in,t =0 | ˜ S prop ) ≥ / ( T + 1) . (B27)Thus, to ensure that H has spectral gap λ ( H | L ⊥ ) ≥ J in = O (∆( T + 1)), J prop = O ( KT J in m ), and J clock = O ( KJ prop T ) = O (poly( n, T, ∆)).Now that we have shown H has L as its groundspace with spectral gap 2∆, we are ready to show that H circuit (∆ , η, (cid:15) )-simulates H with only polynomial overhead in energy. To this end, we use the following result regardingperturbative reductions adapted from Lemma 4 of [23] (also Lemma 35 of [10]): Lemma 4 (First-order reduction, adapted from [23]) . Suppose ˜ H = H + H out , deﬁned on Hilbert space ˜ H = L ⊕ L ⊥ such that H L = 0 and λ ( H | L ⊥ ) ≥ . Suppose H is a Hermitian operator and V is an isometry such that (cid:107) V HV † − H out | L (cid:107) ≤ (cid:15)/ , then ˜ H is a (∆ , η, (cid:15) ) -simulation of H , as long as ∆ ≥ O ( (cid:15) − (cid:107) H out (cid:107) + η − (cid:107) H out (cid:107) ) , perDef. 2. In other words, (cid:107) ˜ H ≤ ∆ − ˜ V H ˜ V † (cid:107) ≤ (cid:15) for some isometry ˜ V where (cid:107) ˜ V − V (cid:107) ≤ η . (cid:7) Observe we are given in the premise of this Lemma 1 that (cid:107)

V HV † − H out | L (cid:107) ≤ (cid:15)/ . (B28)Hence, H circuit = H + H out simulates H to precision (∆ , η, (cid:15) ) where ∆ ≥ O ( (cid:15) − (cid:107) H out (cid:107) + η − (cid:107) H out (cid:107) ). The maximuminteraction energy in H circuit is J clock = O (poly( n, T, ∆)) = O (poly( n, T, (cid:15) − , η − , (cid:107) H out (cid:107) )). This concludes the proofof Lemma 1.We now prove the second Lemma, which shows that in order to ensure the circuit-Hamiltonian simulates the originalHamiltonian with good precision with trivial encoding, we only need to add O (poly( n, (cid:15) − )) “idling” identity gatesto the end of a polynomial-sized circuit before transforming the circuit back to a Hamiltonian. Proof of Lemma 2 . Note that we can write | η µ (cid:105) = (cid:112) − χ | ψ µ (cid:105) ⊗ | α (cid:105) + χ | β µ (cid:105) (B29)5where | α (cid:105) = 1 √ L + 1 | m (cid:105) anc ⊗ D + L (cid:88) t = D | t T − t (cid:105) clock , (B30) | β µ (cid:105) = 1 √ D D − (cid:88) t =0 (cid:16) U t · · · U U | ψ µ (cid:105) | m (cid:105) anc (cid:17) | t T − t (cid:105) clock , (B31)and χ = (cid:112) D/ ( D + L + 1) . (B32)Observe that (cid:104) β µ | ( | ψ ν (cid:105) | α (cid:105) ) = 0 since the clock register are at diﬀerent times, and (cid:104) β µ | β ν (cid:105) = 1 D D − (cid:88) t =0 (cid:104) ψ µ | ψ ν (cid:105) = δ µν . (B33)Let P anc = | α (cid:105)(cid:104) α | . Then H eﬀ − H ⊗ P anc = (cid:88) µ (cid:104) E µ | η µ (cid:105)(cid:104) η µ | − E µ | ψ µ (cid:105)(cid:104) ψ µ | ⊗ | α (cid:105)(cid:104) α | (cid:105) = (cid:77) µ (cid:32) − E µ χ E µ χ (cid:112) − χ E µ χ (cid:112) − χ E µ χ (cid:33) (B34)And so (cid:107) H eﬀ − H ⊗ P anc (cid:107) ≤ χ max µ E µ ≤ χ (cid:107) H (cid:107) . (B35)To ensure (cid:107) H eﬀ − H ⊗ P anc (cid:107) ≤ (cid:15) , it’s suﬃcient to choose L so that χ (cid:107) H (cid:107) = (cid:15) . Plugging in χ = (cid:112) D/ ( D + L + 1), weﬁnd that it is suﬃcient to choose L = O ( D (cid:107) H (cid:107) (cid:15) ).

2. Proof of Theorem 2 – Strongly Universal Hamiltonian on 2D Square Lattice

In this subsection, we show how to transform the spatially sparse Hamiltonian constructed previously into a Hamil-tonian from a universal family of 2D spin-lattice model, with only polynomial overhead, proving our main Theorem 2.This follows from our Proposition 2 and the following result from Ref. [10]:

Lemma 5 (Essentially Ref. [10]) . Given any k -local n -qudit Hamiltonian H that is spatially sparse, we can construct H (cid:48) from a family of S -Hamiltonian on the 2D square lattice that eﬃciently (∆ , η, (cid:15) ) -simulates H as long as S isnon-2SLD. Here, ∆ = O (poly( (cid:107) H (cid:107) , η − , (cid:15) − ) , and H (cid:48) has O (poly( n, (cid:15) − )) qubits and interaction energy O (poly(∆)) . Then our main Theorem is a simple consequence of the above Lemma and our Proposition 2:

Proof of Theorem 2 . As we showed in Proposition 2, any O (1)-local qudit Hamiltonian H with (cid:107) H (cid:107) can be simu-lated by a spatially sparse 5-local Hamiltonian H circuit with O (poly( n ) /(cid:15) ) terms and qubits, and interaction energyat most O (poly( n, η − , (cid:15) − )). By Lemma 5, we can simulate H circuit by a S -Hamiltonian on the 2D square latticewith polynomial overhead, as long as S is non-2SLD.Although it was essentially shown in Ref. [10], we provide here a sketch of the proof of Lemma 5 for completeness. Proof Sketch of Lemma 5 . To show this, we use a sequence of reductions originally described in Ref. [10, 17], whichtogether performs the desired transformation. These reductions are enumerated in the following list of Lemmas:

Lemma 6 (Lemma 21 of [10]) . Given any k -local Hamiltonian on n qu d its H , we can construct a k (cid:100) log d (cid:101) -localHamiltonian H (cid:48) on n (cid:100) log d (cid:101) qubits that (∆ , , -simulates H , for ∆ ≥ (cid:107) H (cid:107) . For d = O (1) , the constructionpreserves spatial sparsity, and uses terms of interaction energy O (∆) . (cid:100) log d (cid:101) qubits, and uses local terms of strength ∆ to penalize any redundantstates among the qubits. Speciﬁcally, consider any isometry W : C d → ( C ) ⊗(cid:100) log d (cid:101) . The construction maps H to H (cid:48) = W ⊗ n HW †⊗ n + ∆ (cid:48) (cid:80) ni =1 P i , for any ∆ (cid:48) > ∆, where P = 1 − W W † . It is easy to see that H (cid:48) is spatially sparseif H is spatially sparse and d = O (1). Lemma 7 (Lemma 22 of [10]) . Given any k -local n -qubit Hamiltonian H , we can construct a real-valued k -local n -qubit Hamiltonian H (cid:48) that (∆ , , -simulates H , for any ∆ ≥ (cid:107) H (cid:107) . The construction preserves spatial sparsity,and uses terms of interaction energy O (∆) . The construction adds one additional qubit per original qubit, and map the individual Pauli operators in thefollowing way: (cid:55)→ ⊗ , σ x,z (cid:55)→ ⊗ σ x,z , σ y (cid:55)→ σ y ⊗ σ y . (B36)For each new pair of qubits ( i, n + i ), an additional local term ∆ (cid:48) ( Y i Y n + i + ) is added, where ∆ (cid:48) > ∆. Note the newHamiltonian is real-valued, and spatially sparse if H is spatially sparse. Lemma 8 (Lemma 39 of [10]) . Real-valued k -local qubit Hamiltonian H with M terms can be (∆ , η, (cid:15) ) -simulated byreal ( k + 1) -local Hamiltonian with O ( M + n ) qubits and terms, whose Pauli-decomposition contains no any Y terms.The construction preserves spatial sparsity, and uses interaction energy at most ∆ = O (poly( (cid:107) H (cid:107) , η − , (cid:15) − )) . The construction here takes any terms in H with (necessarily) even number of Y ’s, and recreates it with a per-turbative gadget involving only X, Z terms and an additional mediator qubit a . Speciﬁcally, the gadget perform thefollowing mapping: Y ⊗ m ⊗ A (cid:55)→ ∆ h + √ ∆ h (B37)where h = (1 + Z a ) / | (cid:105)(cid:104) | a , h = X a ( X ⊗ m ⊗ + ( − m +1 Z ⊗ m ⊗ A ) . (B38)Every term is mapped in parallel with an independent mediator qubit gadget. Since there are M terms in H , we haveat most O ( M + n ) qubits in the end. The large interaction energy ∆ = O (poly( (cid:107) H (cid:107) , η − , (cid:15) − )) is required to ensuresmall errors from perturbation. It is easy to see that if H is spatially sparse, so is the new Hamiltonian. Lemma 9 (Theorem 40 of [10]) . Suppose H is any k -local qubit Hamiltonian with M terms whose Pauli-decompositioncontains no Y . Then H can be simulated by 2-local qubit Hamiltonians with O ( M + n ) terms and qubits to preci-sion (∆ , η, (cid:15) ) whose Pauli-decomposition contains no Y terms. The construction preserves spatial sparsity, and usesinteraction energy at most Θ(∆) = O (poly( (cid:107) H (cid:107) , η − , (cid:15) − )) assuming k = O (1) . This construction makes use of the subdivision and 3-to-2 local gadgets, which are described in details in Ref. [17].For each k -local term of the form A ⊗ B , one can map it to ( (cid:100) k/ (cid:101) + 1)-local terms of the form A ⊗ X w + X w ⊗ B using a subdivision gadget that introduces an extra ancilla qubit w . Thus, O (log k ) applications of the subdivisiongadget is suﬃcient to reduce the locality to 3-local. This is then reduced to 2-local terms using the 3-to-2 local gadget:this converts term of the form A ⊗ B ⊗ C to 2-local terms such as ( A − B ) X w , C | (cid:105)(cid:104) | w , AB , and ( A + B ) C . Inother words, the construction maps each k -local term to O ( k ) 2-local terms mediated by O ( k ) ancilla qubits. Thismapping clearly preserves spatial sparsity. The required interaction energy blows up exponentially in k ; however,since k = O (1), the interaction energy required is at most O (poly( (cid:107) H (cid:107) , η − , (cid:15) − )). Lemma 10 (Theorem 41 of [10]) . Suppose H is a 2-local n -qubit Hamiltonian whose Pauli-decomposition containsno Y terms. Then it can be simulated by a n -qubit S -Hamiltonian to precision (∆ , η, (cid:15) ) , where S = { XX + Y Y + ZZ } or { XX + Y Y } . The construction preserves spatial sparsity, and uses interaction energy at most Θ(∆) = O (poly( (cid:107) H (cid:107) , η − , (cid:15) − )) . Here, the construction uses a perturbative gadget that maps every logical qubit in H to a group of 4 physical qubitsthat interact only via terms from S . The 1-local and 2-local interaction on any two logical qubits can be implementedusing two-body terms from S coupling diﬀerent pairs of physical qubits from the two groups. Hence, spatial sparsityis preserved by this construction. The required interaction energy scales as Θ(∆) = O (poly( (cid:107) H (cid:107) , η − , (cid:15) − )).Finally, we restate the following two results from Ref. [17]: Lemma 11 (Lemma 46 of [10]) . Let S be either { XX + Y Y + ZZ } or { XX + Y Y } . Any spatially sparse S -Hamiltonian on n qubits, whose largest interaction energy is Λ , can be simulated by a S -Hamiltonian on a 2Dsquare lattice of poly( n ) qubits using interaction energy at most J ij = O (poly( n Λ (1 /(cid:15) + 1 /η ))) . Lemma 12 (Theorem 42 of [10]) . Suppose S be a set of interactions on 2-qubits that is non-2SLD. Then given an { XX + Y Y + ZZ } - or { XX + Y Y } -Hamiltonian on the 2D square lattice, we can simulate it with an S -Hamiltonianon the 2D square lattice. S as either { XX + Y Y + ZZ } or { XX + Y Y } . For any S that is non-2SLD, we map H circuit to an S -Hamiltonian on the 2D square lattice in the following sequence:1. By Lemma 6, we can simulate H circuit with H that is spatially sparse, O (1)-local on O (poly( n ) /(cid:15) ) qubits andinteraction energy at most O ( (cid:107) H circuit (cid:107) ) = O (poly( n, η − , (cid:15) − H with H that is spatially sparse, real-valued, and O (1)-local, with onlypolynomial overhead in qubit-number of interaction energy.3. By Lemma 8, we can simulate H with H that is spatially sparse and O (1)-local, and contains no Y terms inPauli-decomposition, with only polynomial overhead in qubit-number of interaction energy.4. By Lemma 9, we can simulate H with H that is spatially sparse and 2-local, contains no Y terms in Pauli-decomposition, with polynomial overhead.5. By Lemma 10, we can simulate H with H that is a spatially sparse S -Hamiltonian, with polynomial overhead.6. By Lemma 11, we can simulate H by H an S -Hamiltonian on a 2D square lattice, with polynomial overhead.7. By Lemma 12, we can simulate H by the broader class of S -Hamiltonian on the 2D square lattice, for any S that is non-2SLD, with polynomial overhead.Since every step of the above sequence of reductions only incurs a polynomial overhead in the number of qubits and thestrength of interactions, we have shown that any O (1)-local, polynomial-sized qudit Hamiltonians can be eﬃcientlysimulated by an S -Hamiltonian with polynomial qubits and interaction energy, assuming S is non-2SLD. Appendix C: Proof that 1D nearest-neighbor Hamiltonian is strongly universal

Here we give our proof of Theorem 3, whose statement we reproduce below for convenience:

Theorem 3.

There is a strongly universal family of 1D Hamiltonians consisting of nearest-neighbor interaction actingon a line of particles with 8 internal dimension.

The proof is based heavily on the framework in Ref. [19]. We will only try to provide a succinct and somewhatself-contained description of the most important elements of the construction here. For the full technical details ofthe construction, we encourage the reader to also examine Section 3 and 4 of Ref. [19].

1. Preliminaries

We ﬁrst describe how the computation is encoded within a line of 8-dimensional particles.

Deﬁnition 8 (1D-encoded L -idling history state) . Consider any quantum circuit U consisting of R = O (poly( n )) rounds of 1-qubit or nearest-neighbor 2-qubit gates on a line of n qubits. This can be further converted to an encodedcircuit ˜ U with nearest-neighbor gates acting on a line nR + L qu d its ( d = 8 ), implicitly arranged in R blocks of n L particles for idling. The Hilbert space of each particle is H = (cid:13) ⊕ (cid:0) (cid:13) ⊕◦(cid:13) ⊕ ×(cid:13) ⊕ ⊕ (cid:73) , where and (cid:73) are 2-dimensional subspaces designed to hold a qubit state, and the rest are1-dimensional subspaces. For a given input state of the form | ψ µ (cid:105) | m (cid:105) ∈ C n on the original n qubits, this is encodedas | γ µ (cid:105) = R blocks (cid:122) (cid:125)(cid:124) (cid:123) (cid:73) ◦(cid:13) ◦(cid:13) · · · ◦(cid:13) (cid:13) (cid:124) (cid:123)(cid:122) (cid:125) the ﬁrst block of length 2 n (cid:13) (cid:13) (cid:13) (cid:13) · · · · · · (cid:3)(cid:3)(cid:67)(cid:67) (cid:13) (cid:13) (cid:13) · · · (cid:13) (cid:124) (cid:123)(cid:122) (cid:125) L idling qudits (C1) where the odd qudits in the ﬁrst block of length n encodes the input state | ψ µ (cid:105) | m (cid:105) . Here, the symbols , and (cid:3)(cid:3)(cid:67)(cid:67) aresimply boundary markers in space that help us identify the role of each particle and do not indicate anything aboutthe internal state of particles. In particular, the symbol (cid:3)(cid:3)(cid:67)(cid:67) marks a special boundary that separating the computationalpart of the line and the idling part.The encoded circuit ˜ U acts on | γ µ (cid:105) with R rounds of computation, each corresponding to applying a round of gatesfrom U (cid:48) to the currently active block of qudits, and then moving the block n positions to the right. This entails a total of K = ( R − n + 2 n −

1) + 2 n steps of nearest-neighbor gates that map conﬁguration to conﬁguration, according tothe transition rule outlined in Table 1 of Ref. [19]. This is then followed by L steps of “idling” where the L rightmostqudits perform a trivial counting operation. To facilitate the idling, we add the following transition rules.( α ) (cid:73) (cid:3)(cid:3)(cid:67)(cid:67) (cid:13) ←→ (cid:3)(cid:3)(cid:67)(cid:67) ×(cid:13) unmarks the active qubit (cid:73) once it hits the special boundary (cid:3)(cid:3)(cid:67)(cid:67) . The (cid:13) changes to ×(cid:13) soas to signal that the idling is supposed to the start.( β ) ×(cid:13) (cid:13) ←→ ×(cid:13) ×(cid:13) for any location to the right of the special boundary (cid:3)(cid:3)(cid:67)(cid:67) .This results in a history of K + L + 1 conﬁgurations on the nR qudits {| γ µt (cid:105)} K + Lt =0 which is pairwise orthogonal: (cid:104) γ µt | γ µt (cid:48) (cid:105) = δ tt (cid:48) . The L -idling history state with respect to U and | ψ µ (cid:105) | m (cid:105) is the following superposition: | η µ (cid:105) = 1 √ K + L + 1 K + L (cid:88) t =0 | γ µt (cid:105) . (C2)To help visualize the computational history, note the conﬁguration after all R rounds of computation comes to haltis | γ µK (cid:105) = ×(cid:13) n ( R − ×(cid:13) ◦(cid:13) · · · ◦(cid:13) ◦(cid:13) (cid:73) (cid:3)(cid:3)(cid:67)(cid:67) (cid:13) ⊗ L . (C3)All ensuing conﬁgurations are of the form (for 1 ≤ (cid:96) ≤ L ): | γ µK + (cid:96) (cid:105) = ×(cid:13) n ( R − ×(cid:13) ◦(cid:13) · · · ◦(cid:13) ◦(cid:13) (cid:3)(cid:3)(cid:67)(cid:67) ×(cid:13) ⊗ (cid:96) (cid:13) ⊗ ( L − (cid:96) ) . (C4) Lemma 13 (1D circuit-Hamiltonian [19]) . Given a circuit U consisting of poly( n )

1- or 2-qubit gates on n qubits.We can construct a Hamiltonian on O (poly( n )) + L qu d its, d = 8 , with only nearest-neighbor interaction of the form H hist = J in H in + J prop H prop + J pen H pen (C5) such that the 1D-encoded L -idling history states with respect to U and | ψ µ (cid:105) | m (cid:105) are of zero-energy, and all otherstates have energy ≥ . The interaction energy of nearest-neighbor terms in H hist are J in , J prop , J pen = O (poly( n )) .Proof. The construction is essentially the same as described in Section 4 of Ref. [19], except for a few small changes: a. Changes to legal conﬁgurations and penalty Hamiltonian — To the right of the special boundary (cid:3)(cid:3)(cid:67)(cid:67) , only ×(cid:13) ×(cid:13) , ×(cid:13) (cid:13) and (cid:13) (cid:13) are allowed conﬁgurations in these locations. This can be addressed by tweaking the penaltyHamiltonian H pen to penalize all conﬁgurations using the term | XY (cid:105)(cid:104) XY | i,i +1 where XY ∈ H ⊗ \{ ×(cid:13) ×(cid:13) , ×(cid:13) (cid:13) , (cid:13) (cid:13) } for these locations. b. Changes to propagation Hamiltonian — We need to incorporate the two new rules ( α ) and ( β ) added above.The Rule ( α ) is similar to Rule 4a (cid:73) (cid:13) ←→ (cid:0) (cid:13) from Table 1 of Ref. [19]. Since there’s only a unique locationwhere Rule ( α ) applies, at the special boundary, we can simply use the following propagation Hamiltonian for thatpair of sites H ( α )prop ,i = | (cid:73) (cid:3)(cid:3)(cid:67)(cid:67) (cid:13) (cid:105)(cid:104) (cid:73) (cid:3)(cid:3)(cid:67)(cid:67) (cid:13) | i,i +1 + | (cid:3)(cid:3)(cid:67)(cid:67) ×(cid:13) (cid:105)(cid:104) (cid:3)(cid:3)(cid:67)(cid:67) ×(cid:13) | i,i +1 − | (cid:3)(cid:3)(cid:67)(cid:67) ×(cid:13) (cid:105)(cid:104) (cid:73) (cid:3)(cid:3)(cid:67)(cid:67) (cid:13) | i,i +1 − | (cid:73) (cid:3)(cid:3)(cid:67)(cid:67) (cid:13) (cid:105)(cid:104) (cid:3)(cid:3)(cid:67)(cid:67) ×(cid:13) | i,i +1 (C6)where i = 2˜ nR for this special pair of sites.Now for Rule ( β ). Note this is the only propagation rule that is applicable in the region to the right of the specialboundary (cid:3)(cid:3)(cid:67)(cid:67) . Thus, we only need the following propagation Hamiltonian for i > nR . H ( β )prop ,i = | ×(cid:13) (cid:13) (cid:105)(cid:104) ×(cid:13) (cid:13) | i,i +1 + | ×(cid:13) ×(cid:13) (cid:105)(cid:104) ×(cid:13) ×(cid:13) | i,i +1 − | ×(cid:13) ×(cid:13) (cid:105)(cid:104) ×(cid:13) (cid:13) | i,i +1 − | ×(cid:13) (cid:13) (cid:105)(cid:104) ×(cid:13) ×(cid:13) | i,i +1 (C7)We note there can be mis-timed transitions from H ( α )prop ,i , e.g. ◦(cid:13) (cid:3)(cid:3)(cid:67)(cid:67) ×(cid:13) ×(cid:13) −→ − ◦(cid:13) (cid:73) (cid:3)(cid:3)(cid:67)(cid:67) (cid:13) ×(cid:13) (C8)and H ( α )prop ,i , e.g., ×(cid:13) ×(cid:13) ×(cid:13) −→ − ×(cid:13) (cid:13) ×(cid:13) (C9)However, they will all result in energy penalty from H pen , because they have illegal conﬁguration (cid:13) ×(cid:13) that is locallydetectable.9 c. Proof that the Hamiltonian has the 1D-encoded history states as the only ground states — This is essentiallygiven in Section 5 and 6 of [19].

2. Proof of Theorem 3

Proof of Theorem 3 . As in the Proof of Theorem 2, we can always convert any input qudit Hamiltonian to a qubitHamiltonian by encoding each qu d it in a group of (cid:100) log d (cid:101) -qudits with polynomial overhead (assuming d = O (1)).Hence, we will take the input H as an O (1)-local n -qubit Hamiltonian. We write H = (cid:80) µ E µ | ψ µ (cid:105)(cid:104) ψ µ | in its eigenbasis,with 0 ≤ E µ ≤ E max . By Proposition 1, we can construct a circuit U local P E consisting on O (poly( n, ζ − )) 1- or 2-qubitnearest-neighbor gates on n + m qubits, m = O (poly( n )), such that its action on any normalized state (cid:80) µ c µ | ψ µ (cid:105) can be described as (cid:13)(cid:13)(cid:13)(cid:13)(cid:13) U NNPE (cid:88) µ c µ | ψ µ (cid:105) | m (cid:105) − (cid:88) µ c µ | ψ µ (cid:105) | ˜ E µ (cid:105) | rest µ (cid:105) (cid:13)(cid:13)(cid:13)(cid:13)(cid:13) ≤ ζ (C10)where | ˜ E µ (cid:105) = | ϕ µ, ϕ µ, ϕ µ, . . . ϕ µ,s (cid:105) is the s -bit string representation of ϕ µ = E µ /E max = 0 .ϕ µ, ϕ µ, ϕ µ, · · · . Fromnow on we will denote ˜ n = n + m as the number of qubits in the phase estimation circuit.We then consider the identity circuit U = ( U NNPE ) † n + m U NNPE = that corresponds to running U NNPE , applyingidentity to every qubit, and then uncomputing. Let R PE be the number of rounds of nearest-neighbor gates in U PE ,and R = 2 R PE + 1 be the total number of rounds. Let K = ( R − n + 2˜ n −

1) + 2˜ n be the total number of stepswhen applying U . By Lemma 13, we can construct a 1D nearest-neighbor Hamiltonian H hist on poly( n ) + L particleswith poly( n ) interaction energy such that all the 1D-encoded L -idling history states with respect to U and | ψ µ (cid:105) | m (cid:105) are the only zero-energy states, and all other states have energy at least ∆.We claim that the following 1D Hamiltonian (∆ , η, (cid:15) )-simulates H ˜ H = 2∆ H hist + H out . (C11)To describe H out , we start by considering the conﬁguration of all 2˜ nR + L particles after the R PE -th round of gatescorresponding to the end of applying U NNPE . This looks like · · · ×(cid:13) ×(cid:13) (cid:73) ◦(cid:13) ◦(cid:13) · · · ◦(cid:13) (cid:13) (cid:13) (cid:13) · · · (cid:13) (cid:3)(cid:3)(cid:67)(cid:67) (cid:13) · · · (C12)Note the next round of gates corresponds to n + m , i.e., applying the identity gate to every qubit. Without lossof generality, we assume the ﬁrst s active particles corresponds ( ζ -approximately) to the ancilla with | ˜ E µ (cid:105) on them.Notice that every active qudit will at one point transition into their appropriate state in the (cid:73) subspace, with identitygate applied. Then, the appropriate H out is H out = ( K + L + 1) E max s (cid:88) b =1 − b | (cid:73) (1) (cid:105)(cid:104) (cid:73) (1) | nR PE +2 b − (C13)where | (cid:73) (1) (cid:105) corresponds to the | (cid:105) qubit state in the (cid:73) subspace.To prove that ˜ H simulates H to the desired precision, we ﬁrst show that H out restricted to the set of 1D-encodedhistory states L = span {| η µ (cid:105)} can be approximated by the following eﬀective Hamiltonian: H eﬀ = (cid:88) µ E µ | η µ (cid:105)(cid:104) η µ | (C14)Consider arbitrary states | η (cid:105) ∈ L . We write | η (cid:105) = (cid:80) µ a µ | η µ (cid:105) , and observe (cid:104) η | H out | η (cid:105) = ( K + L + 1) E max s (cid:88) b =1 − b (cid:88) ν,µ a ∗ ν a µ (cid:104) η ν | (cid:16) | (cid:73) (1) (cid:105)(cid:104) (cid:73) (1) | nR PE +2 b − (cid:17) | η µ (cid:105) = E max s (cid:88) b =1 − b (cid:88) ν,µ K (cid:88) t,t (cid:48) =0 a ∗ ν a µ (cid:104) γ νt | (cid:16) | (cid:73) (1) (cid:105)(cid:104) (cid:73) (1) | nR PE +2 b − (cid:17) | γ µt (cid:48) (cid:105) (C15)0Note the 2˜ nR PE + 2 b − (cid:73) subspace at one point in time among the K + 1 steps, regardlessof possible input state. Table 2 of [19] provides a clear illustration for the reason. Let’s call that time t b , and write (cid:104) η | H out | η (cid:105) = E max s (cid:88) b =1 − b (cid:88) ν,µ a ∗ ν a µ (cid:104) γ νt b | (cid:16) | (cid:73) (1) (cid:105)(cid:104) (cid:73) (1) | nR PE +2 b − (cid:17) | γ µt b (cid:105) (C16)Note the conﬁguration (cid:80) µ a µ | γ µt b (cid:105) is the encoded version of U NNPE (cid:80) µ a µ | ψ µ (cid:105) | m (cid:105) , which is ζ -close to the encodedversion of (cid:80) µ a µ | ψ µ (cid:105) | ˜ E µ (cid:105) | rest µ (cid:105) . Hence (cid:104) η | H out | η (cid:105) = E max s (cid:88) b =1 − b (cid:88) ν,µ a ∗ ν a µ ϕ µ,b (cid:104) ψ ν | ψ µ (cid:105) (cid:104) rest ν | rest µ (cid:105) + Θ( ζE max )= (cid:88) µ | a µ | s (cid:88) b =1 − b ϕ µ,b E max + Θ( ζE max )= (cid:88) µ | a µ | ˜ E µ + Θ( ζE max ) . (C17)where we have denoted ˜ E µ = E max ˜ ϕ µ = E max (0 .ϕ µ, ϕ µ, · · · ϕ µ,s ) as the s -bit representation of energy eigenvalue E µ of H . Then |(cid:104) η | H out − H eﬀ | η (cid:105)| ≤ (cid:88) µ | a µ | | ˜ E µ − E µ | + Θ( sζE max ) ≤ [2 − s + Θ( ζ )] E max (C18)By choosing s = Θ(log E max /(cid:15) ) = Θ(log n + log (cid:15) − ) and ζ = Θ( (cid:15)/E max ) = Θ(1 / poly( n, (cid:15) − )), just like we did inEq. (B12), we can ensure |(cid:104) η | H out − H eﬀ | η (cid:105)| ≤ (cid:15)/ ∀ | η (cid:105) ∈ L = ⇒ (cid:107) H eﬀ − H out | L (cid:107) ≤ (cid:15)/ L -idling history state more carefully, it looks like | η µ (cid:105) = 1 √ K + L + 1 K (cid:88) t =0 | γ µt (cid:105) = (cid:112) − χ | α µ (cid:105) + χ | β µ (cid:105) (C20)where χ = (cid:114) K + 1 K + L + 1 (C21)and | β µ (cid:105) = 1 √ K + 1 K (cid:88) t =0 | γ µt (cid:105) (C22) | α µ (cid:105) = 1 √ L L (cid:88) t =1 | γ µK + t (cid:105) = ×(cid:13) n ( R − ×(cid:13) ◦(cid:13) · · · ◦(cid:13) ◦(cid:13) (cid:3)(cid:3)(cid:67)(cid:67) √ L L (cid:88) (cid:96) =1 ×(cid:13) ⊗ (cid:96) (cid:13) ⊗ ( L − (cid:96) ) = ( V | ψ µ (cid:105) ) ⊗ | α (cid:105) (C23)where V | ψ µ (cid:105) is simply the input state | ψ µ (cid:105) stored in the qubit subspace of a subset of the ˜ n qudits marked within the R -th block, | α (cid:105) is the state of the remaining ancilla. Therefore, | η µ (cid:105) = (cid:112) − χ ( V | ψ µ (cid:105) ) ⊗ | α (cid:105) + χ | β µ (cid:105) . (C24)By choosing L = O ( K/(cid:15) ), we can ensure that (cid:107)| η µ (cid:105)(cid:104) η µ | − V | ψ µ (cid:105)(cid:104) ψ µ | V † ⊗ | α (cid:105)(cid:104) α |(cid:107) ≤ O ( (cid:15) ). With the same argumentin Lemma 2, we can show that (cid:107) H eﬀ − V HV † ⊗ | α (cid:105)(cid:104) α |(cid:107) ≤ (cid:15)/ (cid:13)(cid:13) V HV † ⊗ | α (cid:105)(cid:104) α | − H out | L (cid:13)(cid:13) ≤ (cid:15)/ Lemma 4 (First-order reduction, adapted from [23]) . Suppose ˜ H = H + H , deﬁned on Hilbert space ˜ H = L ⊕ L ⊥ such that H L = 0 and λ ( H | L ⊥ ) ≥ . Suppose H is a Hermitian operator and V is an isometry such that (cid:107) V HV † − H | L (cid:107) ≤ (cid:15)/ , then ˜ H (∆ , η, (cid:15) ) -simulates H , as long as ∆ ≥ O ( (cid:15) − (cid:107) H (cid:107) + η − (cid:107) H (cid:107) ) , per Def. 2. In otherwords, (cid:107) ˜ H ≤ ∆ − ˜ V H ˜ V † (cid:107) ≤ (cid:15) for some isometry ˜ V where (cid:107) ˜ V − V (cid:107) ≤ η . We apply the above Lemma with H = 2∆ H hist and H = H out . Thus, ˜ H = H + H simulates H to precision(∆ , η, (cid:15) ) by choosing ∆ ≥ O (( (cid:15) − + η − ) poly( n, (cid:107) H (cid:107) )). Note that ˜ H contains O (poly( n, (cid:15) − )) nearest-neighborterms, with interaction energy O (poly( n, η − , (cid:15) − ,,