[PDF] Reconstruction of non- ℵ 0 -categorical theories

Abstract

We generalise the correspondence between ℵ0 -categorical theories and their automorphism groups to arbitrary complete theories in classical logic, and to some theories (including, in particular, all ℵ0 -categorical ones) in continuous logic.

Full PDF

aa r X i v : . [ m a t h . L O ] F e b RECONSTRUCTION OF NON- ℵ -CATEGORICAL THEORIES ITAÏ BEN YAACOVA

BSTRACT . We generalise the correspondence between ℵ -categorical theories and their automorphism groups toarbitrary complete theories in classical logic, and to some theories (including, in particular, all ℵ -categorical ones) incontinuous logic. C ONTENTS

Introduction 11. Topological groupoids 22. The groupoid associated to a classical theory 33. Reconstructing a classical theory 64. Universal Skolem sorts 105. The groupoid associated to a theory with a universal Skolem sort 156. Further questions 18References 19I

NTRODUCTION

To every ℵ -categorical theory T (all theories under consideration are in a countable language) one canassociate the automorphism groups G ( T ) of its unique countable model, equipped with the Polish grouptopology of simple convergence. It is by now a classical result that G ( T ) is a classifying invariant for the bi-interpretation class of T (Ahlbrandt and Ziegler [AZ86], but due to Coquand). In more explicit terms, if T and T ′ are ℵ -categorical, then G ( T ) ∼ = G ( T ′ ) as topological groups if and only if there exists a bi-interpretationbetween T and T ′ . The same was later extended by Kaïchouh and the author in [BK16] to ℵ -categoricaltheories in continuous logic, where G ( T ) is the automorphism group of the unique separable model. Thesecorrespondences opened the door to many interactions between model theory and topological dynamics, withmodel-theoretic properties of T corresponding to well-studied dynamical properties of G ( T ) , see for example[BT16, Iba16, Iba17, BIT18].In the present paper, we propose to extend the correspondence between bi-interpretation classes of theoriesand topological groups (or group-like objects) beyond the ℵ -categorical realm. One motivation for doing thiscomes from a desire to imitate the elegance of the original correspondence result. Other motivations arisefrom applications of the correspondence between model-theoretic properties of T and dynamical propertiesof G ( T ) . When T is not ℵ -categorical, one can no longer speak of “the” automorphism group of T . Model-theoretic properties of T still correspond to dynamical properties of all actions of automorphism groups ofcountable/separable models of T on formulas, but the resulting criteria are far from being as elegant, or asuseful (depending on context), as in the ℵ -categorical case. Speciﬁcally, one needs to consider automorphismgroups of all models of T (or of sufﬁciently rich ones), and to know which functions on the group(s) correspondto formulas.Let us make this a little more concrete using our favourite motivating example. It is proved in [Ben09] thatthe randomisation of a NIP theory is again NIP. The proof much more analytic than model-theoretic, and therehave since been several attempts to replace it with a different argument. The only successful one, as far as weare aware, is by Ibarlucía [Iba17]. It only applies to ℵ -categorical T , and is based on the characterisation ofNIP in terms of the representability in Rosenthal Banach spaces of a dynamical system associated with G ( T ) .When T is not ℵ -categorical, then, as in the previous paragraph, there still is a correspondence between NIPand Rosenthal representability of all actions of automorphism groups on formulas. However, the one-by-one Mathematics Subject Classiﬁcation.

Key words and phrases. complete theory, groupoid, reconstruction.Author supported by ANR projects GruPoLoCo (ANR-11-JS01-008) and AGRUME (ANR-17-CE40-0026).

Revision of consideration of countable/separable models of T does not pass well to the randomisation – we can constructsome separable models of T R , but not all of them, as randomisations of models of T – so the criterion doesnot seem to be applicable. The approach of the present paper allows us to consider all countable/separablemodels of T jointly (rather than severally). Recent results of Jorge Muñoz assert that this does commute quitewell with randomisation, allowing us to hope to extend Ibarlucía’s results.We achieve the desired correspondence by replacing topological groups with topological groupoids, whichare brieﬂy discussed in Section 1.We treat theories in classical and in continuous logic separately. For a complete classical theory T we con-struct in Section 2 a topological groupoid G ( T ) over the Cantor space. Roughly speaking, points in the base (inthe Cantor space) are types of (codes for) models, and groupoid elements code isomorphisms between modelsof the source and target types. We prove that G ( T ) only depends on T up to bi-interpretation, and conversely,in Section 3 we reconstruct T up to bi-interpretation from G ( T ) . It follows that If T is ℵ -categorical, then G ( T ) ∼ = N × G ( T ) × N , so our correspondence is a generalisation of the ℵ -categorical case.The treatment of the continuous case is not quite as satisfactory. In Section 4 we identify a sort of “codes formodels” as a universal Skolem sort . While we do not know that one exists in full generality, we do know that: • If it exists, then it is unique (up do a deﬁnable bijection). • All classical theories admit such a sort (constructed in Section 2, motivating the general deﬁnition). • All ℵ -categorical theories (classical or continuous) admit such a sort. • If T admits such a sort, then so does its randomisation T R (this is due to Jorge Muñoz, and is notproved here).In Section 5, assuming T admits a universal Skolem sort, we construct G ( T ) and reconstruct T up to bi-interpretation.This leaves quite a few open questions, which we present in Section 6.We should point out that in the context of categorical logic there exist results which also code a theorywith a topological groupoid, in a very different fashion. These include explicit constructions, such as Awodeyand Forssell [AF13], as well as general “there exists a groupoid that codes a topos that codes something”arguments. Awodey and Forssell consider models over subsets of a ﬁxed uncountable set, so their groupoidis non-separable T (but not T , since the closure of a singleton representing one model consists of all its sub-models). Our construction, in contrast, yields a Polish groupoid (separable and completely metrisable as atopological space), and while we do not discuss this here, elementary embeddings of models of T arise verydifferently, as the left-completion of the said groupoid. To the extremely limited extent that we understand themore general constructions (the reader will forgive the author for his terrible lack of familiarity with categoricallogic), similar differences apply there as well.1. T OPOLOGICAL GROUPOIDS

Let us recall the deﬁnition of a groupoid. The deﬁnition is essentially equivalent to the one found in, say,Mackenzie [Mac87], except that we consider the base as a subset of the groupoid rather than as a separatespace.

Deﬁnition 1.1. A groupoid is a set G equipped with a partial composition law · : G G and an inversionmap − : G → G , such that for all f , g , h ∈ G :(i) Composition is associative: ( f g ) h = f ( gh ) , as soon as one of the two sides is deﬁned (which meansthat then the other is deﬁned as well).(ii) The compositions g − g and gg − are always deﬁned.(iii) If f g is deﬁned, then f gg − = f and f − f g = g .We call s g = g − g the source of g and t g = gg − its target . We call e ∈ G neutral if e = e . The set of neutralelements of G will be denoted B or B ( G ) . We call B the base set of G , and say that G is a groupoid over B .Let us make a few observations:(i) Both s g and t g are neutral for all g ∈ G , deﬁning maps s , t : G → B .(ii) The composition f g is deﬁned if and only if s f = t g . In particular, s f g = t g − = s g and t f g = s f − = t f .(iii) If e is neutral, then e = e − e = e − e = s e , and similarly e = t e . In particular, eg ( ge ) is deﬁned,necessarily equal to g , if and only if e = t g ( e = s g ).(iv) If f g is neutral, then f = f gg − = g − and similarly g = f − . In particular, ( g − ) − = g and ( f g ) − = g − f − . ECONSTRUCTION OF NON- ℵ -CATEGORICAL THEORIES 3 From these observations follows the equivalence with the (possibly more familiar) deﬁnition of a groupoidas a category all of whose morphisms are invertible, in which case B is the object set.Notice that for A ⊆ G we have s ( A ) = A − A ∩ B = A − G ∩ B = G A ∩ B and t ( A ) = AA − ∩ B = A G ∩ B = G A − ∩ B . Similarly, for A ⊆ B we have s − ( A ) = G A and t − ( A ) = A G .The advantage of the algebraic deﬁnition is that it is easier to cast a topology on top of it. Deﬁnition 1.2. A topological groupoid is a groupoid such that G is a Hausdorff topological space and composi-tion and inversion are continuous (where deﬁned).A topological groupoid G with base B is open if the source map s : G → B is open.We could also state the deﬁnition of a groupoid in a categorical language: an object equipped with arrowsfor source, inverse and product, say. This would make the deﬁnition meaningful in any category with ﬁbredproducts (and not only in the category of sets). In the category of topological spaces and continuous maps, itwould agree with our deﬁnition of a topological groupoid.Since the source and target maps are total, the domain of composition, deﬁned by the condition t ( g ) = s ( f ) ,is closed in G . It follows that the condition g = g is closed, so the base set B is a closed subset of G .Clearly, a topological groupoid is open if and only if its target map is open. Every topological group, viewedas a groupoid over a point, is open. Deﬁnition 1.3. A topological space over B is a topological space X equipped with a continuous map π : X → B .The ﬁbred product of two spaces over B is X × B Y = (cid:8) ( x , y ) ∈ X × Y : π X x = π Y y (cid:9) .When X = G we take π X = s , and when Y = G we take π Y = t .In particular, the domain of composition in G is G × B G = (cid:8) ( g , h ) ∈ G : s g = t h (cid:9) . Deﬁnition 1.4.

Let G be a groupoid over B , and X a space over B . A continuous (left) action of G on X , denoted G y X , is a continuous map G × B X → X , sending ( g , x ) gx , such that ( gh ) x = g ( hx ) whenever either isdeﬁned (so π ( gx ) = t ( g ) ). A continuous right action X x G is deﬁned analogously as a map X × B G → X .In particular, the product map G × B G → G is both a left and a right continuous action of G on itself. On B , viewed as a space over itself, G admits a unique action ( g , s g ) t g (and similarly a unique right action). Fact 1.5.

The following are equivalent for a topological groupoid G over a base B : (i) The groupoid G is open. (ii) For any topological space X over B , the projection G × B X → X is open. (iii)

For any continuous action G y X, the action law G × B X → X is open. (iv)

The groupoid law G × B G → G is open.Proof. (i) = ⇒ (ii). A basic open set of G × B X is of the form U × B V , where U ⊆ G and V ⊆ X are open. Since G is open, the sets W = s ( U ) ⊆ B and π − ( W ) ⊆ X are open, and the image of U × B V in X is the open set V ∩ π − ( W ) .(ii) = ⇒ (iii). Compose with the homeomorphism ( g , x ) ( g − , gx ) .(iii) = ⇒ (iv). This is a special case.(iv) = ⇒ (i). If U ⊆ G is open, then U − U ⊆ G is open, and therefore s ( U ) = U − U ∩ B is open in B . (cid:4)

2. T

HE GROUPOID ASSOCIATED TO A CLASSICAL THEORY

In this Section, let T denote a complete theory, in the sense of classical (i.e., not continuous) ﬁrst order logic,in a countable language L . We consider that by deﬁnition of the logic, all structures (so all models of T ) arenot empty. In order to avoid borderline cases, let us also assume that no model of T is a singleton (or, if T ismulti-sorted, that in no model are all sorts singletons). By deﬁnable we mean without parameters. Deﬁnition 2.1.

Let T be a classical ﬁrst-order theory in a countable language. Let G ( T ) ⊆ S × N ( T ) consist ofall possible types of a pair of enumerations of a model of T (i.e., any two enumerations of any single countablemodel). Members of G ( T ) will be denoted g , h , and so on, or possibly as types g ( x , y ) where x and y standfor countable tuples of variables. Let B ( T ) ⊆ G ( T ) to be the subset deﬁned by the condition x = y . We mayidentify tp ( a , a ) ∈ B ( T ) with tp ( a ) , thus identifying B ( T ) with the subset of S N ( T ) consisting of types ofenumerations of models.If g = tp ( a , b ) and h = tp ( b ′ , c ′ ) , where b ≡ b ′ , then we might as well assume that b = b ′ , in which case g − = tp ( b , a ) and gh = tp ( a , c ′ ) depend only on g and h , and belongs to G ( T ) . ITAÏ BEN YAACOV

Lemma 2.2.

As deﬁned above, G ( T ) is a Polish open topological groupoid with base B ( T ) . If g = tp ( a , b ) ∈ G ( T ) ,then t g = tp ( a ) and s g = tp ( b ) .Proof. It is easy to check that G ( T ) is indeed a topological groupoid. Let us prove that G ( T ) is open, i.e., thatthe map s : tp ( a , b ) tp ( b ) is open. A basic open set U ⊆ G ( T ) is deﬁned by a formula ϕ ( x , y ) (in whichonly ﬁnitely many variables actually appear). We claim that s ( U ) is deﬁned by ∃ x ϕ ( x , y ) (quantifying onlyover those x i that appear in ϕ ). Indeed, let tp ( b ) ∈ B ( T ) , so b enumerate some M (cid:15) T . If g = tp ( a , b ) ∈ U ,then a also enumerates M , s ( g ) = tp ( b ) , and (cid:15) ϕ ( a , b ) implies (cid:15) ∃ x ϕ ( x , b ) . Conversely, if (cid:15) ∃ x ϕ ( x , b ) , thenthere exists a tuple a in M such that (cid:15) ϕ ( a , b ) . Since only ﬁnitely many variables actually appear in ϕ , we mayreplace a tail of a with an enumeration of M , so still (cid:15) ϕ ( a , b ) , and now g = tp ( a , b ) ∈ U . Thus s ( U ) is indeeddeﬁned by ∃ x ϕ ( x , y ) . (cid:4) Our goal is to associate to each theory T a groupoid G ( T ) such that for any theory T ′ we have G ( T ) ∼ = G ( T ′ ) as topological groupoids if and only if T and T ′ are bi-interpretable. In fact, we desire a seeminglystronger version of the left-to-right implication, to which we refer as reconstruction : a procedure by which weobtain, from G ( T ) , a theory bi-interpretable with T , in a (reasonably) constructive fashion. While G ( T ) mayseem natural, neither implication seems to hold for it, nor, a fortiori , reconstruction. Indeed, naïve attempts atreconstruction quickly run into obstacles that seem to arise from the fact that the base B ( T ) is not compact.The following deﬁnition was originally an attempt to remedy this, i.e., to make the base compact. Somewhatsurprisingly, it solves all other issues at the same time, including that of a presenting the present work as ageneralisation of the ℵ -categorical case. An explanation of sorts as to why (rather than how) that happens isgiven in Section 4 (see Proposition 4.16).We assume throughout that T is in a single-sorted language. The deﬁnitions and arguments adapt in anobvious manner to the multi-sorted case, with additional bookkeeping that we prefer to avoid. Deﬁnition 2.3.

Assume that we work in a language in a single sort. A sequence Φ = (cid:0) ϕ n ( x < n , y ) : n ∈ N (cid:1) ,where y is a single variable, will be called rich if every formula ϕ ( x < k , y ) appears (with dummy variables) as ϕ n for some n ≥ k .When there are many sorts, we ﬁx a sort S i for each x i , and require that in ϕ n ( x < n , y ) , the variable y belongto S n .Clearly, a rich Φ exists, provided, in the many-sorted case, that each sort is repeated inﬁnitely often. Deﬁnition 2.4.

For a rich Φ = ( ϕ n : n ∈ N ) we deﬁne D Φ , n ( x < n ) = ^ k < n ∀ y (cid:2) ϕ k ( x < k , y ) → ϕ k ( x ≤ k ) (cid:3) , D Φ ( x ) = ^ n ∈ N D Φ , n ( x < n ) .This just says that if there exists a witness for ϕ k , then x k must be one.We shall view each D Φ , n as a formula or as a deﬁnable set of n -tuples, as convenient. Similarly, D Φ is apartial type or a type-deﬁnable set of inﬁnite tuples. Lemma 2.5.

Any member of D Φ , n in a countable model M (cid:15) T can be extended to a member of D Φ that moreoverenumerates M. In particular, D Φ is never empty.Moreover, let ψ ( x , y ) be a formula, where x is in the sort of D Φ and y is arbitrary (of course, only ﬁnitely manyvariables from the inﬁnite tuples x actually appear in ψ ). Then the property ( ∃ x ∈ D Φ ) ψ ( x , y ) is expressible as a formula in the variables y.Proof. The main assertion is immediate from the deﬁnition, and implies the moreover part. (cid:4)

We can now associate to T a groupoid through restriction of G ( T ) to D Φ . Deﬁnition 2.6.

Assume T is a theory in classical logic, and let Φ be a rich sequence.We deﬁne S mD Φ ( T ) ⊆ S m × N ( T ) to be the (compact) set of possible types of members of the type-deﬁnableset D m Φ . We deﬁne B Φ ( T ) = S D Φ ( T ) , so B Φ ( T ) ⊆ B ( T ) (any member of D Φ must satisfy the Tarski-Vaughttest), and G Φ ( T ) = (cid:8) g ∈ G ( T ) : s g , t g ∈ B Φ ( T ) (cid:9) .In other words, G Φ ( T ) = G ( T ) ∩ S D Φ ( T ) consists of all tp ( a , b ) where a , b ∈ Φ enumerate the same set (infact, each is necessarily a sub-sequence of the other, repeating each element inﬁnitely often). The following isimmediate from the deﬁnitions, but deserves nonetheless to be stated explicitly: Lemma 2.7.

As deﬁned above, G Φ ( T ) is Polish, open, and its base B Φ ( T ) is the Cantor set. ECONSTRUCTION OF NON- ℵ -CATEGORICAL THEORIES 5 Proof.

The groupoid G Φ ( T ) is Polish as a closed subset of a Polish space. The base B Φ ( T ) is totally discon-nected, compact and second-countable by construction. We have agreed to assume that no model of T is asingleton, so no sentence that implies, modulo T , that the model is a singleton. This excludes the possibility ofisolated points in B Φ ( T ) , which is therefore the Cantor set. Let U ⊆ G Φ ( T ) be a basic open set, say deﬁnedby a formula χ ( x , y ) . We claim that s ( U ) is deﬁned by ( ∃ x ∈ D Φ ) χ ( x , y ) (following Lemma 2.5). Indeed, oneinclusion is as for Lemma 2.2, while the other follows from Lemma 2.5. (cid:4) Now things seem to be even worse: the groupoid depends not only on T , but also on Φ . Let us show thatthis is not truly a problem.Let us ﬁx some terminology. The sorts of the language of T will be called the basic sort(s) . More generally,an interpretable sort , or, from now on, merely a sort , will be any deﬁnable subset of a deﬁnable quotient of aproduct of the basic sorts: S ⊆ ( S × . . . × S n − ) / E .Say that a family of sorts is sufﬁcient if any sort (equivalently, any basic sort) is in a deﬁnable bijection withsuch a subset of quotient, with S i in the given family (so this family can be taken as an alternate family of basicsorts). Of course, the easiest way to get a sufﬁcient family of sorts is to take all basic sorts, together with someadditional ones.We gave Deﬁnition 2.4 with respect to the basic sorts, but we can just as well deﬁne with respect to any(sufﬁcient) family of sorts. So let us ﬁx two rich sequences Φ and Ψ , with respect to two sufﬁcient families ofsorts (and we may reduce the general case to the one where one family is a superset of the other).Let x = ( x n ) denote a variable in D Φ and y a variable in D Ψ . In what follows, ∃ x should be understood as ∃ x ∈ D Φ , in the sense of Lemma 2.5, and similarly for ∀ x , ∃ ˜ x , as so on. Similarly, we quantify on y or ˜ y over D Ψ . Deﬁnition 2.8. An approximate bijection between D Φ and D Ψ is a formula ϕ ( x , y ) such that ∀ x ∃ y ϕ and ∀ y ∃ x ϕ are valid (i.e., consequences of T ). Lemma 2.9.

Let ψ ( x < n , y < m ) be a formula, and assume that ( ∃ y < m ) ψ is equivalent to D Φ , n ( x < n ) . Then there existindices n ≤ i < . . . < i m − such that, letting i = ( i j : j < m ) , the formula ψ is equivalent to: ( ∃ z ∈ D Φ ) (cid:0) ( z < n = x < n ) ∧ ( z i = y < m ) (cid:1) . Proof.

We choose i j by induction on j < m , such that ϕ i j ( x < i j , z ) is ( ∃ y < m ) h ψ ( x < n , y < m ) ∧ ( y j = z ) ∧ ( y < j = x i < j ) i .With this choice, our assertion is easy to check. (cid:4) Lemma 2.10.

For any approximate bijection ϕ between D Φ and D Ψ , and for any j, there exists a deﬁnable map f : D Φ → S y j , where S y j denotes the sort of y j , such that ϕ ( x , y ) ∧ (cid:0) y j = f ( x ) (cid:1) is again an approximate bijection.Proof. We may express the sort of y j as a deﬁnable subset of something of the form ( S × · · · × S m − ) / E forsome basic sorts of Φ and deﬁnable equivalence relation E . Let n be larger than any i such that x i appears in ϕ . Let ψ ( x < n , ¯ z ) be the formula D Φ , n ( x < n ) ∧ ∃ y (cid:0) ϕ ( x , y ) ∧ y j = [ ¯ z ] E (cid:1) .Since ϕ is assumed to be an approximate bijection, the formula D Φ , n is equivalent to ( ∃ ¯ z ) ψ . In other words, ψ satisﬁes the hypothesis of Lemma 2.9, so let i be as in the conclusion. We claim that ϕ ( x , y ) ∧ (cid:0) y j = [ x i ] E (cid:1) isan approximate bijection. Indeed, if x ∈ D Φ , then ψ ( x < n , x i ) holds, so y ∈ D Ψ as desired exists. Conversely, if y ∈ D Ψ , then a tuple x < n exists such that D Φ , n ( x < n ) ∧ ϕ ( x < n , y ) holds, and a tuple ¯ z exists such that y j = [ ¯ z ] E .Therefore ψ ( x < n , ¯ z ) holds, whence the existence of x ∈ D Φ such that x i = ¯ z , and y j = [ x i ] E . (cid:4) Proposition 2.11.

For any two sufﬁcient families of sorts, and any two rich sequences Φ and Ψ is these families,respectively, there exists a deﬁnable bijection σ : D Φ ∼ = D Ψ .Proof. For the main assertion, apply a back-and-forth construction using Lemma 2.10. More precisely, startwith ϕ ( x , y ) = ⊤ (True). Then, given ϕ n , apply Lemma 2.10 twice to ﬁnd f n and g n deﬁnable such that ϕ n + ( x , y ) = ϕ n ( x , y ) ∧ x n = f n ( y ) ∧ y n = g n ( x ) .is an approximate bijection. Together, these yield the desired deﬁnable bijection. (cid:4) ITAÏ BEN YAACOV

One usually deﬁnes a bi-interpretation between T and T ′ as a pair of interpretation schemes of one in theother, such that, when composed to yield an interpretation of T or of T ′ in itself, the models are uniformlydeﬁnably isomorphic to their interpreted copies. It is however fairly easy to check that this is equivalent to theproperty that the theory obtained by adjoining to T the sort of T ′ (without forgetting anything), and the onethat is obtained by adjoining to T ′ the sort of T , are the same up to a change of language. This, together withProposition 2.11, yields: Theorem 2.12.

Let T and T ′ be bi-interpretable, and let Φ and Ψ be rich sequences for the languages of T and of T ′ ,respectively. Then G Φ ( T ) and G Ψ ( T ′ ) are isomorphic as topological groupoids.In other words, up to isomorphism of topological groupoids, G Φ ( T ) does not depend on Φ , and only depends on T upto bi-interpretation. From now on we may denote G Φ ( T ) by G ( T ) , omitting Φ .When T is ℵ -categorical (so, in particular, complete), we have already associated to T a different object,the topological group G ( T ) = Aut ( M ) , where M is any countable model of T . Viewing G ( T ) as a topologicalgroupoid, it is distinct from G ( T ) , since the base of G ( T ) is a singleton (its identity). Our next result says thatthis is the only difference between the two.Let G be a topological group and B a topological space. The set B × G × B is naturally a groupoid basedover B , with composition law ( x , g , y )( y , f , z ) = ( x , g f , z ) . Deﬁnition 2.13.

Say that a topological groupoid G is trivially based if it is isomorphic, as a topological groupoid,to a groupoid of the form B × G × B (where B is necessarily the base of G ). A trivialising section for G is acontinuous map g : B → G such that t ◦ g = id B and s ◦ g is constant. Fact 2.14.

A topological groupoid G over B is trivially based if and only if it admits a trivialising section. In this case G ∼ = B × G × B , where G ∼ = G e = { g ∈ G : s g = t g = e } for any e ∈ B .Proof. Assume ﬁrst that is trivially based, say G = B × G × B . Let e ∈ B . Then G ∼ = G e , and g ( e ′ ) = ( e ′ , 1, e ) is a trivialising section. Conversely, assume that g is a trivialising section, say s ◦ g ≡ e , and let G = G e . Then f g = g ( t f ) − f g ( s f ) ∈ G for all f ∈ G , and f (cid:0) t f , f g , s f (cid:1) is the desired isomorphism G ∼ = B × G × B . (cid:4) Proposition 2.15.

Let T be ℵ -categorical, and let G ( T ) be the isomorphism group of its countable model. Then G ( T ) ∼ = N × G ( T ) × N .Proof. Let Φ be rich and let q ( x ) ∈ B Φ ( T ) . We have already observed that B Φ ( T ) ∼ = N . Let q n ( x < n ) be therestriction of q to x < n , and for ℓ ≥ n , let q n , ℓ ( x < n , x ℓ ) be the restriction of q to x < n , x ℓ . We deﬁne A n ∈ N suchthat if b (cid:15) q , then any 1-type over b < n is realised by b i for some n ≤ i < A n . We then deﬁne B = B k + = A B k . Then, if B k ≤ n < B k + , we choose m ( n ) > m ( n − ) such that ϕ m ( n ) ( x < m ( n ) , y ) is the formulasaying that • if q n + ( x m ( ) ,..., m ( n − ) , x k ) holds, then y = x k , • and otherwise, if n < ℓ < A n is least such that q n , ℓ ( x m ( ) ,..., m ( n − ) , x k ) holds (such ℓ must exist), then q n + ℓ ( x m ( ) ,..., m ( n − ) , y , x k ) .Let a ∈ D Φ and b = m ∗ ( a ) = ( a m ( i ) : i ∈ N ) . One proves by induction on n that q n ( b < n ) must hold. Indeed,in the second case such a minimal ℓ must exist by choice of A n , and in either case a y as desired must exist, so ϕ m ( n ) ( a < m ( n ) , b n ) holds, and implies q n + ( b ≤ n ) .We also claim that a and b enumerate the same set, and more precisely, that a k = b ℓ for some B k ≤ ℓ < B k + .Indeed, assume that tp ( b < B k , a k ) = q B k , ℓ , where B k ≤ ℓ < B k + is least. Then by induction on B k ≤ n ≤ ℓ wehave tp ( b < n , a k ) = q n , ℓ . In particular we have tp ( b <ℓ , a k ) = q ℓ , ℓ = q ℓ + , so b ℓ = a k .Therefore, if p = tp ( a ) ∈ B ( T ) , then g ( p ) = tp (cid:0) a , m ∗ ( a ) (cid:1) ∈ G ( T ) , and g : B ( T ) → G ( T ) is a trivialisingsection. It is easy to check that G ( T ) q ∼ = G ( T ) , concluding the proof. (cid:4)

3. R

ECONSTRUCTING A CLASSICAL THEORY

We turn to reconstruction, namely, recovering T , up to bi-interpretation, from the topological groupoid G = G ( T ) = G Φ ( T ) , for some (any) choice of Φ . Members of G represent 2-types in D Φ , and we are soongoing to see that we can recover formulas in two (imaginary sort) variables as subsets of G – most importantly,deﬁnable equivalence relations. If we want to recover formulas in k variables, we need an analogue of G for k -types in D Φ . This can be constructed directly from G (that is to say, without knowing that it is of the form G Φ ( T ) ), as follows. ECONSTRUCTION OF NON- ℵ -CATEGORICAL THEORIES 7 Deﬁnition 3.1.

Let G be a topological groupoid, and let k ∈ N . We deﬁne G k / t as the k -fold t -ﬁbred power(the ﬁrst e is not really necessary unless k = G k / t = { ( e , g ) ∈ B × G k : e = t g = t g = · · · } .It is equipped with natural maps t : G k / t → B , s : G k / t → B k , ( e , g ) e , ( e , g ) ( s g , . . . , s g k − ) ,and with corresponding groupoid actions actions G y G k / t x G k .When k ≥

1, we deﬁne G [ k ] = G \ G k / t = { G g : g ∈ G k / t } ,equipped with the quotient topology and the induced action G [ k ] x G k .We have a natural homeomorphism θ : G k / t ∼ = G [ k + ] : θ : ( e , h ) G ( e , e , h ) , θ − : G ( e , g ) ( s g , g − g , . . . , g − g k ) .This homeomorphism sends the actions G y G k / t x G k to G [ k + ] x G k + : θ ( g · p · h ) = θ ( p ) · ( g − , h ) .In particular, B ∼ = G [ ] and G ∼ = G [ ] , replacing the double action G y G x G with G x G ( g · ( f , h ) = f − gh ).When G = G Φ ( T ) , it follows that G [ k ] can be identiﬁed with the space of types p = tp ( a , b , c , . . . ) ∈ S kD Φ ( T ) such that a , b , c , and so on all enumerate the same model. Indeed, we may identify such p with ( e , g ) ∈ G k − t , where e = tp ( a ) , g = tp ( a , b ) , g = tp ( a , c ) , and so on (it is easy to check that this identiﬁcation ishomeomorphic), and therefore with G ( e , e , g ) ∈ G [ k ] . From now on we shall just pretend that G [ k ] is givenin this fashion as a subspace of S k Φ ( T ) , so G = G [ ] . The action G [ k ] x G k is then easy to describe: if p = tp ( a , b , . . . ) ∈ G [ k ] , g = tp ( a , a ′ ) , h = tp ( b , b ′ ) and so on, then p · ( g , h , . . . ) = tp ( a ′ , b ′ , . . . ) .Let also ϕ ( x i : i < k ) is a formula with x i ∈ D Φ . We then deﬁne [ ϕ ] = (cid:8) p ∈ S kD Φ ( T ) : ϕ ∈ p (cid:9) , [ ϕ ] G = [ ϕ ] ∩ G [ k ] ⊆ G [ k ] .When k =

2, we may identify G [ ] with G , and (cid:2) ϕ ( x , y ) (cid:3) G with a subset of G , accordingly. Let us understandhow the various actions above relate to this interpretation of formulas.We say that ϕ only uses n variables if of each x i , which is an inﬁnite tuple of variables, only the ﬁrst n ones,denoted x i < n , actually occur freely in ϕ . Lemma 3.2.

Let ϕ ( x , y ) and ψ ( y , z ) be formulas with variables in D Φ , and let χ ( x , z ) be the formula ( ∃ y )( ϕ ∧ ψ ) (which is indeed a formula, as per Lemma 2.5). Then [ ϕ ] G [ ψ ] G = [ χ ] G , where all are viewed as subsets of G .More generally, let ϕ ( x i : i < k ) be a formula, and for each i < k, let ψ i ( x i , y i ) be a formula, with all variables in D Φ .Let χ ( y i : i < k ) be the formula ( ∃ x , x , . . . ) (cid:0) ϕ ∧ ψ ∧ ψ ∧ · · · (cid:1) . Then [ ϕ ] G · (cid:0) [ ψ ] G × [ ψ ] G × · · · (cid:1) = [ χ ] G , Where [ ϕ ] G and [ χ ] G are subsets of G [ k ] , each [ ψ i ] G is viewed as a subset of G , and the dot represents the action G [ k ] x G k .Proof. For the ﬁrst identity, the inclusion [ ϕ ] G [ ψ ] G ⊆ [ χ ] G is clear. For the opposite inclusion assume thattp ( a , c ) ∈ [ χ ] G . Then a and c both enumerate the same model M . Assuming that ϕ and ψ only use n variables,there exists a tuple b < n ∈ D Φ , n ( M ) such that ϕ ( a < n , b < n ) and ψ ( b < n , c < n ) hold. By Lemma 2.5, we may extend b < n to a sequence b ∈ D Φ that enumerates M . Then tp ( a , b ) ∈ [ ϕ ] G , tp ( b , c ) ∈ [ ψ ] G , and their product istp ( a , c ) .The proof of the second, superﬁcially more complex, case is essentially identical. (cid:4) ITAÏ BEN YAACOV

Let S kD Φ , n ( T ) denote the space of k -types in D Φ , n . We let π = π k , n : S kD Φ ( T ) → S kD Φ , n ( T ) denote the naturalprojection tp ( a , b , . . . ) tp ( a < n , b < n , . . . ) , and let π G = π k , n , G : G [ k ] → S kD Φ , n ( T ) denote its restriction to G [ k ] . Lemma 3.3.

Let k , n ∈ N , k ≥ . (i) The map π : S kD Φ ( T ) → S kD Φ , n ( T ) is continuous, closed, open and onto. (ii) We have π ( U ) = π ( U ∩ G [ k ] ) for every open U ⊆ S kD Φ ( T ) . (iii) The restricted map π G : G [ k ] → S kD Φ , n ( T ) is open and onto as well.Proof. Continuity of π (and therefore of π G ) is immediate, and together with compactness it implies that π isclosed. Openness of π follows from the possibility to quantify (namely, Lemma 2.5): if U = [ ϕ ] ⊆ S D Φ ( T ) is abasic open set, then π ( U ) is deﬁned by the formula ψ ( x < n , y < n ) = (cid:0) ∃ z , w (cid:1)(cid:0) ϕ ( z , w ) ∧ ( x < n = z < n ) ∧ ( y < n = w < n ) (cid:1) .Onto follows from Lemma 2.5.Let U ⊆ S kD Φ ( T ) be open, and let p = tp ( a i : i < k ) ∈ U . Then there exists a formula ϕ ( x i : i < k ) suchthat p ∈ [ ϕ ] ⊆ U , and we may assume that that ϕ only uses m variables for some m ≥ n . Let M be a countablemodel containing all the a i . By Lemma 2.5, there exist b i ∈ D Φ ( M ) that enumerate M , such that b i < m = a i < m .Then q = tp ( b i : i < k ) ∈ [ ϕ ] G ⊆ U ∩ G [ k ] and π ( p ) = π ( q ) ∈ π G ( U ∩ G [ k ] ) .It follows that π G is open and onto as well. (cid:4) Let E n ( x , y ) be the deﬁnable equivalence relation x < n = y < n (where x , y ∈ D Φ ). Lemma 3.4.

Let n ≥ and k ≥ . Then the map ϕ ( x i : i < k ) [ ϕ ] G deﬁnes a bijection between formulas in D Φ that only use n variables (up to logical equivalence modulo T) and clopensubsets X ⊆ G [ k ] that are [ E n ] G -invariant, i.e., such that X = X · [ E n ] k G (here · k denotes Cartesian power).Proof. Assume ﬁrst that ϕ ( x i : i < k ) only uses n variables, and let X = [ ϕ ] G . Then it is clearly clopen in G [ k ] , and it is [ E n ] G -invariant by Lemma 3.2. It follows from Lemma 2.5 that G [[ k ]] is dense in in S kD Φ ( T ) . Thisimplies in turn that if [ ϕ ] G = [ ϕ ′ ] G , then ϕ and ϕ ′ must be equivalent modulo T .To see that the map is onto, let X ⊆ G [ k ] be clopen and [ E n ] G -invariant. Consider the map π G : G [ k ] → S kD Φ , n ( T ) , and let us prove that that π G ( X ) ∩ π G ( G [ k ] r X ) = ∅ . Indeed, assume that p ∈ X and q ∈ G [ k ] r X have the same image π G ( p ) = π G ( q ) . We may write p = tp ( a i : i < k ) and q = tp ( b i : i < k ) , wherethe a i enumerate some model, and the b i enumerate another. The hypothesis π G ( p ) = π G ( q ) means that ( a i < n : i < k ) ≡ ( b i < n : i < k ) , and we may assume that equality holds: a i < n = b i < n for all i < k . Let M be acountable model containing everything.Since X is open, there exists a formula ψ such that p ∈ [ ψ ] G ⊆ X . Similarly, X is closed, so there exists aformula χ such that q ∈ [ χ ] ⊆ X , and we may assume that both ψ and χ only use m variables for some m ≥ n .By Lemma 2.5, as usual, we may ﬁnd c i and d i that enumerate M , such that c i < m = a i < m and d i < m = b i < m . Let p ′ = tp ( c i : i < k ) ∈ [ ψ ] G ⊆ X , q ′ = tp ( d i : i < k ) ∈ [ χ ] G ⊆ G [ k ] r X , g i = tp ( c i , d i ) ∈ [ E n ] G ⊆ G .Then q ′ = p ′ · ( g i : i < k ) ∈ X · [ E n ] k G = X ,a contradiction.Thus, we have indeed proved that π G ( X ) ∩ π G ( G [ k ] r X ) = ∅ . Since π G is onto and open, it follows that π G ( X ) is clopen in S kD Φ , n ( T ) . It is therefore deﬁned by some formula ϕ ( x i < n : i < k ) . But then the sameformula, with added dummy variables, deﬁnes X in G , concluding the proof. (cid:4) The last technical step is to get rid of the hypothesis involving E n in Lemma 3.4. Let H denote the collectionof clopen sub-groupoids of G that contain B : H = (cid:8) H ⊆ G clopen : H = HH − ⊇ B (cid:9) . Lemma 3.5.

Every H ∈ H contains [ E n ] G for some n.Proof. Let H ∈ H . If e ∈ B , then e ∈ H , so H contains a basic neighbourhood of e , i.e., one of the form [ ϕ ] G . If e = tp ( a ) , then ϕ ( a , a ) must hold. If ϕ only uses n variables, then we may replace it with ϕ ( x , x ) ∧ E n ( x , y ) . ECONSTRUCTION OF NON- ℵ -CATEGORICAL THEORIES 9 In other words, for each e ∈ B there exist a formula ϕ e ( x ) and n e ∈ N such that e ∈ U e = [ ϕ e ∧ E n e ] G ⊆ H .By compactness, there is a ﬁnite family e i for i < m such that B ⊆ S i < m U e i . Then B ⊆ S [ ϕ e i ] , so [ E n ] G = [ i < m [ ϕ e i ∧ E n ] G ⊆ [ i < m U e i ⊆ H where n = max n e i . (cid:4) We can now reconstruct T from G . For this we need to recover • the sorts of T , and • the formulas (deﬁnable subsets) on each ﬁnite product of sorts.By sort we mean any interpretable sort, as in the discussion following Lemma 2.7: indeed, we have no wayto distinguish the basic sorts from the interpretable ones. It follows from Proposition 2.11 that any such sortis of the form (i.e., in deﬁnable bijection with) D Φ , n / E , for some n and some deﬁnable equivalence relation E . With some abuse of notation, we may even write it as D Φ / E , where E ( x , y ) is again a deﬁnable relationin which only x < n and y < n actually appear. The relation E n ( x , y ) which we deﬁned earlier as x < n = y < n is adeﬁnable equivalence relation, and any other deﬁnable equivalence relation on D Φ coarsens of E n for some n . Lemma 3.6.

The map E [ E ] G deﬁnes a bijection between deﬁnable equivalence relations on D Φ and H . In addition,if H = [ E ] G , a ∈ D Φ enumerates M and e = tp ( a ) , then the map tp ( a , b ) [ b ] E (the E-class of b) is a bijectionbetween the set e G / H = { g H : t g = e } and the sort D Φ / E in M.Proof. If E is a equivalence relation on D Φ and H = [ E ] G , then it is easy to check that H ∈ H : in particular, HH = H by Lemma 3.2. Conversely, if H ∈ H , then by Lemma 3.5 and Lemma 3.4 it is of the form [ E ] G for aunique formula E ( x , y ) . By the same reasoning, E deﬁnes an equivalence relation: it is reﬂexive since B ⊆ H ;it is symmetric since H = H − ; and it is transitive since H = HH , using Lemma 3.2.For the second part, if a ∈ D Φ is a ﬁxed enumeration of M , then any g = tp ( a , b ) ∈ G determines b ,and [ b ] E ∈ D Φ / E in M . By Lemma 2.5, as usual, every member of D Φ / E in M is of this form. Finally, if h = tp ( a , c ) ∈ G , then: [ b ] E = [ c ] E ⇐⇒ E ( b , c ) ⇐⇒ g − h = tp ( b , c ) ∈ H ⇐⇒ g H = h H .Therefore the map g H [ b ] E is injective, completing the proof. (cid:4) Now that we have recovered the sorts, we may recover formulas. Let E i be deﬁnable equivalence relationson D Φ for i < k , and let H i = [ E i ] G .Say that a formula ϕ ( x i : i < k ) with x i ∈ D Φ is E-invariant if it is E i -invariant in each x i . Such a formulacontains the exact same information as a formula ˜ ϕ ( ˜ x i : i < k ) , with ˜ x i ∈ D Φ / E i , one being the pull-back ofthe other. In this case, the set [ ϕ ] G ⊆ G [ k ] is clopen and H -invariant , that is to say that [ ϕ ] G = [ ϕ ] G · ( H × · · · × H k − ) . Lemma 3.7.

The map ϕ [ ϕ ] G deﬁnes a bijection between E-invariant formulas (equivalently, formulas in the sortsD Φ / E × · · · × D Φ / E k − ), up to logical equivalence modulo T, and H -invariant clopen subsets of G [ k ] .Moreover, assume that a ∈ D Φ enumerates a model M, let e = tp ( a ) , and let us identify D Φ / E i in M with e G / H i as per Lemma 3.6. In other words, a member ˜ b i of D Φ / E i is identiﬁed with g i H i , where g i = tp ( a , b i ) and ˜ b i = [ b i ] E i .Then tp ( b i : i < k ) ∈ G [ k ] , and ϕ ( ˜ b i : i < k ) ⇐⇒ tp ( b i : i < k ) ∈ [ ϕ ] G . Proof.

We have already observed that [ ϕ ] G is a clopen H -invariant set. For the converse direction, let n be largeenough that each E i only uses n variables, and let X ⊆ G [ k ] be clopen and H -invariant. Then X is also [ E n ] G -invariant, and therefore of the form [ ϕ ] G for a unique formula ϕ ( x i : i < k ) , by Lemma 3.4. By Lemma 3.2, ϕ must be E -invariant. The moreover part is tautological. (cid:4) Together, Lemma 3.6 and Lemma 3.7 tell us how to recover sorts, formulas, and their interpretations incountable models.

Deﬁnition 3.8.

Let G be an open groupoid over B , and let H be the collection of clopen sub-groupoids of G that contain B . Deﬁne a language L ( G ) as follows: (i) It has one sort D H for each H ∈ H .(ii) It has a predicate symbol P X in the sorts D H = ( D H i : i < k ) for each sequence H = ( H i : i < k ) ofsuch sub-groupoids and clopen, H -invariant X ⊆ G [ k ] .For each e ∈ B we deﬁne an L ( G ) -structure M e . We interpret each sort D H as e G / H = { g H : t g = e } , andeach predicate symbol P X as (cid:8) ( g i H i : i < k ) : G ( e , g ) ∈ X (cid:9) .Finally, we deﬁne T ( G ) to be the L ( G ) -theory of the family { M e : e ∈ B } .The we have proven: Theorem 3.9.

Let T be a classical theory and G = G ( T ) . Then T ( G ) is bi-interpretable with T. Up to a change oflanguage, its sorts consist of all interpretable sorts in T, with the full induced structure.In particular, if T ′ is another theory and G ( T ) ∼ = G ( T ′ ) , then T and T ′ are bi-interpretable.

4. U

NIVERSAL S KOLEM SORTS

So far we have only treated the case of a theory in classical logic, even though the correspondence between ℵ -categorical theories and their automorphism groups, which we seek to generalise, also applies in continu-ous logic (see [BK16]). Since we do not see how to generalise the construction of D Φ to continuous logic, weshall follow here a different, more “axiomatic” path.Throughout we work in the context of a complete theory T in a countable language, in the sense of continu-ous logic. By deﬁnable (map, set, etc.) we always mean without parameters, unless explicitly said otherwise. A sort is any deﬁnable subset of an imaginary sort. More precisely, the family of all metric sorts is generated byclosing the basic sort(s) (i.e., those named in the language) under the following operations: • Inﬁnite product: if D n is a sort for each n , then so is ∏ D n , equipped with any deﬁnable distance, say d ( x , y ) = sup n − n ∧ d ( x n , y n ) . Formulas on an inﬁnite product sort (i.e., with a variable in such a sort,and possibly other variables) are formulas on ﬁnite sub-products, as well as the uniform limits (so theproposed distance is indeed deﬁnable). • Metric quotient: If D is a sort and d ′ is a deﬁnable pseudo-distance on D , then D ′ = ( D , d ′ ) , obtained bydividing out the induced equivalence relation, is a sort as well. Notice that when applied to a structurethat is not ℵ -saturated, one may also need to pass to the completion. Formulas on D ′ are formulason D which are uniformly continuous with respect to d ′ . If ϕ ( x , y ) is any formula with x ∈ D , theninf x ′ ϕ ( x ′ , y ) + Nd ′ ( x , x ′ ) ( N ∈ N ) is uniformly continuous (even Lipschitz) with respect to d ′ , andformulas obtained in this fashion are dense among all formulas in D ′ . • Subset: If D is a sort and E ⊆ D is a deﬁnable subset, then E is a sort as well. Formulas on E arerestrictions of formulas on D . Recall that E ⊆ D is a deﬁnable set if the distance to E is deﬁnable in D , or equivalently, if for every formula ϕ ( x , y ) , where x ∈ D , the expression ψ ( y ) = inf x ∈ E ϕ ( x , y ) isagain a formula.By an easy compactness argument, any two deﬁnable distances on a sort are uniformly equivalent. Bythe characterisation through quantiﬁers, the notion of a deﬁnable subset does not depend on the choice of adeﬁnable distance. In addition, if D is a sort E ⊆ D is a deﬁnable subset, then any deﬁnable distance on E extends to a deﬁnable pseudo-distance on D . It follows that up to a deﬁnable isometric bijection, any sort,equipped with any deﬁnable distance, is a deﬁnable subset of a metric quotient of a product of the generatingsorts.Let us be given a theory T in a language L , together with a family of sorts as deﬁned above. Let L ′ extend L with new basic sorts for the desired family of sorts, as well as new predicate symbols for formulas on anyproduct of sorts (possibly restricting to a dense family of formulas). Then there exists a unique theory T ′ extending T which says that the new basic sorts and new symbols interpret the desired sorts and formulas onthem. This adds no new additional structure on the original sorts (i.e., every formula is equivalent modulo T ′ to an L -formula), and each of the new basic sorts admits a canonical deﬁnable bijection with the correspondingsubset-of-quotient-of-product. Convention 4.1.

Throughout, inequalities are interpreted with a universal quantiﬁer in the context of a giventheory T , so for example, inf y ϕ ( x , y ) ≤ r means that the sentence sup x inf y ϕ ( x , y ) ≤ r is a consequence of T . Deﬁnition 4.2.

Let D and E be sorts, ϕ ( x , y ) be a formula in D × E , and ε >

0. An ε -Skolem map for ϕ is adeﬁnable map σ : D → E satisfying ϕ ( x , σ x ) ≤ inf y ϕ ( x , y ) + ε . ECONSTRUCTION OF NON- ℵ -CATEGORICAL THEORIES 11 One of the obstacles in continuous logic is that in general, one cannot name new Skolem maps in the lan-guage: there is no natural continuity modulus for such a map, and one can even construct examples whereany such map would have to be discontinuous.

Deﬁnition 4.3.

Let D and E be sorts.(i) We say that D is a Skolem sort for E if every formula ϕ ( x , y ) in D × E admits ε -Skolem maps for every ε > D is universal for E if for every ε > σ : D → E such that theimage of any ε -ball in D is ε -dense in E .We say that D is a Skolem ( universal ) sort if it is for every sort E .If ϕ ( x , y ) is any formula in D × E , then it has the same Skolem maps as ϕ ( x , y ) − inf z ϕ ( x , z ) . Therefore, wemay restrict our attention to formulas satisfying inf y ϕ =

0. It is also sufﬁcient to test for existence of Skolemmaps for a dense family of formulas in D × E . Combining the two observations, it sufﬁces to test the existenceof Skolem map on a dense subset of { ϕ : inf y ϕ = } .Since any two deﬁnable distance on E are uniformly equivalent, universality does not depend on any choiceof deﬁnable distance. A deﬁnable map has dense image in every model of T if and only if it is surjective in anysufﬁciently saturated model. Lemma 4.4.

Let D and ( E m : m ∈ N ) be sorts. Let F k = ∏ m < k E m and F = ∏ m E m . (i) If D is Skolem for every E m , then it is also for all F k and for F. (ii) If D is universal for every F k , then it is also for F.Proof. First of all, either hypothesis implies that there exist deﬁnable maps D → E m for every m . Therefore,any deﬁnable map D → F k can be lifted into a deﬁnable map D → F .Assume that D is Skolem for E and for E ′ separately, and let ϕ ( x , y , y ′ ) be a formula in D × E × E ′ . Let σ : D → E be an ε -Skolem map for inf y ′ ϕ ( x , y , y ′ ) , and let σ ′ : D → E ′ be an ε -Skolem map for ϕ ( x , σ x , y ′ ) .Then ( σ , σ ′ ) : D → E × E ′ is a 2 ε -Skolem map for ϕ . It follows that if D is Skolem for every E m , then it is alsoSkolem for every F k . Any formula in D × F can be approximated by a formula in D × F k , and an ε -Skolem mapfor the latter can be lifted to F to give a, say, 2 ε -Skolem map for the former.For universality, we may equip F k and F with the distance d ( y , y ′ ) = sup m − m ∧ d ( y m , y ′ m ) . If 2 − k < ε , σ : D → F k is deﬁnable, and any ε -ball in D has ε -dense σ -image in F k , then the same holds for any lifting of σ to D → F . (cid:4) Lemma 4.5.

Let D and E be sorts. If D is a universal (Skolem) sort for E, then it is also for any quotient sort F of E.Proof.

For universality, this follows from the quotient map π : E → F being uniformly continuous with denseimage. For the Skolem property, just replace ϕ ( x , z ) with ϕ ( x , π y ) . (cid:4) Let us now combine the two properties (universality and Skolem), to obtain a Skolem map which gets allpotential witnesses (more or less).

Deﬁnition 4.6.

Let ϕ ( x , y ) be a formula on D × E such that inf y ϕ =

0, and let σ : D → E be an ε -Skolemmap for ϕ . We say that σ is a combined ε -Skolem map for ϕ if for every ( a , b ) ∈ D × E , if ϕ ( a , b ) =

0, then d (cid:0) σ B ( a , ε ) , b (cid:1) < ε . It is strong if under the same hypotheses, b ∈ σ B ( a , ε ) in any model containing both a and b (and not merely in a saturated model). Lemma 4.7.

Let D and E be sorts. Then D is universal Skolem for E if and only if, for every formula ϕ ( x , y ) in D × Esuch that inf y ϕ = and every ε > , there exists a combined ε -Skolem map σ : D → E.Proof.

For right to left, the Skolem property is immediate, and for universality consider ϕ =

0. For the otherdirection, assume that D is universal Skolem for E . Let δ > d ( x , x ′ ) , d ( y , y ′ ) < δ imply (cid:12)(cid:12) ϕ ( x , y ) − ϕ ( x ′ , y ′ ) (cid:12)(cid:12) < ε , and by universality, let τ : D → E be deﬁnable, such that the image of every δ -ball is δ -dense. Let ψ ( x , y ) be the formula ϕ ( x , y ) + (cid:0) ε − . ϕ ( x , τ x ) (cid:1) ∧ d ( y , τ x ) Considering the cases where ϕ ( x , τ x ) > ε and ≤ ε separately, we see that inf y ψ ≤ ε (in the ﬁrst use the factthat inf y ϕ =

0, and in the second take y = τ x ). Let σ be an ε -Skolem map for ψ . Then it is, in particular, a3 ε -Skolem map for ϕ .Assume now that ϕ ( a , b ) =

0. By hypothesis on τ , there exists a ′ ∈ B ( a , δ ) such that d ( τ a ′ , b ) < δ . It followsthat ϕ ( a ′ , τ a ′ ) < ε . Since ψ ( a ′ , σ a ′ ) ≤ ε , we must have d ( σ a ′ , τ a ′ ) ≤ ε . We conclude that d (cid:0) B ( a , δ ) , b (cid:1) < ε + δ ,which is enough. (cid:4) Lemma 4.8.

Let D and E be sorts. Then the following are equivalent: (i)

For every formula ϕ ( x , y ) in D × E satisfying inf y ϕ = and every ε > there exists a strong ε -Skolem map σ : D → E. (ii) There exists a sort E ′ ⊇ E such that D is universal Skolem for E ′ .In particular, being universal Skolem for E passes to sub-sorts of E.Proof. In one direction, Skolem is immediate and a strong ε -Skolem map for the zero formula yields (a strongvariant of) universality. In the other direction, let ϕ ( x , y ) and ε > ϕ is always positive, wemay extend ϕ to a positive formula on D × E ′ , denoted ψ ( x , y ′ ) . In particular, inf y ′ ψ = η n = ( − − n − ) ε and let 0 < δ n < ε /2 n + be such that if d ( x , x ) + d ( y ′ , y ′ ) ≤ δ n , then (cid:12)(cid:12) ψ ( x , y ′ ) − ψ ( x , y ′ ) (cid:12)(cid:12) ≤ ε /2 n + .We construct a sequence of deﬁnable maps σ n : D → E ′ , such that d ( σ n x , E ) ≤ δ n and ψ ( x , σ n x ) ≤ η n . Wedeﬁne ψ ( x , y ′ ) = d ( y ′ , E ) + ψ ( x , y ′ ) , ψ n + ( x , y ′ ) = d ( y ′ , E ) + (cid:2) ψ ( x , y ′ ) − . ( η n + ε /2 n + ) (cid:3) + (cid:2) d ( σ n x , y ′ ) − . δ n (cid:3) .We have inf y ψ = σ n , for each a ∈ D there exists b ∈ E such that d ( σ n a , b ) ≤ δ n , so ψ ( a , b ) ≤ ψ ( a , σ n a ) + ε /2 n + ≤ η n + ε /2 n + and ψ n + ( a , b ) =

0. Therefore inf y ψ n + = ψ n admits a combined δ n -Skolem map σ n : D → E ′ . Then indeed d ( σ n x , E ) ≤ δ n . We alsohave ψ ( x , σ x ) ≤ δ < η and ψ ( x , σ n + x ) ≤ η n + ε /2 n + + δ n + < η n + , so the construction may proceed.We have d ( σ n , σ n + ) ≤ δ n + δ n + , so the sequence ( σ n ) converges uniformly to a deﬁnable map σ : D → E ′ .We have d ( σ x , E ) ≤ lim δ n =

0, so in fact σ : D → E , and ϕ ( x , σ x ) = ψ ( x , σ x ) ≤ lim η n = ε .Assume now that ϕ ( a , b ) ≤

0, and let us construct a sequence ( a n ) ⊆ D such that ψ n ( a n , b ) =

0. We startwith a = a (indeed, ψ ( a , b ) = σ n is combined δ n -Skolem for ψ n and ψ ( a n , b ) =

0, there exists a n + ∈ B ( a n , δ n ) such that d ( b , σ n a n + ) < δ n . We have ϕ ( a n , b ) ≤ η n − + ε /2 n + < η n , so ϕ ( a n + , b ) < η n + ε /2 n + .Therefore ψ n + ( a n + , b ) =

0, and the construction may proceed.The sequence ( a n ) converges to some a ′ ∈ D , where d ( a , a ′ ) < ∑ δ n < ε , and σ a ′ = b . If a ∈ D ( M ) and b ∈ E ( M ) for some M (cid:15) T , then the entire sequence can be constructed in D ( M ) , proving that σ is a strong ε -Skolem function for ϕ . (cid:4) Proposition 4.9.

A sort D is universal Skolem (for all sorts) if and only if it is Skolem for every basic sort, and universalfor any ﬁnite product of the basic sort(s).Proof.

One direction is immediate, and the other follows from Lemma 4.4, Lemma 4.5 and Lemma 4.8. (cid:4)

Let D and E be any two sorts. We equip the space of deﬁnable maps σ : D → E with the distance of uniformconvergence d ( σ , ρ ) = sup x ∈ D d ( σ x , ρ x ) .This renders the space of deﬁnable maps a complete separable metric space. Theorem 4.10.

Let D and E be two sorts that are universal Skolem for each other. Then there exists a deﬁnable bijection σ : D ∼ = E.Proof.

We construct surjective deﬁnable maps σ n : E → D and ρ n : D → E as follows. We start with σ , whichexists by universality of E .Assume now that σ n is known. Let ϕ n ( x , y ) be the formula d ( x , σ n y ) . Then inf x ϕ n =

0, and since σ n issurjective, inf y ϕ n = n =

0, let 0 < ε < n >

0, since a deﬁnable map isuniformly continuous, choose 0 < ε n < − n such that d ( x , x ′ ) < ε n = ⇒ d ( ρ n − x , ρ n − x ′′ ) < − n .Then choose a strong ε n -Skolem function ρ n : D → E for ϕ n . Since ρ n Skolem, we have d ( x , σ n ρ n x ) = ϕ n ( x , ρ n x ) < ε n .Since ϕ n ( σ n y , y ) = ρ n is strong, it is surjective.Similarly, given ρ n : D → E we construct a surjective deﬁnable σ n + : E → D such that d ( y , ρ n σ n + y ) < δ n < − n ,where d ( y , y ′ ) < δ n = ⇒ d ( σ n y , σ n y ′ ) < − n . ECONSTRUCTION OF NON- ℵ -CATEGORICAL THEORIES 13 Once the construction is complete, we have d ( ρ n , ρ n + ) ≤ d ( ρ n , ρ n σ n + ρ n + ) + d ( ρ n σ n + ρ n + , ρ n + ) < − n − + − n , d ( σ n , σ n + ) ≤ d ( σ n , σ n ρ n σ n + ) + d ( σ n ρ n σ n + , σ n + ) < − n + − n .The sequences ( σ n ) and ( ρ n ) converge uniformly to deﬁnable maps σ and ρ , and ρ = σ − . (cid:4) In particular, the universal Skolem sort, if it exists, is unique (up to a deﬁnable bijection). Let us point out afew general properties of universal Skolem sorts.

Lemma 4.11.

Let D be a universal Skolem sort. The space of types in D, denoted S D ( T ) , is homeomorphic to the Cantorspace. Moreover, if U ⊆ S D ( T ) is clopen and non-empty, and D U ⊆ D consists of all realisations of types in U, thenD U is deﬁnable in D, and is again a universal Skolem sort.Proof. Assume that p , q ∈ S D ( T ) are distinct. Then there exists a formula ϕ ( x ) , say with values in [

0, 1 ] , suchthat ϕ ( p ) = ϕ ( q ) =

1. Let y be a variable in the sort {

0, 1 } , and deﬁne ψ ( x , y ) to be 2 ϕ ( x ) − . 1 if y = − . 2 ϕ ( x ) if y =

1, so inf y ψ =

0. If σ : D → {

0, 1 } is 1/3-Skolem, then it separates the type space into twoclopen sets, one containing p and the other q . This proves that S D ( T ) is totally disconnected.Let U , V ⊆ S D ( T ) be non-empty, complementary clopen sets. By a compactness argument, d ( D U , D V ) = r >

0, so D U is a deﬁnable subset of D . Considering r > ε >

0, we see that D U is also universal, and it is clearlySkolem.Since a universal sort must realise more than one type, this also shows that S D ( T ) has no isolated points.Being metrisable (since the language is countable), it is the Cantor set. (cid:4) Lemma 4.12.

If D և D և · · · is an inverse system of universal Skolem sorts with surjective deﬁnable maps, thenits inverse limit is again universal Skolem sort.Proof. First of all, it is fairly easy to check that the inverse limit, call it D , is a deﬁnable subset of ∏ D i , so it isa sort. The maps D → D i are deﬁnable and surjective, and since each D i is universal, any one of them can beused to show that D is universal as well. For any sort E , any formula on D × E can be approximated arbitrarilywell by a formula on D i × E for some i large enough, so D is also Skolem. (cid:4) Lemma 4.13.

Let D be a universal Skolem sort of T. Then D × and D × N are also universal Skolem sorts.Proof. It follows from Lemma 4.11 and the uniqueness of the universal Skolem sort that D admits a deﬁnablebijection with D ×

2. Now apply Lemma 4.12 to the inverse system consisting of D × n . (cid:4) Lemma 4.14.

Let D be the universal Skolem sort of T. Then every a ∈ D is interdeﬁnable with a model, necessarilyseparable. Conversely, if M (cid:15)

T is a separable model, then the set of a ∈ D ( M ) that are interdeﬁnable with M is densein D ( M ) .Proof. Assume that M (cid:15) T and a ∈ D ( M ) . Let N ⊆ M be the deﬁnable closure of a in the basic sort(s).The existence of Skolem maps implies that N (cid:22) M (by the Tarski-Vaught Criterion) and that a and N areinterdeﬁnable.Now let us assume that M is separable, and let b be an enumeration of a dense countable sequence in M .Let E denote the sort of b . By universality, for every ε > σ : D → E such that forall a ∈ D ( M ) there exists a ′ ∈ B ( a , ε ) ∩ D ( M ) such that σ a ′ = b . Such a ′ is necessarily interdeﬁnable with M ,proving density. (cid:4) Let us pass to the question of the existence of a universal Skolem sort. First of all, one need not always exist,as the following (admittedly pathological) example shows.

Example . Let L be a continuous signature, with bound one on the diameter, and a single unary 1-Lipschitz [

0, 1 ] -valued predicate symbol P . Let T be the theory saying that the distance is always either 0 or 1 and P hasdense image (i.e., the sentences sup x , y d ( x , y ) (cid:0) − d ( x , y ) (cid:1) and inf x | P ( x ) − r | vanish for every r ∈ [

0, 1 ] ). In anysufﬁciently saturated model of T , each r ∈ [

0, 1 ] is attained as P ( x ) for inﬁnitely many possible values of x ,and a back-and-forth argument between two such models shows that T eliminates quantiﬁers. In particular, T is complete, and S ( T ) is the interval [

0, 1 ] .Assume that T admits a Skolem sort D , and let E denote the home sort. Then there exists a map σ : D → E such that P ( σ x ) < D is a sort and σ is deﬁnable, ϕ ( x ) = d ( x , img σ ) is a formula (we may alsoexpress it as inf y ∈ D d ( x , σ y ) ). It is 0/1-valued, so it cuts S ( T ) = [

0, 1 ] into two non-trivial clopen sets, acontradiction.Therefore T cannot admit a Skolem sort. Our deﬁnition of a universal Skolem sort was motivated by D Φ of Section 2. Let us now justify this formally. Proposition 4.16.

Assume that T is classical. Then viewed as a theory in continuous logic, the set D Φ , constructed inDeﬁnition 2.6, is a universal Skolem sort.Proof. Assume that T is single-sorted, for simplicity of the deﬁnition of D Φ and of the argument presentedhere. That D Φ is a deﬁnable set, i.e., a sort, follows immediately from the fact that for each n , the set of n -tupleswhich can be extended to a member of D Φ , is deﬁnable (by D Φ , n ). The sort D Φ is Skolem for the basic sort E by construction.For universality, we may assume that D Φ is equipped with the distance d ( x , y ) = inf (cid:8) − n : x < n = y < n (cid:9) .Given any k and ε >

0, we may choose m < m < · · · < m k − such that each ϕ m i is always true and 2 − m < ε .Then the map D Φ → E k that sends x ( x m i : i < k ) is surjective on any ε -ball, so D Φ is universal for E k . ByProposition 4.9, this is enough. (cid:4) When T is ℵ -categorical, classical or continuous, we can give another construction of a universal Skolemsort. It generalises Proposition 2.15 to the continuous case (and, in a sense, explains it better). Proposition 4.17.

Assume that T is ℵ -categorical. Let a enumerate a dense subset of a model M (cid:15) T, let D be thetype of a (a deﬁnable set, since T is ℵ -categorical). Then D = D × N is a universal Skolem sort.Proof. It will sufﬁce to show that D is universal Skolem for every sort E . Let a variable in D be denotedˆ x = ( x , ˜ x ) , where x ∈ D and ˜ x ∈ N .In order to show that D is a Skolem sort, let ϕ ( ˆ x , y ) be a formula on D × E such that inf y ϕ =

0. We mayassume that ϕ only depends on the ﬁrst k entries of ˜ x (by density of such formulas). In other words, we mayview ϕ as a formula on D × k × E , and write ϕ ( x , ℓ , y ) where ℓ < k . For each ℓ < k , choose b ℓ ∈ E ( M ) suchthat ϕ ( a , ℓ , b ℓ ) < ε . Let σ : D × k → E be the map which sends ( a , ℓ ) b ℓ (and ( a ′ , ℓ ) to the unique b ′ suchthat a ′ b ′ ≡ ab ℓ ). Then σ is deﬁnable, and we may view it as a map σ : D → E that only depends on the ﬁrst k bits. It is ε -Skolem by construction.In order to show that D is universal, let us ﬁx ε . By the Ryll-Nardzewski/Henson characterisation of ℵ -categoricity (see [BU07]), the type space S D × E ( T ) is metrically compact, so it contains a ﬁnite, ε -dense se-quence ( p ℓ : ℓ < k ) . Let p ℓ = tp ( a ℓ , b ℓ ) . We may choose a ′ ℓ ∈ D such that b λ ∈ dcl ( a ′ ℓ ) and such that d ( a λ , a ′ ℓ ) is arbitrarily small. We may therefore assume that b ℓ ∈ dcl ( a ℓ ) , and in fact that a ℓ = a and b ℓ ∈ E ( M ) for all ℓ .Deﬁne σ : D → E as in the previous paragraph. Now, for any b ∈ E ( M ) , there exist ℓ and a ′ , b ′ (cid:15) p ℓ , possiblyoutside M , such that d ( a ′ b ′ , ab ℓ ) < ε . In particular, σ ( a ′ , ℓ ) = b ′ , so inf x d ( x , a ) ∨ d (cid:0) σ ( x , ℓ ) , b (cid:1) < ε (we use ∨ asinﬁx notation for the maximum). This is almost good enough: if we code ℓ not in the ﬁrst k bits, but sufﬁcientlyfarther along the inﬁnite sequence that is ˜ x , we obtain, for any ˜ a ∈ N :inf x , ˜ x d ( x , a ) ∨ d ( ˜ x , ˜ a ) ∨ d (cid:0) σ ( x , ˜ x ) , b (cid:1) < ε ,concluding the proof. (cid:4) When a universal Skolem sort exists, it allows us to associate to T a canonical (or almost) bi-interpretabletheory. Deﬁnition 4.18.

Let T be a theory and D a universal Skolem sort. We deﬁne T D to be the theory of the sort D together with the induced structure.The full induced structure on D is given by naming all formulas with variables in D by predicate symbols.Since the language of T is assumed countable, the set of all n -ary formulas is separable for each n , and naminga countable dense subset is just as good. Lemma 4.19.

Let T be a theory admitting a universal Skolem sort. Then T D is bi-interpretable with T. Conversely, upto choice of language, and in particular of distance (among all deﬁnable distances), the theory T D only depends on thebi-interpretation class of T, and in particular, does not depend on the choice of universal Skolem sort.Proof. Consider the theory T ′ consisting of T with all its basic sorts, together with D as an additional sort, andall the induced structure on the entire family of sorts. This is an interpretation expansion of both T (since D is a sort) and of T D (since all sorts are quotients of D ), so T and T D are bi-interpretable. Independence on thechoice of D follows from Theorem 4.10. (cid:4) ECONSTRUCTION OF NON- ℵ -CATEGORICAL THEORIES 15

5. T

HE GROUPOID ASSOCIATED TO A THEORY WITH A UNIVERSAL S KOLEM SORT

Before introducing any hypotheses, let us prove the following technical fact.

Lemma 5.1.

Let T be any theory in a countable language. (i)

Let A and B be any two sorts of T, and let X ⊆ S A , B ( T ) be the set of types tp ( a , b ) , where a ∈ A, b ∈ B, and bis deﬁnable from a. Then X is a G δ subset of S A , B ( T ) . (ii) Let C be an additional sort, and let Y ⊆ S B , C ( T ) be the set of types tp ( b , c ) , where b ∈ B, c ∈ C, and c isdeﬁnable from b. Let X × B Y consist of all pairs ( p , q ) that agree on the type of the member of B. Any suchpair can be written as (cid:0) tp ( a , b ) , tp ( b , c ) (cid:1) , in which case c is deﬁnable from a, and we may deﬁne a compositionp ◦ q = tp ( a , c ) . Then ◦ : X × B Y → S A , C ( T ) is continuous.Proof. For ε >

0, and formula ϕ ( x , y ) in A × B , let U ε , ϕ ⊆ S A , B ( T ) be the open set deﬁned by ϕ ( x , y ) ∨ sup z , z ′ (cid:0) d ( z , z ′ ) − ϕ ( x , z ) − ϕ ( x , z ′ ) (cid:1) < ε .Let V ε = [ ϕ U ε , ϕ , W = \ ε > V ε .Let p ( x , y ) = tp ( a , b ) ∈ X . Then d ( y , b ) is deﬁnable with parameter a , i.e., d ( y , b ) = ϕ ( a , y ) for some formula ϕ ( x , y ) , in which case p ∈ U ε , ϕ for all ε >

0, and therefore p ∈ W . Conversely, assume that p ∈ W , and let ε >

0. Then there exists a formula ϕ such that p ∈ U ε , ϕ . But then the diameter of the set of realisations of p ( a , y ) is at most 3 ε , and since ε was arbitrary, b is the unique realisation of p ( a , y ) , so p ∈ X . We conclude that W = X , and it is G δ by construction.Let again p ( x , y ) = tp ( a , b ) ∈ X , and let q ( y , z ) = tp ( b , c ) ∈ Y , so p ◦ q = tp ( a , c ) . A neighbourhood oftp ( a , c ) can be assumed to be deﬁned by a condition ϕ ( x , z ) <

1, where ϕ ( a , c ) =

0. Since c is deﬁnable from b ,we may express ϕ ( x , c ) as ψ ( x , b ) . Let χ ( y , z ) = sup x (cid:12)(cid:12) ϕ ( x , z ) − ψ ( x , y ) (cid:12)(cid:12) .Then ψ ( a , b ) = χ ( b , c ) =

0, and (cid:0) X ∩ [ ψ < ] (cid:1) ◦ (cid:0) Y ∩ [ χ < ] (cid:1) ⊆ [ ϕ < ] . (cid:4) From this point onward, assume that T is a complete theory in a countable continuous language, admittinga universal Skolem sort D . We let S mD ( T ) denote the space of types in m variables in the sort D (i.e., in D m ). Deﬁnition 5.2.

We deﬁne G ( T ) (or G D ( T ) , if we want to be explicit) as the set of all types tp ( a , b ) ∈ S D ( T ) such that dcl ( a ) = dcl ( b ) . We shall implicitly identify a type tp ( a , a ) ∈ G ( T ) with tp ( a ) , and let B ( T ) = S D ( T ) be the collection of all such types.The groupoid structure is deﬁned as in Deﬁnition 2.1:tp ( a , b ) · tp ( b , c ) = tp ( a , c ) , tp ( a , b ) − = tp ( b , a ) . Proposition 5.3.

As deﬁned in Deﬁnition 5.2, G ( T ) is an open Polish topological groupoid. Its base is B ( T ) , which ishomeomorphic to the Cantor set, and the action G ( T ) y B ( T ) is minimal (i.e., all orbits are dense). As a topologicalgroupoid, G ( T ) only depends on the bi-interpretation class of T (in particular, it does not depend on D).Proof. It is easy to check that G ( T ) is a groupoid over B ( T ) , with source and target maps given by g = tp ( a , b ) = ⇒ t g = tp ( a ) , s g = tp ( b ) .It is a Polish topological groupoid by Lemma 5.1, and B ( T ) is homeomorphic to the Cantor space byLemma 4.11. Since the universal Skolem sort is unique up to a deﬁnable bijection, G ( T ) only depends onthe bi-interpretation class of T .To see that G ( T ) y B ( T ) is minimal, let V = [ ϕ ( x ) > ] ⊆ B ( T ) be a non-empty basic open set, and let e ∈ B ( T ) . Then e = tp ( a ) for some a ∈ D , which codes a separable model M . Since T is complete and V = ∅ , T must imply that sup x ϕ >

0, and so there exists b ∈ D ( M ) such that ϕ ( b ) >

0. By the density clause inLemma 4.14, there exists c ∈ D ( M ) arbitrarily close to b that codes M as well. Taking d ( b , c ) small enough wehave ϕ ( c ) >

0, and g = tp ( c , a ) ∈ G ( T ) sends e into V .To see that G ( T ) is open, let U = (cid:2) ϕ ( x , y ) > (cid:3) ⊆ G ( T ) be a basic open set, and let V ⊆ B ( T ) be deﬁnedby sup x ϕ ( x , y ) >

0. If g = tp ( a , b ) ∈ U , then clearly tp ( b ) ∈ V . Conversely, if tp ( b ) ∈ V , and M is the modelcoded by b , then b ∈ D ( M ) , so there exists a ∈ D ( M ) such that ϕ ( a , b ) >

0. By Lemma 4.14, there exists a ′ ∈ D ( M ) arbitrarily close to a such that dcl ( a ′ ) = M , i.e., g = tp ( a , b ) ∈ G ( T ) . Taking d ( a ′ , a ) small enoughwe have ϕ ( a , b ) >

0, i.e., g ∈ U . In either case, s g = tp ( b ) , so V = s ( U ) and G ( T ) is open. (cid:4) When T is classical, the sort D Φ is universal Skolem by Proposition 4.16, so our construction generalisesthat of Section 2. When T is ℵ -categorical, if G ( T ) = Aut ( M ) for any separable M (cid:15) T , then G ( T ) = N × G ( T ) × N , by Proposition 4.17, generalising Proposition 2.15.We turn to the reconstruction of T , up to bi-interpretation, from the topological groupoid G = G ( T ) , relativeto some ﬁxed universal Skolem sort D . We shall attempt to keep this as close as possible to what was done inSection 3, despite some unavoidable differences. Our precise aim is to recover the theory T D , in the single sort D (and not in all the interpretable sorts, of which there are uncountably many). Similarly, aiming to recovera metric sort (rather than discrete ones), the role of clopen sub-groupoids will be taken over by compatible(semi-)norms. Deﬁnition 5.4.

Let X be a topological space. By a neighbourhood of a (usually compact) subset K ⊆ X we meanany set containing an open set containing K . A basis of neighbourhoods for K is a family of neighbourhoods thatis coﬁnal among all neighbourhoods with respect to inverse inclusion. Deﬁnition 5.5. A semi-norm on a groupoid G is a function ρ : G → R + which vanishes on B and satisﬁes ρ ( g ) = ρ ( g − ) , ρ ( f g ) ≤ ρ ( f ) + ρ ( g ) when f g is deﬁned.It is a norm if it vanishes only on B , and it is compatible (with the topology) if it continuous and the sets { ρ < r } = (cid:8) g ∈ G : ρ ( g ) < r (cid:9) form a basis of neighbourhoods for B .Clearly, any two compatible norms ρ and ρ ′ must be uniformly equivalent : for every ε > δ > { ρ < δ } ⊆ { ρ ′ < ε } and vice versa. However, a compatible norm on a topological groupoid doesnot sufﬁce to recover the topology (while it does for a topological group), and assuming that { ρ < r } forms abasis of neighbourhoods for B does not imply that ρ is continuous. For our purposes it will sufﬁce to keep inmind the analogy with Section 3: a 0/1-valued continuous semi-norm is the same thing as the 0 -characteristicfunction of a clopen sub-groupoid H ≤ G that contains B (i.e., ρ ( g ) = g ∈ H and ρ ( g ) = D ), since the generalisation to more variablesis straightforward. Such a formula ϕ ( x , y ) deﬁnes a continuous bounded function that will also be denoted ϕ : S D ( T ) → R . Its restriction to G will be denoted ϕ G . We may also write [ ϕ < r ] = (cid:8) p ∈ S D ( T ) : ϕ ( p ) < r (cid:9) , [ ϕ < r ] G = [ ϕ < r ] ∩ G .Given any two bounded functions ξ , ζ : G → R , let us deﬁne ( ξ ∗ ζ )( f ) = inf (cid:8) ξ ( g ) + ζ ( h ) : f = gh (cid:9) .In particular, any semi-norm satisﬁes ρ ∗ ρ = ρ . The analogue of Lemma 3.2 is: Lemma 5.6.

Let ϕ ( x , y ) and ψ ( y , z ) be formulas with variables in D, and let χ ( x , z ) be the formula inf y ( ϕ + ψ ) . Then ϕ G ∗ ψ G = χ G . Proof.

The inequality ≥ is clear. For the opposite inequality assume that f = tp ( a , c ) ∈ G and χ G ( f ) = χ ( a , c ) < r . Then a and c both code the same separable model M , and there exists b ∈ D ( M ) such that ϕ ( a , b ) + ψ ( b , c ) < r . By Lemma 4.14, we can ﬁnd b ′ ∈ D ( M ) that also codes M arbitrarily close to b . Thismeans that g = tp ( a , b ′ ) ∈ G and h = tp ( b ′ , c ) ∈ G . Since formulas are always uniformly continuous, wemay choose b ′ close enough to b ′ that ϕ G ( g ) + ψ G ( h ) = ϕ ( a , b ′ ) + ψ ( b ′ , c ) < r . In addition, f = gh , so ( ϕ G ∗ ψ G )( f ) < r as well. (cid:4) It follows that if d is any deﬁnable distance on D (and we might as well ﬁx one now), then d G is a continuousnorm on G . The following is the analogue of Lemma 3.5: Lemma 5.7.

If d is a deﬁnable distance on D, then d G is a compatible norm on G .Proof. We still need to show that every neighbourhood U of B contains a set of the form { d G < r } . If e ∈ B , then U contains a basic neighbourhood of e , namely of the form [ ϕ < ] G = (cid:8) g ∈ G : ϕ ( g ) < (cid:9) for some formula ϕ ( x , y ) that vanishes at e . Since ϕ is uniformly continuous, for r > e ∈ [ ϕ ( x , x ) < ] G ∩ [ d ( x , y ) < r ] G . From this point we proceed as in the proof of Lemma 3.5, using compactness of B toﬁnd a ﬁnite cover B ⊆ S i < k [ ϕ ( x , x ) < ] G and r > e i , so [ d < r ] G = { d G < r } ⊆ U . (cid:4) The analogy of the next steps is somewhat less clear: we work exclusively within the sort D , so the projection π of Section 3 has no analogue. Still, in some twisted way, the following is at least related to Lemma 3.3. Lemma 5.8.

Let U ⊆ G be open, d be a deﬁnable distance on D, and δ > . Deﬁne V = ( U ) d < δ ⊆ S D ( T ) to be theset of all p = tp ( a , b ) ∈ S D ( T ) for which there exists g = tp ( c , d ) ∈ U with d ( a , c ) ∨ d ( b , d ) < δ . Then V is open in S D ( T ) . ECONSTRUCTION OF NON- ℵ -CATEGORICAL THEORIES 17 Proof.

Indeed, let p ∈ V , as witnessed by g ∈ U . Since U is open, there exists a basic open set U = [ ϕ < ] G such that h ∈ U ⊆ U . Let χ ( x , y ) = inf u , v (cid:20) ϕ ( u , v ) ∨ d ( x , u ) δ ∨ d ( y , v ) δ (cid:21) .Clearly, p ∈ [ χ < ] .Assume now that χ ( a ′ , b ′ ) <

1. This is witnessed by some c ′ , d ′ such that ϕ ( c ′ , d ′ ) < d ( a ′ , c ′ ) ∨ d ( a ′ , d ′ ) < δ . Since every formula is uniformly continuous, this remains true if we move c ′ and d ′ by a sufﬁ-ciently small amount. In particular, by Lemma 4.14, we may assume that c ′ and d ′ that both code some model M . Then tp ( c ′ , d ′ ) ∈ U , and it witnesses that tp ( a ′ , b ′ ) ∈ V .We have thus shown that p ∈ [ χ < ] ⊆ V , so V is indeed open. (cid:4) At any rate, the following is analogous to Lemma 3.4.

Deﬁnition 5.9.

Say that a continuous function ξ : G → R is uniformly continuous and continuous , or UCC , if it iscontinuous, and in addition, for every ε > U of B such that | ξ ( g ) − ξ ( h ) | < ε whenever h ∈ UgU . Remark . If ρ is any compatible norm, then ξ is UCC if and only if it is continuous, and for every ε > δ > | ξ ( g ) − ξ ( f gh ) | < ε whenever f gh is deﬁned and ρ ( f ) ∨ ρ ( h ) < δ . Lemma 5.11. If ϕ ( x , y ) is a formula, then ϕ G : G → R is UCC, and conversely, every UCC function on G is of thisform, for a unique formula ϕ .Proof. The ﬁrst assertion follows from standard facts: every formula is a uniformly continuous function ofits arguments, and a continuous function of their types. For the converse, it will sufﬁce to prove that a UCCfunction ξ : G → R extends to a (necessarily unique) continuous function on S D ( T ) .For this, let p = tp ( a , b ) ∈ S D ( T ) and ε > d be a deﬁnable distance on D , and ﬁx δ > ρ = d G . We may choose a separable model M that contains both a and b . By Lemma 4.14 wemay choose c and d that code M , and in addition d ( a , c ) ∨ d ( b , d ) < δ . In particular, g = tp ( c , d ) belongs to G .Without loss of generality we may assume that ξ ( g ) =

0, and let U = {| ξ | < ε } , an open subset of G . Let V = ( U ) d < δ ⊆ S D ( T ) as in Lemma 5.8. Then V is open in S D ( T ) and p ∈ V by construction. In order toﬁnish the proof, it will sufﬁce to show that | ξ | ≤ ε on V ∩ G .So let h ′ = tp ( a ′ , b ′ ) ∈ V ∩ G , and assume toward a contradiction that ξ ( h ′ ) > ε . Let g ′ ∈ U witness that h ′ ∈ V , so g ′ = tp ( c ′ , d ′ ) and d ( a ′ , c ′ ) ∨ d ( b ′ , d ′ ) < δ .We can ﬁnd a basic open set g ′ ∈ U = [ ψ < ] G ⊆ U . Since ψ is uniformly continuous, if g ′′ = tp ( c ′′ , d ′′ ) ∈ G and c ′′ and d ′′ are close enough to c ′ and d ′ , then (cid:12)(cid:12) ψ ( c ′ , d ′ ) − ψ ( c ′′ , d ′′ ) (cid:12)(cid:12) < − | ψ ( c ′ , d ′ ) | ,so g ′′ ∈ U ⊆ U as well. A similar consideration applies for h ′ ∈ W = { ξ > ε } . We may now applyLemma 4.14 to ﬁnd a ′′ , b ′′ , c ′′ and d ′′ that code a common model M and are sufﬁciently close to a ′ , b ′ , c ′ and d ′ ,respectively, that g ′′ = tp ( c ′′ , d ′′ ) ∈ U , h ′′ = tp ( a ′′ , b ′′ ) ∈ W , d ( a ′′ , c ′′ ) ∨ d ( b ′′ , d ′′ ) < δ .But now g ′′ = tp ( c ′′ , a ′′ ) · h ′′ · tp ( b ′′ , d ′′ ) ,so | ξ ( h ′′ ) − ξ ( g ′′ ) | < ε by choice of δ , a contradiction.To sum up, for every p ∈ S D ( T ) and ε > V of p such that ξ variesby no more than 4 ε on V ∩ G . It follows that ξ can be extended to a continuous function on S D ( T ) , i.e., to aformula. (cid:4) The following is clearly analogous to Lemma 3.6. If ρ is a (semi-)norm and f , g ∈ G have the same target,let d ρ L ( f , g ) = ρ ( f − , g ) , which deﬁnes a (pseudo-)distance on e G for each e ∈ B (the L stands for left-invariant : d ρ L ( f , g ) = d ρ L ( h f , hg ) whenever t f = t g = s h ). Lemma 5.12.

The map d d G deﬁnes a bijection between deﬁnable distances on D and compatible norms on G .In addition, let d be such a distance, let a ∈ D code M and e = tp ( a ) , and let D ⊆ D ( M ) be the set of b ∈ D ( M ) that code M. Then tp ( a , b ) b is an isometric bijection of ( e G , d ρ L ) with ( D , d ) , that extends to an isometric bijection \ ( e G , d ρ L ) ∼ = ( D , d ) . Proof.

We have already observed in Lemma 5.7 that if d is a deﬁnable distance on D , then d G is a compatiblenorm. For the converse, let ρ be any compatible norm. Then it is UCC (by Remark 5.10), so ρ = d G for someformula d ( x , y ) , which is necessarily the unique continuous extension of ρ to S D ( T ) .We have d ( x , x ) = ρ vanishes on B , and d ( x , y ) = d ( y , x ) since ρ ( g ) = ρ ( g − ) . In addition, we have ρ ∗ ρ = ρ , so by uniqueness of the reconstructed formula and Lemma 5.6, d ( x , z ) = inf y (cid:0) d ( x , y ) + d ( y , z ) (cid:1) .Therefore d deﬁnes a distance, and ρ = d G .The second part is essentially tautological. (cid:4) As in Section 3, in order to recover formulas in several variables, we need to replace G ∼ = G [ ] with G [ k ] = G \ G k / t for arbitrary k ≥

1, which we identify with the set of tp ( a i : i < k ) ∈ S kD ( T ) for which all the a i ∈ D code the same model. In particular, G [ k ] ⊆ S kD ( T ) , and is even dense there. If ϕ ( x i : i < k ) is a formula,then we identify it with the corresponding continuous function ϕ : S kD ( T ) → R , and let ϕ G be its restrictionto G [ k ] . For U ⊆ G [ k ] we may deﬁne ( U ) d < δ ⊆ S kD ( T ) to consist of all tp ( b i : i < k ) such that there existstp ( a i : i < k ) ∈ U satisfying d ( a i , b i ) < δ for all i . We say that ξ : G [ k ] → R is UCC if it is continuous, and forevery ε > B ⊆ U such that, if q ∈ p · U k (with respect to the action G [ k ] x G k ),then | ξ ( p ) − ξ ( q ) | < ε . Lemma 5.13.

Let k ≥ and let d be a deﬁnable distance on D. (i) If U ⊆ G [ k ] is open, then ( U ) d < δ is open in S kD ( T ) . (ii) Let ρ be a compatible norm on G . A continuous function ξ : G [ k ] → R is UCC if and only if for every ε > there exists δ > such that, if if q ∈ p · { ρ < δ } k , then | ξ ( p ) − ξ ( q ) | < ε . (iii) If ϕ ( x i : i < k ) is a formula, then ϕ G : G [ k ] → R is UCC, and conversely, every UCC function on G [ k ] is ofthis form, for a unique formula ϕ .Proof. As for Lemma 5.8 and Lemma 5.11, mutatis mutandis . (cid:4) Theorem 5.14.

Let T and T ′ be two complete theories with universal Skolem sorts. Then G ( T ) ∼ = G ( T ′ ) as topologicalgroupoids if and only if T and T ′ are bi-interpretable. Moreover, given G = G ( T ) as a topological groupoid, we canreconstruct the theory T D , up to a change of language and choice of distance on D (among all deﬁnable distances).Proof. For the ﬁrst assertion, one direction has already been observed ( G ( T ) only depends on T up to bi-interpretation), and the other direction follows from the moreover part.For the reconstruction, we must ﬁrst choose (arbitrarily) a compatible norm ρ on G , which, by Lemma 5.7,is the same thing as choosing a deﬁnable distance d on the universal Skolem sort D (so ρ = d G ). We deﬁne L D to consist of a single metric sort, together with a k -ary predicate symbol P ξ for each UCC function ξ on G [ k ] (orfor a countable uniformly dense family of such functions).We need to specify a bound and a continuity modulus for each symbol: if ξ is UCC, then Lemma 5.13(ii)(with the chosen ρ ) provides us with a modulus of continuity. In addition, P ξ is the restriction of a formula,and therefore bounded. In particular, we use the bound on ρ for a bound on the diameter in L D .Next, for each e ∈ B we deﬁne M e = \ ( e G , d ρ L ) (where we recall that d ρ L ( f , g ) = ρ ( f − g ) ). We interpret eachpredicate P ξ on e G as: P ξ ( g ) = ξ ( G g ) .It satisﬁes the prescribed bound and continuity modulus, and in particular extends continuously to all of M e .If e = tp ( a ) , where a codes M , and if we identify P ξ with the formula ϕ of T such that ξ = ϕ G , then M e isisomorphic to D ( M ) . Then the theory of any M e (or of the family of all of them) is, up to a change of language, T D . (cid:4)

6. F

URTHER QUESTIONS

We have intentionally kept this paper relatively short, with the bare minimum of associating G ( T ) to T andreconstructing T from G ( T ) . Let us point out some further topics for research. About some of them someprogress has already made, and they may be treated in a subsequent paper, whiles others are wide open. ECONSTRUCTION OF NON- ℵ -CATEGORICAL THEORIES 19 General groupoids.

It follows from our work that G = G ( T ) admits compatible norms, and that UCC func-tions on G , or even on G [ k ] , separate points and closed sets (i.e., determine the topology). For a generaltopological groupoid, even assuming that it is open, (completely) metrisable and that B is the Cantor space,the best we can show is that it admits a semi-compatible norm, namely one such that the sets { ρ < r } for a basisof open neighbourhoods of identity, so it is upper semi-continuous, but not necessarily continuous. We canalso show that under reasonable hypotheses, the existence of a compatible norm and of sufﬁciently many UCCfunctions are equivalent (this will appear in a subsequent paper).In groups, UC functions (uniformly continuous with respect to the Roelcke uniformity) are analogous to ourUCC functions, and are closely related to the Roelcke completion and the Roelcke compactiﬁcation. A polishgroup G is of the form G ( T ) , for ℵ -categorical T , if and only if it is Roelcke pre-compact (see [BT16]), i.e., ifand only if the compactiﬁcation and completion agree. Question . State general hypotheses under which a topological groupoid admits a compatible norm / sufﬁ-ciently many UCC functions. The conditions must hold when G = G ( T ) , and not refer to the G ( T ) construc-tion explicitly. Question . Construct analogues of the Roelcke compactiﬁcation and the Roelcke completion of a groupoid(possibly under certain hypotheses). When G = G ( T ) , both should be S D ( T ) . Question . Characterise topological groupoids of the form G ( T ) . Ideally, the characterisation should be: ﬁrst,some general conditions hold, ensuring in particular that the Roelcke completion makes sense, and second, theRoelcke completion is compact. Universal Skolem sorts and possible generalisations.

We have only constructed G ( T ) when T admits a uni-versal Skolem sort. We know that this is true when T is classical or ℵ -categorical. In his Ph.D. dissertation (inprogress), Jorge Muñoz shows that if T admits a universal Skolem sort, then so does its randomisation T R (see[BK09, Ben13]), giving an explicit construction of one sort from the other. On the other hand, in Example 4.15we showed that a Skolem sort need not always exist. Question . Can the G ( T ) construction be extended, or generalised, to all theories?By generalise we mean something similar to how our groupoid construction and reconstruction relate tothe ℵ -categorical situation: the groupoid G ( T ) is not the same as the group G ( T ) , but one can be triviallyrecovered from the other. The category of interpretations.

We have shown that isomorphisms of G ( T ) and G ( T ′ ) correspond to bi-interpretations of T and T ′ . When T and T ′ are ℵ -categorical, interpretations on T ′ in T correspond to con-tinuous morphisms G ( T ) → G ( T ′ ) such that the isometric action G ( T ) y \ G ( T ′ ) L has compactly many orbitclosures (see [BK16]). More precisely, the category of interpretations of ℵ -categorical theories (modulo a reas-onable equivalence relation) is equivalent to the category of Roelcke pre-compact Polish groups with suchmorphisms. Question . Provide a correspondence between interpretations of T ′ in T and (special) morphisms of group-oids G ( T ) → G ( T ′ ) . The category of interpretations of complete countable theories should be equivalent tothe category of G ( T ) with some condition on the morphisms. Model theoretic properties.

One of the motivations for the present work lies in the fact that for an ℵ -categorical theory T , model-theoretic properties (in particular, stability and NIP) correspond to dynamicalproperties of the system G y R , where G = G ( T ) and R is its Roelcke completion/compactiﬁcation (see[BT16, Iba16]). Question . Extend the above to the case where G = G ( T ) , so the corresponding dynamical system shouldbe G ( T ) y S D ( T ) . Question . Together with work of Muñoz alluded to above, extend the preservation arguments of [Iba17]from ℵ -categorical theories to arbitrary ones (admitting a universal Skolem sort).R EFERENCES[AF13] Steve A

WODEY and Henrik F

ORSSELL , First-order logical duality , Annals of Pure and Applied Logic (2013), no. 3, 319–348, doi:10.1016/j.apal.2012.10.016 .[AZ86] Gisela A

HLBRANDT and Martin Z

IEGLER , Quasi-ﬁnitely axiomatizable totally categorical theories , Annals of Pure and Applied Logic (1986), no. 1, 63–82, Stability in model theory (Trento, 1984), doi:10.1016/0168-0072(86)90037-0 .[Ben09] Itaï B EN Y AACOV , Continuous and random Vapnik-Chervonenkis classes , Israel Journal of Mathematics (2009), 309–333, doi:10.1007/s11856-009-0094-x , arXiv:0802.0068 . [Ben13] , On theories of random variables , Israel Journal of Mathematics (2013), no. 2, 957–1012, doi:10.1007/s11856-012-0155-4 , arXiv:0901.1584 .[BIT18] Itaï B EN Y AACOV , Tomás I

BARLUCÍA , and Todor T

SANKOV , Eberlein oligomorphic groups , Transactions of the American Mathem-atical Society (2018), no. 3, 2181–2209, doi:10.1090/tran/7227 , arXiv:1602.05097 .[BK09] Itaï B EN Y AACOV and H. Jerome K

EISLER , Randomizations of models as metric structures , Conﬂuentes Mathematici (2009), no. 2,197–223, doi:10.1142/S1793744209000080 , arXiv:0901.1583 .[BK16] Itaï B EN Y AACOV and Adriane K

AÏCHOUH , Reconstruction of separably categorical metric structures , Journal of Symbolic Logic (2016), no. 1, 216–224, doi:10.1017/jsl.2014.80 , arXiv:1405.4177 .[BT16] Itaï B EN Y AACOV and Todor T

SANKOV , Weakly almost periodic functions, model-theoretic stability, and minimality of topological groups ,Transactions of the American Mathematical Society (2016), no. 11, 8267–8294, doi:10.1090/tran/6883 , arXiv:1312.7757 .[BU07] Itaï B EN Y AACOV and Alexander U

SVYATSOV , On d-ﬁniteness in continuous structures , Fundamenta Mathematicae (2007),67–88, doi:10.4064/fm194-1-4 .[Iba16] Tomás I

BARLUCÍA , The dynamical hierarchy for Roelcke precompact Polish groups , Israel Journal of Mathematics (2016), no. 2,965–1009, doi:10.1007/s11856-016-1399-1 .[Iba17] Tomás I

BARLUCÍA , Automorphism groups of randomized structures , The Journal of Symbolic Logic (2017), no. 3, 1150–1179, doi:10.1017/jsl.2017.2 , arXiv:1605.00473 .[Mac87] K. M ACKENZIE , Lie groupoids and Lie algebroids in differential geometry , London Mathematical Society Lecture Note Series, vol. 124,Cambridge University Press, Cambridge, 1987.I

TAÏ B EN Y AACOV , U

NIVERSITÉ C LAUDE B ERNARD – L

YON

1, I

NSTITUT C AMILLE J ORDAN , CNRS UMR 5208, 43

BOULEVARD DU NOVEMBRE

ILLEURBANNE C EDEX , F

RANCE

URL ::