[PDF] Ties between Parametrically Polymorphic Type Systems and Finite Control Automata

Abstract

We present a correspondence and bisimulation between variants of parametrically polymorphic type systems and variants of finite control automata, such as FSA, PDA, tree automata and Turing machine. Within this correspondence we show that two recent celebrated results on automatic generation of fluent API are optimal in certain senses, present new results on the studied type systems, formulate open problems, and present potential software engineering applications, other than fluent API generation, which may benefit from judicious use of type theory.

Full PDF

TTies between Parametrically Polymorphic Type Systems andFinite Control Automata

Extended Abstract

JOSEPH (YOSSI) GIL and ORI ROTH,

The TechnionWe present a correspondence and bisimulation between variants of parametrically polymorphic type systemsand variants of finite control automata, such as FSA, PDA, tree automata and Turing machine. Within thiscorrespondence we show that two recent celebrated results on automatic generation of fluent API are optimal incertain senses, present new results on the studied type systems, formulate open problems, and present potentialsoftware engineering applications, other than fluent API generation, which may benefit from judicious use oftype theory.CCS Concepts: • Software and its engineering → General programming languages ; API lan-guages ; Polymorphism . Additional Key Words and Phrases: type systems, automata, computational complexity, fluent API

Computational complexity of type checking is a key aspect of any type system. Several classicalresults characterize this complexity in type systems where the main type constructor is functionapplication: Type checking in the S imply T yped L ambda C alculus (STLC), in which functionapplication is the sole type constructor, is carried out in linear time. In the Hindley-Milner (HM)type system [Damas and Milner 1982; Hindley 1969; Milner 1978], obtained by augmenting theSTLC with parametric polymorphism with unconstrained type parameters, type checking is harder,and was found to be deterministic exponential (DEXP) time complete [Kfoury et al. 1990]. However,the Girard–Reynolds type system [Girard 1971, 1972; Reynolds 1974] (System-F) which generalizesHM is undecidable [Wells 1999].In contrast, our work focuses in type systems where the main type constructor is pair (or tuple),i.e., no higher order functions. This type constructor models object based programming , includingconcepts such as records, classes and methods, but not inheritance. In particular, we investigatethe computational complexity of such systems in the presence of parametric polymorphism, alsocalled genericity, allowing generic classes and generic functions.We acknowledge significant past work on more general systems modeling the combination ofgenericity with the object oriented programming paradigm, i.e., classes with single and even multipleinheritance. Type checking in these is particularly challenging, since inheritance may be used toplace sub-type and super-type constraints on the parameters to generics. In fact, Kennedy andPierce [2007] showed that, in general, such type systems are undecidable. Their work carefullyanalyzed the factors that may lead to undecidability, and identified three decidable fragments, butwithout analyzing their complexity. In fact, the presumed decidability of C 𝔗 of type systems, andvariants of finite control automata, such as FSA, PDA, tree automata and Turing machine, organizedin another conceptual lattice 𝔄 . a r X i v : . [ c s . P L ] O c t oseph (Yossi) Gil and Ori Roth With this correspondence we determine the exact computational complexity class of type checkingof many, but not all, type systems in 𝔗 ; for other type systems, we provide upper and lower bounds,leaving the precise characterizations as open problems. We also show that two celebrated results onthe fluent API problem, are optimal in certain senses. The research also has practical applicationsfor language design, e.g., Thm. 5.1 below shows that introducing functions whose return type isdeclared by keyword auto to C Recall that a fluent API generator transforms ℓ , the formal language that specifies the API, into 𝐿 = 𝐿 ( ℓ ) , a library of type definitions in some target programming language , e.g., Java, Haskell, orC++. Library 𝐿 ( ℓ ) is the fluent API library of ℓ if an expression 𝑒 type checks (in the targetlanguage) against 𝐿 ( ℓ ) if an only if word 𝑤 = 𝑤 ( 𝑒 ) is in the API language ℓ , 𝑤 ( 𝑒 ) ∈ ℓ ⇔ 𝑒 type checks against 𝐿 ( ℓ ) . It is required that expression 𝑒 is in the form of a chain of methodinvocations. The word 𝑤 = 𝑤 ( 𝑒 ) is obtained by enumerating, in order, the names of methods inthe chain of 𝑒 , e.g., a fluent API generator for C++ receives a language ℓ over alphabet Σ = { 𝑎, 𝑏 } ,i.e., ℓ ⊆ { 𝑎, 𝑏 } ∗ and generates as output a set of C++ definitions in which an expression such as new Begin().a().b().a().a().end() (1.1)type checks if, and only if word 𝑎𝑏𝑎𝑎 belongs in language ℓ .The most recent such generator is TypelevelLR due to Yamazaki, Nakamaru, Ichikawa andChiba [2019].

TypelevelLR compiles an

LR language ℓ into a fluent API library 𝐿 ( ℓ ) in ei-ther Scala, C++, or, Haskell (augmented with “ four GHC extensions: MultiParamTypeClasses , FunctionalDependencies , FlexibleInstances , and

UndecidableInstances ”), but neither Javanor C

TypelevelLR makes it possible to switch between different front-ends totranslate a context free grammar specification of ℓ into an intermediate representation. Differentsuch front-ends are SLR, LALR, LR(1) grammar processors. Similarly to traditional multi-languagecompilers, the front-ends compile the input specification into a library in Fluent , an intermediatelanguage invented for this purpose; the back-ends of

TypelevelLR translates 𝐿 Fluent into an equivalentlibrary 𝐿 = 𝐿 ( 𝐿 Fluent ) in the target languages. TypelevelLR strikes a sweet spot in terms of front-ends: It is a common belief that most program-ming languages are LR, so there is no reason for a fluent API generator to support any wider classof formal languages for the purpose of the mini programming language of an API. On the otherhand,

TypelevelLR ’s client may tune down the generality of

TypelevelLR , by selecting the mostefficient front-end for the grammar of the particular language of the fluent API.We show that in terms of computational complexity,

TypelevelLR strikes another sweet spotin selecting

Fluent , specifically, that

Fluent = LR (Thm. 5.3). Equality here is understood in termsof classes of computational complexity, i.e., for every set of definitions 𝐿 in Fluent , there existsan equivalent formal language ℓ = ℓ ( 𝐿 ) ∈ LR, and for every ℓ ∈ LR there exists equivalentlibrary 𝐿 = 𝐿 ( ℓ ) . Also term Fluent in the equality refers to the computational complexity classdefined by the

Fluent language. This abuse of notation is freely used henceforth.Is there a similar sweet spot in the back-ends of

TypelevelLR ? Why didn’t

TypelevelLR includeneither

Fluent -into-Java nor

Fluent into-C

It follows from Thm. 5.3 that

Fluent can be compiled into a type system 𝑇 only if 𝑇 is (computa-tionally wise) sufficiently expressive , i.e., LR ⊆ 𝑇 . But which are the features of 𝑇 that make it this Sect. B gives more precise definitions and motivation for the fluent API problem L eft-to-right, R ightmost derivation [Knuth 1965] 2 ies between Type Systems and Automata expressive? Motivated by questions such as theses, we offer in Sect. 3 a taxonomy, reminiscentof 𝜆 -cube [Barendregt 1991], for the classification of parametrically polymorphic type systems.The difference is that 𝜆 -cube is concerned with parametric polymorphism where the main typeconstructor is function application; our taxonomy classifies type system built around the pairingtype constructor, as found in traditional imperative and object oriented languages.The taxonomy is a partially ordered set, specifically a lattice, 𝔗 of points spanned by six, mostlyorthogonal characteristics . (See Table 3.1 below.) A point 𝑇 ∈ 𝔗 is a combination of features(values of a characteristic) that specify a type system, e.g., Fluent is defined by combination ofthree non-default features, monadic-polymorphism , deep-argument-type , and rudimentary-typeof ofthree characteristics; features of the three other characteristics take their default (lowest) value, linear-patterns , unary-functions , and, one-type .We say that 𝑇 is less potent than 𝑇 , if 𝑇 is strictly smaller in the lattice order than 𝑇 , i.e., anyprogram (including type definitions and optionally an expression to type check) of 𝑇 is also aprogram of 𝑇 . In writing 𝑇 = 𝑇 ( 𝑇 ⊊ 𝑇 ) we mean that the computational complexity class of 𝑇 is the same as (strictly contained in) that of 𝑇 . The Pp p Type System.

We employ 𝔗 to analyze Fling, yet another API generator [Gil and Roth2019] (henceforth G&R), capable of producing output for Java and C Fluent , type definitions produced by Flingbelong to a very distinct fragment of type systems of Java and C unboundedunspecialized parametric polymorphism ”, and we call Pp p3 here.In plain words, Pp p refers to a type system in which genericity occurs only in no-parametersmethods occurring in generic classes (or interfaces) that take one or more unconstrained typearguments, as in, e.g., List. 1.1 . In terms of lattice 𝔗 , type system Pp p is defined by feature polyadic-parametric-polymorphism of the “ number of type arguments" characteristics (and default, least-potentfeature value of all other characteristics). Listing 1.1

An example non-sense program in type system Pp p class Program { // Type definitions interface 𝛾 𝛾 interface 𝛾 𝛾 𝛾 𝛾 interface 𝛾 𝛾 𝛾 𝛾 𝛾 𝛾 { // Initializer with expression(s) to check. (( 𝛾 𝛾 𝛾 // Type checks (( 𝛾 𝛾 𝛾 // Type check error } } We prove that Pp p = DCFL (Thm. 4.1), i.e., computational complexity of Pp p is the same as Fluent .Further, we will see that type systems less potent than Pp p reduce its computational complexity.Other results (e.g., Thm. 4.2 and Thm. 4.3) show that making it more potent would have increasedits computational complexity. Combining Theory and Practice: Fling+

TypelevelLR architecture.

As Yamazaki et al. noticed,translating of

Fluent into mainstream programming language is not immediate. Curiously, the typesystems of all target languages of

TypelevelLR are undecidable. However, it follows from Thm. 5.3that the target language, from the theoretical complexity perspective, is only required to be at leastas expressive as DCFL, as is the case in language such as Java, ML, (vanilla) Haskell, and C read “plain parametric polymorphism”, or, “polyadic parametric polymorphism” here Although not intended to be executable, Java (and C++) code in this examples can be copied and pasted as is (includingUnicode characters such as 𝛾 ) into-, and then compiled on- contemporary IDEs. Exceptions are expressions included fordemonstrating type checking failure. 3 oseph (Yossi) Gil and Ori Roth To bring theory into practice, notice that all these languages contain the Pp p type system. Weenvision a software artifact, the whose architecture combines TypelevelLR and Fling, making itpossible to compile of the variety of LR grammars processors into any programming languagewhich supports code such as List. 1.1. Front ends of this “ultimate fluent API generator” are the sameas

TypelevelLR . However, instead of directly translating

Fluent introduce a (rather straightforward)implementation, e.g., in Java, of the algorithm behind the proof of Thm. 5.3, plugging it as a back endof

TypelevelLR . Concretely, the artifact compiles

Fluent into a specification of a DPDA (deterministicpushdown automaton) as in the said proof. We then invoke (a component of) Fling to translate theDPDA specification into a set of Pp p type definitions. The back-ends of Fling are then employed totranslate these type definitions to the desired target programming language. Outline.

Sect. 2 presents the lattice 𝔄 of finite control automata on strings and trees, ranging fromFSAs to Turing machines, and reminds the reader of the computational complexity classes of automatain this lattice, e.g., in terms of families of formal languages in the Chomsky hierarchy. The lattice ofparametrically polymorphic type systems 𝔗 is then presented in Sect. 3. The presentation makes clearbisimulation between the runs of certain automata and type checking in certain type systems, wherebyobtaining complexity results of these type systems.Sect. 4, concentrating on parallels between real-time automata and type systems, derives furthercomplexity results. In particular, this section shows that Fling is optimal in the sense that no widerclass of formal languages is used by Pp p , the type system it uses. Non real-time automata, and theirrelations to type systems which admit the typeof keywords are the subject of Sect. 5. In particular,this section sets the computational complexity of Fluent and several variants. Sect. 6 then turns todiscussing the ties between non-deterministic automata and type systems that allow an expression tohave multiple types.Sect. 7 concludes in a discussion, open problems and directions for future research.While reading this paper, readers should notice extensive overloading of notation, made in attemptto highlight the ties between automata and type systems. The list of symbols in Sect. A should help inobtaining a full grasp of this overloading. Appendices also include some of the more technical proofsand other supplementary material.

This section presents a unifying framework of finite control automata and formal languages,intended to establish common terminology and foundation for the description in the forthcomingSect. 3 of parametrically polymorphic type systems and their correspondence to automata.Definitions here are largely self contained, but the discussion is brief; it is tacitly assumed thatthe reader is familiar with fundamental concepts of automata and formal languages, which we onlyre-present here.

We think of automata of finite control as organized in a conceptual lattice 𝔄 . The lattice (strictlyspeaking, a Boolean algebrea ) is spanned by seven (mostly) orthogonal characteristics , such as thekind of input that an automaton expects, the kind of auxiliary storage it may use, etc. Overall,lattice 𝔄 includes automata ranging from finite state automata to Turing machines, going throughmost automata studied in the classics of automata and formal languages (see, e.g., Hopcroft, Motwaniand Ullman [2007]).Concretely, Table 2.1 defines lattice 𝔄 , by row-wise enumeration of the said characteristics andthe values that each may take. We call these values properties of the lattice. ies between Type Systems and Automata Characteristic

Values in increasing potence

No. states(Def. 2.2)

Aux. storage(Sect. 2.3)

Recognizer kind(Def. 2.1, Def. 2.9). 𝜀 -transitions(Def. 2.5) 𝜀 -transitions Determinism(Def. 2.6)

Rewrite multiplicity(Def. 2.7)

Rewrite depth(Def. 2.8)

Table 2.1. Seven characteristics and 18 prop-erties spanning lattice 𝔄 of finite control au-tomata Values of a certain characteristics are mutually exclu-sive: For example, the first row in the table states thatthe first characteristic, number of states , can be either stateless (the finite control of the automata does not in-clude any internal states) or stateful (finite control maydepend on and update an internal state). An automatoncannot be both stateful and stateless.An automaton in 𝔄 is specified by selecting a valuefor each of the characteristics.The table enumerates properties in each characteristicin increasing potence order. For example, in “number ofstates” characteristic, stateful automata are more potent than stateless automata, in the sense that any computa-tion carried out by 𝐴 , a certain stateless automaton in 𝔄 ,can be carried out by automaton 𝐴 ′ ∈ 𝔄 , where the onlydifference between 𝐴 and 𝐴 ′ is that 𝐴 ′ is stateful .Each automaton in the lattice might be fully specified as a set of seven properties ⟨ 𝑝 , . . . , 𝑝 ⟩ . Inthe abbreviated notation we use, a property of a certain characteristic is mentioned only if it isnot the weakest (least potent) in this characteristic. For example, the notation “ ⟨⟩ ” is short for theautomaton with least-potent property of all characteristics, 𝔄 ⊥ = ⟨⟩ = ⟨ stateless , no-store , language , real-time , determ , shallow , linear ⟩ , (2.1)the bottom of lattice 𝔄 . Table 2.2 offers additional shorthand using acronyms of familiar kinds ofautomata and their mapping to lattice points. For example, the second table row maps FSAs tolattice point ⟨ stateful , non-deter ⟩ . Acronym Common name Lattice point Complexity D eterministic F inite S tate A utomaton DFSA ⟨ stateful ⟩ REG F inite S tate A utomaton FSA non-deterministic- DFSA = ⟨ stateful, non-deterministic ⟩ REG S tateless R eal-time D eterministic P ush D own A utomaton SRDPDA ⟨ pushdown, stateless, 𝜀 -transitions, non-deterministic,shallow, linear ⟩ ⊊ DCFL R eal-time D eterministic P ush D own A utomaton RDPDA stateful- SRDPDA= ⟨ pushdown, stateful, 𝜀 -transitions,non-deterministic, shallow, linear ⟩ ⊊ DCFL D eterministic P ush D own A utomaton DPDA 𝜀 -transitions- RDPDA = ⟨ pushdown, stateful, 𝜀 -transitions,deterministic, shallow, linear ⟩ DCFL T ree A utomaton TA ⟨ tree-store, stateless, real-time, shallow, linear ⟩ DCFL P ush D own A utomaton PDA non-deterministic- DPDA= ⟨ pushdown, stateful, 𝜀 -transitions,non-deterministic, shallow, linear ⟩ CFL R eal-time T uring M achine RTM ⟨ linearly-bounded-tape, stateful, real-time, deterministic,shallow, linear ⟩ ⊊ CSL L inear B ounded A utomaton LBA ⟨ linearly-bounded-tape, shallow, linear ⟩∨ FSA= ⟨ linearly-bounded-tape, deterministic, shallow, linear ⟩ CSL T uring M achine TM unbounded-tape- LBA= ⟨ unbounded-tape, stateful,deterministic, shallow, linear ⟩ RE REG ⊊ DCFL ⊊ CFL ⊊ CSL ⊊ PR ⊊ R ⊊ RE ⟨ 𝜀 -transitions, non-deterministic ⟩∨ FSA = FSA [Autebert et al. 1997, Example 5.3] deep- DPDA = DPDA deep- PDA = PDA non-deterministic- TM = TM Table 2.2. Selected automata in the lattice 𝔄 and their computational complexity classes Observe that just as the term pushdown automaton refers to an automaton that employs apushdown auxiliary storage, we use the term tree automaton for an automaton that employs a tree oseph (Yossi) Gil and Ori Roth auxiliary storage. Some authors use the term for automata that receive hierarchical tree ratherthan string as input. In our vocabulary, the distinction is found in the language-recognizer vs. forest-recognizer properties of the “recognizer kind” characteristic.The final column of Table 2.2 also specifies the computational complexity class of the automatondefined in the respective row. In certain cases, this class is a set of formal languages found in theChomsky hierarchy. From the first two rows of the table we learn that even though DFSAs areless potent than FSAs, they are able to recognize exactly the same set of formal languages, namelythe set of regular languages denoted REG. By writing, e.g., DFSA = FSA = REG, we employ theconvention of identifying an automaton in the lattice by its computational complexity class. Wenotice that a less potent automaton is not necessarily computationally weaker.

As usual, let Σ be a finite alphabet , and let Σ ∗ denote the set of all strings (also called words) over Σ ,including 𝜀 , the empty string. A (formal) language ℓ is a (typically infinite) set of such strings,i.e., ℓ ⊆ Σ ∗ .Definition 2.1. A recognizer of language ℓ ⊆ Σ ∗ is a device that takes as input a word 𝑤 ∈ Σ ∗ and determines whether 𝑤 ∈ ℓ . Let 𝐴 be a finite control automata for language recognition. (Automata for recognizing forestsare discussed below in Sect. 2.4.) Then, 𝐴 is specified by four finitely described components: states,storage, consuming transition function, and 𝜀 -transition function:(1) States.

The specification of these includes (i) a finite set 𝑄 of internal states (or states ), (ii) adesignated initial state 𝑞 ∈ 𝑄 , and, (iii) a set 𝐹 ⊆ 𝑄 of accepting states .Definition 2.2. 𝐴 is stateful if | 𝑄 | > ; it is stateless if | 𝑄 | = , in which case 𝐹 = 𝑄 = { 𝑞 } . (2) Storage.

Unlike internal state, the amount of data in auxiliary storage is input dependent, henceunbounded. The pieces of information that can be stored is specified as a finite alphabet Γ of storage symbols , which is not necessarily disjoint from Σ .The organization of these symbols depends on the auxiliary-storage characteristic of 𝐴 : In pushdown-store automata, Γ is known as the set of stack symbols , and the storage layout issequential. In tree-store automata, the organization is hierarchical and Γ is a ranked-alphabet.In tape automata, Γ is called the set of tape symbols , and they are laid out sequentially in auni-directional tape.Let 𝚪 denote the set of possible contents of the auxiliary storage. In pushdown automata 𝚪 = Γ ∗ ; in tape automata, the storage contents includes the position of the head: Specifically, in unbounded-tape-store (employed by Turing machines), 𝚪 = N × Γ ∗ . We set 𝚪 = N × Γ ∗ also forthe case of linearly bounded automata. For tree-store automata, 𝚪 = Γ △ , where Γ △ is definedbelow as the set of trees whose internal nodes are drawn from Γ .Definition 2.3. An I nstantaneous D escription (ID, often denoted 𝜄 ) of 𝐴 running on inputword 𝑤 ∈ Σ ∗ includes three components: (i) a string 𝑢 ∈ Σ ∗ , where 𝑢 is a suffix of 𝑤 , specifyingthe remainder of input to read; (ii) the current state 𝑞 ∈ 𝑄 , and, (iii) 𝜸 ∈ 𝚪 , the current contentsof the auxiliary storage. The auxiliary storage is initialized by a designated value 𝜸 ∈ 𝚪 . Any run of 𝐴 on input 𝑤 ∈ Σ ∗ begins with ID 𝜄 = ⟨ 𝑤, 𝑞 , 𝜸 ⟩ , and then proceeds as dictated by the transitions functions.Definition 2.4. 𝐴 is no-store if | Γ | = , in which case 𝚪 is degenerate, 𝚪 = { 𝜸 } . (3) Consuming transition function.

Denoted by 𝛿 , this partial , possibly multi-valued function,defines how 𝐴 proceeds from a certain ID to a subsequent ID in response to a consumptionof a single input letter. ies between Type Systems and Automata • Function 𝛿 depends on (i) 𝜎 ∈ Σ , the current input symbol , being the first letter of 𝑢 ,i.e., 𝑢 = 𝜎𝑢 ′ , 𝑢 ′ ∈ Σ ∗ (ii) 𝑞 ∈ 𝑄 the current state, and, (iii) 𝜸 ∈ 𝚪 , the current contents of theauxiliary storage. • Given these, 𝛿 returns a new internal state 𝑞 ′ ∈ 𝑄 and the new storage contents 𝜸 ′ for thesubsequent ID. The “remaining input” component of the subsequent ID is set to 𝑢 ′ .(4) 𝜀 -transition function. A partial, multi-valued function 𝜉 specifies how 𝐴 moves from a certainID to a subsequent ID, without consuming any input. Function 𝜉 depends on the currentstate 𝑞 ∈ 𝑄 and 𝜸 , the storage’s contents, but not on the current input symbol. Just like 𝛿 ,function 𝜉 returns a new internal state 𝑞 ′ ∈ 𝑄 and storage contents 𝜸 ′ for the subsequent ID.However, the remaining input component of IDs is unchanged by 𝜉 .Automaton 𝐴 accepts 𝑤 if there exists a run 𝜄 , 𝜄 , . . . , 𝜄 𝑚 , that begins with the initial ID 𝜄 = ⟨ 𝑤, 𝑞 , 𝜸 ⟩ and ends with an ID 𝜄 𝑚 = ⟨ 𝜀, 𝑞, 𝛼 ⟩ in which all the input was consumed, the internalstate 𝑞 is accepting, i.e., 𝑞 ∈ 𝐹 , and no further 𝜀 -transitions are possible, i.e., 𝜉 ( 𝜄 𝑚 ) is not defined.On each input letter, automaton 𝐴 carries one transition defined by 𝛿 , followed by any numberof 𝜀 -transitions defined by 𝜉 , including none at all. A real-time automaton is one which carriesprecisely one transition for each input symbol.Definition 2.5. 𝐴 is real-time if there is no id 𝜄 for which 𝜉 ( 𝜄 ) is defined. Real-time and non-real-time automata are, by the above definitions, non-deterministic. Sinceboth 𝛿 and 𝜉 are multi-valued, an ID does not uniquely determine the subsequent ID.Definition 2.6. 𝐴 is deterministic if (i) partial functions 𝜉 and 𝛿 are single valued, and, (ii) thereis no ID 𝜄 for which both 𝜉 ( 𝜄 ) and 𝛿 ( 𝜄 ) are defined. Both deterministic and non-deterministic automata may hang , i.e., they might reach an ID 𝜄 for which neither 𝜉 ( 𝜄 ) nor 𝛿 ( 𝜄 ) are defined. If all runs of a non-deterministic automaton 𝐴 on agiven input 𝑤 either hang or reach a non-accepting state, 𝐴 rejects 𝑤 . Alternatively, if the theonly run of a deterministic automaton 𝐴 on 𝑤 hangs, automaton 𝐴 rejects 𝑤 . Hanging is the onlyway a stateless automaton can reject. A stateful automaton rejects also in the case it reaches anon-accepting state 𝑞 ∈ 𝑄 \ 𝐹 after consuming all input. Since functions 𝛿 and 𝜉 are finitely described, they are specified as two finite sets, Δ and Ξ of input-to-output items, e.g., the requirement in Def. 2.5 can be written as | Ξ | =

0. Since the transformationof auxiliary storage 𝜸 to 𝜸 ′ by these functions must be finitely described, only a bounded portionof 𝜸 can be examined by 𝐴 . The transformation 𝜸 to 𝜸 ′ , what we call rewrite of auxiliary storage ,must be finitely described in terms of this portion. Tape initialization, rewrite, and head overflow.

The literature often defines tape automata with noconsuming transitions, by making the assumption that they receive their input on the tape storewhich allows bidirectional movements. Our lattice 𝔄 specifies that the input word 𝑤 is consumedone letter at a time. No generality is lost, since with the following definitions tape automaton 𝐴 ∈ 𝔘 may begin its run by consuming the input while copying it to the tape, and only then process itwith as many 𝜀 -transitions are necessary.The contents 𝜸 of tape auxiliary storage is a pair ( ℎ, 𝛾 𝛾 · · · 𝛾 𝑚 − ) , where integer ℎ ≥ 𝛾 𝛾 · · · 𝛾 𝑚 − ∈ Γ ∗ is the tape’s content. Let 𝜸 = ( 𝜀, ) , i.e., the tape is initiallyempty and the head is at location 0. Rewrites of tape are the standard read and replace of symbol-under-head, along with the move-left and move-right instructions to the head: Tape rewrite 𝛾 → 𝛾 ′+ (respectively, tape rewrite 𝛾 → 𝛾 ′− ) means that if 𝛾 ℎ = 𝛾 then replace it with, not necessarily distinct,symbol 𝛾 ′ ∈ Γ and increment (respectively, decrement) ℎ . A third kind of rewrite is ⊥ → 𝛾 , whichmeans that if the current cell is undefined, i.e., ℎ ∉ { , . . . , 𝑚 − } , replace it with 𝛾 ∈ Γ . oseph (Yossi) Gil and Ori Roth The automaton hangs if ℎ becomes negative, or if ℎ exceeds 𝑛 , the input’s length, in the case of alinear bounded automaton. Rewrites of a pushdown.

Rewrites of a pushdown auxiliary storage are the usual push and popoperations; we will see that these can be regarded as tree rewrites.

Trees.

A finite alphabet Γ is a signature if each 𝛾 ∈ Γ is associated with a integer 𝑟 = 𝑟 ( 𝛾 ) ≥ arity ). A tree over Γ is either a leaf , denoted by the symbol 𝜺 , or a (finite) structure inthe form 𝛾 ( 𝒕 ) , where 𝛾 ∈ Γ of arity 𝑟 is the root and 𝒕 = 𝑡 , . . . , 𝑡 𝑟 is a multi-tree , i.e., a sequence of 𝑟 (inductively constructed) trees over Γ . Let Γ △ denote the set of trees over Γ .Let Depth ( 𝑡 ) be the depth of tree 𝑡 (for leaves let, Depth ( 𝜺 ) = monadic treeabbreviation by which tree 𝛾 ( 𝛾 ( 𝜺 ) , 𝛾 ( 𝛾 ( 𝜺 ))) is written as 𝛾 ( 𝛾 , 𝛾 𝛾 ) , and tree 𝛾 ( 𝛾 (· · · ( 𝛾 𝑛 ( 𝜺 )))) is written as 𝛾 𝛾 · · · 𝛾 𝑛 . If the rank of all 𝛾 ∈ Γ is 1, then Γ △ is essentially the set Γ ∗ , and everytree 𝑡 ∈ Γ △ can be viewed as a stack whose top is the root of 𝑡 and depth is Depth ( 𝑡 ) .In this perspective, a pushdown automaton is a tree automaton in which the auxiliary tree ismonadic. We set 𝜸 in tree automata to the leaf 𝜺 ∈ Γ △ , i.e., the special case pushdown automatonstarts with an empty stack. Terms.

Let 𝑋 = { 𝑥 , 𝑥 , . . . } be an unbounded set of variables disjoint to all alphabets. Then, a pattern (also called term ) over Γ is either some variable 𝑥 ∈ 𝑋 or a structure in the form 𝛾 ( 𝜏 , . . . , 𝜏 𝑟 ) ,where the arity of 𝛾 ∈ Γ is 𝑟 and each of 𝜏 , . . . , 𝜏 𝑟 is, recursively, a term over Γ . Let Γ △ denote theset of terms over Γ . Thus, Γ △ ⊊ Γ △ , i.e., all trees are terms. Trees are also called grounded terms ; ungrounded terms are members of Γ △ \ Γ △ . A term is linear if no 𝑥 ∈ 𝑋 occurs in it more than once,e.g., 𝛾 ( 𝑥, 𝑥 ) is not linear while 𝛾 ( 𝑥 , 𝛾 ( 𝑥 , 𝑥 )) is linear, Terms match trees.

Atomic term 𝑥 matches all trees in Γ △ ; a compound linear term 𝜏 = 𝛾 ( 𝜏 , . . . , 𝜏 𝑟 ) matches tree 𝑡 = 𝛾 ( 𝑡 , . . . , 𝑡 𝑟 ) if for all 𝑖 = , . . . , 𝑟 , 𝜏 𝑖 recursively matches 𝑡 𝑖 , e.g., 𝛾 ( 𝑥 , 𝛾 ( 𝑥 , 𝑥 )) matches 𝛾 ( 𝜺 , 𝛾 ( 𝜺 , 𝛾 ( 𝜺 , 𝜺 ))) . To define matching of non-linear terms define tree substitution 𝑠 (substi-tution for short) as a mapping of variables to terms, 𝑠 = { 𝑥 → 𝜏 , . . . , 𝑥 𝑟 → 𝜏 𝑟 } . Substitution 𝑠 is grounded if all terms 𝜏 , . . . , 𝜏 𝑟 are grounded. An application of substitution 𝑠 to term 𝜏 , denoted 𝜏 / 𝑠 ,replaces each variable 𝑥 𝑖 with term 𝜏 𝑖 if and only if 𝑥 𝑖 → 𝜏 𝑖 ∈ 𝑠 . The notation 𝜏 ′ ⊑ 𝜏 is to say thatterm 𝜏 matches a term 𝜏 ′ , which happens if there exists a substitution 𝑠 such that 𝜏 ′ = 𝜏 / 𝑠 . Tree rewrites. A tree rewrite rule 𝜌 ( rewrite for short) is a pair of two terms written as 𝜌 = 𝜏 → 𝜏 .Rewrite 𝜌 is applicable to (typically grounded) term 𝜏 ′ if 𝜏 ′ = 𝜏 / 𝑠 for some substitution 𝑠 . Ifrewrite 𝜌 matches term 𝜏 ′ then 𝜏 ′ / 𝜌 , the application of 𝜌 to 𝜏 ′ (also written 𝜏 ′ / 𝜏 → 𝜏 ) yields theterm 𝜏 ′ = 𝜏 / 𝑠 .The definition of rewrites does not exclude a rewrite 𝛾 ( 𝑥 ) → 𝛾 ( 𝑥 , 𝛾 ( 𝑥 )) , whose right-hand-side term introduces variables that do not occur in the left-hand-side term. Applying such a rewriteto a tree will always convert it to a term. Since the primary intention of rewrites is the manipulationof trees, we tacitly assume here and henceforth that it is never the case; a rewrite 𝜏 → 𝜏 is validonly if Vars ( 𝜏 ) ⊆ Vars ( 𝜏 ) .Manipulation of tree and pushdown auxiliary storage is defined with rewrites. For example, therewrite 𝛾 ( 𝛾 ( 𝑥 )) → 𝛾 ( 𝑥 ) , or in abbreviated form 𝛾 𝛾 𝑥 → 𝛾 𝑥 , is, in terms of stack operations: ifthe top of the stack is symbol 𝛾 followed by symbol 𝛾 , then pop these two symbols and then pushsymbol 𝛾 onto the stack. With these definitions: • Each member of set Δ is in the form ⟨ 𝜎, 𝑞, 𝜌, 𝑞 ′ ⟩ meaning: if the current input symbol is 𝜎 ,the current state is 𝑞 and auxiliary storage 𝑡 matches 𝜌 , then, consume 𝜎 , move to state 𝑞 ′ and set the storage to 𝑡 / 𝜌 . • Each member of set Ξ is in the form ⟨ 𝑞, 𝜌, 𝑞 ′ ⟩ meaning: if the current state is 𝑞 and auxiliarystorage 𝑡 matches 𝜌 , then, move to state 𝑞 ′ and set the storage to 𝑡 / 𝜌 . ies between Type Systems and Automata A tree rewrite 𝜌 = 𝜏 → 𝜏 is linear if 𝜏 is linear, e.g., rewrites 𝛾 ( 𝑥 ) → 𝛾 ′ ( 𝑥, 𝑥, 𝑥 ) and 𝛾 ( 𝑥 , 𝑥 ) → 𝛾 ( 𝛾 ( 𝑥 , 𝑥 ) , 𝜺 ) are linear, but 𝛾 ( 𝑥, 𝑥 ) → 𝜺 is not. Notice that rewrites of tape and pushdown auxiliarystorage are linear: the transition functions of these do never depend on the equality of two tape orpushdown symbols.Definition 2.7. 𝐴 is linear-rewrite if all rewrites in Ξ and Δ are linear. Let Depth ( 𝜌 ) , 𝜌 = 𝜏 → 𝜏 , be Depth ( 𝜏 ) , and where the depth of terms is defined like treedepth, a variable 𝑥 ∈ 𝑋 considered a leaf. A term (rewrite) is shallow if its depth is at most one,e.g., 𝑥 , 𝛾 ( 𝑥 ) , and 𝛾 ( 𝑥, 𝑥 ) are shallow, while 𝛾 ( 𝛾 ( 𝑥 )) is not. Rewrite of tape storage are shallow bydefinition, since only the symbol under the head is inspected.Definition 2.8. 𝐴 is shallow-rewrite if all rewrites in Ξ and Δ are shallow. In the case that the set of input symbols Σ is a signature rather than a plain alphabet, the input to afinite control automata is then a tree 𝑡 ∈ Σ △ rather than a plain word. We use the term forest forwhat some call tree language , i.e., a (typically infinite) set of trees. Generalizing Def. 2.1 we define:Definition 2.9. A recognizer of forest £ ⊆ Σ △ is a device that takes as input a tree 𝑡 ∈ Σ △ anddetermines whether 𝑡 ∈ £ . As explained in Sect. 2.2 a language-recognizer automaton scans the input left-to-right. However,this order is not mandatory, and there is no essential difference between left-to-right and right-to-left automata. This symmetry does not necessarily apply to a forest-recognizer automaton—there ismuch research work on comparing and differentiating bottom-up and top-down traversal strategiesof finite control automata (e.g., Coquidé et al. [1994] focus on bottom-up automata, Guessarian[1983] on top-down, while Comon et al. [2007] presents several cases in which the two traversalstrategies are equivalent.)Our interest in parametrically polymorphic type systems sets the focus here on the bottom-uptraversal strategy only. Most of the description of language-recognizer automata above in Sect. 2.2remains unchanged. The state and storage specification are the same in the two kinds of recognizers,just as the definitions of deterministic and real-time automata. Even the specification of 𝜉 , the 𝜀 -transition function is the same, since the automaton does not change its position on the input treeduring an 𝜀 -transition.However, input consumption in forest recognizers is different than in language recognizers, andcan be thought of as visitation. A bottom-up forest-recognizer consumes an input tree node labeled 𝜎 of rank 𝑟 by visiting it after its 𝑟 children were visited. Let 𝑞 , 𝑞 , . . . , 𝑞 𝑟 be the states of the automatonin the visit to these children, and let 𝒒 be the multi-state of the 𝑟 children, i.e., 𝒒 = 𝑞 , 𝑞 , . . . , 𝑞 𝑟 .Then, the definition of 𝛿 is modified by letting it depend on multi-state 𝒒 ∈ 𝑄 𝑘 rather than ona single state 𝑞 ∈ 𝑄 . More precisely, each input-to-output item in Δ takes the form ⟨ 𝜎, 𝒒 , 𝜌, 𝑞 ′ ⟩ ,meaning, if (i) the automaton is in a node labeled 𝜎 , and (ii) it has reached states 𝑞 , 𝑞 , . . . , 𝑞 𝑟 in the 𝑟 children of this node, and if storage rewrite rule 𝜌 is applicable, then select state 𝑞 ′ for the current nodeand apply rewrite 𝜌 .Consider 𝜌 , the rewrite component of an input-output item. As it turns out, only tree auxiliarystorage makes sense for bottom up forest recognizers . Let 𝑡 , . . . , 𝑡 𝑟 be the trees representing thecontents of auxiliary storage in 𝑟 children of the current node. Rewrite rule 𝜌 should produce anew tree 𝑡 of unbounded size from a finite inspection of the 𝑟 trees, whose size is also unbounded.We say that 𝜌 is a many-input tree rewrite rule (for short, rewrite when the context is clear) ifit is in the form 𝜌 = 𝜏 , . . . , 𝜏 𝑟 → 𝜏 ′ . Rule 𝜌 = 𝜏 , . . . , 𝜏 𝑟 → 𝜏 ′ is applied to all children, with thestraightforward generalization of the notions of matching and applicability of a single-input-rewrite: In top-down forest recognizers pushdown auxiliary storage is also admissible.9 oseph (Yossi) Gil and Ori Roth A multi-term 𝝉 is a sequence of terms 𝝉 = 𝜏 , . . . , 𝜏 𝑟 , and a multi-tree 𝒕 is a sequence oftrees, 𝒕 = 𝑡 , . . . , 𝑡 𝑟 . Then, rule 𝜌 = 𝝉 → 𝜏 ′ applies to (also, matches ) 𝒕 if there is a singlesubstitution 𝑠 such that 𝜏 𝑖 / 𝑠 = 𝑡 𝑖 for all 𝑖 = , . . . , 𝑟 . The application of 𝜌 to 𝒕 is 𝜏 / 𝑠 . This section offers a unifying framework for parametrically polymorphic type systems. Definitionsreuse notations and symbols introduced in Sect. 2 in the definition of automata, but with differentmeaning. For example, the Greek letter 𝜎 above denoted an input letter, but will be used hereto denote the name of a function defined in a certain type system. This, and all other cases ofoverloading of notation are intentional , with the purpose of highlighting the correspondencebetween the two unifying frameworks. Characteristic

Values in increasing order 𝐶 Number of type arguments(Sect. 3.1.2) 𝐶 Type pattern depth(Sect. 3.1.3) 𝐶 Type pattern multiplicity(Sect. 3.1.4) 𝐶 Arity of functions(Sect. 3.1.5) 𝐶 Type capturing(Sect. 3.1.6) 𝐶 Overloading(Sect. 3.1.7)

Table 3.1. Six characteristics and 17 properties spanning lat-tice 𝔗 of parametrically polymorphic type systems. Examine Table 3.1 describing 𝔗 , the lat-tice (Boolean algebra) of parametricallypolymorphic type systems. This table isthe equivalent of Table 2.1 depicting 𝔄 ,the lattice of finitely controlled automata.We use the terms potence , characteristics ,and properties as before, just as the con-ventions of writing lattice points and useof abbreviations.Table 3.1 give means for economicspecification of different variations ofparametrically polymorphic types sys-tems. For example, inspecting Yamazakiet al.’s work we see that the type systemof the Fluent intermediate language is

Fluent = ⟨ monadic-parametric-polymorphism,deep-type-pattern,rudimentary ⟩ , (3.1)i.e., (i) it allows only one parameter generics, e.g., interface 𝛾 𝛾 𝛾 (ii) it allows generic functions to be defined for deeply nested generic parameter type, such as static 𝛾 𝛾 𝛾 𝛾 𝛾 and, (iii) it allows in the definition of function return type, a typeof clause, but restricted to use onlyone function invocation, e.g., static typeof(f(e)) g( 𝛾 In contrast, the type system used by, e.g., G&R, is simplyPp p = ⟨ polyadic-parametric-polymorphism ⟩ . (3.2)The remainder of this section describes in detail the characteristics in Table 3.1. Type system ⟨⟩ , the bottom of 𝔗 , also denoted 𝔗 ⊥ models objectbased programming paradigm, i.e., a paradigm with objects and classes, but without inheritance norparametric polymorphism. A good approximation of the paradigm is found in the Go programminglanguage [Donovan and Kernighan 2015]. The essence of 𝔗 ⊥ is demonstrated in this (pseudo Javasyntax) example: interface A { B a(); void b(); } interface B { B b(); A a(); } new A().a().b().b().a().b(); For concreteness we exemplify abstract syntax with the concrete syntax of Java or C++. Code rendered in distinctive color as in abuses the host language syntax for the purpose of illustration.10 ies between Type Systems and Automata

The example shows (i) definitions of two classes , A and B , (ii) methods in different classes have thesame name, but different return type, (iii) an expression whose type correctness depends on thesedefinitions.Fig. 3.1 presents the abstract syntax, notational conventions and typing rules of 𝔗 ⊥ . The subse-quent description of type systems in 𝔗 is by additions and modifications to the figure. Fig. 3.1

The bottom of lattice 𝔗 : the type system ⟨⟩ modeling the object-based paradigm 𝑃 :: = Δ 𝑒 Δ :: = 𝛿 ∗ 𝛿 :: = 𝜎 : 𝛾 → 𝛾 ′ :: = 𝜎 : 𝜺 → 𝛾 ′ 𝑒 :: = 𝜀 | 𝑒.𝜎 | 𝜎 ( 𝑒 ) (cid:18) FunctionApplication (cid:19) 𝑒 : 𝑡𝜎 : 𝑡 → 𝑡 ′ 𝑒.𝜎 : 𝑡 ′ (cid:18) One TypeOnly (cid:19) 𝑒 : 𝑡 𝑒 : 𝑡 𝑡 ≠ 𝑡 𝑒 : ⊥ 𝑃 Program 𝑒 Expression Δ Set of function definitions 𝛿 A function definition 𝜎 Function name, drawn from al-phabet Σ 𝛾 Class names, drawn from alpha-bet Γ disjoint to Σ 𝑡, 𝑡 ′ , 𝑡 , 𝑡 Grounded (non-generic) types 𝜺 The unit type 𝜀 The single value of the unit type ⊥ The error type (a) Abstract syntax (b) Typing rules (c) Variables and notations

A type in 𝔗 ⊥ is either drawn from Γ , or is the designated bottom type 𝜺 . The atomic expression,bootstrapping expression 𝑒 , is denoted by 𝜀 , and its type is 𝜺 .The figure defines program 𝑃 in ⟨⟩ as a set Δ of function definitions 𝛿 followed by an expression 𝑒 to type check. For 𝜎 drawn from set Σ of function names, and types names 𝛾 , 𝛾 drawn from set Γ of class names, we can think of a function definition of the form 𝜎 : 𝛾 → 𝛾 as either • a method named 𝜎 in class 𝛾 taking no parameters and returning class 𝛾 , or , • an external function taking a single parameter of type 𝛾 , and returning a value of type 𝛾 .With the first perspective, the recursive description of expressions is the Polish convention, 𝑒 :: = 𝑒.𝜎 ,best suited for making APIs fluent. With the latter perspective, this recursive definition shouldbe made in prefix notation, i.e., 𝑒 :: = 𝜎 ( 𝑒 ) . Fig. 3.1 uses both variants, and we will use theseinterchangeably. Indeed, the distinction between methods and functions is in our perspective onlya syntactical matter.The special case of a function taking the unit type as argument, 𝜎 : 𝜺 → 𝛾 , can be thought ofas an instantiation of the return type, new 𝛾 . The function name, 𝜎 , is not essential in this case,but is kept for consistency. Also in the figure is the standard Function Application typing rule.Overloading on the parameter type is intentionally allowed, i.e., methods defined in different classesmay use the same name. The One Type Only rule excludes overloading based on the return type. Let Pp p be short for lattice point ⟨ polyadic-parametric-polymorphism ⟩ , as demonstrated in List. 1.1 above. Pp p is the type system behind LINQ , the firsttheoretical treatise of fluent API [Gil and Levy 2016], Fling and other fluent API generators, e.g.,of [Xu 2010] and [Nakamaru et al. 2017].The definition of Pp p relies on the definitions of trees, terms and rewrites in Sect. 2.3. Notice thatin 𝔗 ⊥ , types were drawn from set Γ . In allowing generic types the type repertoire is extended to Γ △ ,the set of trees over signature Γ . A type 𝛾 ∈ Γ of rank 𝑟 ≥ 𝑟 type parameters; theonly leaf, of rank 0, is the unit type 𝜺 . Pp p also admits “terms”, i.e., trees including formal variablesdrawn from the set Γ △ . We refer to terms of Pp p as “ ungrounded types ”; an ungrounded type is also ignore the somewhat idiosyncratic distinction between classes and interfaces https://docs.microsoft.com/en-us/dotnet/api/system.linq 11 oseph (Yossi) Gil and Ori Roth viewed in Pp p as a type pattern that typically match “grounded types” (trees in Γ △ ), but can also beused for matching over ungrounded types.Fig. 3.2 summarizes the changes in Pp p ’s definitions with respect to those of 𝔗 ⊥ in Fig. 3.1. Fig. 3.2

The type system Pp p (same as Fig. 3.1 (a) and...) (same as Fig. 3.1 (b) and...) (same as Fig. 3.1 (c) and...) 𝛿 :: = 𝜎 : 𝛾 ( 𝒙 ) → 𝜏 term 𝛾 ( 𝒙 ) is linear :: = 𝜎 : 𝜺 → 𝑡 𝒙 :: = 𝑥 , . . . , 𝑥 𝑟 𝜏 :: = 𝛾 ( 𝝉 ) | 𝑥 | 𝑡 𝝉 :: = 𝜏 , . . . ,𝜏 𝑟 𝑡 :: = 𝛾 ( 𝒕 ) | 𝜺𝒕 :: = 𝑡 , . . . , 𝑡 𝑟 (cid:169)(cid:173)(cid:171) GenericFunctionApplication (cid:170)(cid:174)(cid:172) 𝑒 : 𝑡𝜎 : 𝜏 → 𝜏 ′ 𝑡 = 𝜏 / 𝑠𝑒.𝜎 : 𝜏 ′ / 𝑠 𝜏,𝜏 ′ Type patterns, drawn from Γ △ 𝝉 Multi-pattern, i.e., a sequence oftype patterns 𝜏𝑥 Type variables, drawn fromset 𝑋 disjoint to all alphabets 𝒙 Multi-variable, i.e., a sequenceof type variables 𝑠 Tree substitution (a) Abstract syntax (b) Typing rules (c) Variables and notations

The main addition of Pp p to 𝔗 ⊥ is allowing function definition 𝛿 to take also the form 𝜎 : 𝛾 ( 𝒙 ) → 𝜏 ,where 𝒙 = 𝑥 , . . . , 𝑥 𝑟 here is a sequence of 𝑟 distinct type variables: • The single parameter to functions is a multi-variable, yet shallow and linear, type pattern 𝛾 ( 𝒙 ) .This requirement models the definition of methods in List. 1.1, i.e., in generic classes with 𝑟 independent type variables. The structure of this pattern implicitly models the Java/C • Also, as demonstrated by List. 1.1, 𝜏 , the return type of a function in this form, is a typepattern of any depth constructed from the variables that occur in 𝒙 but also from any othertypes in Γ .The figure also shows how the Function Application typing rule is generalized by employingthe notions of matching and tree substitution from Sect. 2.3.The definition of a dyadic-parametric-polymorphism type system adds to Fig. 3.2 the requirementthat 𝑟 ( 𝛾 ) ≤

2. In monadic-parametric-polymorphism , used for fluent API generation by Nakamaruet al. [2017] and Yamazaki et al. [2019], the requirement becomes 𝑟 ( 𝛾 ) = 𝑡 :: = 𝛾 ( 𝑡 ) instead of 𝑡 :: = 𝛾 ( 𝒕 ) , 𝜏 :: = 𝛾 ( 𝜏 ) instead of 𝜏 :: = 𝛾 ( 𝝉 ) , and 𝛿 :: = 𝜎 : 𝛾 ( 𝑥 ) → 𝜏 instead of 𝛿 :: = 𝜎 : 𝛾 ( 𝒙 ) → 𝜏 . Java, C f defined by static 𝛾 𝛾 𝛾 𝛾 is applicable only if the type of its single argument matches the deep type pattern 𝛾 ( 𝑥 , 𝛾 ( 𝑥 , 𝑥 )) .The corresponding lattice property is obtained by adding derivation rule 𝛿 :: = 𝜎 : 𝜏 → 𝜏 ′ term 𝜏 is linear . (3.3)along with the requirement that 𝜏 is linear to Fig. 3.2.As we shall see, the deep-type-pattern property increases the expressive power of Pp p . However,the syntax of invoking generic, non-method functions in contemporary languages breaks theelegance of fluent API: Using functions instead of methods, (1.1) takes the more awkward form end(a(a(b(a(new Begin()))))) . (3.4)The syntactic overhead of the above “reverse fluent API” can be lessened with a change to the hostlanguage; the case for making the change can be made by sorting out the expressive power addedby the deep property. ies between Type Systems and Automata Recall the abstract syntax rule of 𝛿 in type system Pp p (Fig. 3.2), 𝛿 :: = 𝜎 : 𝛾 ( 𝒙 ) → 𝜏 term 𝛾 ( 𝒙 ) is linear (3.5)The deep-type-pattern property generalized this abstract syntax rule by allowing functions whoseargument type is not restricted to the flat form 𝛾 ( 𝒙 ) . Another orthogonal dimension in which (3.5)can be generalized is by removing the constraint that “term 𝛾 ( 𝒙 ) is linear”, i.e., allowing non-lineartype patterns. Such patterns make it possible to define function 𝜎 : 𝛾 ( 𝑥, 𝑥 ) → 𝑥 that type checkswith expression parameter 𝑒 : 𝛾 ( 𝑡 , 𝑡 ) if and only if 𝑡 = 𝑡 . Noticing that 𝑡 and 𝑡 are trees whosesize is unbounded, and may even be exponential in the size of the program, we understand whythe term non-linear was coined. Non-linear type patterns may coerce the type-checker into doingnon-linear amount of work, e.g., the little Java program in List. 3.1 brings the Eclipse IDE and itscommand line compiler ecj to their knees. Listing 3.1

Java proram in type system 𝑆 = ⟨ n-ary,deep,non-linear ⟩ requiring over five minutesof compilation time by ecj executing on contemporary hardware class S2 { interface 𝜖 {} interface C{C, C> f();} C< 𝜖 , 𝜖 > f() {return null;} void 𝛾 (x e1, x e2) {} { 𝛾 (f().f().f().f().f().f().f().f().f().f().f().f().f().f().f().f().f().f().f().f().f().f().f().f().f().f() .f().f().f().f().f().f(), f().f().f().f().f().f().f().f().f().f().f().f().f().f().f().f().f().f().f().f() .f().f().f().f().f().f().f().f().f().f().f().f()); } } Type system non-linear- Pp p is defined by replacing (3.5) by its relaxed version, 𝛿 :: = 𝜎 : 𝛾 ( 𝒙 ) → 𝜏 term 𝛾 ( 𝒙 ) may be non-linear . (3.6)Likewise, type system ⟨ deep,non-linear ⟩∨ Pp p is obtained by replacing (3.3) by the relaxed version, 𝛿 :: = 𝜎 : 𝜏 → 𝜏 ′ term 𝜏 may be non-linear . (3.7) Yet a third orthogonal dimension of generalizing (3.5) is the number ofarguments; so far, 𝜎 was thought of as unary function, i.e., either as a nullary method that takesno explicit parameters, or a generic unary, non-method function. The n-ary-functions property ofpolymorphic type systems allows binary, ternary, and in general 𝑛 -ary functions, 𝑛 ≥

1. The detailsare in Fig. 3.3.

Fig. 3.3

The type system ⟨ n-ary-functions,deep ⟩ (same as Fig. 3.2 (a) and...) (same as Fig. 3.2 (b) and...) (same as Fig. 3.2 (c) and...) 𝛿 :: = 𝜎 : 𝝉 → 𝜏 Vars ( 𝜏 )⊆ Vars ( 𝝉 ) 𝝉 :: = 𝜏 × 𝜏 × · · · × 𝜏 𝑟 𝑒 :: = 𝜀 | 𝒆 .𝜎 | 𝜎 ( 𝒆 ) 𝒆 :: = 𝑒 , 𝑒 , . . . , 𝑒 𝑟 (cid:18) MultipleArguments (cid:19) 𝑒 : 𝑡 , 𝑒 : 𝑡 , . . . , 𝑒 𝑟 : 𝑡 𝑟 𝜎 : 𝜏 × 𝜏 × · · · × 𝜏 𝑟 → 𝜏𝑡 = 𝜏 / 𝑠 𝑡 = 𝜏 / 𝑠 . . . 𝑡 𝑟 = 𝜏 𝑟 / 𝑠𝑒 , 𝑒 , . . . , 𝑒 𝑟 .𝜎 : 𝜏 / 𝑠 𝑒 , 𝑒 , . . . , 𝑒 𝑟 Expressions 𝑡 , 𝑡 , . . . , 𝑡 𝑟 Grounded types 𝜏 ,𝜏 . . . ,𝜏 𝑟 Generic types 𝒆 Multi-expression, i.e.,a sequence of expres-sions 𝑒 , . . . , 𝑒 𝑟 (a) Abstract syntax (b) Typing rules (c) Variables and notations Comparing the figure to Fig. 3.2 above we notice the introducing of notation 𝒆 for a sequence ofexpressions. With this notation, a call to an 𝑛 -ary function can be written 𝒆 .𝜎 (Polish, fluent APIlike, convention) or as 𝜎 ( 𝒆 ) (traditional convention). As might be expected, the figure also extendsthe function application typing rule to non-unary functions.Note that languages embedded in n-ary- Pp p are no longer languages of words, but rather forests —languages of trees. Indeed, an expression in n-ary- Pp p is a tree of method calls, and the set Δ in an n-ary- Pp p program defines the set of tree-like expressions that type-check against it. oseph (Yossi) Gil and Ori Roth A primary motivation for introducing keyword decltype to C++, was stream-lining the definition of wrapper functions—functions whose return type is the same as the wrappedfunction, e.g., templateauto wrap(x e) → decltype(wrapee(e)) { /∗ · · · ∗/ auto $=wrapee(e); /∗ · · · ∗/ return $;} As it turns out, keyword decltype dramatically changes the type system, by bringing about theundesired effect that type checking is undecidable. The predicament is due to the idiom of usingthe type of one function to declare the return type of another. Alternative, seemingly weakertechniques for piecemeal definition of the return type, e.g., allowing typedef s in classes do notalleviate the problem. Likewise, the idiom is possible even with the seemingly weaker feature, ofallowing functions whose return type is auto , as in template auto wrap(x e){return wrappee(e);}

Note that neither Java nor C auto functions; it appears that the designers of the languagesmade a specific effort to block loopholes that permit piecemeal definition of functions’ return type.Fig. 3.4 presents abstract modeling of C++’s decltype ; for readability we use the more familiar typeof keyword. The figure describes n-ary-functions ; for unary-functions let 𝑛 = Fig. 3.4

Type system ⟨ full-typeof,deep,n-ary-functions ⟩ (same as Fig. 3.3 (a) and...) (same as Fig. 3.3 (b) and...) (same as Fig. 3.3 (c) and...) 𝑃 :: = Δ Ξ 𝑒 Ξ :: = 𝜉 ∗ 𝜉 :: = 𝜑 : 𝝉 → typeof 𝜗 :: = 𝜑 : 𝝉 → 𝜏 Vars ( 𝜗 )⊆ Vars ( 𝝉 ) 𝛿 :: = 𝜎 : 𝝉 → typeof 𝜗 Vars ( 𝜗 )⊆ Vars ( 𝝉 ) 𝜗 :: = 𝝑 .𝜑 | 𝝑 .𝛿 | 𝜏 𝝑 :: = 𝜗 , . . . , 𝜗 𝑟 (cid:18) TypeofExpression (cid:19) 𝑓 = 𝜎 or 𝑓 = 𝜑𝑓 : 𝜏 × · · · × 𝜏 𝑟 → typeof 𝜗𝑒 : 𝑡 · · · 𝑒 𝑟 : 𝑡 𝑟 𝑡 = 𝜏 / 𝑠 · · · 𝑡 𝑟 = 𝜏 𝑟 / 𝑠𝜗 / 𝑠 : 𝑡 𝑒 , . . . , 𝑒 𝑟 .𝑓 : 𝑡 The Multiple Arguments typing rule of Fig. 3.3 isalso generalized for auxiliary functions ( 𝜑 ). Ξ Set of auxiliary function def-initions, used only in typeof clause 𝜉 An auxiliary function definition 𝜑 Auxiliary function names,drawn from alphabet Φ disjointto Σ 𝜗 Pseudo expression, an ex-pression whose type is notgrounded 𝝑 Sequence of pseudo-expressions (a) Abstract syntax (b) Typing rules (c) Variables and notations

The figure uses two syntactical categories for defining functions: 𝛿 ∈ Δ , which as before,defines a function named 𝜎 ∈ Σ that may occur in expression 𝑒 (more generally 𝒆 ); the similarlystructured 𝜉 ∈ Ξ uses distinct namespace 𝜑 ∈ Φ is for functions that may occur in a typeof clause. Pseudo-expressions.

Compare 𝝉 → typeof 𝜗 (the format of a definition of function named 𝜎 in thefigure) with 𝝉 → 𝜏 (the format of this definition in n-ary-function type system (Fig. 3.3)). Withouttype capturing, 𝜎 ’s return type is determined by a tree rewrite of the argument type(s). With typecapturing, the return type is determined by subjecting type 𝜏 to other function(s). To see this,expand the recursive abstract syntax definition of 𝜗 , assuming for simplicity that 𝑛 = 𝛿 :: = 𝜎 : 𝝉 → typeof 𝜏 .𝜑 . · · · .𝜑 𝑟 , (3.8)i.e., the pseudo-expression 𝜗 in this case is 𝜗 = 𝜏 .𝜑 . · · · .𝜑 𝑟 . If 𝑛 > typeof is specified by hierarchical structure 𝜗 , for which the figure coins the term pseudo-expression . Notice that a plain expression is a tree whose leaves (type instantiations) aredrawn from Γ and internal nodes (function calls) are drawn from Σ . Pseudo expressions are moregeneral in allowing type variables in their leaves. As emphasized in the figure, these variables mustbe drawn from 𝝉 , the multi-pattern defining the types of arguments to 𝜎 .A full-typeof type system allows any number of function calls in pseudo-expression 𝜗 , as in(3.8). In contrast, a rudimentary-typeof type system allows at most one function symbol in pseudo-expressions. This restriction is obtained by replacing the abstract syntax rule for 𝜗 in Fig. 3.4 witha simpler, non-recursive variant, 𝜗 :: = 𝝉 .𝜎 | 𝜏 .To describe the semantics of typeof , we need to extend the notion of tree substitution to pseudo-expressions as well. The application of function 𝜎 of (3.8) to a multi-expression 𝒆 with multi-type 𝒕 ies between Type Systems and Automata requires first that 𝒕 ⊑ 𝝉 , where the matching uses a grounded substitution 𝑠 . Then, 𝜗 / 𝑠 , theapplication of 𝑠 to pseudo-expression 𝜗 is the plain-expression obtained by replacing the typevariables in 𝜗 with the ground types defined by 𝑠 .Typeof Expression typing rule employs this notion as follows: typing expression 𝒆 .𝜎 withfunction 𝜎 : 𝝉 → typeof 𝜗 and arguments 𝒆 : 𝒕 , we (i) match the argument types with the parametertypes, 𝒕 = 𝝉 / 𝑠 , deducing substitution 𝑠 , (ii) type 𝜗 / 𝑠 : 𝑡 (using an appropriate typing rule), andfinally (iii) type 𝒆 .𝜎 : 𝑡 . As an application of the Type of Expression rule requires an additionaltyping, of 𝜗 , its definition is recursive. The one-type property means that expressions must have exactly one type (asdefined in Fig. 3.1). With the more potent, multi-type property, expressions are allowed multipletypes, by disposing the One Type Only type inference rule of Fig. 3.1. With multi-type-overloading ,expressions are allowed multiple types. With eventually-one-type , the semantics of the Ada pro-gramming language [Persch et al. 1980] apply: Sub-expressions are allowed to have multiple types.However, upper level expressions are still required to be singly typed. For example, while the upperlevel expression 𝑒 = 𝜎 ( 𝜎 ( 𝜎 ())) can be assigned at most one type, both 𝜎 () and 𝜎 ( 𝜎 ()) may havemany types. The notation used in this section highlight ties between tree automata and type systems, e.g., atree 𝑡 = 𝛾 ( 𝛾 ( 𝛾 ) , 𝛾 ) can be understood as an instantiated generic type, 𝛾 𝛾 𝛾 𝛾 , to use Javasyntax. Likewise the tree rewrite 𝜌 = 𝛾 ( 𝛾 ( 𝑥 ) , 𝑥 ) → 𝛾 ( 𝑥 ) can be interpreted as a Java function static 𝛾 𝛾 𝛾 . Applying 𝜌 to 𝑡 yields the tree 𝛾 ( 𝛾 ) , while the returntype of the invocation foo(new 𝛾 𝛾 𝛾 𝛾 is 𝛾 𝛾 .In fact, with the above definitions of type systems and finite control automata, we can now easilypair certain automata with type systems.Observation 1. (1) 𝔗 ⊥ = FSA(2) Pp p = TA (3) deep- Pp p = deep- TA (4) non-linear- Pp p = non-linear- TA (5) ⟨ monadic ⟩ = SRDPDA

To be convinced, notice the natural bisimulation of automata and type system, obtained by aone-to-one correspondence between, e.g., • a run of an automaton and the type checking process as dictated by the type checking rules, • the hanging of an automaton, and failure of type checking , • the input word or tree, and the type-checked expression , • input-output items in Δ 𝐴 and function definitions in Δ 𝑇 , • the contents of auxiliary storage, and the type of checked expression.Observe however that states of an automaton do not easily find their parallel in the typing world(except for 𝔗 ⊥ = FSA, in which classes correspond to states). Luckily, the expressive power ofmany of the automata we deal with does not depend on the presence of states, e.g., it is easy to seethat deep- TA = ⟨ deep,stateful ⟩∨ TA.

The following result employs the type-automata correspondence to characterize the complexityclass of type system Pp p .Theorem 4.1. Pp p = TA = DCFL

Recalling the equivalence Pp p = TA (Obs. 1), the gist of the theorem is the claim TA = DCFL.Towards the proof we draw attention to G&R’s “ tree encoding ”, which is essentially a reductionby which every DPDA is converted to an equivalent tree automaton. Their work then proceeds toshow how this tree automaton is emulated in the Pp p type system they use (and that the emulationdoes not incur exponential space (and time) overhead). Hence, by G&R [2019], oseph (Yossi) Gil and Ori Roth DCFL = DPDA ⊆ TA = Pp p . (4.1)A similar result is described by Guessarian [1983]. In fact, we note that Guessarian’s contribution ismore general, specifically she achieves the result that augmenting tree automata with 𝜀 -transitionsand multiple states does not increase their computational class.Fact 4.1 ([Guessarian 1983, Corollary 1. (i) ]). ⟨ 𝜀 -transitions, stateful ⟩∨ TA = TA Fact 4.1 generalizes (4.1), since DPDAs are instances of ⟨ 𝜀 -transitions, stateful ⟩∨ TA, where thetree store is linear. The proof of Thm. 4.1 is completed by showing the inverse of (4.1).Lemma 4.1. TA ⊆ DPDA.

Proof. The proof is constructed by employing Theorem 3 of Guessarian [1983]. (Notice that sheuses the term “ pushdown tree automaton ” (PDTA) for top-down tree-automata. However, for thepurpose of the reduction, we concentrate on input trees that are in the form of a string, i.e., thetree traversal order is immaterial.) □ Observe that Lem. 4.1 means that G&R’s result is the best possible in the following sense: It isimpossible to extend Fling to support any wider family of fluent API languages within the limitsof the fragment of the Java type system that Fling uses. Moreover, as shown by Grigore [2017],allowing the fluent API generator a larger type system fragment, makes type-checking undecidableif the larger fragment includes the common Java idiom of employing super in signatures, as in e.g.,method boolean removeIf(Predicate filter) found in the standard java.util.Collection class.Combining Obs. 1 (5), known results (Table 2.2) and Thm. 4.1, we have ⟨ monadic ⟩ = SRDPDA ⊊ DCFL = Pp p = ⟨ polyadic ⟩ , (4.2)i.e., had Pp p been weakened to allow only monadic generics, its expressive power would have beenreduced. Conversely, we would like to check the changes to complexity when Pp p is made morepotent. Consider now allowing generic functions (on top of methods of generic classes) by addingthe deep-type-pattern feature to Pp p .Theorem 4.2. DCFL ⊊ deep- TA = deep- Pp p Again, recall the equivalence deep- Pp p = deep- TA from Obs. 1. The set containment, DCFL ⊆ deep- TA follows from (4.1). It remains to show that this containment is proper.The proof of Thm. 4.2 in Sect. C.1 is by encoding the context sensitive language 𝑎 𝑛 𝑏 𝑛 𝑐 𝑛 ⊆ { 𝑎, 𝑏, 𝑐 } ∗ in type system deep- Pp p , and relying on the following definition: For an integer 𝑘 ≥

0, let 𝑈 𝑘 ,the unary type encoding of 𝑘 , be a grounded type in Pp p , 𝑈 = Zero , and 𝑈 𝑘 = Succ< 𝑢 𝑘 − > , withtypes (in Java syntax) interface Zero{} and interface Succ{} (assumed implicitly henceforth).Thus, 𝑈 = Zero , 𝑈 = Succ , 𝑈 = Succ> , etc.Note that in type system Pp p it is possible to increment and decrement integers, static Zero zero() { return null; } interface Zero { Succ inc(); } interface Succ { Succ> inc(); T dec(); } We have, e.g., that the type of expression zero().inc().inc().inc().inc().dec().inc() is 𝑈 .We now show that just like deep patterns, non-linear patterns, i.e., patterns in which the sametype variable occurs more than once, increase the computational power of Pp p .This increase is attributed here to the ability of non-linear patterns to compare nested types, inparticular types that are unary encoding of the integers. For example, the Java generic function staticvoid equal(x e1,x e2){} (in type system ⟨ non-linear-patterns,n-ary-functions ⟩∨ Pp p ) type-checksif the types of its arguments are (say) 𝑈 and 𝑈 , and does not type check if these are (say) 𝑈 and 𝑈 . More importantly, type comparison is also possible if all functions are unary.Consider, e.g., the two argument generic type 𝛾 , interface 𝛾 {} , and the generic unaryfunction equal , staticvoid equal( 𝛾 e){} , in type system non-linear- Pp p . Then, function equal ies between Type Systems and Automata type-checks if the type of its single argument is (say) 𝛾 < 𝑈 , 𝑈 > , and does not type-check if this typeis (say) 𝛾 < 𝑈 , 𝑈 > . With this observation, we can state.Theorem 4.3. DCFL ⊊ non-linear- TA = non-linear- Pp p The proof of Thm. 4.3 in Sect. C.2 is again by encoding the context sensitive language 𝑎 𝑛 𝑏 𝑛 𝑐 𝑛 ⊆{ 𝑎, 𝑏, 𝑐 } ∗ , but this time in type system non-linear- Pp p . The ability of this type system to compareintegers encoded as types is the gist of the proof.Recall that a type system is dyadic if no generic takes more than two type parameters. Consideringthe shallow case, we claim no more than placing dyadic between monadic and polyadic in (4.2), ⟨ monadic ⟩ ⊆ ⟨ dyadic ⟩ ⊆ ⟨ polyadic ⟩ , (4.3)although we conjecture ⟨ monadic ⟩ ⊊ ⟨ dyadic ⟩ can be shown relatively easily. In contrast, in deep type system, the expressive power does not increase by allowing more than two generic parameters.Theorem 4.4. ⟨ deep,polyadic ⟩ = ⟨ deep,dyadic ⟩ Proof. (sketch) Relying on the automata-type correspondence, we construct for every deep-

TAautomaton 𝐴 , an equivalent binary deep- TA 𝐴 ′ . Let 𝛾 be a tree node in 𝐴 of rank 𝑘 >

2: Replace 𝛾 withnodes 𝛾 , 𝛾 , . . . , 𝛾 𝑘 − of rank two, and 𝛾 𝑘 of rank one. Tree nodes appear in both sides of tree rewriterules, and in the initial auxiliary storage tree: Replace every occurrence of 𝛾 in 𝐴 , 𝛾 ( 𝜏 , 𝜏 , . . . , 𝜏 𝑘 ) ,with 𝛾 ( 𝜏 , 𝛾 ( 𝜏 , . . . 𝛾 𝑘 − ( 𝜏 𝑘 − , 𝛾 𝑘 ( 𝜏 𝑘 )) . . . )) . □ 𝜀 -TRANSITIONS In the previous section we showed that the addition of deep-type-pattern property, as found ingeneric, non-method functions of (say) Java, to the Pp p type system, increases its computationalcomplexity, but does not render it undecidable. We now prove that the addition of even rudimentary typeof to Pp p makes it undecidable.Theorem 5.1. ⟨ deep,rudimentary-typeof ⟩∨ Pp p = RE.

The following reduction is pertinent to the proof of Thm. 5.1.Lemma 5.1.

A Turing machine 𝑀 can be simulated by a deep-rewrite, stateful tree automaton 𝐴 which is allowed 𝜀 -transitions. Proof. As explained in Sect. 2, we can assume that 𝑀 accepts its input on the tape with thehead on the first letter, and then engages in 𝜀 -transitions only. Also, w.l.o.g., 𝑀 ’s tape is extendedinfinitely in both directions by an infinite sequences of a designated blank symbol ♭ . Fig. 5.1

Turing machine acceptingthe language 𝑎 𝑛 𝑏 𝑛 𝑞 𝑞 𝑞 𝑞 start 𝑞 𝑎 → ♭ + ♭ → ♭ − 𝑏 → ♭ − ♭ → ♭ + ♭ → ♭ + 𝑎 → 𝑎 + 𝑏 → 𝑏 + 𝑎 → 𝑎 − 𝑏 → 𝑏 − Fig. 5.1 is an example of such a machine 𝑀 with internalstates 𝑞 through 𝑞 , single accepting state 𝑞 , and, tape alpha-bet Γ = { 𝑎, 𝑏, ♭ } . The machine terminates in an accepting stateif and only if the tape is initialized with a word 𝑎 𝑛 𝑏 𝑛 , 𝑛 ≥ 𝑎 fromthe beginning of the word and its counterpart letter 𝑏 from theword’s end by ♭ , until no more 𝑎 ’s or 𝑏 ’s are left. The conven-tion of depicting transitions over edges in the graph of statesis standard, e.g., the arrow and label rendered in purple (goingfrom state 𝑞 to state 𝑞 ) is the 𝜀 -transition item ⟨ 𝑞 , 𝑎 → ♭ + , 𝑞 ⟩ , (5.1)which states that if the Turing machine is in internal state 𝑞 ,and, the symbol under head is 𝑎 , then (i) replace 𝑎 by ♭ , (ii) increment ℎ , and and, (iii) change internal state to 𝑞 .The encoding of 𝑀 in 𝐴 includes the following components: oseph (Yossi) Gil and Ori Roth (1) Adopting the set of states 𝑄 , set of accepting states 𝐹 , and initial state 𝑞 of 𝑀 .(2) A rank-1 tree symbol for each of the tape symbols, including ♭ .(3) Employing the designated leaf symbol 𝜺 ∉ Γ to encode the infinite sequences of ♭ at the endsof the tape.(4) Introducing a rank-3 tree symbol ◦ for encoding the tape itself. The center child of a nodelabeled ◦ encodes of a ◦ node encodes the cell under the head; its left (resp. right) childencodes the tape to the left (resp. to the right) of the head. For example, the tape con-tents · · · ♭♭♭ 𝑏𝑎𝑎𝑏𝑏 ♭♭♭ · · · is encoded by a certain tree 𝑡 = ◦( 𝑏 ( 𝜺 )) , 𝑎 ( 𝜺 ) , 𝑎 ( 𝑏 ( 𝑏 ( 𝜺 ))) .For the sake of readability we write ◦ nodes in infix notation, e.g., 𝑡 = 𝑏 ( 𝜺 ))/ 𝑎 ( 𝜺 )/ 𝑎 ( 𝑏 ( 𝑏 ( 𝜺 ))) ,or even more concisely 𝑡 = 𝑏 / 𝑎 / 𝑎𝑏𝑏 .(5) Setting 𝜸 = 𝜺 / 𝜎 / 𝜎 · · · 𝜎 𝑛 , i.e., letting the initial state of auxiliary storage encode the inputword 𝜎 𝜎 · · · 𝜎 𝑛 .(6) Introducing | Σ | + 𝐴 for each of 𝑀 ’s transitions: A single transition for dealingwith the 𝜺 leaf denoting an infinite sequence of blanks, and a transition for each tape symbol.In demonstration, transition ⟨ 𝑞 , 𝑎 → ♭ + , 𝑞 ⟩ (5.1) is encoded in four 𝜀 -transitions of 𝐴 whichdiffer only in their tree rewrite rule. ⟨ 𝑞 , 𝑥 / 𝑎 / 𝑎𝑥 → ♭ 𝑥 / 𝑎 / 𝑥 , 𝑞 ⟩ ⟨ 𝑞 , 𝑥 / 𝑎 / 𝑏𝑥 → ♭ 𝑥 / 𝑏 / 𝑥 , 𝑞 ⟩⟨ 𝑞 , 𝑥 / 𝑎 / ♭ 𝑥 → ♭ 𝑥 / ♭ / 𝑥 , 𝑞 ⟩ ⟨ 𝑞 , 𝑥 / 𝑎 / 𝜺 → ♭ 𝑥 / ♭ / 𝜺 , 𝑞 ⟩ . (5.2)The rules above distinguish between the values the right child of node ◦ , i.e., the symbol tothe right of the head: For example, the first rule, 𝑥 / 𝑎 / 𝑎𝑥 → ♭ 𝑥 / 𝑎 / 𝑥 , deals with the casethis child is 𝑎 followed by some tape suffix captured in variable 𝑥 . The rule rewrites thenode, making 𝑎 the center child.Notice that with the encoding, the input to 𝐴 is encoded in its transitions rules. □ Relying on Lem. 5.1, the proof of Thm. 5.1 is completed by encoding the automaton 𝐴 of thelemma in the appropriate type system.Proof of Thm. 5.1. We encode automaton 𝐴 = 𝐴 ( 𝑀 ) as a program 𝑃 = ΔΞ 𝑒 in type system ⟨ deep,rudimentary ⟩∨ Pp p . In this encoding, set Δ is empty, and there is a function 𝜉 ∈ Ξ for every 𝜀 -transition item in set Ξ of 𝐴 . Expression 𝑒 type checks against Ξ , if, and only if, machine 𝑀 (automaton 𝐴 ) halts.In the encoding, the tree vocabulary of 𝐴 incarnates as generic types: A three parameter generictype ◦ , and generic one-parameter type 𝛾 for each tape symbol, including ♭ . Also the argument toevery function 𝜉 ∈ Ξ function is a deep pattern over possible instantiations of ◦ .Also, introduce a function symbol 𝜑 𝑞 for every 𝑞 ∈ 𝑄 , and let every transition ⟨ 𝑞, 𝜏 → 𝜏 ′ , 𝑞 ′ ⟩ of 𝐴 add an overloaded definition 𝜑 𝑞 : 𝜏 → typeof 𝜏 ′ .𝜑 ′ 𝑞 to this symbol. Thus, function 𝜑 𝑞 emulates 𝐴 instate 𝑞 with tape 𝜏 : It applies the rewrite 𝜏 → 𝜏 ′ to the type, and employs the resolution of typeof to continue the computation in function 𝜑 ′ 𝑞 which corresponds to the destination state 𝑞 ′ .For example, the Turing machine transition shown in (5.1), encoded by the tree automatontransitions of (5.2), is embedded in C++ using decltype , as depicted in List. 5.1. Listing 5.1

Definitions in type system ⟨ rudimentary-typeof, deep ⟩∨ Pp p (using C++ syntax) encodingthe tree automata transitions of (5.2) template typeof(q2(O, B>())) q1(O, B, xR>) {} template typeof(q2(O, B>())) q1(O, xR>) {} template typeof(q2(O, B>())) q1(O, B, xR>) {} template typeof(q2(O, B>())) q1(O, B, xR>) {} ies between Type Systems and Automata Further, to encode the input word, set 𝑒 = ◦( 𝜺 , 𝜎 , 𝜎 (· · · 𝜎 𝑛 ( 𝜺 ) · · · )) .𝜑 𝑞 , or, in monadic abbrevia-tion form, 𝑒 = ◦( 𝜺 , 𝜎 , 𝜎 · · · 𝜎 𝑛 ) .𝜑 𝑞 .To terminate the typing process, further overload 𝜑 𝑞 with definition 𝜑 𝑞 : ◦( 𝑥 , 𝛾 ( 𝜺 ) , 𝑥 ) → 𝜺 for every accepting state 𝑞 ∈ 𝐹 and cell symbol 𝛾 ∈ Γ , for which a Turing machine transition isnot defined. These definitions correspond to the situation of 𝐴 reaching an accepting state—typechecking succeeds if and only if typeof resolution reaches such a definition.The full C++ encoding of the Turing machine of Fig. 5.1 is shown in List. D.1 in the appendices. □ Having examined the contribution of deep by itself, and the combination of deep and rudimentary to the computational complexity of Pp p , it is time to consider the contribution of rudimentary byitself to complexity. The following shows that there is no such contribution.Theorem 5.2. Pp p = rudimentary-typeof- Pp p Proof. The first direction Pp p ⊆ rudimentary-typeof- Pp p is immediate, as every Pp p program isalso a rudimentary-typeof- Pp p program by definition. We prove rudimentary-typeof- Pp p ⊆ Pp p .Given a program 𝑃 = ΔΞ 𝑒 in rudimentary- Pp p we need to convert it into equivalent program 𝑃 ′ in type system Pp p . By Thm. 4.1 it is sufficient to convert 𝑃 into a vanilla tree automaton, i.e.,one with neither states nor 𝜀 -transitions. Instead, we convert 𝑃 into a more potent tree automa-ton 𝐴 which is allowed both 𝜀 -transitions and states, and then employ Guessarian’s observa-tion ⟨ 𝜀 -transitions,stateful ⟩∨ TA = TA (see Fact 4.1 above) to complete the proof.The set of internal states of 𝐴 includes an initial and accepting state 𝑞 and a state 𝑞 𝜑 for everyauxiliary function name 𝜑 used in Ξ .Consider a definition in 𝑃 = ΔΞ 𝑒 of a (primary or auxiliary) function that employs a typeof clause 𝜏 → typeof 𝜗 . With rudimentary typeof , pseudo-expression 𝜗 is either 𝜏 ′ or 𝜏 ′ .𝜑 . Therefore,every function definition is either in the direct form 𝜏 → 𝜏 ′ or in the forwarding form 𝜏 → typeof 𝜏 ′ .𝜑 .There are four cases to consider:(1) Primary function definitions , found in Δ , are encoded as consuming transitions of 𝐴 :(a) Direct definition 𝜎 : 𝜏 → 𝜏 ′ is encoded as transition ⟨ 𝜎, 𝑞 , 𝜏 → 𝜏 ′ , 𝑞 ⟩ .(b) Forwarding definition 𝜎 : 𝜏 → typeof 𝜏 ′ .𝜑 is encoded as transition ⟨ 𝜎, 𝑞 , 𝜏 → 𝜏 ′ , 𝑞 𝜑 ⟩ .(2) Auxiliary function definitions , found in Ξ , are encoded as 𝜀 transitions of 𝐴 :(a) Direct defintion 𝜑 : 𝜏 → 𝜏 ′ is encoded as transition ⟨ 𝑞 𝜑 , 𝜏 → 𝜏 ′ , 𝑞 ⟩ .(b) Forwarding definition 𝜑 : 𝜏 → typeof 𝜏 ′ .𝜑 ′ is converted to 𝜀 -transition ⟨ 𝑞 𝜑 , 𝜏 → 𝜏 ′ , 𝑞 𝜑 ′ ⟩ .In all four cases, the change from input type to output type by a function is encoded as a rewrite ofthe tree auxiliary storage of 𝐴 . Direct definitions are encoded by 𝐴 moving into state 𝑞 Forwardingto function 𝜓 is encoded by 𝐴 moving into state 𝑞 𝜑 .Notice that state 𝑞 , the only accepting state, is the only state with outgoing consuming transitions,and it is also the only one without outgoing 𝜀 -transitions. Therefore, the automaton consumes aletter in state 𝑞 , and finishes conducting 𝜀 -transitions back in 𝑞 , or otherwise it rejects the input.With the above construction, expression 𝑒 = 𝜀.𝜎 . · · · .𝜎 𝑛 type-checks against Δ and Ξ if andonly if 𝐴 accepts word 𝑤 = 𝜎 · · · 𝜎 𝑛 . □ Theorem 5.3. Fluent = DCFL

Proof. Yamazaki et al. [2019] showed that DCFL ⊆ Fluent , i.e., that any LR language, alterna-tively, any DCFL, can be encoded in a

Fluent program. It remains to show the converse,

Fluent ⊆ DCFL. We prove

Fluent ⊆ deep- DPDA, noting the folk-lore equality deep-

DPDA = DPDA . (5.3)The encoding of a Fluent program in a deep-

DPDA is reminiscent of the encoding of a programin rudimentary- Pp p type system in a vanilla tree automaton in the proof of Thm. 5.2 just above. Thefull proof of the current theorem is in Sect. C.3. □ oseph (Yossi) Gil and Ori Roth Having seen that

Fluent is not more expressive than it was intended to be, it is interesting tocheck whether its expressive power would increase if it allowed unrestricted typeof clauses.Theorem 5.4. full-typeof-Fluent ⊋ DCFL

The proof is by showing that type system full-typeof-Fluent is expressive enough to encode thelanguage 𝑤 𝑤 , known to be context sensitive. The full proof is in Sect. C.4. Most previous work concentrated in recognition of deterministic languages [Gil and Levy 2016;Gil and Roth 2019; Grigore 2017; Nakamaru et al. 2017]. We show here that type system withAda-like overloading can encode non-deterministic context free languages as well. Its proof relieson creating a direct correspondence of the type system and c ontext f ree g rammars (CFGs).Theorem 6.1. UCFL ⊆ ⟨ monadic, eventually-one-type ⟩ Proof. Given an unambiguous context free grammar 𝐺 , we encode it as Δ , a set of functiondefinitions in ⟨ monadic, eventually-one-type ⟩ such that 𝐺 derives word 𝜎 · · · 𝜎 𝑛 if, and only if,expression 𝜀.𝜎 . · · · .𝜎 𝑛 . $ ($ being a dedicated function symbol) type checks against Δ .We redefine CFGs using a notation more consistent with this manuscript: Context free grammar 𝐺 is a specification of a formal language over alphabet Σ in the form of a quadruple ⟨ Σ , Γ , 𝜺 , 𝑅 ⟩ where Σ is the set of 𝐺 ’s terminals, Γ is the set of grammar variables, 𝜺 ∉ Γ is the start symbol, and 𝑅 is a setof derivation rules. Each derivation rule 𝜌 ∈ 𝑅 is either in the form 𝜺 → 𝜔 , or in the form 𝛾 → 𝜔 ,where 𝛾 ∈ Γ and where 𝜔 is a possibly empty sequence of terminals and grammar variables,i.e., 𝜔 ∈ ( Σ ∪ Γ ) ∗ .Recall that a grammar is in G reibach N ormal F orm (GNF) if every rule 𝜌 ∈ 𝑅 is in one of threeforms (i) the usual form, 𝜌 = 𝛾 → 𝜎 𝜸 , where 𝜎 ∈ Σ is a terminal and 𝜸 ∈ Γ ∗ is a sequence ofvariables, (ii) the initialization form, 𝜌 = 𝜺 → 𝜎 𝜸 , or, (iii) the 𝜀 -form , 𝜌 = 𝜺 → 𝜀 , present only if thegrammar derives the empty word 𝜀 ∈ Σ ∗ .For the encoding, first convert unambiguous grammar 𝐺 into an equivalent unambiguousgrammar in GNF. This is done using the algorithm of Nijholt [1979] (also presented in moreaccessible form by Salomaa and Soittola [1978]).The type encoding of GNF grammar 𝐺 uses a monadic generic type 𝛾 for every symbol 𝛾 ∈ Γ , anadditional monadic generic type $ , and, one non-generic type 𝜺 , also known as the unit type.For each derivation rule 𝜌 ∈ 𝑅 introduces a function 𝛿 ∈ Δ that uses these types: • Suppose 𝑅 includes the 𝜀 -form rule 𝜺 → 𝜀 $, introduce (one overloaded) definition of func-tion $ : 𝜺 → 𝜺 . Then, 𝜀. $, the expression corresponding to the empty word, type-checks totype 𝜺 . (Recall that 𝜀 is the single type of the unit type 𝜺 .) • If 𝜌 is in the initialization form 𝜺 → 𝜎 𝜸 then 𝛿 = 𝜎 : 𝜺 → 𝜸 $ . For such a rule introduce alsofunction $ : → $ 𝜺 → 𝜺 . • If 𝜌 is in the usual form 𝛾 → 𝜎 𝜸 , then 𝛿 = 𝜎 : 𝛾𝑥 → 𝜸 𝑥 .We show by induction on 𝑖 = , . . . , 𝑛 the following claim on the partial expression 𝑒 𝑖 = 𝜀.𝜎 . · · · .𝜎 𝑖 : The set of types assigned by the type checker to 𝑒 𝑖 includes a type 𝜸 $ , 𝜸 ∈ Γ + , ifand only if, there exists a l eft m ost d erivation (LMD) that yields the sentential form 𝜎 · · · 𝜎 𝑖 𝜸 .For the inductive base observe that 𝑒 = 𝜀 and that the set of types of 𝜀 includes only the unittype 𝜺 ; indeed there is a (trivial) LMD of the degenerate sentential form 𝜀 𝜺 = 𝜺 .Consider an LMD of 𝜎 · · · 𝜎 𝑖 𝜎 𝑖 + 𝜸 ′ $ , where 𝑖 < 𝑛 , 𝜸 ′ ∈ Γ + and 𝜎 𝑖 + is the terminal 𝜍 ∈ Σ , 𝜍 ≠ $.We show that 𝜸 ′ is a type of 𝑒 𝑖 + = 𝜍 ( 𝑒 𝑖 ) . The said LMD can only be obtained by applying arule 𝜌 = 𝛾 → 𝜍 𝜸 ” to the sentential form 𝜎 · · · 𝜎 𝑖 𝜸 $ , where 𝛾 is the first symbol of 𝜸 .By examining the kind of functions in Δ , one can similarly show that every type 𝜸 ′ of 𝑒 𝑖 + is anevidence of an LMD of a sentential form 𝜎 · · · 𝜎 𝑖 𝜎 𝑖 + 𝜸 ′ . ies between Type Systems and Automata The proof is completed by manually checking that a full expression, ending with the . $ invocationcan only type check to a single type, 𝜺 , and this can happen only if the type of 𝜀.𝜎 . · · · .𝜎 𝑛 is 𝜸 ,where 𝜸 occurs in an initialization rule 𝜺 → 𝜎 𝑛 𝜸 . □ Sect. D.2 demonstrates the proof by presenting a fluent API of the non-deterministic context freelanguage of even length palindromes.If final expressions are also allowed to be multi-typed, then we can construct fluent API for allcontext free languages.Theorem 6.2. ⟨ monadic, multiple-type ⟩ = CFL

Proof. The construction in the proof of Thm. 6.1 works here as well. Note that here the transitionfrom a plain CFG to GNF does not have to preserve unambiguity. □ Perspective.

Revisiting Table 3.1, we see that in total it has | 𝐶 | · | 𝐶 | · | 𝐶 | · | 𝐶 | · | 𝐶 | · | 𝐶 | = · · · · · = lattice points. Accounting for the fact that in a nyladic type system, thevalues of 𝐶 ( type pattern depth ), and 𝐶 ( type pattern multiplicity ) are meaningless, we see thatlattice 𝔗 spans | 𝐶 | · | 𝐶 | · | 𝐶 | = · · = monomorphic type systems ( 𝔗 ⊥ among them),and (| 𝐶 | − ) · | 𝐶 | · | 𝐶 | · | 𝐶 | · | 𝐶 | · | 𝐶 | = · · · · · = potential polymorphic type systems(Pp p and Fluent among them). To make the count more exact, account for 𝐶 being irrelevant in a monadic type system, obtaining | 𝐶 | · | 𝐶 | · | 𝐶 | · | 𝐶 | = · · · = monadic , yet polymorphic typesystems, and (| 𝐶 | − ) · | 𝐶 | · | 𝐶 | · | 𝐶 | · | 𝐶 | · | 𝐶 | = · · · · · = non- monadic polymorphictype systems.Beyond the implicit mention that the type-automata correspondence applies to monomorphictype systems , these were not considered here. Our study also invariably assumed unary-function ,ignoring in characteristic 𝐶 n-ary-functions type systems which comprise half of the type systemsof 𝔗 .Even though most of this work was in characterizing the complexity classes of type systems,it could not have covered even the ( + )/ =

126 type systems remaining in scope. Thestudy rather focused on these systems which we thought are more interesting: We gave an exactcharacterization of the complexity classes of two central type systems, Pp p (Thm. 4.1) and Fluent (Thm. 5.3), and investigated how this complexity changes if the type systems are made more orless potent along 𝔗 ’s characteristics (with the exception of 𝐶 , the function arity characteristic).Comparing (3.1) with Table 3.1 we see that Fluent can be made more potent along 𝐶 , 𝐶 , or 𝐶 ,and, as follows from our results, its complexity class increases in all three cases:(1) In 𝐶 , Fluent ⊊ dyadic-Fluent = RE, by combining Thm. 4.4 and Thm. 5.1.(2) In 𝐶 , Fluent ⊊ eventually-one-type-Fluent (Thm. 6.1).(3) In 𝐶 , Fluent ⊊ full-typeof-Fluent (Thm. 5.4).Conversely, Fluent can be made less potent along characteristics 𝐶 , 𝐶 and 𝐶 :(1) In 𝐶 complexity decreases, Fluent − monadic = 𝐹𝑆𝐴 ⊊ Fluent (Obs. 1).(2) In 𝐶 , (5.3) makes us believe that complexity does not change, Fluent − deep + shallow = Fluent .(3) In 𝐶 , then, by Obs. 1 and (5.3)), Fluent − rudimentary = deep- RDPDA. We believe complexitydecreases but are unsure.Type system Pp p can be made more potent along characteristics 𝐶 , 𝐶 , 𝐶 and 𝐶 :(1) In 𝐶 complexity increases, Pp p ⊊ deep- Pp p (Thm. 4.2). the ignored n-ary-functions correspond to the forest-recognizer brand of automata; however forest-recognizer automatawere used in the construction, e.g., in Lem. 5.1. 21 oseph (Yossi) Gil and Ori Roth (2) In 𝐶 complexity increases, Pp p ⊊ non-linear- Pp p (Thm. 4.3).(3) In 𝐶 complexity does not change, Pp p = rudimentary-typeof- Pp p (Thm. 5.2).(4) In 𝐶 complexity increases, Pp p ⊊ eventually-one-type- Pp p (Thm. 6.1).Type system Pp p can be made less potent only along characteristic 𝐶 . From Obs. 1 and Thm. 4.1, 𝐹𝑆𝐴 = ⟨ nylaldic ⟩ ⊊ 𝑆𝑅𝐷𝑃𝐷𝐴 = ⟨ monadic ⟩ ⊆ ⟨ dyadic ⟩ ⊆ ⟨ polyadic ⟩ ⊊ ⟨ polyadic ⟩ = DCFL , (7.1)i.e., it is not known whether decreasing Pp p along 𝐶 to dyadic reduces its complexity, but decreasingit further to monadic certainely does.This work should also be viewed as a study of the type-automata correspondence: (i) The resultsin Sect. 4 revolve around the correspondence between tree-store automata employing tree rewrites,and type system in which the signature of functions employs type pattern to match its argument. (ii)

Sect. 5 explored the correspondence between typeof clause in the signature of functions, and 𝜀 -transitions of automata. (iii) The correspondence between non-deterministic runs and allowingmultiple types of expressions, or at least as a partial step during resolution of overloading wasthe subject of Sect. 6. Overall, our study confirmed that the type-automata correspondence is asignificant aid in the characterization of complexity classes, either by a direct bisimulation betweenthe two, or by employing and adapting (sometimes ancient) contributions in the decades oldresearch of automata.

Open Problems.

Technically, we leave open the problem of characterizing the complexity classof each of the 126 type systems that were not considered at all, or, considered, but not fully character-ized. However, many of these can be trivially solved, e.g., since 𝑇 = ⟨ 𝑑𝑒𝑒𝑝, 𝑟𝑢𝑑𝑖𝑚𝑒𝑛𝑡𝑎𝑟𝑦, 𝑝𝑜𝑙𝑦𝑎𝑑𝑖𝑐 ⟩ = RE, (Thm. 5.1), 𝑇 = RE for all 𝑇 ∈ 𝔗 , 𝑇 > 𝑇 . We draw attention to four type systems for whichwe are able to set a lower and an upper bound, but still miss precise characterization, e.g., in termsof familiar computational complexity classes.(1) deep- Pp p , for which we have DCFL ⊊ deep- Pp p ⊆ CSL by Thm. 4.2.(2) non-linear- Pp p , for which we also have DCFL ⊊ non-linear- Pp p ⊆ CSL by Thm. 4.3.(3) ⟨ deep,non-linear ⟩∨ Pp p , for which we have again DCFL ⊊ ⟨ deep, non-linear ⟩∨ Pp p ⊆ CSL byThms. 4.2 and 4.3.(4) full-typeof-Fluent , for which we have DCFL ⊊ full-typeof-Fluent ⊆ RE by Thm. 5.4.Also, we do not know yet how these relate to each other in terms of computational complexity,beyond what can be trivially inferred by 𝔗 ’s partial order. Sect. D.3 may offer some insights. Expression Trees vs. Expression Words.

Language recognizers, i.e., automata which take trees asinputs were defined and used in the proofs. Still, this study does not offer much on the study of n-ary-functions —the type counterpart of language recognizers. There is potential in exploring thetheory of polymorphic types of tree shaped expressions. In particular, it is interesting to study typesystems 𝑆 = ⟨ n-ary , deep ⟩ and 𝑆 = ⟨ n-ary , deep , non-linear ⟩ , both modeling static generic multi-argument functions of C 𝑆 adds the power, and predicament (see List. 3.1), ofnon-linear type patterns. In the type-automata perspective 𝑆 and 𝑆 correspond to forest-recognizerreal-time tree-store brand of automata, which received little attention in the literature. We see twonumber of potential applications of type theory, for which (say) Pp p is insufficient, and could serveas motivation for resolving the open problems above and for the study of 𝑆 and 𝑆 . Types for linear algebra

The matrix product 𝐴 × 𝐵 is defined if matrix 𝐴 is 𝑚 × 𝑚 and matrix 𝐵 is 𝑚 × 𝑚 , in which case the result is an 𝑚 × 𝑚 matrix. The matrix addition 𝐴 + 𝐵 is definedonly if both 𝐴 and 𝐵 are 𝑚 × 𝑚 , in which case the result is also 𝑚 × 𝑚 . The unary encodingof integers and their comparison in one step in the proof of Thm. 4.3 seem to be sufficient fordeveloping a decidable type system that enforces such constraints. ies between Type Systems and Automata However, unlike type systems for checking fluent API, types for linear algebra implementedthis way are impractical: matrices whose dimensions are in the range of thousands arecommon, e.g., in image processing. But, programmers cannot be expected to encode integersthis large in unary, not mentioning the fact that such types tend to challenge compilers’stability. The problem is ameliorated in 𝑆 in which a decimal (say) representation of integersis feasible. A more precise design is left for future research.A more difficult challenge is the type system support and checking of operations whichinvolve integer arithmetic. A prime example is numpy ’s reshape operation which converts,e.g., an 𝑚 × 𝑚 matrix to an 𝑚 × 𝑚 matrix, where correctness is contingent on the equalityif 𝑚 · 𝑚 = 𝑚 · 𝑚 . Indeed, we are not aware of any decidable type system that can dointeger multiplication. Dimensional types

A similar challenge is supporting of physical dimensions , i.e., a design of atype system allowing, e.g., the division of distance quantity by time quantity obtaining speedquantity, and addition and comparison distance quantities, but forbidding, e.g., additionand comparison of time and distance quantities. To do so, the type system should probablyencode (cid:206) 𝑟𝑖 = 𝑥 𝑚 𝑖 𝑖 , 𝑚 𝑖 ∈ Z , the general form of a physical dimension (in say MKS), as a tupleof 𝑟 of signed integers.To enforce the rules of addition and comparison of physical dimensions, the type systemshould be able compare (typically very small) integers, as done in Thm. 4.3, although theimplementation should be tweaked to support negative integers. For multiplying and dividingphysical quantities, the type system should be able to add (small) integers. We do not knowwhether this is possible in 𝑆 or 𝑆 . Modeling type erasure.

Finally, we draw attention to the fact that Java’s type erasure is notaccurately modeled by our system. In particular Java forbids function overloading if the type of theoverloaded functions becomes identical after type erasure. We propose this type inference rule fortype erasure (cid:16)

TypeErasure (cid:17) 𝜎 : 𝛾 ( 𝝉 ) → 𝜏 𝜎 : 𝛾 ( 𝝉 ′ ) → 𝜏 ′ 𝜎 : ⊥ (7.2)and leave the problem of studying type systems with type erasure to future research. REFERENCES

Nada Amin and Ross Tate. 2016. Java and Scala’s type systems are unsound: The existential crisis of null pointers. In

Proceedings of the 2016 ACM SIGPLAN International Conference on Object-Oriented Programming, Systems, Languages, andApplications (OOPSLA 2016) . Association for Computing Machinery, New York, NY, USA, 838–848. https://doi.org/10.1145/2983990.2984004Jean-Michel Autebert, Jean Berstel, and Luc Boasson. 1997.

Context-free languages and pushdown automata . Springer, Berlin,Heidelberg. 111–174 pages. https://doi.org/10.1007/978-3-642-59136-5\protect$\relax_3$Henk Barendregt. 1991. Introduction to generalized type systems.

J. Functional Programming

On the power of real-time Turing machines: 𝑘 tapes are more powerful than 𝑘 − tapes Theoretical Comp. Science

Proceedings of the 9th ACMSIGPLAN-SIGACT Symposium on Principles of Programming Languages (POPL ’82) . Association for Computing Machinery,New York, NY, USA, 207–212. https://doi.org/10.1145/582153.582176 https://numpy.org/ 23 oseph (Yossi) Gil and Ori Roth Alan A.A. Donovan and Brian W. Kernighan. 2015.

The Go Programming Language . Addison-Wesley Professional, Boston,MA, USA.Yossi Gil and Tomer Levy. 2016. Formal language recognition with the Java type checker. In , Shriram Krishnamurthi and Benjamin S.Lerner (Eds.), Vol. 56. Schloss Dagstuhl–Leibniz-Zentrum fuer Informatik, Dagstuhl, Germany, 10:1–10:27. https://doi.org/10.4230/LIPIcs.ECOOP.2016.10Yossi Gil and Ori Roth. 2019. Fling—a fluent API generator. In , Alastair F. Donaldson (Ed.), Vol. 134. Schloss Dagstuhl–Leibniz-Zentrum fuerInformatik, Dagstuhl, Germany, 13:1–13:25. https://doi.org/10.4230/LIPIcs.ECOOP.2019.13Jean-Yves Girard. 1971. Une Extension De ĽInterpretation De Gödel a ĽAnalyse, Et Son Application a ĽElimination DesCoupures Dans ĽAnalyse Et La Theorie Des Types. In

Proceedings of the Second Scandinavian Logic Symposium , J.E.Fenstad (Ed.). Studies in Logic and the Foundations of Mathematics, Vol. 63. Elsevier, 63–92. https://doi.org/10.1016/S0049-237X(08)70843-7Jean-Yves Girard. 1972.

Interprétation fonctionnelle et élimination des coupures de l’arithmétique d’ordre supérieur . Ph.D.Dissertation. Université Paris.Radu Grigore. 2017. Java generics are Turing complete.

SIGPLAN Not.

52, 1 (Jan. 2017), 73–85. https://doi.org/10.1145/3093333.3009871Irène Guessarian. 1983. Pushdown tree automata.

Math. Syst. Theory

16, 1 (1983), 237–263.R. Hindley. 1969. The principal type-scheme of an object in combinatory logic.

Trans. Amer. Math. Soc.

Introduction to automata theory, languages, and computation (3rd ed.). Pearson Addison Wesley, Boston, MA.Richard M. Karp. 1972. Reducibility among combinatorial problems. In

Proc. Symp. Complex. Comp. , Raymond E. Miller,James W. Thatcher, and Jean D. Bohlinger (Eds.). Springer, Yorktown Heights, NY, 85–103. https://doi.org/10.1007/978-1-4684-2001-2\protect$\relax_9$Andrew Kennedy and Benjamin Pierce. 2007. On decidability of nominal subtyping with variance. In

Int. Workshop Found.& Devel. OO Lang. (FOOL/WOOD‘07) . Nice, France. http://foolwood07.cs.uchicago.edu/program/kennedy-abstract.htmlA. J. Kfoury, J. Tiuryn, and P. Urzyczyn. 1990. ML typability is DEXPTIME-complete. In

CAAP ’90 , A. Arnold (Ed.). Springer,New York, 206–220.Donald E. Knuth. 1965. On the translation of languages from left to right.

Info & Comp.

8, 6 (1965), 607–639. https://doi.org/10.1016/S0019-9958(65)90426-2Robin Milner. 1978. A theory of type polymorphism in programming.

J. Comput. System Sci.

17, 3 (1978), 348–375.https://doi.org/10.1016/0022-0000(78)90014-4Tomoki Nakamaru and Shigeru Chiba. 2020. Generating a generic fluent API in Java.

The Art, Science, and Eng. of Prog.

4, 3(Feb. 2020). https://doi.org/10.22152/programming-journal.org/2020/4/9Tomoki Nakamaru, Kazuhiro Ichikawa, Tetsuro Yamazaki, and Shigeru Chiba. 2017. Silverchain: a fluent API generator. In

Proc. 16th ACM SIGPLANInt. Conf. Generative Prog. (GPCE’17) . ACM, Vancouver, BC, Canada, 199–211.Anton Nijholt. 1979. Grammar functors and covers: From non-left-recursive to Greibach normal form grammars.

BITNumerical Mathematics

19, 1 (01 March 1979), 73–78. https://doi.org/10.1007/BF01931223Guido Persch, Georg Winterstein, Manfred Dausmann, and Sophia Drossopoulou. 1980. Overloading in Preliminary Ada.

SIGPLAN Not.

15, 11 (Nov. 1980), 47–56. https://doi.org/10.1145/947783.948640Michael O. Rabin. 1963. Real time computation.

Israel J. Math.

Programming Symposium , B. Robinet (Ed.). Springer BerlinHeidelberg, Berlin, Heidelberg, 408–425.A. Salomaa and M. Soittola. 1978.

Automata-Theoretic Aspects of Formal Power Series . Springer-Verlag, NY.J.B. Wells. 1999. Typability and type checking in System F are equivalent and undecidable.

Annals of Pure and Applied Logic

98, 1 (1999), 111 – 156. https://doi.org/10.1016/S0168-0072(98)00047-5Hao Xu. 2010. EriLex: an embedded domain specific language generator. In

Objects, Models, Components, Patterns , Jan Vitek(Ed.). Springer, Berlin, Heidelberg, 192–212.Tetsuro Yamazaki, Tomoki Nakamaru, Kazuhiro Ichikawa, and Shigeru Chiba. 2019. Generating a fluent API with syntaxchecking from an LR grammar.

Proc. ACM Program. Lang.

3, Article Article 134 (Oct. 2019), 24 pages. https://doi.org/10.1145/3360560 24 ies between Type Systems and Automata

A ABBREVIATIONS, ACRONYMS, AND NOTATION

Acronyms

G&R Gil and Roth [2019], page 3Pp p plain parametric polymorphism, or, polyadic parametric polymorphism, (3.2) and Fig. 3.2,page 3API application programming interface, page 1CFG context free grammar, page 20CFL context free language, page 5CSL context sensitive language, page 5DCFL deterministic context free language, page 5DEXP deterministic exponential, page 1DFSA deterministic finite state automaton, page 5FSA finite state automaton, page 1GHC Glasgow Haskell Compiler, page 2GNF Greibach normal form (of CFG), page 20HM Hindley-Milner (type system), page 1IDE interactive development environment, page 3LBA linear bounded automaton, page 5LMD left-most derivation, page 20LR left-to-right, right-most derivation, page 2MKS meter-kilogram-second (system of physical units, page 23ML the ML (“meta-language”) programming language, page 3PDA pushdown automaton, page 1PDTA pushdown tree automaton, page 16RDPDA real-time deterministic pushdown automaton, page 5REG the set of regular languages, page 5RTM real-time Turing machine, page 5SRDPDA stateless real-time deterministic pushdown automaton, page 5STLC simply typed lambda calculus, page 1TA tree automaton, i.e., an automaton employing a tree store, page 5UCFL unambiguous context free language, page 20 List of Symbols

1. Latin Letters Like (upper case) £ forest, or language of trees, £ ⊆ Σ △ , page 9 𝐴 a finite control automaton, page 6 𝐴 a two-dimensional matrix, page 22 𝐵 a two-dimensional matrix, page 22 𝐶 𝑖 a characteristic of lattice 𝔗 , see Table 3.1, page 10 𝐶 number of type arguments (characteristic of lattice 𝔗 ), see Table 3.1, page 10 𝐶 type pattern depth (characteristic of lattice 𝔗 ), see Table 3.1, page 10 𝐶 type pattern multiplicity (characteristic of lattice 𝔗 ), see Table 3.1, page 10 𝐶 arity of functions (characteristic of lattice 𝔗 ), see Table 3.1, page 10 𝐶 type capturing (characteristic of lattice 𝔗 ), see Table 3.1, page 10 𝐶 overloading (characteristic of lattice 𝔗 ), see Table 3.1, page 10 𝐹 the set of accepting states in a finite control automaton, page 6 oseph (Yossi) Gil and Ori Roth 𝐺 context free grammar, page 2 𝐿 library of type definitions in a programming language, page 2 𝑀 a Turing machine, page 17 𝑃 program (abstract syntax start symbol), see Fig. 3.1, page 11 𝑄 the set of internal states of a finite control automaton, page 6 𝑅 set of derivation rules of CFG, page 20 𝑆 ⟨ n-ary , deep ⟩ (type system in 𝔗 ), page 22 𝑆 ⟨ n-ary , deep , non-linear ⟩ (type system in 𝔗 ) List. 3.1, page 22 𝑇 a type system in lattice 𝔗 , page 2 𝑈 unary type encoding of 0, base of the 𝑈 𝑘 recursion, page 16 𝑈 𝑘 unary type encoding of integer 𝑘 ∈ N , defined recursively, page 16 𝑋 unbounded set of variables disjoint to all alphabets, page 8 ℓ formal language, page 2 N the set of non-negative integers { , , , . . . } , page 6 Z set of signed integers, {· · · , − , − , , , , . . . } , page 23 𝔄 lattice of finite control automata, see Table 2.1, page 1 𝔄 ⊥ bottom of lattice 𝔄 , see (2.1), page 5 𝔗 lattice of parametrically polymorphic type systems, see Table 3.1, page 1

2. Latin Letters Like (lower case) 𝑎 an example letter in alphabet, page 2 𝑏 an example letter in alphabet, page 2 𝑐 an example letter in alphabet, page 16 𝑒 expression (abstract syntax category), see Fig. 3.1, page 11 ℎ position of the read/write head on tape auxiliary storage, page 7 𝑚 a dimension of a matrix, page 22 𝑚 exponent of certain physical unit in a physical dimension such as kilogram/meter-squared,page 23 𝑛 length of word input to finite control automaton, page 8 𝑝 𝑖 value of the lattice property 𝑖 , page 5 𝑞 a state of a finite control automaton, page 6 𝑞 the initial internal state of a finite control automaton, page 6 𝑟 number of physical units in a system of physical units such as MKS, page 23 𝑟 rank/number of children in a node of a tree in Γ △ , page 8 𝑟 ( 𝛾 ) rank of symbol 𝛾 drawn from a signature, page 8 𝑠 tree substitution { 𝑥 → 𝜏 , . . . , 𝑥 𝑟 → 𝜏 𝑟 } , page 8 𝑡 a tree in Γ △ , page 8 𝑡 grounded type (abstract syntax category), see Fig. 3.1, page 11 𝑢 the word denoting the remainder of input to a language recognizer, page 6 𝑤 the input word to language recognizer, page 6 𝑥 type variable (abstract syntax category), see Fig. 3.2, page 12 𝑥 variable used in a term, page 8 𝑥 𝑖 a physical unit such as centimeter, second, gram, and coulomb, page 23 ℓ formal language of strings, ℓ ⊆ Σ ∗ , page 6 ♭ designated blank symbol occupying uninitialized cells of tape auxiliary storage, page 17 𝒆 multi-expression (abstract syntax category), see Fig. 3.3, page 13 𝒒 multi-state 𝑞 , 𝑞 , . . . , 𝑞 𝑟 , 𝑟 determined by context, page 9 𝒕 multi-tree 𝑡 , . . . , 𝑡 𝑟 , 𝑟 determined by context, page 10 ies between Type Systems and Automata 𝒙 multi-variable (abstract syntax category), see Fig. 3.2, page 12

3. Greek Letters Like (upper case) Γ alphabet of symbols used in auxiliary storage, page 6 Γ set of variables of CFG, page 20 Γ △ set of all trees over signature Γ , page 8 Γ △ set of all terms over signature Γ , page 8 Δ set of input-output items of consuming transition function 𝛿 , page 7 Δ set of primary function definitions (abstract syntax category), see Fig. 3.1, page 11 Ξ set of auxiliary function definitions (abstract syntax category), see Fig. 3.4, page 14 Ξ set of input-output items of 𝜀 -transition function 𝜉 , page 7 Σ finite alphabet of symbols, page 6 Σ set of terminals of CFG, page 20 Σ ∗ set of all strings (words) over Σ , including the empty string, page 6 Φ set of auxiliary function names, disjoint to Σ , see Fig. 3.4, page 14 𝚪 set of possible contents of auxiliary storage, page 6

4. Greek Letters Like (lower case) 𝛾 variable (non-terminal) of CFG, page 20 𝛿 a definition of primary function (abstract syntax category), see Fig. 3.1, page 11 𝛿 the consuming transition function of a finite control automaton, page 7 𝜄 an instantaneous description of a finite control automaton, see Def. 2.3, page 6 𝜄 initial instantaneous description of a finite control automaton, page 6 𝜉 auxiliary function definition (abstract syntax category), see Fig. 3.4, page 14 𝜉 the 𝜀 -transition function of a finite control automaton, page 7 𝜌 derivation rule of CFG, page 20 𝜌 tree rewrite rule, page 8 𝜍 A terminal of a CFG, or the special symbol $, page 20 𝜎 a letter in alphabet Σ , page 7 𝜎 class name (abstract syntax category), see Fig. 3.1, page 11 𝜎 name of primary function (abstract syntax category), see Fig. 3.1, page 11 𝜎 terminal of CFG, page 20 𝜏 term in set Γ △ , page 8 𝜏 type pattern, i.e., ungrounded type (abstract syntax category), see Fig. 3.2, page 12 𝜔 sentential form, i.e., a sequence of terminals and variables of a CFG, 𝜔 ∈ ( Σ ∪ Γ ) ∗ , page 20 𝜗 pseudo expression, an expression whose type is ungrounded (abstract syntax category),see Fig. 3.4, page 14 𝜙 auxiliary function name, drawn from set Φ (abstract syntax category), see Fig. 3.4, page 14 𝜸 a string of symbols drawn from alphabet Γ , page 32 𝜸 entire contents of auxiliary storage, page 6 𝜸 sequence of CFG variables, 𝜸 ∈ Γ ∗ , page 20 𝜸 initial contents of auxiliary storage, page 6 𝝉 multi-term, 𝜏 , . . . , 𝜏 𝑟 , 𝑟 determined by context, page 10 𝝉 multi-type pattern, i.e., multi ungrounded type (abstract syntax category), see Fig. 3.2,page 12 𝜺 degenerate tree, also denoting a leaf in any tree in Γ △ , page 8 𝜺 designated stack symbol denoting the bottom of the stack, page 32 𝜺 start symbol of CFG, page 20 oseph (Yossi) Gil and Ori Roth 𝜺 the unit type (terminal of abstract syntax), see Fig. 3.1, page 11 𝝑 multi-pseudo expression (abstract syntax category), see Fig. 3.4, page 14 𝜀 the empty string, page 6 𝜀 the single value of the unit type (terminal of abstract syntax), see Fig. 3.1, page 11

5. Other

Depth ( 𝑡 ) depth of tree 𝑡 ∈ Γ △ , Depth ( 𝜺 ) =

0, page 8Depth ( 𝜌 ) depth of pattern 𝜌 ∈ Γ △ , page 9Depth ( 𝜏 ) depth of term 𝑡 ∈ Γ △ , Depth ( 𝑥 ) =

0, page 9Vars ( 𝜌 ) set of variables in rewrite 𝜌 , page 8Vars ( 𝜏 ) set of variables in term 𝜏 , page 8 ⊥ the error type (terminal of abstract syntax), see Fig. 3.1, page 11Fling a fluent API generator contributed by G&R , page 3 Fluent intermediate language used in the implementation of

TypelevelLR , page 2

TypelevelLR a fluent API generator due to Yamazaki, Nakamaru, Ichikawa and Chiba [2019], page 2 ies between Type Systems and Automata B FLUENT API: FROM PRACTICE TO THEORY An application programming interface (API) provides the means to interact with an application viaa computer program. For example, using a file system API we can open, read, and close files fromwithin C code: open(); // Open file read(); // Read line read(); // Read another line close(); // Close file Accompanied to an API is a protocol of use , defining rules for good API practice. A protocol is usuallybrought in internal and external documentation, delegating its imposition to the programmer. Forinstance, a typical file system API protocol disallows read() to be called before open() , and close() tobe called twice in a row. Although breaking the protocol may result in malicious run time behaviors,it nonetheless yields coherent, runnable programs.With object oriented programming (OOP) , functions (methods) are defined within classes. Toinvoke a method, it must be sent as a message to an object of the defining class. Methods of an OO fluent API yield objects that accept other API methods: Listing B.1

Fluent file system API implemented in Java class ClosedFile { OpenedFile open() {. . . } } class OpenedFile { OpenedFile read() {. . . } ClosedFile close() {. . . } } In this OO file system API there are two classes,

ClosedFile and

OpenedFile . Every API call returnseither an object of class

ClosedFile or an object of class

OpenedFile , and thus may immediately befollowed by a successive API call:

Listing B.2

Chain of fluent API method calls closedFile.open().read().read().close(); This expression conducts multiple API calls: Invoking open on a

ClosedFile object yields an

OpenedFile object. Calling read on the

OpenedFile yields itself, but a close invocation returns a

ClosedFile .The main advantage of fluent APIs is their ability to enforce a protocol at compile time : Theobject returned from API call 𝜎 𝑖 () is missing method 𝑓 () , if calling 𝑓 at that location ( 𝜎 𝑖 + ← 𝑓 )breaks the protocol. Consider, for instance, finishing the methods chain of List. B.2 with a second close() call, therefore breaking the file system protocol which forbids double closing: This call failsat compile time, raising a compilation error, as the first close call returns a ClosedFile object, definedin List. B.1, which lacks a close method.Fluent APIs grew in fame due to their application for domain specific languages (DSLs). In contrastto general purpose programming languages, as Java and C++, DSLs employ syntax and semanticsdesigned for a specific component. Standard query language (SQL), for example, is a DSL for writingdatabase queries. To make use of an application in a general software library, its DSL has to besubstituted for an API. Making the API fluent is then ideal: it makes it possible to embed

DSLprograms in code as chains of method calls, that preserve and enforce the original syntax of theDSL. Additional details on DSLs and fluent APIs may be found in [Gil and Roth 2019]. Strictly speaking, we need only “object based” programming, which admits classes and objects, but no class inheritance.29 oseph (Yossi) Gil and Ori Roth

A protocol or a DSL may be described by a formal language ℓ : Then, the fluent API problem is tocompile ℓ into a fluent API that enforces the protocol. The fluent API problem is parameterized bythe complexity of the input language, and the capabilities of the host type system. The file systemprotocol, for instance, is described by a regular expression, ( open · read ∗ · close ) ∗ , and therefore defines a regular language. Given a class of formal languages L , we seek a minimalset of type system features required to embed L languages.As many programming languages and DSLs are not regular, practical interest lies with strongerlanguage classes. A popular approach is to use parametric polymorphism , yet another commonOOP feature . A fixed number of polymorphic classes define an infinite number of types ( A , A , A> ,. . . ): Intuitively, these types can be used to simulate an unbounded storage, required toaccept non-regular languages.Consider, for example, the following Java definitions: With these definitions, an expression of

Listing B.3

Fluent stack API implemented in Java using (monadic) polymorphism class Empty { Stack push() {. . . } Empty empty() {. . . } } class Stack { Stack> push() {. . . } T pop() {. . . } } the form 𝑒 = new Empty(). 𝜎 (). 𝜎 (). . . . . 𝜎 𝑛 ().empty() , (B.1)where 𝜎 𝑖 ∈ { push , pop } type checks if and only if, 𝜎 𝜎 · · · 𝜎 𝑛 belongs in the Dyck language ofbalanced parentheses with the homomorphism ℎ ( 𝜎 ) = (cid:40) push 𝜎 = ‘ ( ’ pop 𝜎 = ‘ ) ’A pop from empty stack (conversely, unbalanced parenthesis) is signaled by a type error generatedat compile time, e.g., in new Empty().push().pop().pop().empty(); the second call to pop() triggers a compile time error, to say that type Empty does not feature thismethod.With the fluent API problem trivial for regular languages , recent studies [Gil and Levy 2016;Gil and Roth 2019; Nakamaru and Chiba 2020; Nakamaru et al. 2017; Xu 2010; Yamazaki et al. 2019]introduced various methods for composing fluent APIs of more complex languages. Two promisingresults are those of G&R and Yamazaki et al. [2019]: Released roughly at the same time, both papersshowed any deterministic context free languages (including the Dyck language) can be composedinto a fluent API. Java generics, C++templates, etc. A finite state machine can be encoded using simple OO classes. A Java fluent API generator for regular languages isavailable at https://github.com/verhas/fluflu. 30 ies between Type Systems and Automata

C PROOFSC.1 Proof of Thm. 4.2

Recall that 𝑎 𝑛 𝑏 𝑛 𝑐 𝑛 ∈ CSL, and that DCFL ⊂ CSL. We show that 𝑎 𝑛 𝑏 𝑛 𝑐 𝑛 ∈ deep- Pp p . The details are inList. C.1, that employs Java syntax to show a set of definitions that recognizes the language 𝑎 𝑛 𝑏 𝑛 𝑐 𝑛 . Listing C.1

Definitions in type system deep- Pp p (using Java syntax) for the language 𝑎 𝑛 𝑏 𝑛 𝑐 𝑛 interface 𝛾 // Type after reading 𝑎 𝑘 is 𝛾 𝑢 𝑘 , 𝑢 𝑘 > interface 𝛾 // Type after reading 𝑎 𝑛 𝑏 𝑘 is 𝛾 𝑢 𝑛 − 𝑘 , 𝑢 𝑛 > interface 𝛾 // Type after reading 𝑎 𝑛 𝑏 𝑛 𝑐 𝑘 is 𝛾 𝑢 𝑛 − 𝑘 > static 𝛾 // chain start static 𝛾 𝛾 // Increment both arguments static 𝛾 𝛾 // 𝑏 after 𝑎 𝑛 ; decrement first argument static 𝛾 𝛾 // 𝑏 after 𝑎 𝑛 𝑏 𝑘 , 𝑘 > ; decrement first argument static 𝛾 𝛾 // 𝑐 after 𝑎 𝑛 𝑏 𝑛 ; decrement second argument static 𝛾 𝛾 // 𝑐 after 𝑎 𝑛 𝑏 𝑛 𝑐 𝑘 , 𝑘 > ; decrement argument static void end( 𝛾 // Accept after 𝑎 𝑛 𝑏 𝑛 𝑐 𝑘 , 𝑘 = 𝑛 static { // Test definitions in static initializer end(c(c(c(b(b(b(a(a(a(begin())))))))))); // Expression 𝑒 = 𝑒 ( 𝑎 𝑏 𝑐 ) type-checks end(c(c(c(b(b(a(a(a(begin()))))))))); // Expression 𝑒 = 𝑒 ( 𝑎 𝑏 𝑐 ) does not type-check } The three generic types 𝛾 𝛾 𝛾 𝑎 , 𝑏 , and 𝑐 (calls to functions a() , b() and c() ) in theinput string:(1) The type of expression a( · · · a(begin()) · · · ) ( 𝑘 occurrences of a ) is 𝛾 𝑢 𝑘 , 𝑢 𝑘 >, where 𝑢 𝑘 is the type encoding of 𝑘 ;(2) the type of b( · · · b(a( · · · a(begin()) · · · )) · · · ) ( 𝑛 occurrences of a , 𝑘 of b ) is 𝛾 𝑢 𝑛 − 𝑘 , 𝑢 𝑛 > ; and (3) the type of expression c( · · · c(b( · · · b(a( · · · a(begin()) · · · )) · · · )) · · · ) ( 𝑛 occurrences of a and b ; 𝑘 occurrences of of c ) is 𝛾 𝑢 𝑛 − 𝑘 > .For example, observe the (overloaded) definition of function b( · ) in the listing, static 𝛾 𝛾 This version of b( · ) , intended for expressions of the form b(a( · · · a(begin()) · · · )) converts 𝛾 < 𝑢 𝑛 , 𝑢 𝑛 > , the type of its argument to 𝛾 < 𝑢 𝑛 − , 𝑢 𝑛 > .Consider the general case expression end(c( · · · c(b( · · · b(a( · · · a(begin()) · · · )) · · · )) · · · )) and, starting at the inner most invocation, begin() , whose type is 𝛾 < 𝑢 , 𝑢 > , and tracing, bottom up,types of the successive nested expressions, we see that: • First, a count of the 𝑎 ’s is recorded in both arguments of generic 𝛾 . This count is incrementedwith each call to a() . • Once the first 𝑏 is seen, these arguments are passed to generic 𝛾

2. The first argument of 𝛾 𝑏 encountered. The second argument remains however unchangedduring these encounters. • This second argument is then passed to generic 𝛾 𝑐 is encountered. It is thendecremented for each 𝑐 encountered. • Function end type-checks only if this argument is 𝑢 . oseph (Yossi) Gil and Ori Roth C.2 Proof of Thm. 4.3

The Java definitions in List. C.2 realize the language 𝑎 𝑛 𝑏 𝑛 𝑐 𝑛 ∈ CSL.

Listing C.2

Definitions in type system non-linear- Pp p (using Java syntax) for the language 𝑎 𝑛 𝑏 𝑛 𝑐 𝑛 interface 𝛾 // Type after reading 𝑎 𝑘 is 𝛾 < 𝑢 𝑘 ,𝑢 ,𝑢 > 𝛾 // No phase change: increment the first type argument 𝛾 // First 𝑏 seen: change phase, and increment second argument } interface 𝛾 // Type after reading 𝑎 𝑛 𝑏 𝑘 is 𝛾 < 𝑢 𝑛 ,𝑢 𝑘 ,𝑢 > 𝛾 // No phase change: increment the second type argument 𝛾 // First 𝑐 seen: change phase, and increment third argument } interface 𝛾 // Type after reading 𝑎 𝑛 𝑏 𝑚 𝑐 𝑘 is 𝛾 < 𝑢 𝑛 ,𝑢 𝑚 ,𝑢 𝑘 > 𝛾 // No phase change: increment the third type argument } static 𝛾 // Start with type 𝛾 < 𝑢 ,𝑢 ,𝑢 > static void end( 𝛾 // Accept only on type 𝛾 < 𝑢 𝑛 ,𝑢 𝑛 ,𝑢 𝑛 > for some 𝑛 ≥ static { // Test definitions in static initializer end(begin().a().a().a().b().b().b().c().c().c()); // Expression 𝑒 = 𝑒 ( 𝑎 𝑏 𝑐 ) type-checks end(begin().a().a().a().b().b().c().c().c()); } // Expression 𝑒 = 𝑒 ( 𝑎 𝑏 𝑐 ) does not type-check The fluent API records the number of 𝑎 ’s, 𝑏 ’s and 𝑐 ’s in three unary integer encodings. Therecording is in generic types 𝛾 𝛾 𝛾 𝛾 𝑎 ’s are encountered, type 𝛾 𝑏 ’s occur, and type 𝛾 𝑐 ’s show.When the entire input is read, the three counters are compared by function end() . This functionrelies on non-linearity, to check that they are indeed equal. C.3 Proof of Thm. 5.3

Given is a fluent program 𝑃 = ΔΞ 𝑒 . We construct from the definitions Δ and Ξ deep- DPDAautomaton 𝐴 . Let 𝑒 = 𝜀.𝜎 . · · · .𝜎 𝑛 . Then, 𝐴 accepts 𝑤 = 𝜎 · · · 𝜎 𝑛 if and only if 𝑃 is type-correct.The construction maintains the invariant that after 𝐴 consumes 𝜎 𝑖 and conducting all (if any) sub-sequent 𝜀 -transitions, its stack contents encodes 𝑡 𝑖 , the type of the partial expression 𝑒 = 𝜀.𝜎 . · · · .𝜎 𝑖 .Concretely, since Fluent is a monadic type system, 𝑡 𝑖 must be in the (full) form 𝛾 ( 𝛾 (· · · 𝛾 𝑘 ( 𝜺 ) · · · )) .The stack encoding of 𝑡 𝑖 is 𝛾 𝛾 · · · 𝛾 𝑘 𝜺 , i.e., the monadic abbreviation of the full form augmentedwith a designated symbol 𝜺 for denoting the stack’s bottom. For this reason, the set of stack symbolsof 𝐴 includes a symbol 𝛾 for every type name used in Δ ∪ Ξ , and the extra symbol 𝜺 .The set of internal states of 𝐴 includes an initial and accepting state 𝑞 . The automaton will be instate 𝑞 initially, and then whenever it exhausted all possible 𝜀 -transitions after consuming a letter,and is ready to consume the next input symbol. Also, 𝐴 has an internal (not-accepting) state 𝑞 𝜑 for every auxiliary function name 𝜑 used in Ξ . These states are used while executing 𝜀 -transitions,which emulate the resolution of the rudimentary typeof clauses allowed in Fluent .As in the proof of Thm. 5.2, the rudimentary-typeof property of the type systems makes itpossible to classify any function definition in Δ ∪ Ξ as either direct , if its type signature is 𝜏 → 𝜏 ′ ,or as forwarding , in case it is 𝜏 → typeof 𝜏 ′ .𝜑 .Every Fluent function is encoded in one (consuming- or 𝜀 -) transition item of 𝐴 . In this encoding,the function type signature uniquely determines the stack rewrite rule 𝜌 , but unlike in the proof ofThm. 5.2, 𝜌 is not identical to the type signature.To see why, recall first that since Fluent is monadic , we can write any term 𝜏 as 𝜸 𝑥 where 𝜸 ∈ Γ ∗ (in the case 𝜏 is a proper term) or as 𝜸 (in the case it is a grounded). If a function’s type is 𝜸 𝑥 → 𝜸 ′ ,then to maintain the invariant, 𝐴 needs to push the string 𝜸 ′ 𝜺 to stack after emptying it, by poppingfirst the 𝜸 fixed portion, and then the 𝑥 variable portion which may of unbounded length . Alas, this 𝑥 portion cannot be cleared with the single stack rewrite allowed in the single transition encoding a Fluent function. ies between Type Systems and Automata For this reason, we use instead a stack rewrite 𝜌 = 𝜸 𝑥 → 𝜸 ′ 𝜺 𝑥 in this case, i.e., emulating stackemptying by pushing another copy of 𝜺 , the bottom of the stack symbol. Automaton 𝐴 is obliviousto the trick, since none of the rewrites in its transitions of removes a 𝜺 symbol off the stack.With the definition of 𝜌 ( 𝜏 → 𝜏 ′ ) by 𝜌 ( 𝜏 → 𝜏 ′ ) =  𝜸 𝑥 → 𝜸 ′ 𝑥 if 𝜏 = 𝜸 𝑥 and 𝜏 ′ = 𝜸 ′ 𝑥 𝜸 𝜺 → 𝜸 ′ 𝜺 if 𝜏 = 𝜸 and 𝜏 ′ = 𝜸 ′ 𝜸 𝑥 → 𝜸 ′ 𝜺 𝑥 if 𝜏 = 𝜸 𝑥 and 𝜏 ′ = 𝜸 ′ (C.1)we can describe the transition encoding of each of the four kinds of functions that may occur in 𝑃 .(1) Primary function definitions , found in Δ , are encoded as consuming transitions of 𝐴 :(a) Direct definition 𝜎 : 𝜏 → 𝜏 ′ as ⟨ 𝜎, 𝑞 , 𝜌 ( 𝜏 → 𝜏 ′ ) , 𝑞 ⟩ ,(b) Forwarding definition 𝜎 : 𝜏 → typeof 𝜏 ′ .𝜑 as ⟨ 𝜎, 𝑞 , 𝜌 ( 𝜏 → 𝜏 ′ ) , 𝑞 𝜑 ⟩ .(2) Auxiliary function definitions , found in Ξ , are encoded as 𝜀 transitions of 𝐴 :(a) Direct defintion 𝜑 : 𝜏 → 𝜏 ′ as ⟨ 𝑞 𝜑 , 𝜌 ( 𝜏 → 𝜏 ′ ) , 𝑞 ⟩ .(b) Forwarding definition 𝜑 : 𝜏 → typeof 𝜏 ′ .𝜑 ′ as ⟨ 𝑞 𝜑 , 𝜌 ( 𝜏 → 𝜏 ′ ) , 𝑞 𝜑 ′ ⟩ .We can now verify that automaton 𝐴 iteratively computes the type of the word-encoded inputexpression: Consuming transitions correspond to type checking of primary function invocation,while 𝜀 -transitions make the detour required to compute the type of functions defined by a typeof clause. If the input expression fails type checking, then automaton 𝐴 hangs (whereby rejecting theinput), due to failure to find an appropriate transition for the current stack contents, internal state(and the current input symbol, when appropriate). C.4 Proof of Thm. 5.4

We present a set of full-typeof-Fluent definitions that encodes the language 𝑤 𝑤 ∈ CSL.

Listing C.3

C++, full-typeof-Fluent program recognizing the CSL 𝑤 𝑤 struct E {}; // Bottom type template struct A {}; // Generic type, stands for 𝑎 template struct B {}; // Generic type, stands for 𝑏 template struct S {}; // Generic type, stands for A a() {} // Begin expression with 𝑎 B b() {} // Begin expression with 𝑏 template A a(T) {} // Accumulate 𝑎 to the expression template B b(T) {} // Accumulate 𝑏 to the expression template S s(T) {} // Accumulate to the expression template auto $(A) { return match_a($(T())); } // Expression has ended; eventually match 𝑎 template auto $(B) { return match_b($(T())); } // Eventually match 𝑏 template auto $(S) { return reverse(T()); } // encountered, reverse the second 𝑤 template auto reverse(A) { return append2end_a(reverse(T())); } // Append 𝑎 to the end, reverse the rest template auto reverse(B) { return append2end_b(reverse(T())); } // Append 𝑏 to the end, reverse the rest E reverse(E) {} // Done reversing template auto append2end_a(A) { return append2start_a(append2end_a(T())); } // Reattach 𝑎 template auto append2end_a(B) { return append2start_b(append2end_a(T())); } // Reattach 𝑏 A append2end_a(E) {} // Append 𝑎 to the end template auto append2end_b(A) { return append2start_a(append2end_b(T())); } // Reattach 𝑎 template auto append2end_b(B) { return append2start_b(append2end_b(T())); } // Reattach 𝑏 B append2end_b (E) {} // Append 𝑏 to the end template A append2start_a(T) {} // Append 𝑎 to the word template B append2start_b(T) {} // Append 𝑏 to the word template T match_a(A) {} // Match 𝑎 template T match_b(B) {} // Match 𝑏 int main() { E w1=$(a(b(a(a(s(a(b(a(a()))))))))); // Expression encoding 𝑤 = 𝑎𝑏𝑎𝑎 𝑎𝑏𝑎𝑎 type-checks E w2=$(a(b(a(a(s(a(b(b(a()))))))))); // Expression encoding 𝑤 = 𝑎𝑏𝑎𝑎 𝑎𝑏𝑏𝑎 does not type-check E w3=$(b(a(a(s(a(b(a(a())))))))); } // Expression encoding 𝑤 = 𝑏𝑎𝑎 𝑎𝑏𝑎𝑎 does not type-check oseph (Yossi) Gil and Ori Roth The definitions first accumulate the input to a monadic type, e.g., where expression a(b(s(a(b())))) is typed as

A>>>> , type E is the bottom type (C++ indentifiers 𝑠 and 𝑆 stand for $ , which terminate all expressions.Function $ first traverses the first 𝑤 of 𝑤 𝑤 , while replacing types A and B with calls to match_a and match_b respectively. Upon reaching type S , encoding $ encodes the second 𝑤 as atype, and reverses it; then functions match_a and match_b proceed to match the words in the correctorder. For example, expression $(a(b(s(a(b() · · · ) changes first into match_a(match_b($(s(a(b() · · · ) ,and then into match_a(match_b(B>)) ; next the match functions match 𝑏 and then 𝑎 , and return thebottom type E , successfully terminating the typing process. If the word before the reverse . Function reverse appends the currenttype A (resp. B ) to the end of the type, recursively, using function append2end_a ( append2end_b ). Function append2end_a examines its argument A ( B ), replaces it with a call to append2start_a ( append2start_b )and continue recursively into T ; type A ( B ) is reattached after the process has ended. Function append2end_b is implemented in a similar way. ies between Type Systems and Automata D SUPPLEMENTARY MATERIALD.1 Full Encoding of a Turing Machine in ⟨ deep,rudimentary ⟩∨ Pp p Thm. 5.1 above showed that any Turing machine can be encoded by a program in type system 𝑇 = ⟨ deep,rudimentary ⟩∨ Pp p The proof of theorem used Fig. 5.1 depicting an example of such a machine. For the sake ofcompleteness, List. D.1 here presents the full encoding in 𝑇 of the Turing machine of Fig. 5.1. Listing D.1

C++program encoding the Turing machine of Fig. 5.1 template struct B {}; struct E {}; template struct a {}; template struct b {}; template struct O {}; template E q4(O, xR>) {} template E q4(O, xR>) {} template E q4(O, xR>) {} template typeof(q4(O, B>())) q0(O, B, xR>) {} template typeof(q4(O, B>())) q0(O, xR>) {} template typeof(q4(O, B>())) q0(O, B, xR>) {} template typeof(q4(O, B>())) q0(O, B, xR>) {} template typeof(q1(O, B, xR>())) q0(O, B>) {} template typeof(q1(O, B, E>())) q0(O, E>) {} template typeof(q1(O, a, xR>())) q0(O, a>) {} template typeof(q1(O, b, xR>())) q0(O, b>) {} template typeof(q2(O, B>())) q1(O, B, xR>) {} template typeof(q2(O, B>())) q1(O, xR>) {} template typeof(q2(O, B>())) q1(O, B, xR>) {} template typeof(q2(O, B>())) q1(O, B, xR>) {} template typeof(q1(O, B, xR>())) q1(O, B>) {} template typeof(q1(O, B, E>())) q1(O, E>) {} template typeof(q1(O, a, xR>())) q1(O, a>) {} template typeof(q1(O, b, xR>())) q1(O, b>) {} template typeof(q1(O, B, xR>())) q1(O, B>) {} template typeof(q1(O, B, E>())) q1(O, E>) {} template typeof(q1(O, a, xR>())) q1(O, a>) {} template typeof(q1(O, b, xR>())) q1(O, b>) {} template typeof(q3(O, B>())) q2(O, b, xR>) {} template typeof(q3(O, B>())) q2(O, xR>) {} template typeof(q3(O, B>())) q2(O, b, xR>) {} template typeof(q3(O, B>())) q2(O, b, xR>) {} template typeof(q0(O, B, xR>())) q3(O, B>) {} template typeof(q0(O, B, E>())) q3(O, E>) {} template typeof(q0(O, a, xR>())) q3(O, a>) {} template typeof(q0(O, b, xR>())) q3(O, b>) {} template typeof(q3(O, a>())) q3(O, a, xR>) {} template typeof(q3(O, a>())) q3(O, xR>) {} template typeof(q3(O, a>())) q3(O, a, xR>) {} template typeof(q3(O, a>())) q3(O, a, xR>) {} template typeof(q3(O, b>())) q3(O, b, xR>) {} template typeof(q3(O, b>())) q3(O, xR>) {} template typeof(q3(O, b>())) q3(O, b, xR>) {} template typeof(q3(O, b>())) q3(O, b, xR>) {} int main() { E w1=q0(O, a>>>>>>>()); // compiles, 𝑤 = 𝑎 𝑏 ∈ 𝑎 𝑛 𝑏 𝑛 E w2=q0(O, a>>>>>>>()); // does not compile, 𝑤 = 𝑎 𝑏𝑎𝑏 ∉ 𝑎 𝑛 𝑏 𝑛 E w3=q0(O, a>>>>>>()); // does not compile, 𝑤 = 𝑎 𝑏 ∉ 𝑎 𝑛 𝑏 𝑛 } D.2 Fluent API for the Language of Palindromes

Here we demonstrate Thm. 6.1 and its proof, by constructing a fluent API library for palindromes inan Ada like type system, i.e., a type system with eventually-one-type style of overloading resolutions. oseph (Yossi) Gil and Ori Roth Consider the formal language of even length palindromes over alphabet { 𝑎, 𝑏 } , as defined by thefollowing context free grammar 𝜺 → 𝑎 𝜺 𝑎 → 𝑏 𝜺 𝑏 → 𝜀. (D.1)It is well known that the language (D.1) is not-deterministic yet unambiguous. Rewriting itsgrammar in Greibach normal form gives 𝜺 → 𝑎𝛾 → 𝑏𝛾 𝛾 → 𝑎𝛾 𝛾 → 𝑏𝛾 𝛾 → 𝑎𝛾 → 𝑎𝛾 𝛾 → 𝑏𝛾 𝛾 → 𝑏𝛾 → 𝑎𝛾 → 𝑏. (D.2)Applying the construction in the proof of Thm. 6.1 to the grammar (D.2) gives the program inList. D.2, that realizes a fluent API for (D.1). Listing D.2

Definitions in type system ⟨ monadic, eventually-one-type ⟩ (using Java-like syntax)encoding the language of even lengthed palindromes interface 𝜀 { 𝛾 𝛾 } interface 𝛾 𝛾 𝛾 𝛾 𝛾 T a(); // Java error, overloaded functions cannot differ only by return type } interface 𝛾 𝛾 𝛾 𝛾 𝛾 T b(); // Java error, overloaded functions cannot differ only by return type } interface 𝛾 T a(); } interface 𝛾 T b(); } interface $ { void $(); } new 𝜀 ().a().a().b().b().a().a().$(); Note that even though the program in the listing uses Java syntax, it would not provide thedesired result if compiled by a Java compiler. The reason is that Java does not permit multiple typesfor sub-expressions.Expression new 𝜀 ().a().a().b().b().a().a() in List. D.2 is phrased as 𝑎𝑎𝑏𝑏𝑎𝑎 —with this prefix,the center of the word (denoted by ‘ · ’), separating 𝑤 from 𝑤 𝑅 , can be in three places: 𝑎𝑎𝑏 · 𝑏𝑎𝑎 ,in case 𝑤 = 𝑎𝑎𝑏 , 𝑎𝑎𝑏𝑏𝑎 · 𝑎 , in case 𝑤 = 𝑎𝑎𝑏𝑏𝑎 , or 𝑎𝑎𝑏𝑏𝑎𝑎 · , in case 𝑤 = 𝑎𝑎𝑏𝑏𝑎𝑎𝜎 ∗ . These three ies between Type Systems and Automata possibilities correspond to three types deduced for the expression. Yet, when reaching method $() ,the type checker settles the ambiguity to the favor of the first option, as only after reading 𝑤𝑤 𝑅 type $ with method $() is returned. As there is exactly one way to type the entire expression, typechecking is successful. D.3 On the Complexity of Deep Polyadic Parametric Polymorphism

We take particular interest in type system deep- Pp p , since it models generic non-method functions.Also, this type system might be applicable for the software engineering applications mentioned inSect. 7.We don’t know the exact complexity class of deep- Pp p , but here are few comments and observationsthat might be useful towards characterizing it.(1) A tree automaton with 𝜀 -transitions is even more potent than a two-pushdown automaton,which is equivalent to a Turing machine. This equivalence does not hold for the tree automatonin point, which is real-time.(2) A direct comparison of our real-time (and hence linear time) tree automata to real-time (orlinear time) Turing machines is not possible, since an elementary operation of tree automatamay involve transformations of trees whose size may be exponential.(3) We can still describe an emulation of the computation of real-time Turing machine (RTM, seeTable 2.1 above) by a deep tree automaton, by breaking the machine’s tape into two stacks,and store these stacks as branches of the same tree, RTM ⊆ deep- TA. Let RTM 𝑛 be an RTMequipped with 𝑛 ≥ = RTM from RTM , showing | 𝑅𝑇 𝑀 | ⊊ RTM . Subsequently, Bruda and Akl[1999] generalized Rabin’s result for any number of tapes, showing that RTM 𝑛 ⊊ RTM 𝑛 + ,for all 𝑛 ≥

1. Extending the tree automaton emulation of RTMs, to run concurrently onany (fixed) number of tapes, we obtain that the entire non-collapsing hierarchy of RTMs iscontained in deep-

TA, i.e, that RTM 𝑛 ⊆ deep- TA for all 𝑛 ≥ deep- TA ⊊ CSL.(5) A hint to the complexity of class deep- Pp p may be found in the fact that it is closed underfinite intersection and finite union. (The proof is by merging the respective tree automataby running their rewrites in tandem on two distinct branches of the same tree. The mergedautomata recognizes intersection if there is an accept both branches; it recognizes the union,if there is an accept in one of the branches.)(6) On the other hand, we claim that deep- TA is not closed under complement (equivalently, setdifference): Consider (yet again) the language 𝑎 𝑛 𝑏 𝑛 𝑐 𝑛 ∈ deep- TA. If there was an automatonthat recognizes the complement of the language, it should accept the word 𝑎𝑏𝑐𝑎 , but reject itsprefix 𝑎𝑏𝑐 . Alas, a stateless automaton such as ours, can only reject by reaching a configurationwhere there are no further legal transitions, and hence cannot recover from the rejection ofthis prefix.We are however able to show that stateful-