[PDF] Classifying Pattern and Feature Properties to Get a Θ(n) Checker and Reformulation for Sliding Time-Series Constraints

Abstract

Given, a sequence X of n variables, a time-series constraint ctr using the Sum aggregator, and a sliding time-series constraint enforcing the constraint ctr on each sliding window of X of m consecutive variables, we describe a Θ(n) time complexity checker, as well as a Θ(n) space complexity reformulation for such sliding constraint.

Full PDF

aa r X i v : . [ c s . F L ] D ec Classifying Pattern and Feature Propertiesto Get a Θ ( n ) Checker and Reformulationfor Sliding Time-Series Constraints

N. Beldiceanu , M. Carlsson , C.-G. Quimper and M. I. Restrepo TASC (LS2N-CNRS), IMT Atlantique, FR – 44307 Nantes, France RISE SICS, Sweden Laval University, Québec, Canada

Abstract.

Given, a sequence X of n variables, a time-series constraint ctr using the Sum aggregator, and a sliding time-series constraint en-forcing the constraint ctr on each sliding window of X of m consecutivevariables, we describe a Θ ( n ) time complexity checker, as well as a Θ ( n ) space complexity reformulation for such sliding constraint. While sequence constraints on sliding windows were introduced a long time agofor counting and for sum constraints, e.g. see among_seq in [4,16,8] and slid-ing_sum in [5,13], no sliding automaton constraint was yet introduced, evenif automaton constraints were known since 2004 [7,14]. More recently in thecontext of planning problems, constraints on streams were introduced in [11,12]for comparing pointwise two stream variables or for stating constraints adaptedfrom Linear Temporal Logic. However, in the context of a long sequence or of adata stream [1], imposing a constraint on a full sequence does not make muchsense, as we rather want to focus on sliding windows. Compositional time-seriesconstraints combining a regular expression σ , a feature f , and an aggregator g were introduced in [6,2]. We ﬁrst provide an example of sliding time seriesconstraint. Example 1.

Given a sequence X = 3 1 3 3 2 1 1 2 2 2 4 4 3 1 2 2 , we want to computethe sum of subsequences of X corresponding to increasing sequences, i.e. tomaximal occurrences of the pattern ‘ < ( < | =) ∗ < | < ’, in every window of size of X . Such windows are shown in the ﬁgure on the right by a dotted line,where each solid line-segment indicatesan increasing sequence. The number tothe left of each window is the sum ofthe elements of the window belongingto an increasing sequence located insidethe window. Beyond this example wewant a generic approach to deal with avariety of patterns and features. > < = > > = < = = < = > > < = ontributions and methodology Our contributions are threefold. – By pursuing the compositional style for deﬁning time-series constraints [6],we introduce sliding time-series constraints, assuming g is the Sum aggrega-tor. This allows one to deﬁne a fair variety of sliding constraints in a genericway, in fact constraints in the time-series catalogue [2]. – It provides a Θ ( n ) linear time complexity checker for such constraints, whichis crucial when extracting patterns from long sequences in the context ofmodel acquisition [15,18]. – It describes a Θ ( n ) linear space complexity reformulation, which allows amemory eﬃcient reformulation.To obtain our contributions we use the following methodology. – We come up with three simple equations allowing one to compute the contri-bution of a window [ i, j ] (with i ≤ j ) wrt the results (a) on the full sequence X = x x . . . x n , (b) on the preﬁx x x . . . x j which ends at position j , and(c) on the suﬃx x i x i +1 . . . x n which starts at position i . – We study both the properties of regular expressions and features: • We systematically categorise regular expressions by partitioning theirwords into a restricted set of classes, so that each regular expression canbe compactly represented by a ﬁnite set of classes. • We identify key pattern and feature properties.For each pair of word classes and feature properties, we prove that a givenequation holds or provide some counterexample. – Finally, we show how equations can be directly turned into checkers andreformulations.The categorisation of a regular expression and the identiﬁcation of the proper-ties of a pattern are done mechanically by checking that some derived regularlanguages are empty or not.Section 2 provides the necessary background on words and time-series con-straints. Section 3 introduces a small number of pattern properties, while Sec-tion 4 (i) deﬁnes the sliding time-series constraints we consider, (ii) classiﬁesregular expressions in relation to sliding windows, (iii) shows how to computethe contribution of a sliding window based on pattern and feature properties, andﬁnally (iv) presents a Θ ( n ) time complexity checker and a Θ ( n ) space complexityreformulation for such sliding time-series constraints. Word

Consider a ﬁnite alphabet Σ . A word w over Σ is a sequence of letters w w . . . w ℓ of the alphabet Σ , and its length ℓ is denoted by | w | . The emptyword is denoted by ǫ . The reverse of w is the word w ℓ w ℓ − . . . w denoted w r .The concatenation of two words is denoted by putting them side by side. A word v is a factor of a word x if there exists two words u and w such that x = uvw ;when u = ǫ , v is a preﬁx of x , when w = ǫ , v is a suﬃx of x . If v is not emptyand diﬀerent from x , then v is a proper factor of x .2 ime-series constraints We assume the reader is familiar with regular expres-sions and automata [10]. A time-series constraint g _ f _ σ ( r, X ) is a constraintwhich restricts an integer result variable r to be the result of some computationsover a sequence of integer variables X . The components of a time-series constraintwe reuse from [6] are a pattern σ , a feature f , and an aggregator g . A pattern σ is described by a regular expression over the alphabet Σ = { ‘ < ’ , ‘ = ’ , ‘ > ’ } whose language L σ does not contain the empty word, and by two non-negativeintegers b σ and a σ , where b σ + a σ is smaller than or equal to the length of thesmallest word of L σ . A feature and an aggregator are functions over integer se-quences as illustrated in Table 1. Note that all functions f and g introducedin Table 1 are commutative. Let S = s s . . . s n − be the signature of a time se-ries X , which is deﬁned by constraints: ( x i < x i +1 ⇔ s i = ‘ < ’ ) ∧ ( x i = x i +1 ⇔ s i = ‘ = ’ ) ∧ ( x i > x i +1 ⇔ s i = ‘ > ’ ) for all i ∈ [1 , n − . If a sub-signature s i s i +1 . . . s j − is a maximal word matching σ in the signature of X , then thesubsequence x i + b σ x i + b σ +1 . . . x j − a σ is called a σ -pattern wrt X , and the sub-sequence x i x i +1 . . . x j is called an extended σ -pattern wrt X . The non-negativeintegers b σ and a σ trim the left and right borders of an extended σ -pattern toobtain a σ -pattern from which a feature value is computed. f value one width j − i − b σ − a σ + 1 surf j − aσ P k = i + bσ x k max max k ∈ [ i + bσ,j − aσ ] x k min min k ∈ [ i + bσ,j − aσ ] x k g value Sum c P k =1 f k σ L σ b σ a σ r n o e s Inflexion < ( < | =) ∗ > | > ( > | =) ∗ < n n y n n BumpOnDecSeq >><>> n n n n n

DipOnIncSeq <<><< n n n n n

Dec > y y n y y Inc < y y n y y Steady = 0 0 y y n y y

DecTerrace > = + > y y n n n IncTerrace < = + < y y n n n Plain > = ∗ < y n y n n Plateau < = ∗ > y n y n n ProperPlain > = + < y n y n n ProperPlateau < = + > y n y n n Gorge ( > ( > | =) ∗ ) ∗ >< (( < | =) ∗ < ) ∗ y n y n n Summit ( < ( < | =) ∗ ) ∗ <> (( > | =) ∗ > ) ∗ y n y n n Peak < ( < | =) ∗ ( > | =) ∗ > y n y n n Valley > ( > | =) ∗ ( < | =) ∗ < y n y n n DecSeq > ( > | =) ∗ > | > y y n y n IncSeq < ( < | =) ∗ < | < y y n y n SteadySeq = + y y n y n StrictlyDecSeq > + y y n y n StrictlyIncSeq < + y y n y n Zigzag ( <> ) + < ( > | ǫ ) | ( >< ) + > ( < | ǫ ) 1 1 y n n n n Table 1: Consider a sequence x x . . . x n . (Top left) features f with their valuescomputed from an extended σ -pattern x i x i +1 . . . x j ; (Bottom left) aggregator g = Sum , its value computed from a sequence of feature values f , f , . . . , f c ;(Right) patterns σ = hL σ , b σ , a σ i grouped by the properties they share, wherecolumns r , n , o , e , s respectively indicate whether a pattern has a reverse in thecatalogue [3], the no-inflexion , the one-inflexion , the exclude-out-in , orthe single letter properties. 3n the following x i,j denotes the integer subsequence x i x i +1 . . . x j when i ≤ j and x i x i − . . . x j otherwise. The term f σ ( x i,j ) denotes the sum of the valuesof the feature f from every extended σ -pattern in subsequence x i,j , i.e. thecontribution of the sliding window [ i, j ] . We introduce a limited number of pattern properties that will be used to pa-rameterise our proofs: we will assume that some of these properties hold to provethat a given equation is valid for calculating the contribution of a sliding window. Deﬁnition 1.

The mirror of a regular language L over Σ = { ‘ < ’ , ‘ = ’ , ‘ > ’ } ,denoted by L mir , consists of the mirrors of all the words in L , where the mirrorof a word w , denoted by w mir , has the reverse order of its letters and has alloccurrences of the letter ‘<’ ﬂipped into ‘>’ and vice versa. Deﬁnition 2.

Two patterns σ = hL σ , b σ , a σ i and σ r = hL σ r , b σ r , a σ r i are the reverse of each other iﬀ w ∈ L σ ⇔ w mir ∈ L σ r , a σ = b σ r and b σ = a σ r . As shown by column r of the pattern part of Table 1, out of the patternsof the time-series catalogue [3] have a reverse pattern deﬁned inside [3]. Example 2 (reverse).

On the one hand, the

Plateau = h ‘ < = ∗ > ’ , , i patternis the reverse of itself since, (1) all letters except the ﬁrst and last letters ofa plateau correspond to the letter ‘=’, (2) the ﬁrst letter ‘<’ is the mirror ofthe last letter ‘>’, and (3) a Plateau = b Plateau = 1 . On the other hand, the

Inflexion = h ‘ < ( < | =) ∗ > | > ( > | =) ∗ < ’ , , i pattern is not the reverseof itself: the mirror of the word ‘<<>’ ∈ L Inflexion , i.e. the word ‘<>>’, is notan inﬂexion since it ends with two occurrences of ‘>’ rather than one.

Deﬁnition 3.

A pattern σ has the convexity property if for any word w = s s . . . s n − in L σ and for any pair of factors u = s c s c +1 . . . s d and v = s e s e +1 . . . s f of w (with c, d, e, f ∈ [1 , n − ) such that, both u and v are wordsin L σ , the word s min( c,e ) s min( c,e )+1 . . . s max( d,f ) is also in L σ .Example 3 ( convexity property). All patterns of the time series catalogue [3]have the convexity property, but the pattern whose language is denoted by L < = > = | < = | = > has not, since the word ‘<=>=’ in L < = > = | < = | = > contains afactor ‘<=>’ that is not in L < = > = | < = | = > , for which both the preﬁx ‘<=’ andthe suﬃx ‘=>’ belong to L < = > = | < = | = > . Deﬁnition 4.

A pattern σ has the no-inflexion property if any word in itslanguage L σ does not simultaneously contain the letters ‘<’ and ‘>’ . Through an abuse of language and for reasons of brevity we say “pattern propertyof σ ” rather than “property of the language L σ of the pattern σ ”. eﬁnition 5. A pattern σ has the one-inflexion property if any word inits language L σ contains either one, but not both occurrences of ‘<=*>’ and ‘>=*<’ . Deﬁnition 6.

A pattern σ has the single letter property if all words of L σ have a length of one. Deﬁnition 7.

A pattern σ has the exclude-out-in property if for any word s s . . . s n − in L σ and for any window [ i, j ] (with ≤ i ≤ j ≤ n and i > ∨ j ’. • Nine out of the reversible patterns of [3] have the one-inflexion prop-erty. The pattern Plain has the one-inflexion property because it containsan occurrence of ‘<=*>’, but not an occurrence of ‘>=*<’. • Eight out of the reversible patterns of [3] have the exclude-out-in property. For instance, the pattern IncSeq has the exclude-out-in prop-erty because any subword w of an increasing sequence such that w / ∈ L IncSeq cannot be the start or the end of an increasing sequence, since w is of theform ‘==*’, i.e. does not start or end with a ‘<’. • Dec , Inc and

Steady have the single letter property.

We introduce the sliding time-series constraint we consider.

Deﬁnition 8.

Given a feature f , a regular expression σ , an integer m > , twovariables low and up , and a sequence of variables X = x x . . . x n with n ≥ m ,the slide_sum_ f _ σ ( m, low , up , X ) constraint holds iﬀ low = min i ∈ [1 ,n − m +1] r i , (1) up = max i ∈ [1 ,n − m +1] r i , (2) with sum_ f _ σ ( r i , x i,i + m − ) , where r i is called the contribution of thetime-series constraint sum_ f _ σ in the window [ i, i + m − . Cond. (1), (resp. (2)), of Def. 8 enforces low (resp. up ) to be the minimum(resp. maximum) of the sum of the feature values of feature f wrt all maximaloccurrences of σ in each subsequence of m consecutive variables of sequence X . Example 5 (Continuation of Example (1)).

Given the pattern

IncSeq and thefeature surf , slide_sum_surf_incseq (10 , , , is sat-isﬁed because the sum of the surfaces of the increasing sequences in the diﬀerentsliding windows of size is between and as shown in Example 1.5 .1 Computing the Contribution in a Window In this section, we consider the patterns σ and σ r which are the reverseof each other, a feature f , an integer sequence x x . . . x n , and all windows x i x i +1 . . . x i + m − of size m (with i ∈ [1 , n − m + 1] ). We investigate how toevaluate directly from an equation the sum of the feature values of feature f ofall pattern occurrences located in a window [ i, j = i + m − , assuming all theelements of the right-hand side of an equation have been previously calculated intime proportional to n . As we have several features and several patterns, we usethree equations all derived from the same simple idea, for which we ﬁrst presentthe intuition. Then we deﬁne suﬃcient properties of features and patterns thatensure the validity of each of the three equations (4), (5) and (6). At the endof this section, Table 4 provides an overview of the validity of each of the threeequations according to properties of the patterns and features. Intuition

Assume we want to deal with the following simpliﬁed problem: givenan integer sequence x x . . . x n , compute for all subsequences of m consecutivepositions the sum t i,j = Σ k ∈ [ i,j ] x k (with j = i + m − ) of the correspondingelements in time O ( n ) . This can be done by ﬁrst computing the partial sums Σ c ∈ [1 ,k ] x c (with k ∈ [1 , n ] ), and Σ c ∈ [ k,n ] x c (with k ∈ [1 , n ] ) and by using theidentity t i,j = Σ k ∈ [1 ,j ] x k + Σ k ∈ [ i,n ] x k − Σ k ∈ [1 ,n ] x k . (3)Equations (4), (5) and (6) present three alternative ways to compute f σ ( x i,j ) inspired by Equation (3). f σ ( x i,j ) = f σ ( x ,j ) + f σ r ( x n,i ) − f σ ( x ,n ) (4) f σ ( x i,j ) = max (0 , f σ ( x ,j ) + f σ r ( x n,i ) − f σ ( x ,n )) (5)if no σ -pattern in x i,j then f σ ( x i,j ) = 0 else f σ ( x i,j ) = f σ ( x ,j ) + f σ r ( x n,i ) − f σ ( x ,n ) (6)Depending on the properties of the pattern σ and of the feature f , we inves-tigate the cases when Equations (4), (5) and (6) are valid. Example 6.

Consider the

DecSeq pattern of Table 1, the sequence w = 2 1 1 1 0 ,the window size m = 2 , i.e. the four sliding windows , , and . • Equation (4) provides the incorrect surf feature value for two of the foursliding windows, namely values , − , − and rather than the expectedvalues , , and . For the second window, shown in grey on the ﬁgure onthe right, this is because there is a non-empty gap (shown in red) betweenthe leftmost and rightmost decreasing sequences in w . Equation (5)gives the correct value since it cancels out the contribution of the gap. − • While Equations (4) and (5) give the incorrect min feature value for two ofthe four sliding windows, namely values , , and rather than values , , and , Equation (6) provides the correct values.6 ase condition illustration (1) u < i i jℓ u (2) ℓ > j i j ℓ u (3) i ≤ ℓ ≤ u ≤ j i jℓ u (4) ℓ < i ≤ u ≤ j ∧ p ( s i,u − ) ∈ L σ i jℓ uα (5) ℓ < i ≤ u < j ∧ p ( s i,u − ) / ∈ L σ i jℓ u (6) i ≤ ℓ ≤ j < u ∧ s ( s ℓ,j − ) ∈ L σ i jℓ uβ (7) i < ℓ ≤ j < u ∧ s ( s ℓ,j − ) / ∈ L σ i jℓ u (8) ( ℓ ≤ i ≤ j ≤ u ) ∧ ( ℓ = i ∨ j = u ) i jℓ u Table 2: Positioning an occurrence of a pattern wrt a window; within cases (4)and (6) the non-empty words p ( s i,u − ) and s ( s ℓ,j − ) are shown in light grey. Case Analysis

Consider a sequence x x . . . x n , a window [ i, j ] , and a maximaloccurrence of pattern o whose signature is s ℓ s ℓ +1 . . . s u − (with ≤ ℓ ≤ u ≤ n ).Table 2 provides eight cases summarising all the possible positioning of x ℓ,u wrt [ i, j ] , where p ( s i,u − ) (resp. s ( s ℓ,j − ) ) denotes the longest preﬁx s i s i +1 . . . s α − of s i s i +1 . . . s u − (resp. the longest suﬃx s β s β +1 . . . s j − of s ℓ s ℓ +1 . . . s j − ) in L σ ifsuch word exists, the empty word otherwise. For cases (1–7) of Table 2, columns f σ ( x ,j ) (resp. f σ r ( x n,i ) ) and f σ ( x ,n ) of Table 3 provide the feature valueof the σ -pattern occurrence o (resp. σ r -pattern occurrence o r ) wrt x x . . . x j (resp. x n x n − . . . x i ) and x x . . . x n ; the last three columns give the contribu-tion of o in the right-hand side of Equations (4), (5), (6). These contributionsagree with the positioning of x ℓ,u wrt [ i, j ] , except for the three grey cells, whichonly work for non-negative feature values. Case (8) of Table 2 corresponds to amaximal occurrence of pattern o whose signature starts before i and ends after j . To study Case (8), the next section classiﬁes a pattern wrt a window. case f σ ( x ,j ) f σ r ( x n,i ) f σ ( x ,n ) Eq. (4) Eq. (5) Eq. (6) (1) f σ ( x ℓ,u ) 0 f σ ( x ℓ,u ) 0 0 0 (2) f σ r ( x u,ℓ ) f σ ( x ℓ,u ) 0 0 0 (3) f σ ( x ℓ,u ) f σ r ( x u,ℓ ) f σ ( x ℓ,u ) f σ r ( x u,ℓ ) max(0 , f σ r ( x u,ℓ )) f σ r ( x u,ℓ ) (4) f σ ( x ℓ,u ) f σ r ( x α,i ) f σ ( x ℓ,u ) f σ r ( x α,i ) max(0 , f σ r ( x α,i )) f σ r ( x α,i ) (5) f σ ( x ℓ,u ) 0 f σ ( x ℓ,u ) 0 0 0 (6) f σ ( x β,j ) f σ r ( x u,ℓ ) f σ ( x ℓ,u ) f σ ( x β,j ) max(0 , f σ ( x β,j )) f σ ( x β,j ) (7) f σ r ( x u,ℓ ) f σ ( x ℓ,u ) 0 0 0 Table 3: [ columns 2 to 4 ] values of f σ ( x ,j ) , f σ r ( x n,i ) and f σ ( x ,n ) wrtcases (1 −

7) of Table 2; [ columns 5 to 7 ] contribution of an occurrence of σ in a window wrt the right-hand side of Equations (4), (5) and (6).7 Systematic Classiﬁcation of Patterns wrt WindowsDeﬁnition 9. [type of a word wrt a pattern]

Given a pattern σ , the type of aproper factor w = w w . . . w k of a word in L σ wrt σ is deﬁned by ﬁve mutuallyincompatible conditions: • out if ∄ c, d : 1 ≤ c ≤ d ≤ k ∧ w c w c +1 . . . w d ∈ L σ • fac if  ∃ c, d : 1 ≤ c ≤ d ≤ k ∧ w c w c +1 . . . w d ∈ L σ ∄ d : 1 ≤ d ≤ k ∧ w w . . . w d ∈ L σ ∄ c : 1 ≤ c ≤ k ∧ w c w c +1 . . . w k ∈ L σ • pre if ( ∃ d : 1 ≤ d ≤ k ∧ w w . . . w d ∈ L σ ∄ c : 1 ≤ c ≤ k ∧ w c w c +1 . . . w k ∈ L σ • suf if ( ∃ c : 1 ≤ c ≤ k ∧ w c w c +1 . . . w k ∈ L σ ∄ d : 1 ≤ d ≤ k ∧ w w . . . w d ∈ L σ • in if ( ∃ d : 1 ≤ d ≤ k ∧ w w . . . w d ∈ L σ ∃ c : 1 ≤ c ≤ k ∧ w c w c +1 . . . w k ∈ L σ In Deﬁnition 9, “ fac ”, “ pre ” and “ suf ” convey the idea of “factor”, “preﬁx”and “suﬃx”. Note that a word with the “ in ” type wrt a convex pattern σ is in L σ . The languages associated with the ﬁve mutually incompatible conditions ofDeﬁnition 9 are deﬁned as L out = Σ + \ ( Σ ∗ L σ Σ ∗ ) , L fac = Σ + L σ Σ + ∩ Σ ∗ \ ( L σ Σ + ) ∩ Σ ∗ \ ( Σ + L σ ) ∩ Σ ∗ \ L σ , L pre = L σ Σ + ∩ Σ ∗ \ ( Σ + L σ ) ∩ Σ ∗ \ L σ , L suf = Σ + L σ ∩ Σ ∗ \ ( L σ Σ + ) ∩ Σ ∗ \L σ , and L in = L σ Σ ∗ ∩ Σ ∗ L σ . Note that becauseof our hypothesis that L σ does not contain the empty word, the languages L out , L fac , L pre , L suf and L in do not contain the empty word. Deﬁnition 10. [type and signature of a word wrt one of its proper factors andwrt a pattern]

Given a pattern σ , consider a word w = w w . . . w k of L σ , andone of its proper factors v = w i w i +1 . . . w j . The type of w wrt v and σ is deﬁnedby h t , t , t i where t , t and t are respectively the type of words w w . . . w j , w i w i +1 . . . w j and w i w i +1 . . . w k wrt pattern σ as deﬁned by Deﬁnition 9. The signature of w wrt v and σ is deﬁned by h sig , sig , sig i , where sig c = if t c =out then else (with c ∈ [1 , ). Theorem 1. [map of feasible types wrt any pattern]

Of the possible typesof Deﬁnition 10, only triples shown in Figure 1 are feasible.Proof. For each triple of Figure 1, Appendix A provides a witness pattern whichgenerates such triple. We now prove that the missing triples cannot be obtainedfrom any pattern. ① h out , = out , - i (resp. h - , = out , out i ) is not feasible since an “ out ” in v = w w . . . w j (resp. w i w i +1 . . . w k ) would imply an “ out ” in any subsequenceof v , namely in w i w i +1 . . . w j , a contradiction. ② h fac , suf , - i (resp. h - , pre , fac i ) is not feasible since a “ suf ” (resp. “ pre ”) in w i w i +1 . . . w j would imply a “ suf ” (resp. “ pre ”) or an “ in ” in w w . . . w j (resp. w i w i +1 . . . w k ), a contradiction.8 h fac , in , - i (resp. h - , in , fac i ) is not feasible since a “ in ” in w i w i +1 . . . w j wouldimply a “ suf ” (resp. “ pre ”) or an “ in ” in w w . . . w j (resp. w i w i +1 . . . w k ), acontradiction. ④ h pre , suf , - i (resp. h - , pre , suf i ) is not feasible since a “ suf ” (resp. “ pre ”)in w i w i +1 . . . w j and a “ pre ” (resp. “ suf ”) in v = w w . . . w j (resp. v = w i w i +1 . . . w k ) would imply an “ in ” in v , a contradiction. ⑤ h pre , in , - i (resp. h - , in , suf i ) is not feasible since an “ in ” in w i w i +1 . . . w j anda “ pre ” (resp. “ suf ”) in v = w w . . . w j (resp. v = w i w i +1 . . . w k ) wouldimply an “ in ” in v , a contradiction. ⊓⊔ (A) (B) (C)(D) ooi oosoop oofioo soopoo fooooo ioi iosiop soipoisoppop sosposposiof ioffoifoisof sofpof poffopfop fosfos fof poffop soffosfoi iofiiiiii siiiip isiipiispipp ssispisssississisfppp ppippi fpissfsppsipspp sspsipsspfppiﬀﬃifsifspﬁpﬁ sﬁsﬁiﬁiﬁifpifp iﬁ sﬁpﬁ ifsifp sfspfspfs sfppfpﬀsﬃﬀp sﬀiﬀpﬀﬀf pﬀ iﬀ sﬀ ﬀpﬃﬀs sipssp sppsss pppssf fppsfpsﬀsfs ﬀppfp Fig. 1: Map of the feasible types where Parts (A), (B), (C) and (D) resp.correspond to triples with three “ out ”, two “ out ”, one “ out ” and no “ out ”, where o , i , p , s , f resp. are abbreviations for “ out ”, “ in ”, “ pre ”, “ suf ”, “ fac ”; there is anarc from a triple t to a triple t iﬀ (1) t and t have the same signature, (2) t and t diﬀer exactly from one position, (3) t is lexicographically less than t assuming i ≺ p , i ≺ s , p ≺ f and s ≺ f ; ellipses denote the types of the DecSeq pattern as described in Example 7.

Notation

In the context of Deﬁnition 10, when w w . . . w j is of type pre or in , i.e. it contains a maximal occurrence of a word x in L σ starting at index , ψ denotes the index of the last letter of x . Similarly, when w i w i +1 . . . w k is oftype suf or in , i.e. it contains a maximal occurrence of a word y in L σ ending t index k , λ denotes the index of the ﬁrst letter of y . The number of triples being important, in our case, we reduce the numberof cases to be considered in our proofs, by introducing Deﬁnition 11 which groupsa certain number of triples in the same class representing the weakest hypothesisassociated with the diﬀerent triples in this class. Consider the (ﬁnite) set S oftriples associated with the words of L σ wrt their proper factors. We partition theset S into subsets where all triples of the same subset have the same signature.Then, we generalise all the triples that belong to the same subset to a uniquerepresentative using the following deﬁnition. Deﬁnition 11. [generalising a set of triples]

Given a pattern σ , consider the setof triples S consisting of all types of the words of L σ wrt their proper factorsand wrt σ that have the same signature. Let S c (with c ∈ [1 , ) denote the setof all the c th components of the triples of S . The set S is represented by a single representative triple R S = h r , r , r i where r c , with c ∈ [1 , , is deﬁned by • S c = { out } ⇒ r c = out • fac ∈ S c ⇒ r c = fac • pre ∈ S c ∧ suf / ∈ S c ∧ fac / ∈ S c ⇒ r c = pre • suf ∈ S c ∧ pre / ∈ S c ∧ fac / ∈ S c ⇒ r c = suf • pre ∈ S c ∧ suf ∈ S c ∧ fac / ∈ S c ⇒ r c = ps • in ∈ S c ∧ pre / ∈ S c ∧ suf / ∈ S c ∧ fac / ∈ S c ⇒ r c = in Deﬁnition 12. [pattern class]

Given a pattern σ , the set of representative triplesof σ is called the class of σ .Example 7. The set S of possible types associated with the DecSeq pattern h ‘ > ( > | =) ∗ > | > ’ , , i is equal to the union of two subsets S = {h pre , fac , suf i , h pre , pre , in i , h in , suf , suf i , h in , in , in i} and S = {h pre , out , suf i} , where eachsubset corresponds to triples for which all “ out ” are located in the same positions,see the ﬁve ellipses in Parts (D) and (C) of Figure 1. Part (A) of Figure 2 gives foreach element of S and S a corresponding example of a word and a proper factor.The sets S and S are respectively represented by the triples h pre , fac , suf i and h pre , out , suf i as shown in Part (B) of Figure 2. Finally, Figure 3 providesthe representative triples for all reversible and convex patterns of Table 1, whichdo not have the single letter property. Finding all the representatives of a pattern

To generate all the represen-tatives of a pattern σ , (i) we ﬁrst generate all potential word types wrt σ , and (ii) we then use Deﬁnition 11. For each of the potential word type h t , t , t i with t i ∈ { out , fac , pre , suf , in } depicted in Figure 1, we describe a systematicmethod to check whether there exists or not a word w = w w . . . w k of L σ whosetype is h t , t , t i . For this purpose we deﬁne the language of h t , t , t i wrt to σ and check whether it is empty or not. Since we need the preﬁx of w associatedwith t to overlap the suﬃx of w associated with t , we ﬁrst introduce the notionof shuﬄe language . 10 A)(B)(C) * pre , fac , suf + ① λ ψ> = >> = >> = >> = >> = >> = > h pre , fac , suf i z }| {* pre , pre , in + ψλ ② > = >> = >> = > * in , suf , suf + ③ ψλ> = >> = >> = > * in , in , in + ④ ψλ>>>>>> h pre , out , suf i z }| {* pre , out , suf + ⑤ > = >> = >> = >  ① > (= | > ) ∗ s (= + > + ) + = + s = ∗ > + (= + > + ) ∗ ② ( > (= | > ) ∗ | ǫ ) s ( > + = + ) + s = ∗ > + (= + > + ) + ③ > (= | > ) ∗ s (= + > + ) + s > ∗ (= + > + ) ∗ ④ s > + (= + > + ) ∗ s = ∗ > + (= + > + ) ∗ | > (= | > ) ∗ s > + (= + > + ) ∗ s > ∗ (= + > + ) ∗ ⑤ > (= | > ) ∗ s = + s = ∗ > + (= + > + ) ∗ Fig. 2: (A) Set of possible types of words wrt their proper factors of the

DecSeq pattern with the corresponding examples of word w and proper factor (in grey)where ψ (resp. λ ) denotes the end (resp. start) of a maximal word in L DecSeq starting at the ﬁrst position (resp. ending at the last position) of w , (B) corre-sponding set of representative triples, and (C) languages of the types of words ① , ② , ③ , ④ , ⑤ as computed from Equation (7) of Theorem 2. Deﬁnition 13. [shuﬄe language]

Given a regular language L over an inputalphabet Σ , and a possibly new input letter s , i.e. a letter that does not necessarilybelong to Σ , the shuﬄe language of L wrt s , denoted shuﬄe ( L , s ) , is deﬁned byall words w over the alphabet Σ ∪ { s } such thati) w contains at least one occurrence of the letter s ,ii) if we remove one single occurrence of the letter s from w then the resultingword belongs to L . Theorem 2. [language of a word type]

Given a pattern σ and one of its potentialword types h t , t , t i , the language associated with h t , t , t i is deﬁned by \  shuﬄe ( shuﬄe ( L σ , s ) , s ) shuﬄe ( L t , s ) sΣ ∗ Σ ∗ s L t sΣ + | Σ + s L t sΣ ∗ Σ ∗ s shuﬄe ( L t , s )  (7) Proof.

The four sub-expressions on the right-hand side of (7) respectively corre-spond to a word of L σ to which two occurrences of s are inserted, and in threeways of decomposing it wrt its preﬁx, to its window, and to its suﬃx. The let-ter s is used to “synchronise” these decompositions, i.e. to enforce a non-emptyintersection between the preﬁx and the suﬃx. Since L t does not contain theempty word, the two occurrences of s delimit a non-empty window. ⊓⊔ Example 8 (Continuation of Example (7)).

Part (C) of Figure 2 gives the lan-guages of the types of words h pre , fac , suf i , h pre , pre , in i , h in , suf , suf i , h in , in , in i ,11nd h pre , out , suf i for the DecSeq pattern, as deﬁned by Theorem 2. Note thatall other triples lead to the empty language.Evaluating whether the language associated with a regular expression isempty or not (e.g. Expression (7)) is done by (i) converting all its operatorinstances (e.g. union, intersection, concatenation, Kleene star, shuﬄe, . . . ) todeterministic ﬁnite automata, by (ii) evaluating the corresponding sequenceof operations on ﬁnite automata, and by (iii) checking whether the resultingminimised automaton has at least one accepting state or not. Following thismethodology, Appendix C gives the corresponding programs which compute therepresentatives and the properties of a pattern. We now show how to generatea ﬁnite automaton for the shuﬄe operator that we previously introduce. – [shuﬄe] From the deterministic and minimised automaton A L associatedwith L , one can build the automaton A shuﬄe ( L ,s ) associated with the lan-guage shuﬄe ( L , s ) by ( i ) duplicating all states of A L and make themnon-initial, ( ii ) make all states of A L non-accepting, ( iii ) add a transitionlabelled by s from each state to its duplicated state. Establishing the properties of a pattern

We now describe how to system-atically ﬁnd the properties of a pattern σ . We use L ssss σ (resp. L ss σ ) as a shortcutfor shuﬄe ( shuﬄe ( shuﬄe ( shuﬄe ( L σ , s ) , s ) , s ) , s ) (resp. shuﬄe ( shuﬄe ( L σ , s ) , s ) ). • A pattern σ has the convexity property iﬀ T L ss σ Σ ∗ s L σ Σ ∗ L σ s Σ ∗ Σ ∗ s ( Σ + \ L σ ) s Σ ∗  S T L ssss σ Σ ∗ s shuﬄe ( L σ , s ) s Σ + s Σ ∗ Σ ∗ s Σ + s shuﬄe ( L σ , s ) s Σ ∗ Σ ∗ s shuﬄe ( shuﬄe ( Σ + \ L σ , s ) , s ) s Σ ∗  = ∅ (8) h pre , out , suf ih pre , fac , suf ih pre , out , out ih out , out , suf ih out , out , out ih in , out , in ih in , in , in ih in , out , out ih out , out , in i IncSeqDecSeqGorge , SummitPeak , Valley SteadySeqStrictlyIncSeqStrictlyDecSeqDecTerraceIncTerracePlain , PlateauProperPlainProperPlateauZigzag

Fig. 3: Pattern classes, where each class corresponds to a set of representativetriples (an arrow from a triple ① to a triple ② means that ① generalises ② )12 A pattern σ has the no-inflexion property iﬀ ( L σ ∩ Σ ∗ < Σ ∗ > Σ ∗ ) ∪ ( L σ ∩ Σ ∗ > Σ ∗ < Σ ∗ ) = ∅ (9) • A pattern σ has the one-inflexion property iﬀ L σ \ L ( < | =) ∗ < = ∗ > ( > | =) ∗ | ( > | =) ∗ > = ∗ < ( < | =) ∗ = ∅ (10) • A pattern σ has the exclude-out-in property iﬀ T L ssss σ Σ ∗ sΣ + s shuﬄe ( Σ + \ ( Σ ∗ L σ Σ ∗ ) , s ) s Σ ∗ Σ ∗ s shuﬄe ( L σ , s ) s Σ ∗ s Σ ∗ Σ ∗ s Σ + s Σ + s Σ ∗ s Σ ∗  S T L ssss σ Σ ∗ s shuﬄe ( Σ + \ ( Σ ∗ L σ Σ ∗ ) , s ) s Σ + s Σ ∗ Σ ∗ s Σ ∗ s shuﬄe ( L σ , s ) s Σ ∗ Σ ∗ s Σ ∗ s Σ + s Σ + s Σ ∗  = ∅ (11) • A pattern σ has the single letter property iﬀ L σ \ L < | = | > = ∅ (12)As the constructions used in (8), (9), (10), (11) and (12) are similar to theone used in Theorem 2, they are not detailed. Proof of Equations Based on Pattern and Feature Properties.

In thissection we study the properties of patterns and features that ensure the validityof Equations (4), (5) and (6). While pattern properties were already introducedin Sections 3 and 4.1, we ﬁrst present some feature properties. Second, we focuson the validity domain of Equation (4), and ﬁnally, based on these results, wederive the properties of the patterns and features for Equations (5) and (6).From now on we focus on commutative features, as well as reversible and convexpatterns, which do not have the single letter property.

Feature Properties

All deﬁnitions of this section, i.e. Deﬁnitions 14 to 19, aswell as all theorems of this section, i.e. Theorems 3 to 9, consider (i) a reversibleand convex pattern σ = hL σ , b σ , a σ i , (ii) a sequence of variables X = x x . . . x n , (iii) an extended σ -pattern occurrence in [1 , n ] given by o = h x ℓ x ℓ +1 . . . x u i with ≤ ℓ ≤ u ≤ n , and (iv) a commutative feature f applied to o .We ﬁrst present four feature properties that only depend on the feature f .We then introduce two additional feature properties that depend on both thefeature f and the pattern σ . Finally, Part (A) of Figure 4 summarises the featureproperties of each of the features deﬁned in the time-series catalogue [3]. Deﬁnition 14.

A feature f has the sum decomposition property if f σ ( x ℓ,u ) can be expressed as u − a σ P t = ℓ + b σ h ( x t ) , where h ( x t ) is a function. E.g., when f = width , h ( x t ) = 1 and the value returned by the application of f to the extended σ -pattern occurrence o is u − ℓ − b σ − a σ + 1 .13 eﬁnition 15. A feature f has the same value property if f σ ( x ℓ,u ) = f σ ( x i,j ) for all i, j ( ℓ ≤ i ≤ j ≤ u ) such that the sequence x i,j alone is an extended σ -pattern occurrence. E.g., when f = one and x i,j is an extended σ -pattern, f σ ( x ℓ,u ) = f σ ( x i,j ) = 1 . Deﬁnition 16.

A feature f has the single position property if f σ ( x ℓ,u ) canbe expressed as h ( x t ) with x t ∈ { x ℓ + b σ , x ℓ + b σ +1 , ..., x u − a σ } . E.g., when f = max , h ( x t ) = x t and f σ ( x ℓ,u ) is the maximum of the variablesin x ℓ + b σ ,u − a σ . Deﬁnition 17.

A feature f has the positive property if f σ ( x i,j ) ≥ ∀ i, j ,such that ≤ i ≤ j ≤ n . Deﬁnition 18.

A feature f and a pattern σ have the single position no-in-flexion property if (i) f has the single position property, (ii) σ has the no-inflexion property and either (iii.a) for all extended σ -pattern occurrences x p,q wrt x p,q (with ℓ ≤ p ≤ q ≤ u ) f σ ( x p,q ) = h ( x p + b σ ) , or (iii.b) for all extended σ -pattern occurrences x p,q wrt x p,q (with ℓ ≤ p ≤ q ≤ u ) f σ ( x p,q ) = h ( x q − a σ ) . E.g., the pair σ = DecSeq , f = min , has the single position no-inflexion property since f σ ( x ℓ,u ) = x q − a σ , where q is the end of the extended σ -patternoccurrence in x ℓ,u . Deﬁnition 19.

A feature f and a pattern σ have the single position in-flexion property if (i) f has the single position property, (ii) σ has the one-inflexion property and (iii) f σ ( x ℓ,u ) is computed from the position of theonly inﬂexion of σ . E.g., the pair σ = Gorge , f = min has the single position inflexion propertysince the value of f σ ( x ℓ,u ) corresponds to the only inﬂexion of the extended σ -pattern occurrence. But the pair σ = Gorge , f = max , does not have the single position inflexion property since f σ ( x ℓ,u ) corresponds to one of thetwo extremities of the gorge.The next two sections deﬁne suﬃcient conditions where (i) Equation (4)and (ii)

Equations (5) and (6) can be used to compute the value of f σ ( x i,j ) wrt a pattern σ , depending on the representatives of a pattern. Part (B) ofFigure 4 summarises all the theorems introduced in these two sections wrt therepresentatives of Figure 3. Remark 1.

Wlog, while doing the proof of such conditions we proceed as follows: – When the representatives h pre , fac , suf i and h in , in , in i are both present,only h pre , fac , suf i is considered, since for h in , in , in i λ = i and ψ = j is aspecial case of h pre , fac , suf i . – Similarly, when the representatives h pre , out , out i and h in , out , out i (resp. h out , out , suf i and h out , out , in i ) both intervene in a proof, only h pre , out , out i (resp. h out , out , suf i ) is considered, as ψ = j (resp. λ = i ).14 feature properties one same value , positive width sum decomposition , positive surf sum decomposition max single position min single position h pre , fac , suf ih in , in , in ih pre , out , out ih out , out , suf ih in , out , out ih out , out , in ih out , out , out ih pre , out , suf ih in , out , in i Theo. 3,4 (Eq. 4)Theo. 5,6 (Eq. 4)Theo. 7, 8 (Eq. 5)Theo. 9 (Eq. 6) (A) (B)

Fig. 4: (A) Properties of the features deﬁned in Table 1 and used in [3], (B) the-orems coverage for the diﬀerent representatives of Figure 3. – When h in , out , out i and h out , out , in i (resp. h pre , out , out i and h out , out , suf i ) both intervene in a proof, only h in , out , out i (resp. h pre , out , out i ) is considered, as the representative h out , out , in i (resp. h out , out , suf i ) is symmetric. Suﬃcient Conditions for the Validity of Equation (4)

Theorem 3.

Consider a pattern σ whose class has a non-empty intersectionwith the set of representatives S = {h pre , fac , suf i , h in , in , in i} . Equation (4) can be used to obtain f σ ( x i,j ) for a sequence x ,n wrt a window [ i, j ] whose typeis in S , assuming that feature f has the sum decomposition property.Proof. From Remark 1, we just consider h pre , fac , suf i . We ﬁrst establish prop-erties between the maximum words associated with pre , fac and suf . – From the convexity property, the signature of the words x ℓ,j , x i,j and x i,u respectively contain at most one maximum word in L σ . Because of pre , fac and suf , the signature of the words x ℓ,j , x i,j and x i,u respectively containat least one word in L σ . Consequently, the signatures of the words x ℓ,j , x i,j and x i,u contain one single maximum word in L σ , respectively denoted by w pre , w fac and w suf . – Because the words w pre and w fac must not end after position j , and fromthe convexity property, w pre and w fac end in the same position ψ . – Because the words w suf and w fac must not start before position i , and fromthe convexity property, w suf and w fac start at the same position λ . – Because the word w fac starts at position λ and ends at position ψ we havethat λ ≤ ψ .Since f has the sum decomposition property, by using the function h ofDeﬁnition 14, Equation (4) can be rewritten as:15 ℓ i λ ψ j u n | {z } fσ ( x ,j )= fσ ( xℓ,j ) fσr ( xn,i )= fσr ( xu,i ) z }| { f σ ( x i,j ) = ψ − a σ X t = ℓ + b σ h ( x t ) | {z } f σ ( x ,j ) + u − b σr X t = λ + a σr h ( x t ) | {z } f σr ( x n,i ) − u − a σ X t = ℓ + b σ h ( x t ) | {z } f σ ( x ,n ) (13)By using the fact that the pattern σ is reversible (i.e. a σ r = b σ and b σ r = a σ )in the second term of Equation (13), by expanding the terms f σ r ( x n,i ) and f σ ( x ,n ) we obtain: ψ − a σ X t = ℓ + b σ h ( x t ) | {z } f σ ( x ,j ) + ψ − a σ X t = λ + b σ h ( x t ) + u − a σ X t = ψ − a σ +1 h ( x t ) | {z } f σr ( x n,i ) − ψ − a σ X t = ℓ + b σ h ( x t ) − u − a σ X t = ψ − a σ +1 h ( x t ) | {z } f σ ( x ,n ) = ψ − a σ X t = λ + b σ h ( x t ) = f σ ( x i,j ) . Hence, Equation (4) holds. ⊓⊔ Theorem 4.

Theorem 7.

Consider a pattern σ whose class has a non-emptyintersection with the set of representatives S = {h pre , fac , suf i , h in , in , in i , h pre , out , out i , h out , out , suf i , h in , out , out i , h out , out , in i , h out , out , out i} . Equation (5) can be used to obtain f σ ( x i,j ) for a sequence x ,n wrt window [ i, j ] whose type is in S , assuming that feature f has the sumdecomposition and the positive properties. If, in addition to the set S , wealso have the representative h pre , out , suf i then Equation (5) can still be used,provided that pattern σ has the exclude-out-in property.Proof. Because of Remark 1 we only consider the representatives h pre , fac , suf i , h pre , out , out i and h out , out , out i . • [ h pre , fac , suf i ] Since, from Theorem 3, Equation (4) is valid for this rep-resentative when f has the sum decomposition property, and since f hasthe positive property, the right-hand side of Equation (5) is the maximumbetween zero and a positive value; therefore Equation (5) is also valid for h pre , fac , suf i . • [ h pre , out , out i ] Since f has the positive and sum decomposition prop-erties, f σ ( x ,n ) ≥ and f σ ( x ,j ) ≤ f σ ( x ,n ) . Due to the third component“ out ” of the representative f σ r ( x n,i ) = 0 . When Equation (4) is used, weobtain f σ ( x i,j ) ≤ ; but with Equation (5) we get f σ ( x i,j ) = 0 , which is truedue to the second component “ out ” of the representative. Hence (5) is valid. • [ h out , out , out i ] Since f has the positive property, f σ ( x ,n ) ≥ . Dueto the ﬁrst and third “ out ” components of the representative, f σ ( x ,j ) = f σ r ( x n,i ) = 0 . When Equation (4) is used, we obtain f σ ( x i,j ) ≤ ; but withEquation (5) we get f σ ( x i,j ) = 0 , which is true due to the second component“ out ” of the representative. Hence, Equation (5) is valid. • [ h pre , out , suf i ]– From the convexity property, the signature of the words x ℓ,j and x i,u respectively contain at most one maximum word in L σ . Because of pre and suf , the signature of the words x ℓ,j and x i,u respectively contain atleast one word in L σ . Consequently, the signature of the words x ℓ,j and x i,u contain one single maximum word in L σ , respectively denoted by w pre and w suf . 18 Because of the out of h pre , out , suf i , the signature of the word x i,j does not contain any subword that belongs to L σ . In addition, sincethe pattern σ has the exclude-out-in property we have that w pre ends before position i , and w suf starts after position j , i.e. w pre and w suf do not overlap. Consequently, since in addition f has the positive and the sum decomposition properties, f σ ( x ,n ) ≥ and f σ ( x ,j ) + f σ r ( x n,i ) ≤ f σ ( x ,n ) . When Equation (4) is used, we obtain f σ ( x i,j ) ≤ ;but with Equation (5) we get f σ ( x i,j ) = 0 , which is true due to the secondcomponent “ out ” of the representative. Hence, Equation (5) is valid. ⊓⊔ Theorem 8.

Consider a pattern σ whose class has a non-empty intersectionwith the set of representatives S = {h pre , fac , suf i , h in , in , in i , h pre , out , suf i , h in , out , in i , h pre , out , out i , h in , out , out i , h out , out , suf i , h out , out , in i , h out , out , out i} . Equation (6) can be used to obtain f σ ( x i,j ) for a sequence x ,n wrt a window [ i, j ] whose type is in S if (a) either both h pre , fac , suf i and h in , in , in i are not representatives of the pattern σ , (b) or if one of the followingconditions holds:i) f has the sum decomposition property.ii) f has the same value property.iii) the pair f, σ has the single position no-inflexion or the single posi-tion inflexion properties. roof. [CASE 1] Consider the representatives that have an extended σ -patternoccurrence in [ i, j ] . In this case the only two representatives are h pre , fac , suf i and h in , in , in i . Since Equation (4) is valid for these representatives when: i) f has the sum decomposition property (see Theorem 3), ii) f has the same value property (see Theorem 5), iii) the pair f, σ has the single position no-inflexion property (see Theo-rem 4) or the single position inflexion property (see Theorem 6),Equation (6) is also valid.[CASE 2] Consider the representatives that do not have an extended σ -patternoccurrence in [ i, j ] . When using Equation (6), because of the check “if no σ -patternin x i,j ” in Equation (6), the value of f σ ( x i,j ) is zero. Hence, Equation (6) is valid. ⊓⊔ Synthesis

The classiﬁcation induced by theorems 3 to 9 is presented in Table 4:for each pattern class corresponding to the same set of representative triples (seeFigure 3) we select one pattern (see the columns of Table 4, e.g.

Plain ) andprovide for each feature property (see the rows of Table 4, e.g. SV ) and for eachfeature/pattern property (see the cells of Table 4, e.g. SPN ) the theorem provingthat an Equation is valid under such properties. Note that any missing Equationis due to a counterexample given in Appendix B, and not to the fact that weare missing a theorem. Coloured grey cells indicate a non-existing time-seriesconstraint in the time-series catalogue [3].Equation (6) can be used to compute the value of f σ ( x i,j ) , for all reversibleand convex patterns without the single letter property from [3], except for Zigzag with the max and min features (see the cells marked with “none” inTable 4), as

Zigzag uses the representative triple h in , in , in i without having the single position inflexion or the single position no-inflexion properties. Since checking each window of m consecutive positions of a sequence of size n independently gives a time complexity of O ( m · n ) , we now introduce a theoremleading to an optimal time complexity. Theorem 10.

The time complexity of evaluating Equations (4) and (5) on asequence X = x x . . . x n for all sliding windows of size m is Θ ( n ) . Moreover, as-suming one can check in constant time whether a sliding window of the sequence X contains or not a σ -pattern, the time complexity of evaluating Equation (6)for all sliding windows of size m of sequence X is also Θ ( n ) .Proof. Evaluating (4), (5) and (6) for all sliding windows [ i, j ] (with i ∈ [1 , n − m +1] and j = i + m − ) requires evaluating f σ ( x ,i + m − ) , f σ ( x i,n ) and f σ ( x ,n ) . – First, note that within Equation (6) all the tests “if no σ -pattern in x i,j ” onthe diﬀerent sliding windows (with i ∈ [1 , n − m + 1] and j = i + m − ) canbe done in O ( n ) because of our assumption.20 p , o , s i h p , f , s i h o , o , o i h i , o , o i h o , o , i i h i , o , i i h i , i , i ih p , f , s i h p , o , o i h o , o , s i h o , o , o i h i , i , i ih p , f , s i h p , o , o i h o , o , s i h o , o , o i h i , o , o i h o , o , i i representative triples f e a t u r ep r o pe r t i e s S V S D S PP N,E O O O N,E pattern properties f \ σ SPNSPN SPO SPO SPO SPN

DecSeq Gorge Valley Plain Zigzag SteadySeqone width surf max min

Table 4: Indicates, for existing combinations of feature f and pattern σ fromthe time-series catalogue, which of the Equations (4), (5) and (6) are valid, aswell as the corresponding justifying theorem, where: ( i ) within a representativetriple we use as a shortcut the ﬁrst letter of each component; ( ii ) p , sp , sd and sv resp. indicate whether the feature f has the positive , the single position ,the sum decomposition or the same value property; ( iii ) n , e , and o resp.indicate whether the pattern σ has the no-inflexion , the exclude-out-in or the one-inflexion property; ( iv ) spn , spo resp. indicate whether the pair f, σ has the single position no-inflexion property or the single positioninflexion property. – Second, evaluating f σ ( x ,i + m − ) for all i ∈ [1 , n − m + 1] as well as f σ ( x ,n ) can be done in O ( n ) by using a register automaton [6] for sum_ f _ σ ( r,x x . . . x n ) , which exposes all its intermediate register values [9]. – Third, since the pattern σ is reversible and since the feature f is commu-tative, f σ ( x i,n ) = f σ r ( x n,i ) . Evaluating f σ r ( x n,i ) for all i ∈ [1 , n − m + 1] can also be done in O ( n ) by using a register automaton for sum_ f _ σ r ( r,x n x n − . . . x ) , which exposes all its intermediate register values.Therefore, the time complexity of evaluating Equations (4), (5) and (6) is O ( n ) .Since each variable of x ,n needs to be scanned at least once to identify patternoccurrences, this time complexity is optimum. ⊓⊔ Pattern Properties for Checking in Linear Time the Occurrence ofPattern in Sliding Windows

We now introduce some additional patternproperties to check in time O ( n ) whether or not the diﬀerent sliding windowsof size m of a sequence X = x x . . . x n contain a pattern occurrence. As theseproperties cover all reversible patterns of the time-series catalogue, one can alsouse Equation (6) for such patterns for the entries of Table 4 mentioning (6).21 eﬁnition 20. A pattern σ has the letter property wrt a letter e if e is aword in L σ , and if any word of L σ contains at least one occurrence of e , i.e. if L σ ∩ { e } 6 = ∅ and if L σ ∩ ( Σ \ e ) ∗ = ∅ . Deﬁnition 21.

A pattern σ has the suffix-unavoidable property wrt a letter e ∈ { ‘<’ , ‘=’ , ‘>’ } if all words in L σ contain at least one occurrence of e , and ifeach suﬃx starting with the letter e of any word of L σ belongs also to L σ , i.e. if L σ ∩ ( Σ \ e ) ∗ = ∅ and if shuﬄe ( L σ , s ) ∧ Σ ∗ s e Σ ∗ ∧ Σ ∗ s ( Σ ∗ \ L σ ) = ∅ . Deﬁnition 22.

A pattern σ has the incompressible property if all properfactors of any word in L σ do not belong to L σ , i.e. if Σ + L σ Σ ∗ ∩ L σ = ∅ and if Σ ∗ L σ Σ + ∩ L σ = ∅ . Deﬁnition 23.

A pattern σ has the factor property if for any word w in L σ allfactors of w , whose length is greater than or equal to the smallest length ω σ of aword in L σ , belong also to L σ , i.e. if shuﬄe ( shuﬄe ( L σ , s ) , s ) ∧ Σ ∗ sΣ ∗ Σ ω σ Σ ∗ sΣ ∗ ∧ Σ ∗ s ( Σ ∗ \ L σ ) sΣ ∗ = ∅ .Example 9 (pattern properties, continuation of Example 4). – Eight out of the reversible patterns of [3] have the letter property.For instance, the patterns Dec , DecSeq and

StrictlyDecSeq all have the letter property wrt { ‘>’ } since ( i ) the word ‘>’ is in L Dec , in L DecSeq andin L StrictlyDecSeq , and ( ii ) any word in L Dec , in L DecSeq or in L StrictlyDecSeq contains at least one occurrence of ‘>’. – out of the reversible patterns of [3] have the suffix-unavoidable property. For instance, the pattern Peak has the suffix-unavoidable prop-erty wrt the letter ‘<’, since ( i ) any occurrence of peak contains at least oneoccurrence of ‘<’, and since ( ii ) any suﬃx, starting with a ‘<’, of a word of L Peak is also a peak. – Six out of the reversible patterns of [3] have the incompressible prop-erty. The pattern DecTerrace has the incompressible property because, ifan occurrence of the letter ‘>’ is removed from any word in L DecTerrace , thecorresponding proper factor is not in L DecTerrace . – Seven out of the reversible patterns of [3] have the factor property. Forinstance, the pattern Zigzag has the factor property because any factorof length greater than or equal to ω Zigzag = 3 of a zigzag is also a zigzag.For each pattern property described in Deﬁnitions 20 to 23 we now show howto check in O ( n ) which sliding windows are empty or not. – Consider a pattern σ that has the letter property wrt a letter e . Firstcompute in one scan the number of occurrences nocc [ k ] of e in x ,k for all k ∈ [1 , n ] ; second, for each sliding window [ i, j ] , check in constant time that nocc [ i ] = nocc [ j ] . – Consider a pattern σ that has the suffix-unavoidable property wrt a let-ter e . First compute in one scan the number of occurrences nocc1 [ k ] of e in22 ,k for all k ∈ [1 , n ] ; second compute in one scan the number of maximal oc-currences nocc2 [ k ] of pattern σ in x ,k for all k ∈ [1 , n ] ; third, for each slidingwindow [ i, j ] , check in constant time that nocc1 [ i ] = nocc1 [ j ] ∨ nocc2 [ i ] = nocc2 [ j ] . – Consider a pattern σ that has the incompressible or the factor property.First compute for each k = 1 , , . . . , n the end end [ k ] of the next patternoccurrence (which will be set to n + 1 if no pattern occurrence ends after k , e.g. end [ n ] = n + 1 ). Second compute for each k = n, n − , . . . , thestart start [ k ] of the previous pattern occurrence (which will be set to if nopattern occurrence starts before k , e.g. start [1] = 0 ). Third, depending onwhether the pattern has the incompressible or the factor property, dothe following check in constant time for each sliding window [ i, j ] : • [ incompressible ] return end [ i ] > j ∨ start [ j ] < i • [ factor ] endi = end [ i ] , startj = start [ j ] if endi > n ∨ startj < then return trueif endi − i ≥ ω σ then i ′ = i else i ′ = endi if j − startj ≥ ω σ then j ′ = j else j ′ = startjendi ′ = end [ i ′ ] , startj ′ = start [ j ′ ] if endi ′ > n ∨ startj ′ < then return truereturn min( j ′ , endi ′ ) − max( i ′ , start [min( j ′ , endi ′ )]) < ω σ Computing the end (resp. start) of the next (resp. previous) pattern occur-rence is done by using a register automaton derived from the transducer [6]which recognises pattern occurrences. Figure 5 give the register automatonassociated with the

Plain and the

Zigzag patterns. In (A), the dotted tran-sition marks the end of a plain. In (B), the dashed (resp. dotted) transitionsindicate that we are inside a zigzag (resp. that a zigzag is ending). Dependingwhether we were in a zigzag or not we set end [ n − to n or to n + 1 . Example 10 (Running automata that compute the end of the next patternoccurrence).

Table 5 (resp. Table 6) shows an example of execution of theregister automaton given in Part (A) (resp. (B)) of Figure 5.

Rather than stating a time-series constraint on each window of size m , whichwould result in an O ( m · n ) space complexity, we now show how to reformulatethe slide_sum_ f _ σ ( m, low , up , x x . . . x n ) constraint as a conjunction ofconstraints with a space complexity of Θ ( n ) . This reformulation was ex-tended to the patterns of Table 1 for Equation (6) to reformulate condition“if no σ -pattern in x i,j ”, but is not described here for space reasons. Theorem 11.

For those time-series constraints for which Equations (4)or (5) holds, the constraint slide_sum_ f _ σ ( m, low , up , x x . . . x n ) canbe reformulated with a space complexity of Θ ( n ) . How to generate a transducer that recognises all maximal pattern occurrences wasdescribed in [17]. roof. For Equation (4), it can be reformulated as the conjunction  sum_ f _ σ ( r, x x . . . x n , −→ r −→ r . . . −→ r n ) ∧ sum_ f _ σ ( r, x n x n − . . . x , ←− r ←− r . . . ←− r n ) ∧∀ i ∈ [1 , n − m + 1] : r i,j = −→ r j + ←− r i − r (with j = i + m − ) ∧ low = min( r ,m r ,m +1 . . . r n − m +1 ,n ) ∧ up = max( r ,m r ,m +1 . . . r n − m +1 ,n ) (20)where ←− r i (resp. −→ r j ) is the exposed register value corresponding to the ﬁrst ar-gument of sum_ f _ σ ( −→ r j , x x . . . x j ) , (resp. sum_ f _ σ r ( ←− r i , x n x n − . . . x i ) ).For Equation (5), we replace in (20) the term r i,j = −→ r j + ←− r i − r by the term r i,j = max(0 , −→ r j + ←− r i − r ) . ⊓⊔ s r< = > > = < (A) transitions: ◦ ∈ { <, = , > } x k ◦ x k +1 end [ k ] = end [ k + 1] x k ◦ x k +1 end [ k ] = k + 1 s a b cd e f ss = >< > = < > = < > = <> = < > = < > = < (B) transitions: end [ n −

1] = n + 1 − in [ n − ◦ ∈ { <, = , > } x k ◦ x k +1 end [ k −

1] = end [ k ] , in [ k ] = 0 x k ◦ x k +1 end [ k −

1] = end [ k ] , in [ k ] = 1 x k ◦ x k +1 end [ k −

1] = k, in [ k ] = 0 Fig. 5: Register automata computing the end of the next pattern maximal oc-currence for (A) the

Plain and (B) the

Zigzag patterns x k s k < > < > = < > < < > < > > < > = < > < >x k +1 k + 1 11 10 9 8 7 6 5 4 3 2 1 end [ k ] 10 10 7 7 7 4 4 4 2 2 0 end [ k + 1] 10 7 7 7 4 4 4 2 2 0 0 (A2) Table 5: Running the register automaton of Figure 5 that computes the end ofthe next plain on (A1) the sequence x = 010100101201 and (A2) on its reverse24 k s k < > < > = < > < < >

1] 5 5 5 5 5 9 9 9 9 12 12 end [ k ] 5 5 5 5 9 9 9 9 12 12 12 in [ k ] 0 0 1 1 0 0 0 1 0 0 1 (B1) x k s k > < > > < > = < > < >x k +1 k

12 11 10 9 8 7 6 5 4 3 2 end [ k −

1] 9 9 9 9 6 6 6 1 1 1 1 end [ k ] 9 9 9 6 6 6 1 1 1 1 1 in [ k ] 0 0 1 0 0 1 0 0 0 1 1 (B2) Table 6: Running the register automaton of Figure 5 that computes the end ofthe next zigzag on (B1) the sequence x = 010100101201 and (B2) on its reverse Based on a detailed analysis of feature and pattern properties of time-series con-straints of the time-series catalogue that use the

Sum aggregator, we came upwith a Θ ( n ) time complexity checker, and a Θ ( n ) space complexity reformula-tion for such constraints. It is an open question how to generalise our results toother aggregators such as min or max . Unlike the sum aggregator, the equality g ( a, x ) = b where a, b are ﬁxed integers and x is a variable does not uniquelydetermine x when g ∈ { min , max } . Acknowledgment

We thank Pierre Flener for some feedback on an early versionof this paper, and Colin de la Higuera for discussions on regular expressions, on theproperties of their languages and on operators such as shuﬄe.

References

1. Alur, R., Fisman, D., Raghothaman, M.: Regular programming for quantitativeproperties of data streams. In: Thiemann, P. (ed.) Programming Languages andSystems - 25th European Symposium on Programming, ESOP 2016, Held as Partof the European Joint Conferences on Theory and Practice of Software, ETAPS2016, Eindhoven, The Netherlands, April 2-8, 2016, Proceedings. Lecture Notes inComputer Science, vol. 9632, pp. 15–40. Springer (2016)2. Arafailova, E., Beldiceanu, N., Douence, R., Carlsson, M., Flener, P., Rodríguez,M.A.F., Pearson, J., Simonis, H.: Global constraint catalog, volume ii, time-seriesconstraints. CoRR abs/1609.08925 (2016), http://arxiv.org/abs/1609.08925

3. Arafailova, E., Beldiceanu, N., Douence, R., Carlsson, M., Flener, P., Rodríguez,M.A.F., Pearson, J., Simonis, H.: Global constraint catalog, volume II, time-seriesconstraints. arXiv preprint arXiv:1609.08925 (2016)4. Beldiceanu, N., Contejean, E.: Introducing Global Constraints in CHIP. Mathl.Comput. Modelling 20(12), 97–123 (1994)5. Beldiceanu, N., Carlsson, M.: Revisiting the cardinality operator and introducingthe cardinality-path constraint family. In: Codognet, P. (ed.) ICLP 2001. LNCS,vol. 2237, pp. 59–73. Springer (2001) . Beldiceanu, N., Carlsson, M., Douence, R., Simonis, H.: Using ﬁnite transduc-ers for describing and synthesising structural time-series constraints. Constraints21(1), 22–40 (January 2016), journal fast track of CP 2015: summary on p. 723 ofLNCS 9255, Springer, 20157. Beldiceanu, N., Carlsson, M., Petit, T.: Deriving ﬁltering algorithms from con-straint checkers. In: Wallace, M. (ed.) CP 2004. LNCS, vol. 3258, pp. 107–122.Springer (2004)8. Bessière, C., Hebrard, E., Hnich, B., Kiziltan, Z., Walsh, T.: SLIDE: A usefulspecial case of the CARDPATH constraint. In: Ghallab, M., et al. (eds.) ECAI 2008.pp. 475–479. IOS Press (2008)9. Carlsson, M., al.: SICStus Prolog User’s Manual. RISE SICS AB, 4.5.1 edn. (April2019)10. Hopcroft, J.E., Motwani, R., Ullman, J.D.: Introduction to Automata Theory,Languages, and Computation. Addison-Wesley, 3rd edn. (2007)11. Lallouet, A., Law, Y.C., Lee, J.H., Siu, C.F.K.: Constraint programming on inﬁnitedata streams. In: Walsh, T. (ed.) IJCAI 2011, Proceedings of the 22nd InternationalJoint Conference on Artiﬁcial Intelligence, Barcelona, Catalonia, Spain, July 16-22,2011. pp. 597–604. IJCAI/AAAI (2011)12. Lee, J.C.H., Lee, J.H.M., Zhong, A.Z.: Augmenting stream constraint program-ming with eventuality conditions. In: Hooker, J.N. (ed.) Principles and Practice ofConstraint Programming - 24th International Conference, CP 2018, Lille, France,August 27-31, 2018, Proceedings. Lecture Notes in Computer Science, vol. 11008,pp. 242–258. Springer (2018)13. Maher, M.J., Narodytska, N., Quimper, C.G., Walsh, T.: Flow-Based Propagatorsfor the sequence and Related Global Constraints. In: Stuckey, P.J. (ed.) Principlesand Practice of Constraint Programming (CP’2008). LNCS, vol. 5202, pp. 159–174.Springer-Verlag (2008)14. Pesant, G.: A regular language membership constraint for ﬁnite sequences of vari-ables. In: Wallace, M. (ed.) CP 2004. LNCS, vol. 3258, pp. 482–495. Springer(2004)15. Picard-Cantin, É., Bouchard, M., Quimper, C., Sweeney, J.: Learning Parametersfor the Sequence Constraint from Solutions. In: Rueher, M. (ed.) Principles andPractice of Constraint Programming (CP’2016). LNCS, vol. 9892, pp. 405–420.Springer-Verlag (2016)16. Régin, J.C., Puget, J.F.: A Filtering Algorithm for Global Sequencing Constraints.In: Smolka, G. (ed.) Principles and Practice of Constraint Programming (CP’97).LNCS, vol. 1330, pp. 32–46. Springer-Verlag (1997)17. Rodríguez, M.A.F., Flener, P., Pearson, J.: Automatic generation of descriptionsof time-series constraints. In: 29th IEEE International Conference on Tools withArtiﬁcial Intelligence, ICTAI 2017, Boston, MA, USA, November 6-8, 2017. pp.102–109. IEEE Computer Society (2017)18. Vaandrager, F.: Model learning. Communications of the ACM 60(2), 86–95 (Febru-ary 2017) List of Feasible Types with Corresponding Witnesses

Counterexamples for Equations (4) , (5) and (6) For each time-series constraint of the time-series constraint catalogue this ap-pendix provides small time series corresponding to counterexamples of the va-lidity of Equations (4), (5) and (6) for all equations missing in Table 4. Forinstance, for nb_decreasing_sequence and Equation (4), we get the fol-lowing counterexample: consider the three windows of size wrt the sequence h , , , − i ; using Equation (4) returns h , , i rather than the expected values h , , i , i.e. on the second subsequence “ , ”, (4) returns − ratherthan the expected value ; Value reﬂects the fact that subsequence “ , ” doesnot contain any decreasing sequence. constraint (4) (5) (6) nb_decreasing_sequence , h , , , − i , h , , i , h , , i , h , , , − i , h , , i , h , , i - sum_width_decreasing_sequence , h , , , , − i , h , , , i , h , − , − , i - - sum_surf_decreasing_sequence , h− , − , − , − , − , i , h− , , , − , i , h− , , , − , i , h− , , − i , h , − i , h , i - sum_max_decreasing_sequence , h , − , − , − i , h , , − i , h , − , − i , h− , − , i , h− , i , h , i - sum_min_decreasing_sequence , h , − , − , − i , h− , , − i , h− , − , − i , h− , , − i , h , − i , h , i - nb_decreasing_terrace , h , , , − i , h , , i , h , − , i - - sum_width_decreasing_terrace , h , , , − i , h , , i , h , − , i - - sum_surf_decreasing_terrace , h , − , − , − i , h , , i , h , , i , h , − , − , − i , h , , i , h , , i - sum_height_decreasing_terrace , h , − , − , − i , h , , i , h , , i , h , − , − , − i , h , , i , h , , i - nb_gorge - - - sum_width_gorge , h , − , , i , h , , i , h , − , i - - sum_surf_gorge , h− , − , , i , h , , i , h , − , i , h , − , i , h− i , h i - sum_min_gorge - , h , − , i , h− i , h i - nb_increasing_sequence , h− , , , i , h , , i , h , , i , h− , , , i , h , , i , h , , i - sum_width_increasing_sequence , h− , , , , i , h , , , i , h , − , − , i - - sum_surf_increasing_sequence , h− , − , − , − , i , h− , , , i , h− , , , i , h− , − , i , h , − i , h , i - sum_max_increasing_sequence , h− , − , − , i , h− , , i , h− , − , i , h− , − , i , h− , i , h , i - sum_min_increasing_sequence , h− , − , − , i , h− , , − i , h− , − , − i , h− , − , i , h , − i , h , i - nb_increasing_terrace , h− , , , i , h , , i , h , − , i - - sum_width_increasing_terrace , h− , , , i , h , , i , h , − , i - - sum_surf_increasing_terrace , h− , − , − , i , h , , i , h , , i , h− , − , − , i , h , , i , h , , i - sum_height_increasing_terrace , h− , − , − , i , h , , i , h , , i , h− , − , − , i , h , , i , h , , i - nb_peak , h− , , , i , h , , i , h , − , i - - sum_width_peak , h− , , , i , h , , i , h , − , i - - sum_surf_peak , h− , − , , − i , h , , i , h , , i , h− , − , − , i , h− , i , h , i - sum_max_peak , h− , , , i , h , , i , h , − , i , h− , − , − , i , h− , i , h , i - onstraint (4) (5) (6) nb_plain , h , − , − , i , h , , i , h , − , i - - sum_width_plain , h , − , − , i , h , , i , h , − , i - - sum_surf_plain , h , − , − , i , h , , i , h , , i , h , − , i , h− i , h i - sum_height_plain , h , − , − , i , h , , i , h , , i , h , − , i , h− i , h i - nb_plateau , h− , , , i , h , , i , h , − , i - - sum_width_plateau , h− , , , i , h , , i , h , − , i - - sum_surf_plateau , h− , , , i , h , , i , h , − , i , h− , − , − , i , h− , i , h , i - sum_height_plateau , h− , , , i , h , , i , h , − , i , h− , − , − , i , h− , i , h , i - nb_proper_plain , h , − , − , i , h , , i , h , − , i - - sum_width_proper_plain , h , − , − , i , h , , i , h , − , i - - sum_surf_proper_plain , h , − , − , i , h , , i , h , , i , h , − , − , i , h , , i , h , , i - sum_height_proper_plain , h , − , − , i , h , , i , h , , i , h , − , − , i , h , , i , h , , i - nb_proper_plateau , h− , , , i , h , , i , h , − , i - - sum_width_proper_plateau , h− , , , i , h , , i , h , − , i - - sum_surf_proper_plateau , h− , , , i , h , , i , h , − , i , h− , − , − , − , i , h , , , i , h , , , i - sum_height_proper_plateau , h− , , , i , h , , i , h , − , i , h− , − , − , − , i , h , , , i , h , , , i - nb_steady_sequence - - - sum_width_steady_sequence - - - sum_surf_steady_sequence - , h− , − , i , h− , i , h , i - sum_height_steady_sequence - , h− , − , i , h− , i , h , i - nb_strictly_decreasing_sequence - - - sum_width_strictly_decreasing_sequence - - - sum_surf_strictly_decreasing_sequence - , h− , , − i , h , − i , h , i - sum_max_strictly_decreasing_sequence - , h− , − , i , h− , i , h , i - sum_min_strictly_decreasing_sequence - , h− , , − i , h , − i , h , i - nb_strictly_increasing_sequence - - - sum_width_strictly_increasing_sequence - - - sum_surf_strictly_increasing_sequence - , h− , − , i , h , − i , h , i - sum_max_strictly_increasing_sequence - , h− , − , i , h− , i , h , i - sum_min_strictly_increasing_sequence - , h− , − , i , h , − i , h , i - nb_summit - - - sum_width_summit , h− , , , i , h , , i , h , − , i - - sum_surf_summit , h− , − , , − i , h , , i , h , , i , h− , − , − , i , h− , i , h , i - sum_max_summit - , h− , − , − , i , h− , i , h , i - nb_valley , h , − , − , i , h , , i , h , − , i - - sum_width_valley , h , − , , i , h , , i , h , − , i - - sum_surf_valley , h− , − , , i , h , , i , h , − , i , h , − , i , h− i , h i - sum_min_valley , h , − , − , i , h , , i , h , , i , h , − , i , h− i , h i - nb_zigzag , h− , , − , i , h , , i , h , − , i , h− , , − , , − i , h , , i , h , , i - sum_width_zigzag , h− , , − , i , h , , i , h , − , i , h− , , − , , − i , h , , i , h , , i - sum_surf_zigzag , h− , , − , i , h , , i , h , , i , h− , , − , i , h , , i , h , , i - sum_max_zigzag , h− , − , − , i , h , , i , h , , i , h− , − , − , i , h , , i , h , , i , h− , , − , , − , , − i , h , , , i , h , , , i sum_min_zigzag , h− , , − , i , h , , i , h , , i , h− , , − , i , h , , i , h , , i , h , − , , − , , − , i , h− , − , − , − i , h− , − , − , − i Evaluating Pattern Properties

This appendix provides the program that computes all the representatives ofa pattern and the program that evaluates the properties of a pattern. Bothprograms (i) convert regular expression formulas of this paper in a sequence ofoperations on ﬁnite automata, and (ii) check that the ﬁnal automaton containsor not an accepting state. % Purpose: Compute the set of representative triple of a pattern and the properties of a pattern% Author: Nicolas Beldiceanu, IMT Atlantique:- use_module(dfa_aux_appendixC).% generate all types used to generate the representative triples (see Figure 3) of the reversible% patterns of Table 1 who have the single letter property% | ? top.% decreasing_terrace-[[out,out,out],[out,out,in],[in,out,out]]% increasing_terrace-[[out,out,out],[out,out,in],[in,out,out]]% plain-[[out,out,out],[out,out,in],[in,out,out]]% plateau-[[out,out,out],[out,out,in],[in,out,out]]% proper_plain-[[out,out,out],[out,out,in],[in,out,out]]% proper_plateau-[[out,out,out],[out,out,in],[in,out,out]]% gorge-[[out,out,suf],[out,out,in],[pre,out,out],[pre,fac,suf],% [pre,pre,in],[in,out,out],[in,suf,suf],[in,in,in]]% summit-[[out,out,suf],[out,out,in],[pre,out,out],[pre,fac,suf],% [pre,pre,in],[in,out,out],[in,suf,suf],[in,in,in]]% peak-[[out,out,out],[out,out,suf],[out,out,in],[pre,out,out],% [pre,fac,suf],[pre,pre,in],[in,out,out],[in,suf,suf],[in,in,in]]% valley-[[out,out,out],[out,out,suf],[out,out,in],[pre,out,out],% [pre,fac,suf],[pre,pre,in],[in,out,out],[in,suf,suf],[in,in,in]]% decreasing_sequence-[[pre,out,suf],[pre,fac,suf],[pre,pre,in],[in,suf,suf],[in,in,in]]% increasing_sequence-[[pre,out,suf],[pre,fac,suf],[pre,pre,in],[in,suf,suf],[in,in,in]]% steady_sequence-[[in,in,in]]% strictly_decreasing_sequence-[[in,in,in]]% strictly_increasing_sequence-[[in,in,in]]% zigzag-[[out,out,out],[out,out,in],[in,out,out],[in,out,in],[in,in,in]]top :- member(Pattern, [decreasing_terrace,increasing_terrace,plain,plateau,proper_plain,proper_plateau,gorge,summit,peak,valley,decreasing_sequence,increasing_sequence,steady_sequence,strictly_decreasing_sequence,strictly_increasing_sequence,zigzag]),reg_exp(Pattern, LPattern),findall(Triple, gen_potential_word_types(LPattern, Triple), Triples),write(Pattern-Triples), nl, fail.gen_potential_word_types(LPattern, Triple) :- % DEFINITION 9Triple = [T1, T2, T3],PotentialLanguages = [out, fac, pre, suf, in],member(T1, PotentialLanguages),member(T2, PotentialLanguages),member(T3, PotentialLanguages),word_language(T1, LPattern, L1),word_language(T2, LPattern, L2),word_language(T3, LPattern, L3), ord_type_language(L1, L2, L3, LPattern, LResult),regex_kernel(LResult, Automaton),(Automaton = kernel([],[]) -> fail ; true).% language of a wordword_language(out, LPattern, Out) :- % DEFINITION 9LEG = {[l],[e],[g]},SigmaStar = *(LEG),SigmaPlus = (LEG + SigmaStar),Out = (SigmaPlus \ (SigmaStar + LPattern + SigmaStar)).word_language(fac, LPattern, Fac) :- % DEFINITION 9LEG = {[l],[e],[g]},SigmaStar = *(LEG),SigmaPlus = (LEG + SigmaStar),Fac = (SigmaPlus + LPattern + SigmaPlus) /\(SigmaStar\(LPattern + SigmaPlus)) /\(SigmaStar\(SigmaPlus + LPattern)) /\(SigmaStar\LPattern).word_language(pre, LPattern, Pre) :- % DEFINITION 9LEG = {[l],[e],[g]},SigmaStar = *(LEG),SigmaPlus = (LEG + SigmaStar),Pre = (LPattern + SigmaPlus) /\(SigmaStar\(SigmaPlus + LPattern)) /\(SigmaStar\LPattern).word_language(suf, LPattern, Suf) :- % DEFINITION 9LEG = {[l],[e],[g]},SigmaStar = *(LEG),SigmaPlus = (LEG + SigmaStar),Suf = (SigmaPlus + LPattern) /\(SigmaStar\(LPattern + SigmaPlus)) /\(SigmaStar\LPattern).word_language(in, LPattern, In) :- % DEFINITION 9LEG = {[l],[e],[g]},SigmaStar = *(LEG),In = (LPattern + SigmaStar) /\(SigmaStar + LPattern).word_type_language(L1, L2, L3, LPattern, LResult) :- % DEFINITION 10 and THEOREM 2LEG = {[l],[e],[g]},SigmaStar = *(LEG),SigmaPlus = (LEG + SigmaStar),Tempo1 = shuffle(shuffle(LPattern,s),s),Tempo2 = (shuffle(L1,s) + [s] + SigmaStar),Tempo3 = ((SigmaStar + [s] + L2 + [s] + SigmaPlus) \/ (SigmaPlus + [s] + L2 + [s] + SigmaStar)),Tempo4 = (SigmaStar + [s] + shuffle(L3,s)),LResult = (Tempo1 /\ Tempo2 /\ Tempo3 /\ Tempo4).% check pattern properties shown in Table 1 and in Examples 4 and 9% | ?- try(convex).% convex(bump_on_decreasing_sequence)% convex(decreasing)% convex(decreasing_sequence)% convex(decreasing_terrace)% convex(dip_on_increasing_sequence)% convex(gorge)% convex(increasing)% convex(increasing_sequence)% convex(increasing_terrace)% convex(inflexion)% convex(peak)% convex(plain)% convex(plateau)% convex(proper_plain)% convex(proper_plateau)% convex(steady)% convex(steady_sequence)% convex(strictly_decreasing_sequence) convex(strictly_increasing_sequence)% convex(summit)% convex(valley)% convex(zigzag)try(convex) :- % DEFINITION 3reg_exp(Pattern, LPattern),(convex(LPattern) -> write(convex(Pattern)), nl ; true),fail.% | ?- try(no_inflexion).% no_inflexion(decreasing)% no_inflexion(decreasing_sequence)% no_inflexion(decreasing_terrace)% no_inflexion(increasing)% no_inflexion(increasing_sequence)% no_inflexion(increasing_terrace)% no_inflexion(steady)% no_inflexion(steady_sequence)% no_inflexion(strictly_decreasing_sequence)% no_inflexion(strictly_increasing_sequence)try(no_inflexion) :- % DEFINITION 4reg_exp(Pattern, LPattern),(no_inflexion(LPattern) -> write(no_inflexion(Pattern)), nl ; true),fail.% | ?- try(one_inflexion).% one_inflexion(gorge)% one_inflexion(inflexion)% one_inflexion(peak)% one_inflexion(plain)% one_inflexion(plateau)% one_inflexion(proper_plain)% one_inflexion(proper_plateau)% one_inflexion(summit)% one_inflexion(valley)try(one_inflexion) :- % DEFINITION 5reg_exp(Pattern, LPattern),(one_inflexion(LPattern) -> write(one_inflexion(Pattern)), nl ; true),fail.% | ?- try(single_letter).% single_letter(decreasing)% single_letter(increasing)% single_letter(steady)try(single_letter) :- % DEFINITION 6reg_exp(Pattern, LPattern),(single_letter(LPattern) -> write(single_letter(Pattern)), nl ; true),fail.% | ?- try(exclude_out_in).% exclude_out_in(decreasing)% exclude_out_in(decreasing_sequence)% exclude_out_in(increasing)% exclude_out_in(increasing_sequence)% exclude_out_in(steady)% exclude_out_in(steady_sequence)% exclude_out_in(strictly_decreasing_sequence)% exclude_out_in(strictly_increasing_sequence)try(exclude_out_in) :- % DEFINITION 7reg_exp(Pattern, LPattern),(exclude_out_in(LPattern) -> write(exclude_out_in(Pattern)), nl ; true),fail.% | ?- try(letter).% letter(decreasing,g)% letter(decreasing_sequence,g)% letter(increasing,l)% letter(increasing_sequence,l)% letter(steady,e)% letter(steady_sequence,e)% letter(strictly_decreasing_sequence,g)% letter(strictly_increasing_sequence,l)try(letter) :- % DEFINITION 20 eg_exp(Pattern, LPattern),member(Letter, [l,e,g]),(letter(LPattern, Letter) -> write(letter(Pattern,Letter)), nl ; true),fail.% | ?- try(suffix_unavoidable).% suffix_unavoidable(decreasing,g)% suffix_unavoidable(decreasing_sequence,g)% suffix_unavoidable(gorge,g)% suffix_unavoidable(increasing,l)% suffix_unavoidable(increasing_sequence,l)% suffix_unavoidable(peak,l)% suffix_unavoidable(plain,g)% suffix_unavoidable(plateau,l)% suffix_unavoidable(proper_plain,g)% suffix_unavoidable(proper_plateau,l)% suffix_unavoidable(steady,e)% suffix_unavoidable(steady_sequence,e)% suffix_unavoidable(strictly_decreasing_sequence,g)% suffix_unavoidable(strictly_increasing_sequence,l)% suffix_unavoidable(summit,l)% suffix_unavoidable(valley,g)try(suffix_unavoidable) :- % DEFINITION 21reg_exp(Pattern, LPattern),member(Letter, [l,e,g]),(suffix_unavoidable(LPattern, Letter) -> write(suffix_unavoidable(Pattern,Letter)), nl ; true),fail.% | ?- try(incompressible).% incompressible(bump_on_decreasing_sequence)% incompressible(decreasing)% incompressible(decreasing_terrace)% incompressible(dip_on_increasing_sequence)% incompressible(increasing)% incompressible(increasing_terrace)% incompressible(plain)% incompressible(plateau)% incompressible(proper_plain)% incompressible(proper_plateau)% incompressible(steady)try(incompressible) :- % DEFINITION 22reg_exp(Pattern, LPattern),(incompressible(LPattern) -> write(incompressible(Pattern)), nl ; true),fail.% | ?- try(factor).% factor(bump_on_decreasing_sequence,5)% factor(decreasing,1)% factor(dip_on_increasing_sequence,5)% factor(increasing,1)% factor(steady,1)% factor(steady_sequence,1)% factor(strictly_decreasing_sequence,1)% factor(strictly_increasing_sequence,1)% factor(zigzag,3)try(factor) :- % DEFINITION 23reg_exp(Pattern, LPattern),pattern_smallest_size(Pattern, Minl),(factor(LPattern, Minl) -> write(factor(Pattern,Minl)), nl ; true),fail.convex(LPattern) :- % DEFINITION 3LEG = {[l],[e],[g]},SigmaStar = *(LEG),SigmaPlus = (LEG + SigmaStar),L1 = (shuffle(shuffle(LPattern,s),s) /\(SigmaStar + [s] + LPattern + SigmaStar + LPattern + [s] + SigmaStar) /\(SigmaStar + [s] + (SigmaPlus\LPattern) + [s] + SigmaStar)),L2 = (shuffle(shuffle(shuffle(shuffle(LPattern,s),s),s),s) /\(SigmaStar + [s] + shuffle(LPattern,s) + [s] + SigmaPlus + [s] + SigmaStar) /\(SigmaStar + [s] + SigmaPlus + [s] + shuffle(LPattern,s) + [s] + SigmaStar) /\ SigmaStar + [s] + shuffle(shuffle(SigmaPlus\LPattern,s),s) + [s] + SigmaStar)),regex_kernel(L1, Automaton1),regex_kernel(L2, Automaton2),Automaton1 = kernel([],[]),Automaton2 = kernel([],[]).no_inflexion(LPattern) :- % DEFINITION 4LEG = {[l],[e],[g]},SigmaStar = *(LEG),L = (LPattern /\ (SigmaStar + [l] + SigmaStar + [g] + SigmaStar)) \/(LPattern /\ (SigmaStar + [g] + SigmaStar + [l] + SigmaStar)),regex_kernel(L, Automaton),Automaton = kernel([],[]).one_inflexion(LPattern) :- % DEFINITION 5Inf1 = (*([l] \/ [e]) + [l] + *([e]) + [g] + *([g] \/ [e])),Inf2 = (*([g] \/ [e]) + [g] + *([e]) + [l] + *([l] \/ [e])),L = (LPattern\(Inf1 \/ Inf2)),regex_kernel(L, Automaton),Automaton = kernel([],[]).single_letter(LPattern) :- % DEFINITION 6LEG = {[l],[e],[g]},L = LPattern\LEG,regex_kernel(L, Automaton),Automaton = kernel([],[]).exclude_out_in(LPattern) :- % DEFINITION 7LEG = {[l],[e],[g]},SigmaStar = *(LEG),SigmaPlus = (LEG + SigmaStar),L1 = (shuffle(shuffle(shuffle(shuffle(LPattern,s),s),s),s) /\(SigmaStar + [s] + SigmaPlus + [s] + shuffle(SigmaPlus\(SigmaStar+LPattern+SigmaStar),s) + [s] + SigmaStar) /\(SigmaStar + [s] + shuffle(LPattern,s) + [s] + SigmaStar + [s] + SigmaStar) /\(SigmaStar + [s] + SigmaPlus + [s] + SigmaPlus + [s] + SigmaStar + [s] + SigmaStar)),L2 = (shuffle(shuffle(shuffle(shuffle(LPattern,s),s),s),s) /\(SigmaStar + [s] + shuffle(SigmaPlus\(SigmaStar+LPattern+SigmaStar),s) + [s] + SigmaPlus + [s] + SigmaStar) /\(SigmaStar + [s] + SigmaStar + [s] + shuffle(LPattern,s) + [s] + SigmaStar) /\(SigmaStar + [s] + SigmaStar + [s] + SigmaPlus + [s] + SigmaPlus + [s] + SigmaStar)),regex_kernel(L1, Automaton1),regex_kernel(L2, Automaton2),Automaton1 = kernel([],[]),Automaton2 = kernel([],[]).letter(LPattern, Letter) :- % DEFINITION 20LEG = {[l],[e],[g]},L1 = (*(LEG\[Letter])) /\ LPattern,L2 = [Letter] /\ LPattern,regex_kernel(L1, Automaton1),regex_kernel(L2, Automaton2),Automaton1 = kernel([],[]),(Automaton2 = kernel([],[]) -> fail ; true).suffix_unavoidable(LPattern, Letter) :- % DEFINITION 21LEG = {[l],[e],[g]},SigmaStar = *(LEG),L1 = (*(LEG\[Letter])) /\ LPattern,L2 = (shuffle(LPattern,s) /\(SigmaStar + [s] + [Letter] + SigmaStar) /\(SigmaStar + [s] + (SigmaStar\LPattern))),regex_kernel(L1, Automaton1),regex_kernel(L2, Automaton2),Automaton1 = kernel([],[]),Automaton2 = kernel([],[]).incompressible(LPattern) :- % DEFINITION 22LEG = {[l],[e],[g]},SigmaStar = *(LEG), igmaPlus = (LEG + SigmaStar),L = ((SigmaPlus + LPattern + SigmaStar) /\ LPattern) \/((SigmaStar + LPattern + SigmaPlus) /\ LPattern),regex_kernel(L, Automaton),Automaton = kernel([],[]).factor(LPattern, Minl) :- % DEFINITION 23LEG = {[l],[e],[g]},SigmaStar = *(LEG),(Minl = 1 -> SigmaMinl = LEG ;Minl = 2 -> SigmaMinl = LEG+LEG ;Minl = 3 -> SigmaMinl = LEG+LEG+LEG ;Minl = 4 -> SigmaMinl = LEG+LEG+LEG+LEG ;Minl = 5 -> SigmaMinl = LEG+LEG+LEG+LEG+LEG ;write(minl_no_implemented(Minl)), nl, false),L = (shuffle(shuffle(LPattern,s),s) /\(SigmaStar + [s] + SigmaStar + SigmaMinl + SigmaStar + [s] + SigmaStar) /\(SigmaStar + [s] + (SigmaStar\LPattern) + [s] + SigmaStar)),regex_kernel(L, Automaton),Automaton = kernel([],[]).% reg_exp(pattern, reg_exp): l for <, e for =, g for >reg_exp(bump_on_decreasing_sequence, [l,l,g,l,l]). % <<><reg_exp(decreasing_sequence, (([g] + *([g] \/ [e]) + [g]) \/ [g])). % > (>|=)* > | >reg_exp(decreasing_terrace, ([g] + [e] + *([e]) + [g])). % > =+ >reg_exp(dip_on_increasing_sequence, [g,g,l,g,g]). % >><>>reg_exp(gorge, (([g]\/([g]+ *([e]\/[g])+[g]))+([l]\/([l]+ *([e]\/[l])+[l])))). % (>|(>(=|>)* >))(<|(<(=|<)*<))reg_exp(increasing, [l]). % | > (>|=)* |=)* >reg_exp(plain, ([g] + *([e]) + [l])). % > =* reg_exp(proper_plain, ([g] + [e] + *([e]) + [l])). % > =+ reg_exp(steady, [e]). % =reg_exp(steady_sequence, ([e] + *([e]))). % =+reg_exp(strictly_decreasing_sequence, ([g] + *([g]))). % >+reg_exp(strictly_increasing_sequence, ([l] + *([l]))). % <+reg_exp(summit, (([l]\/([l]+ *([e]\/[l])+[l]))+([g]\/([g]+ *([e]\/[g])+[g])))). % (<|(< (=|<)*<))(>|(>(=|>)*>))reg_exp(valley, ([g] + *([e] \/ [g]) + *([l] \/ [e]) + [l])). % > (= | >)* (< | =)* )+ (<|<>) | (><)+ (>|><)pattern_smallest_size(bump_on_decreasing_sequence, 5) :- !.pattern_smallest_size(decreasing, 1) :- !.pattern_smallest_size(decreasing_sequence, 1) :- !.pattern_smallest_size(decreasing_terrace, 3) :- !.pattern_smallest_size(dip_on_increasing_sequence, 5) :- !.pattern_smallest_size(gorge, 2) :- !.pattern_smallest_size(increasing, 1) :- !.pattern_smallest_size(increasing_sequence, 1) :- !.pattern_smallest_size(increasing_terrace, 3) :- !.pattern_smallest_size(inflexion, 2) :- !.pattern_smallest_size(peak, 2) :- !.pattern_smallest_size(plain, 2) :- !.pattern_smallest_size(plateau, 2) :- !.pattern_smallest_size(proper_plain, 3) :- !.pattern_smallest_size(proper_plateau, 3) :- !.pattern_smallest_size(steady, 1) :- !.pattern_smallest_size(steady_sequence, 1) :- !.pattern_smallest_size(strictly_decreasing_sequence, 1) :- !.pattern_smallest_size(strictly_increasing_sequence, 1) :- !.pattern_smallest_size(summit, 2) :- !.pattern_smallest_size(valley, 2) :- !.pattern_smallest_size(zigzag, 3) :- !.pattern_smallest_size(P, _) :- write(not_implemented(P)), nl, false. Purpose: Some operations on plain DFA i.e. no registers, no guards% Author: Mats Carlsson, RISE:- module(dfa_aux_appendixC, [regex_kernel/2,kernel_closure/2,kernel_intersection/3,kernel_union/3,kernel_difference/3,kernel_concatenation/3,kernel_shuffle/3,kernel_normalize/2,kernel_string/2,kernel_print_dot/1]).:- use_module(library(lists)).:- use_module(library(ordsets)).:- use_module(library(avl)).:- use_module(library(ugraphs))./***EXPORTED PREDICATES:%% Kernel ::= kernel(SourcesSinks,Arcs)%% where Alphabet should be a plain Prolog list%% Regex ::= [w,o,r,d] // plain Prolog list of atomic symbols%% | {Regex,Regex,...} // union over set of regex%% | *(Regex) // Kleene star%% | (Regex /\ Regex) // intersection%% | (Regex \/ Regex) // union%% | (Regex \ Regex) // difference%% | (Regex + Regex) // concatenation%% | shuffle(Regex,S) // L(Regex) with the symbol S inserted once, somewhere%% | truncate(Regex,1) // the set of strings of L(Regex) truncated to length at most one%% | tail(Regex,1) // the set of strings of L(Regex) truncated to tails of length at most one%% | prefix(Regex) // the set of prefixes of L(Regex)%% | suffix(Regex) // the set of suffixes of L(Regex)%. regex_kernel(+Rgegx, -Kernel)%%%% Computes the normalized kernel that recognizes Regex.%. kernel_closure(+Kernel, -Closure)%%%% Computes the Kleene closure of a kernel.%. kernel_intersection(+Kernel1, +Kernel2, -Intersection)%%%% Computes the intersection of two kernels.%. kernel_union(+Kernel1, +Kernel2, -Union)%%%% Computes the union of two kernels.%. kernel_difference(+Kernel1, +Kernel2, -Difference)%%%% Computes the difference of two kernels.%. kernel_concatenation(+Kernel1, +Kernel2, -Concatenation)%%%% Computes the concatenation of two kernels.%. kernel_shuffle(+Kernel, +Symbol, -Insertion)%% % Computes the kernel corresponding to shuffle(...,Symbol)%. kernel_truncate(+Kernel, +N, -Truncation)%%%% Computes the kernel corresponding to truncate(...,N)%. kernel_tail(+Kernel, +N, -Tail)%%%% Computes the kernel corresponding to tail(...,N)%. kernel_prefix(+Kernel, -Prefix)%%%% Computes the kernel corresponding to prefix(...)%. kernel_suffix(+Kernel, -Suffix)%%%% Computes the kernel corresponding to suffix(...)%. kernel_normalize(+Kernel1, -Kernel2)%%%% Makes a kernel determinate if need be, and minimize it.%. kernel_print_dot(+Kernel)%%%% Prints a kernel as a digraph in the dot language.%. kernel_string(+Kernel, -String)%%%% Generate a string that the kernel recognizes. Enumerate all strings on backtracking.%% N.B. Kernel must be normalized.***/regex_kernel(Regexp, Kernel) :-empty_avl(AVL),regex_kernel(Regexp, Kernel, AVL, _).regex_kernel(Regexp, Kernel, AVL0, AVL) :-avl_fetch(Regexp, AVL0, Kernel), !,AVL = AVL0.regex_kernel(Regexp, Kernel, AVL0, AVL) :-regex_kernel_rec(Regexp, Kernel0, AVL0, AVL1),kernel_normalize(Kernel0, Kernel),avl_store(Regexp, AVL1, Kernel, AVL).regex_kernel_rec((R1\/R2), Kernel) --> !,regex_kernel(R1, K1),regex_kernel(R2, K2),{kernel_union(K1, K2, Kernel)}.regex_kernel_rec((R1/\R2), Kernel) --> !,regex_kernel(R1, K1),regex_kernel(R2, K2),{kernel_intersection(K1, K2, Kernel)}.regex_kernel_rec((R1\R2), Kernel) --> !,regex_kernel(R1, K1),regex_kernel(R2, K2),{kernel_difference(K1, K2, Kernel)}.regex_kernel_rec(*(R1), Kernel) --> !,regex_kernel(R1, K1),{kernel_closure(K1, Kernel)}.regex_kernel_rec((R1+R2), Kernel) --> !,regex_kernel(R1, K1),regex_kernel(R2, K2),{kernel_concatenation(K1, K2, Kernel)}.regex_kernel_rec(shuffle(R1,S), Kernel) --> !,regex_kernel(R1, K1),{kernel_shuffle(K1, S, Kernel)}.regex_kernel_rec(truncate(R1,1), Kernel) --> !,regex_kernel(R1, K1),{kernel_truncate(K1, 1, Kernel)}. egex_kernel_rec(tail(R1,1), Kernel) --> !,regex_kernel(R1, K1),{kernel_tail(K1, 1, Kernel)}.regex_kernel_rec(prefix(R1), Kernel) --> !,regex_kernel(R1, K1),{kernel_prefix(K1, Kernel)}.regex_kernel_rec(suffix(R1), Kernel) --> !,regex_kernel(R1, K1),{kernel_suffix(K1, Kernel)}.regex_kernel_rec({}, Kernel) --> !,regex_kernel([], Kernel).regex_kernel_rec({Tree}, Kernel) --> !,{orify(Tree, Regexp)},regex_kernel(Regexp, Kernel).regex_kernel_rec(String, Kernel) -->{length(String, _)}, !,{Kernel = kernel([source(S1),sink(S4)], Arcs)},( foreach(A,String),foreach(arc(S2,A,S3),Arcs),fromto(S1,S2,S3,S4)do []).orify((X,Y), (R\/S)) :- !,orify(X, R),orify(Y, S).orify(X, X).kernel_closure(Kernel1, Closure) :-kernel_parts(Kernel1, Sources, Sinks, _, Arcs, _),tag_sources_sinks(Sources, Sources, Sinks1, Sources1),tag_sources_sinks([], Sinks, [], Sinks2),ord_union([Sinks1, Sinks2, Sources1], SS3),( foreach(arc(Q3,A,Q4),Arcs),fromto(Arcs1,Arcs2,Arcs6,[]),param(Sinks,Sources)do ( ord_member(Q4, Sinks)-> Arcs5 = [arc(Q3,A,Q4)|Arcs6],( foreach(Q5,Sources),fromto(Arcs2,Arcs3,Arcs4,Arcs5),param(Q3,A)do Arcs3 = [arc(Q3,A,Q5)|Arcs4]); Arcs2 = [arc(Q3,A,Q4)|Arcs6])),Closure = kernel(SS3,Arcs1).kernel_complement(Kernel1, Complement) :-kernel_parts(Kernel1, Sources, Sinks, States, Arcs, _),Complement = kernel(SourcesSinks2,Arcs),ord_subtract(States, Sinks, NotSinks),tag_sources_sinks(Sources, NotSinks, Sources2, Sinks2),append(Sources2, Sinks2, SourcesSinks2).kernel_intersection(Kernel1, Kernel2, Intersection) :-Intersection = kernel(SourcesSinks3,Arcs3),kernel_parts(Kernel1, Sources1, Sinks1, _, Arcs1, _),kernel_parts(Kernel2, Sources2, Sinks2, _, Arcs2, _),pairs(Sources1, Sources2, Sources3, []),closure(Sources3, Sources3, Closure, Arcs1, Arcs2, Arcs3, []),pairs(Sinks1, Sinks2, Sinks3, []),ord_intersection(Sinks3, Closure, Sinks3c),tag_sources_sinks(Sources3, Sinks3c, SS1, SS2),append(SS1, SS2, SourcesSinks3).kernel_union(Kernel1, Kernel2, Union) :-kernel_parts(Kernel1, Sources1, Sinks1, _, Arcs1, _), ernel_parts(Kernel2, Sources2, Sinks2, _, Arcs2, _),append(Sources1, Sources2, Sources12),append(Sinks1, Sinks2, Sinks12),append(Arcs1, Arcs2, Arcs12),tag_sources_sinks(Sources12, Sinks12, TSo12, TSi12),append(TSo12, TSi12, SS12),Union = kernel(SS12,Arcs12).kernel_difference(Kernel1, Kernel2, Difference) :-kernel_parts(Kernel1, _, _, _, _ , Alpha1),kernel_parts(Kernel2, Sources2, Sinks2, _, Arcs2, Alpha2),tag_sources_sinks(Sources2, Sinks2, TSo2, TSi2),append(TSo2, TSi2, SS2),ord_union(Alpha1, Alpha2, Alpha3),ord_subtract(Alpha3, Alpha2, ToAdd2),( foreach(A2,ToAdd2),foreach(arc(_,A2,_),New2)do true),append(Arcs2, New2, Arcs22),kernel_complement(kernel(SS2,Arcs22), K2C),kernel_intersection(Kernel1, K2C, Difference).kernel_concatenation(Kernel1, Kernel2, Concat) :-kernel_parts(Kernel1, Sources1, Sinks1, _, Arcs1, _),kernel_parts(Kernel2, Sources2, Sinks2, _, Arcs2, _),Concat = kernel(SS3,Arcs3),tag_sources_sinks(Sources1, Sinks2, Sources3, Sinks3),( foreach(arc(Q5,A5,R5),Arcs1),fromto(New3,New4,New7,[]),param(Sinks1,Sources2)do ( ord_nonmember(R5, Sinks1) -> New4 = New7; ( foreach(So5,Sources2),fromto(New4,New5,New6,New7),param(Q5,A5)do New5 = [arc(Q5,A5,So5)|New6]))),( ord_disjoint(Sources1, Sinks1) -> Sources4 = []; tag_sources_sinks(Sources2, [], Sources4, [])),append([Sources3, Sources4, Sinks3], SS3),ord_union([Arcs1,Arcs2,New3], Arcs3).kernel_shuffle(Kernel1, Symbol, Insertion) :-kernel_parts(Kernel1, Sources1, _, States1, Arcs1, _),kernel_parts(Kernel1, _, Sinks2, States2, Arcs2, _),Insertion = kernel(SS3,Arcs3),tag_sources_sinks(Sources1, Sinks2, Sources3, Sinks3),( foreach(Q1,States1),foreach(Q2,States2),foreach(arc(Q1,Symbol,Q2),New3),param(Symbol)do true),append(Sources3, Sinks3, SS3),ord_union([Arcs1,Arcs2,New3], Arcs3).kernel_truncate(Kernel1, 1, Truncation) :-kernel_normalize(Kernel1, Kernel2), % precondition!Kernel2 = kernel(SS2,Arcs2),( Arcs2 = [] -> Truncation = Kernel2; memberchk(source(Src), SS2),( foreach(Arc,Arcs2),fromto(Arcs3,Arcs4,Arcs5,[]),param(Src,Q3)do Arc = arc(Q1,A,_), Q1==Src -> Arcs4 = [arc(Src,A,Q3)|Arcs5]; Arcs4 = Arcs5)),(memberchk(sink(Src), SS2) -> Sinks3 = [Src,Q3] ; Sinks3 = [Q3]),tag_sources_sinks([Src], Sinks3, TSS1, TSS2),append(TSS1, TSS2, TSS12),Truncation = kernel(TSS12,Arcs3)).kernel_tail(Kernel1, 1, Tail) :-kernel_normalize(Kernel1, Kernel2), % precondition!Kernel2 = kernel(SS2,Arcs2),( Arcs2 = [] -> Tail = Kernel2; memberchk(source(Src), SS2),( foreach(Arc,Arcs2),fromto(Arcs3,Arcs4,Arcs5,[]),param(SS2,Src,Sink)do Arc = arc(_,A,Q),( memberchk(sink(Q), SS2) -> Arcs4 = [arc(Src,A,Sink)|Arcs5]; Arcs4 = Arcs5)),(memberchk(sink(Src), SS2) -> Sources3 = [Src,Sink] ; Sources3 = [Src]),tag_sources_sinks(Sources3, [Sink], TSS1, TSS2),append(TSS1, TSS2, TSS12),Tail = kernel(TSS12,Arcs3)).%% make all states sinks, keeping sourceskernel_prefix(Kernel1, Prefix) :-Kernel1 = kernel(SS1,Arcs),( foreach(SS,SS1),fromto(SS2,SS3,SS4,SS5)do (SS = source(_) -> SS3 = [SS|SS4] ; SS3 = SS4)),( foreach(arc(Q1,_,Q2),Arcs),fromto(SS5,SS6,SS7,[])do SS6 = [sink(Q1),sink(Q2)|SS7]),sort(SS2, SS8),Prefix = kernel(SS8,Arcs).%% make all states sources, keeping sinkskernel_suffix(Kernel1, Suffix) :-Kernel1 = kernel(SS1,Arcs),( foreach(SS,SS1),fromto(SS2,SS3,SS4,SS5)do (SS = sink(_) -> SS3 = [SS|SS4] ; SS3 = SS4)),( foreach(arc(Q1,_,Q2),Arcs),fromto(SS5,SS6,SS7,[])do SS6 = [source(Q1),source(Q2)|SS7]),sort(SS2, SS8),Suffix = kernel(SS8,Arcs).%% rename states to brand new variables%% if need be, add extra "black hole" state%% ensure that every combo has at least one transition%% output Sources, Sinks, States, Arcs, Alphabet as ordered setskernel_parts(Kernel1, Sources, Sinks, States, Arcs, Alphabet) :-rename_states(Kernel1, Kernel2),Kernel2 = kernel(SourcesSinks,Arcs1),( foreach(Item,SourcesSinks),fromto(Sources1,So1,So2,[]),fromto(Sinks1,Si1,Si2,[]),foreach(Y,Qs4) o ( Item = source(Y) -> So1 = [Y|So2], Si1 = Si2; Item = sink(Y) -> So1 = So2, Si1 = [Y|Si2])),sort(Sources1, Sources),sort(Sinks1, Sinks),sort(Arcs1, Arcs2),( foreach(arc(Q1,A1,Q2),Arcs2),foreach(Q1*A1,Out1),foreach(A1,As),fromto(Qs1,Qs2,Qs3,Qs4)do Qs2 = [Q1,Q2|Qs3]),sort(As, Alphabet),sort(Qs1, States1),sort(Out1, Out2),pairs(States1, Alphabet, Out3, []),ord_subtract(Out3, Out2, Out4),( Out4 = [] -> Arcs = Arcs2, States = States1; ( foreach(Q3*A3,Out4),foreach(arc(Q3,A3,Aux),Arcs3),param(Aux)do true),( foreach(A4,Alphabet),foreach(arc(Aux,A4,Aux),Arcs4),param(Aux)do true),ord_union([Arcs2,Arcs3,Arcs4], Arcs),ord_add_element(States1, Aux, States)).rename_states(kernel(SourcesSinks1,Arcs1), kernel(SourcesSinks2,Arcs2)) :-rename_states(SourcesSinks1, SourcesSinks2, KL1, KL2),rename_states(Arcs1, Arcs2, KL2, []),keysort(KL1, KL3),keyclumped(KL3, KL4),( foreach(_-Clump,KL4)do ( foreach(X,Clump),param(X)do true)).rename_states(L1, L2) -->( foreach(X,L1),foreach(Y,L2)do ( {X = source(Q1)}-> {Y = source(Q2)}, [Q1-Q2]; {X = sink(Q1)}-> {Y = sink(Q2)}, [Q1-Q2]; {X = arc(Q1,A,Q3)}-> {Y = arc(Q2,A,Q4)}, [Q1-Q2,Q3-Q4])).tag_sources_sinks(Sources, Sinks, SS1, SS2) :-( foreach(Q1,Sources),foreach(source(Q1),SS1)do true),( foreach(Q2,Sinks),foreach(sink(Q2),SS2)do true).pairs(Xs, Ys) --> foreach(X,Xs),param(Ys)do ( foreach(Y,Ys),param(X)do [X*Y])).closure([], Closure, Closure, _, _) --> [].closure([P1*P2|L1], Sofar1, Closure, Arcs1, Arcs2) -->{filter_arcs(Arcs1, P1, Arcs3)},{filter_arcs(Arcs2, P2, Arcs4)},{keyclumped(Arcs3, KL1)},{keyclumped(Arcs4, KL2)},( foreach(A-Clump1,KL1),fromto(Incr,S0,S6,[]),param(KL2,P1,P2)do ( foreach(B-Clump2,KL2),fromto(S0,S1,S5,S6),param(A,Clump1,P1,P2)do ( {A==B} ->( foreach(X,Clump1),fromto(S1,S2,S4,S5),param(A,Clump2,P1,P2)do ( foreach(Y,Clump2),fromto(S2,[X*Y|S3],S3,S4),param(A,P1,P2,X)do [arc(P1*P2,A,X*Y)])); {S1 = S5}))),{sort(Incr, Incr1)},{ord_union(Sofar1, Incr1, Sofar2, L2)},{append(L1, L2, L3)},closure(L3, Sofar2, Closure, Arcs1, Arcs2).filter_arcs([], _, []).filter_arcs([arc(P,A,Q)|Arcs], P1, KL) :-compare(K, P, P1),filter_arcs(K, A, Q, Arcs, P1, KL).filter_arcs(<, _, _, Arcs, P1, KL) :-filter_arcs(Arcs, P1, KL).filter_arcs(=, A, Q, Arcs, P1, [A-Q|KL]) :-filter_arcs(Arcs, P1, KL).filter_arcs(>, _, _, _, _, []).%% first, transform to DFA if need be%% then, minimizekernel_normalize(Kernel1, Kernel3) :-ensure_dfa(Kernel1, Kernel2),Kernel2 = kernel(SourcesSinks1,Arcs1),Kernel3 = kernel(SourcesSinks3,Arcs3),make_penta(SourcesSinks1, Arcs1, Penta1),remove_unreachable(Penta1, Penta2),Penta2 = penta(States2,_,_,_,Sinks2),ord_subtract(States2, Sinks2, NonSinks2),( Sinks2\==[], NonSinks2\==[] -> Partition0 = [NonSinks2,Sinks2]; true -> Partition0 = [States2]),refine_partition(Partition0, Partition, Penta2),collapse(Penta2, Partition, Penta3),Penta3 = penta(_,_,ArcsF3,Sources3,Sinks3),avl_to_list(ArcsF3, ArcsL3),( foreach((P-A)-Q,ArcsL3), oreach(arc(P,A,Q),Arcs3)do true),tag_sources_sinks(Sources3, Sinks3, SS1, SS2),append(SS1, SS2, SourcesSinks3),numbervars(Kernel3, 0, _).make_penta(SourcesSinks, Arcs, penta(States,Alfabet,ArcsF,Sources,Sinks)) :-( foreach(Item,SourcesSinks),fromto(Sources0,So1,So2,[]),fromto(Sinks0,Si1,Si2,[])do ( Item = source(Y) -> So1 = [Y|So2], Si1 = Si2; Item = sink(Y) -> So1 = So2, Si1 = [Y|Si2])),( foreach(arc(P,A,Q),Arcs),foreach(A,Alfa0),foreach((P-A)-Q,ArcsFL),fromto(States0,[P,Q|S],S,[])do true),sort(ArcsFL, ArcsFOL),ord_list_to_avl(ArcsFOL, ArcsF),sort(States0, States1),sort(Alfa0, Alfabet),sort(Sources0, Sources),sort(Sinks0, Sinks),ord_union([States1,Sources,Sinks], States).remove_unreachable(Penta1, Penta2) :-Penta1 = penta(_, Alfa,ArcsF1,Sources1,Sinks1),Penta2 = penta(States2,Alfa,ArcsF2,Sources2,Sinks2),avl_to_list(ArcsF1,ArcsFL),( foreach((P-_)-Q,ArcsFL),fromto(EdgesF1,[P-Q|EdgesF2],EdgesF2,AuxF),fromto(EdgesB1,[Q-P|EdgesB2],EdgesB2,AuxB)do true),( foreach(So,Sources1),fromto(AuxF,[(*)-So|AuxF1],AuxF1,[])do true),( foreach(Si,Sinks1),fromto(AuxB,[(*)-Si|AuxB1],AuxB1,[])do true),vertices_edges_to_ugraph([*], EdgesF1, GF),vertices_edges_to_ugraph([*], EdgesB1, GB),reachable(*, GF, ReachF),reachable(*, GB, ReachB),ord_intersection(ReachF, ReachB, ReachFB),ord_del_element(ReachFB, *, States2),ord_intersection(Sources1, States2, Sources2),ord_intersection(Sinks1, States2, Sinks2),( foreach((P1-A1)-Q1,ArcsFL),fromto(ArcsFL2,ArcsFL3,ArcsFL4,[]),param(States2)do ( ord_member(P1,States2),ord_member(Q1,States2) ->ArcsFL3 = [(P1-A1)-Q1|ArcsFL4]; ArcsFL3 = ArcsFL4)),ord_list_to_avl(ArcsFL2, ArcsF2).refine_partition(Part0, Part, Penta) :-( fromto(1,_,D,0),fromto(Part0,Part1,Part2,Part), aram(Penta)do refine_partition1(Part1, Part2, Penta),length(Part1, N1),length(Part2, N2),D is N1-N2).refine_partition1(Part1, Part2, Penta) :-Penta = penta(_,Alfa,ArcsF,_,_),( foreach(Part,Part1),count(I,1,_),fromto(AL,AL1,AL3,[])do ( foreach(S,Part),fromto(AL1,[S-I|AL2],AL2,AL3),param(I)do true)),sort(AL, AOL),ord_list_to_avl(AOL, Map),( foreach(Q-J,AL),foreach((J-SignSet)-Q,KL1),param(Alfa,Map,ArcsF)do ( foreach(A,Alfa),fromto(Sign,Sign1,Sign2,[]),param(Q,Map,ArcsF)do ( avl_fetch(Q-A, ArcsF, R) ->avl_fetch(R, Map, R1),Sign1 = [s(A,R1)|Sign2]; Sign1 = Sign2)),sort(Sign, SignSet)),keysort(KL1, KL2),keyclumped(KL2, KL3),( foreach(_-Clump,KL3),foreach(Clump,Part2)do true).collapse(Penta1, Partition, Penta2) :-Penta1 = penta(States1,Alfa,Arcs1,Sources1,Sinks1),Penta2 = penta(States2,Alfa,Arcs2,Sources2,Sinks2),( foreach(Part,Partition),fromto(AL,AL1,AL3,[])do ( foreach(S0,Part),fromto(AL1,[S0-SI|AL2],AL2,AL3),param(SI)do true)),sort(AL, AOL),list_to_avl(AOL, Map),( foreach(Q1,States1),foreach(R1,States1b),param(Map)do avl_fetch(Q1, Map, R1)),avl_to_list(Arcs1, Arcs1L),( foreach((P-A)-Q,Arcs1L),foreach((R-A)-S,Arcs2L),param(Map)do avl_fetch(P, Map, R),avl_fetch(Q, Map, S)),sort(Arcs2L, Arcs2OL),ord_list_to_avl(Arcs2OL, Arcs2), foreach(Q2,Sources1),foreach(R2,Sources1b),param(Map)do avl_fetch(Q2, Map, R2)),( foreach(Q3,Sinks1),foreach(R3,Sinks1b),param(Map)do avl_fetch(Q3, Map, R3)),sort(States1b, States2),sort(Sources1b, Sources2),sort(Sinks1b, Sinks2)./* NFA to DFA: standard powerset construction algorithm. */ensure_dfa(Kernel1, Kernel2) :-kernel_parts(Kernel1, Sources, Sinks, _, Arcs, Alphabet),( foreach(arc(Q1,A,Q2),Arcs),foreach(Q1*A - Q2,KL1)do true),keyclumped(KL1, KL2),( Sources = [_,_|_] -> true; member(_-[_,_|_], KL2) -> true), !,ord_list_to_avl(KL2, Trans),det_closure([Sources], Alphabet, Trans, [Sources], DStates, [], DArcs),DSources = [Sources],det_select(DStates, Sinks, DSinks),tag_sources_sinks(DSources, DSinks, ESources, ESinks),append(ESources, ESinks, ESS),Kernel2 = kernel(ESS,DArcs).ensure_dfa(Kernel, Kernel).det_closure([], _, _, States, States, Arcs, Arcs).det_closure([R1|Queue], Alphabet, Trans, States0, States, Arcs0, Arcs) :-det_arcs(R1, Alphabet, Trans, Arcs2),sort(Arcs2, Arcs3),( foreach(arc(_,_,R2),Arcs3),foreach(R2,R2s)do true),sort(R2s, R3s),ord_subtract(R3s, States0, New),ord_union(R3s, States0, States1),ord_union(Arcs0, Arcs3, Arcs1),append(Queue, New, Queue1),det_closure(Queue1, Alphabet, Trans, States1, States, Arcs1, Arcs).det_arcs(R1, Alphabet, Trans, Arcs) :-( foreach(A,Alphabet),foreach(Arc,Arcs),param(R1,Trans)do ( foreach(Q1,R1),fromto(Qs1,Qs2,Qs3,[]),param(A,Trans)do avl_fetch(Q1*A, Trans, Q1As),append(Q1As, Qs3, Qs2)),sort(Qs1, Qs4),Arc = arc(R1,A,Qs4)).det_select(All, Key, Selected) :-( foreach(X,All),fromto(Selected,Sel1,Sel2,[]),param(Key) o ( ord_disjoint(X, Key) -> Sel1 = Sel2; Sel1 = [X|Sel2])).kernel_string(Kernel, String) :-Kernel = kernel(SourcesSinks,Arcs),( foreach(Item,SourcesSinks),fromto(Init1,Init2,Init3,Init4),fromto(Sinks1,Si1,Si2,[])do ( Item = source(Y) -> Init2 = [Y-[]|Init3], Si1 = Si2; Item = sink(Y) -> Init2 = Init3, Si1 = [Y|Si2])),sort(Sinks1, Sinks),( foreach(arc(Q2,A,Q3),Arcs),foreach(Q2-(A-Q3),KL1)do true),keysort(KL1, KL2),keyclumped(KL2, KL3),ord_list_to_avl(KL3, Map),kernel_string(Init1, Init4, Sinks, Map, String).kernel_string(Head, Tail1, Sinks, Map, String) :-Head\==Tail1,Head = [State-Stack|Head1],( ord_member(State, Sinks),reverse(Stack, String); avl_fetch(State, Map, Clump)-> ( foreach(A-Q,Clump),fromto(Tail1,Tail2,Tail3,Tail4),param(Stack)do Tail2 = [Q-[A|Stack]|Tail3]),kernel_string(Head1, Tail4, Sinks, Map, String); kernel_string(Head1, Tail1, Sinks, Map, String)).kernel_print_dot(kernel(SourcesSinks,Arcs)) :-write(’digraph automaton {\n’),write(’ size="8.5,11";\n’),write(’ fontsize="24";\n’),write(’ rankdir=LR;\n’),write(’ edge [labelfontsize="10"];\n’),write(’ node [shape=circle];\n’),write(’ source [shape=none, label=""];\n’),write(’ comment [shape=box, label="’),write(’"];\n’),( foreach(SS,SourcesSinks)do ( SS = source(X)-> format(’ source -> ~w;\n’, [X]); SS = sink(X)-> format(’ ~w [shape=doublecircle];\n’, [X]))),( foreach(Arc,Arcs),foreach(arc3(A,C)-B,KL1)do Arc = arc(A,B,C)),keysort(KL1, KL2),keyclumped(KL2, KL3),( foreach(arc3(V,W)-Lets,KL3)do label_dot(Lets, Ldot),format(’ ~w -> ~w [taillabel="~w"];\n’, [V,W,Ldot])),write(’}\n\n’). abel_dot(Lets, Dot3) :-( foreach(L,Lets),fromto(’’,Dot1,Dot2,Dot3)do name(L, Lcodes),atom_codes(A, Lcodes),atom_concat(Dot1, A, Dot2)).end_of_file.%%% Some examples:| ?- regex_kernel([w,o,r,d], K).K = kernel([source(A),sink(B)],[arc(C,d,B),arc(D,o,E),arc(E,r,C),arc(A,w,D)]) ?| ?- regex_kernel({}, K).K = kernel([source(A),sink(A)],[]) ?| ?- regex_kernel({[a],[b,c]}, K).K = kernel([source(A),sink(B)],[arc(A,a,B),arc(A,b,C),arc(C,c,B)]) ?| ?- regex_kernel(*({[a,b]}), K).K = kernel([source(A),sink(A)],[arc(B,b,A),arc(A,a,B)]) ?| ?- regex_kernel({[a],[b]}+{[a],[b]}, K).K = kernel([source(A),sink(B)],[arc(A,a,C),arc(A,b,C),arc(C,a,B),arc(C,b,B)]) ?| ?- regex_kernel(({[a],[b]}+{[a],[b]})/\[a,b], K).K = kernel([source(A),sink(B)],[arc(A,a,C),arc(C,b,B)]) ?| ?- regex_kernel(({[a],[b]}+{[a],[b]})\/[a,b], K).K = kernel([source(A),sink(B)],[arc(A,a,C),arc(A,b,C),arc(C,a,B),arc(C,b,B)]) ?| ?- regex_kernel(({[a],[b]}+{[a],[b]})\[a,b], K).K = kernel([source(A),sink(B)],[arc(A,a,C),arc(A,b,D),arc(C,a,B),arc(D,a,B),arc(D,b,B)]) ?| ?- regex_kernel(*([a]) + [b] + *([a]), K).K = kernel([source(A),sink(B)],[arc(A,a,A),arc(A,b,B),arc(B,a,B)]) ?| ?- regex_kernel(shuffle([a,b],s), K).K = kernel([source(A),sink(B)],[arc(C,a,D),arc(A,a,E),arc(A,s,C),arc(E,b,F),arc(E,s,D),arc(D,b,B),arc(F,s,B)]) ?| ?- regex_kernel(shuffle([a,b],s), K), kernel_string(K,S).K = kernel([source(A),sink(B)],[arc(C,a,D),arc(A,a,E),arc(A,s,C),arc(E,b,F),arc(E,s,D),arc(D,b,B),arc(F,s,B)]),S = [a,b,s] ? ;K = kernel([source(A),sink(B)],[arc(C,a,D),arc(A,a,E),arc(A,s,C),arc(E,b,F),arc(E,s,D),arc(D,b,B),arc(F,s,B)]),S = [a,s,b] ? ;K = kernel([source(A),sink(B)],[arc(C,a,D),arc(A,a,E),arc(A,s,C),arc(E,b,F),arc(E,s,D),arc(D,b,B),arc(F,s,B)]),S = [s,a,b] ? ;no| ?- regex_kernel(prefix([w,o,r,d]), K), findall(S, kernel_string(K,S), Prefixes).K = kernel([source(A),sink(B),sink(C),sink(D),sink(E),sink(A)],[arc(C,d,B),arc(D,o,E),arc(E,r,C),arc(A,w,D)]),Prefixes = [[],[w],[w,o],[w,o,r],[w,o,r,d]] ?| ?- regex_kernel(suffix([w,o,r,d]), K), findall(S, kernel_string(K,S), Suffixes).K = kernel([source(A),sink(B),sink(A)],[arc(C,d,B),arc(D,o,E),arc(E,r,C),arc(A,d,B),arc(A,o,E),arc(A,r,C),arc(A,w,D)]),Suffixes = [[],[d],[r,d],[o,r,d],[w,o,r,d]] ?abel_dot(Lets, Dot3) :-( foreach(L,Lets),fromto(’’,Dot1,Dot2,Dot3)do name(L, Lcodes),atom_codes(A, Lcodes),atom_concat(Dot1, A, Dot2)).end_of_file.%%% Some examples:| ?- regex_kernel([w,o,r,d], K).K = kernel([source(A),sink(B)],[arc(C,d,B),arc(D,o,E),arc(E,r,C),arc(A,w,D)]) ?| ?- regex_kernel({}, K).K = kernel([source(A),sink(A)],[]) ?| ?- regex_kernel({[a],[b,c]}, K).K = kernel([source(A),sink(B)],[arc(A,a,B),arc(A,b,C),arc(C,c,B)]) ?| ?- regex_kernel(*({[a,b]}), K).K = kernel([source(A),sink(A)],[arc(B,b,A),arc(A,a,B)]) ?| ?- regex_kernel({[a],[b]}+{[a],[b]}, K).K = kernel([source(A),sink(B)],[arc(A,a,C),arc(A,b,C),arc(C,a,B),arc(C,b,B)]) ?| ?- regex_kernel(({[a],[b]}+{[a],[b]})/\[a,b], K).K = kernel([source(A),sink(B)],[arc(A,a,C),arc(C,b,B)]) ?| ?- regex_kernel(({[a],[b]}+{[a],[b]})\/[a,b], K).K = kernel([source(A),sink(B)],[arc(A,a,C),arc(A,b,C),arc(C,a,B),arc(C,b,B)]) ?| ?- regex_kernel(({[a],[b]}+{[a],[b]})\[a,b], K).K = kernel([source(A),sink(B)],[arc(A,a,C),arc(A,b,D),arc(C,a,B),arc(D,a,B),arc(D,b,B)]) ?| ?- regex_kernel(*([a]) + [b] + *([a]), K).K = kernel([source(A),sink(B)],[arc(A,a,A),arc(A,b,B),arc(B,a,B)]) ?| ?- regex_kernel(shuffle([a,b],s), K).K = kernel([source(A),sink(B)],[arc(C,a,D),arc(A,a,E),arc(A,s,C),arc(E,b,F),arc(E,s,D),arc(D,b,B),arc(F,s,B)]) ?| ?- regex_kernel(shuffle([a,b],s), K), kernel_string(K,S).K = kernel([source(A),sink(B)],[arc(C,a,D),arc(A,a,E),arc(A,s,C),arc(E,b,F),arc(E,s,D),arc(D,b,B),arc(F,s,B)]),S = [a,b,s] ? ;K = kernel([source(A),sink(B)],[arc(C,a,D),arc(A,a,E),arc(A,s,C),arc(E,b,F),arc(E,s,D),arc(D,b,B),arc(F,s,B)]),S = [a,s,b] ? ;K = kernel([source(A),sink(B)],[arc(C,a,D),arc(A,a,E),arc(A,s,C),arc(E,b,F),arc(E,s,D),arc(D,b,B),arc(F,s,B)]),S = [s,a,b] ? ;no| ?- regex_kernel(prefix([w,o,r,d]), K), findall(S, kernel_string(K,S), Prefixes).K = kernel([source(A),sink(B),sink(C),sink(D),sink(E),sink(A)],[arc(C,d,B),arc(D,o,E),arc(E,r,C),arc(A,w,D)]),Prefixes = [[],[w],[w,o],[w,o,r],[w,o,r,d]] ?| ?- regex_kernel(suffix([w,o,r,d]), K), findall(S, kernel_string(K,S), Suffixes).K = kernel([source(A),sink(B),sink(A)],[arc(C,d,B),arc(D,o,E),arc(E,r,C),arc(A,d,B),arc(A,o,E),arc(A,r,C),arc(A,w,D)]),Suffixes = [[],[d],[r,d],[o,r,d],[w,o,r,d]] ?