[PDF] Automated Verification of Integer Overflow

Abstract

Integer overflow accounts for one of the major source of bugs in software. Verification systems typically assume a well defined underlying semantics for various integer operations and do not explicitly check for integer overflow in programs. In this paper we present a specification mechanism for expressing integer overflow. We develop an automated procedure for integer overflow checking during program verification. We have implemented a prototype integer overflow checker and tested it on a benchmark consisting of already verified programs (over 14k LOC). We have found 43 bugs in these programs due to integer overflow.

Full PDF

aa r X i v : . [ c s . P L ] S e p Automated Veriﬁcation of Integer Overﬂow

Asankhaya Sharma SourceClear

Corresponding author:Asankhaya Sharma Email address: [email protected]

ABSTRACT

Integer overﬂow accounts for one of the major source of bugs in software. Veriﬁcation systems typically assumea well deﬁned underlying semantics for various integer operations and do not explicitly check for integer overﬂowin programs. In this paper we present a speciﬁcation mechanism for expressing integer overﬂow. We develop anautomated procedure for integer overﬂow checking during program veriﬁcation. We have implemented a prototypeinteger overﬂow checker and tested it on a benchmark consisting of already veriﬁed programs (over 14k LOC). Wehave found 43 bugs in these programs due to integer overﬂow.

INTRODUCTION

Numerical errors in software are quite common and yet are ignored by most veriﬁcations systems. Integer overﬂow inparticular has been among the top 25 software bugs (Christey et al., 2011). These errors can lead to serious failures andexploitable vulnerabilities. Program veriﬁcation helps to build reliable software by ensuring that relevant properties ofcan be validated. In Hoare logic style program veriﬁcation we typically specify programs using pre and post conditions.The veriﬁer assumes an underlying well deﬁned semantics and generates proofs of correctness of program. Mostveriﬁcation systems do not check for errors due to undeﬁned behaviors in programs. In C/C++ the integer operationsmay lead to undeﬁned behaviors as speciﬁed by the standard. Undeﬁned behaviors in the C/C++ speciﬁcation leadto confusion among programmers. Moreover many programmers expect wrap around behavior for integer overﬂow(Dietz et al., 2012) and may intentionally write code that leads to such overﬂows. The code written with undeﬁned (inthe language speciﬁcation) intentional overﬂows is not guaranteed to be portable and the behavior may depend on theoptimizations used in various compilers.It is no surprise that automated veriﬁers typically assume a well deﬁned semantics for various integer operations.However in order to increase the completeness of veriﬁcation it is desirable to specify and verify integer overﬂow.The starting point of this work is the HIP/SLEEK veriﬁcation system (Chin et al., 2011) based on separation logic.HIP/SLEEK system can do automated veriﬁcation of shape, size and bag properties of programs (Chin et al., 2012).We extend the domain of integers (extended number line) with two logical inﬁnite constants ∞ and − ∞ , correspondingto positive and negative inﬁnity respectively. Even though this kind extension of integers is common in librariesfor programming languages for portability reasons, it is eventually typically mapped to a ﬁxed value for a particularunderlying architecture (32-bit or 64-bit). In a veriﬁcation setting we ﬁnd it better to enrich the underlying speciﬁcationlogic with these constants and reason with them automatically during entailment. This mechanism allows us to specifyintentional and unintentional integer overﬂow in programs. In particular our key contributions are • A speciﬁcation mechanism for integer overﬂows using logical inﬁnities • Entailment procedure for handling logical inﬁnities • Integrated integer overﬂow checking with automated veriﬁcation • A prototype implementation of an Integer overﬂow checker • Finding 43 integer overﬂow bugs in existing benchmark of veriﬁed programshe rest of the paper is structured as follows. In the next section, we motivate our approach with a few examples.Then we present our speciﬁcation language with logical inﬁnities. This speciﬁcation language is used to describeautomated veriﬁcation with integer overﬂow checking. We also formulate some soundness properties of our system.In the experiments section we present our implementation with a benchmark of already veriﬁed programs. We describesome related work and ﬁnally we conclude in the last section.

MOTIVATING EXAMPLES

We illustrate the integration of integer overﬂow checking in a veriﬁcation system by means of few examples. Thefollowing function increments the value passed to it. void ex1 ( int n ) requires n ≥ = n + ; { return n + ; } If the value of n that is passed to this function is the maximum value that can be represented by the underlyingarchitecture this program will lead to an integer overﬂow. In order to avoid dealing with absolute values of maximumand minimum integers we introduce a logical constant ∞ in the speciﬁcation language. With this constant it is possibleto write the speciﬁcation for the function to avoid integer overﬂow. void ex2 ( int n ) requires 0 ≤ n + < ∞ ensures res = n + ; { return n + ; } Another beneﬁt of adding this constant to the speciﬁcation language is that it allows users to specify intentionalinteger overﬂow. A recent study (Dietz et al., 2012) has found intentional integer overﬂow occurs frequently in realcodes. We allow a user to express integer overﬂow using ∞ constant where such behavior is well deﬁned. Thefollowing example shows how to specify intentional overﬂow. void ex3 ( int n ) requires n ≥ ( n + < ∞ ∧ res = n + ) ∨ ( n + ≥ ∞ ∧ ( true ) ioc ) ; { return n + ; } In this example we use the error calculus from (Le et al., 2013) to specify integer overﬂow with an error status( ioc ). This error status is veriﬁed and propagated during entailment. Details of the error validation and localizationmechanism are given in (Le et al., 2013). Use of logical inﬁnity constants ( ∞ ) enable us to specify intentional integeroverﬂow as an explicit error scenario in the method speciﬁcation. Another beneﬁt of using an enhanced speciﬁcationmechanism (with ∞ constants) is that we can integrate integer overﬂow checking in an expressive logic like separationlogic. This addresses two major problems faced by static integer overﬂow checkers - tracking integer overﬂow throughheap and integer overﬂows inside data structures. As an example consider the following method which returns the sumof values inside a linked list. 2 ata { int val ; node next } ll h root , sum i≡ ( root = null ∧ sum = ) ∨∃ d , q · ( root node h d , q i∗ ll h q , rest i ) ∧ sum = d + rest ∧ sum < ∞ int ex4 ( node x ) requires ll h x , s i ensures ll h x , s i∧ res = s { if ( x == null ) return 0 ; elsereturn x . val + ex4 ( x . next ) ; } We can specify the linked list using a predicate in separation logic ( ll ). In the predicate deﬁnition we use ∞ constant to express the fact that the sum of values in the list cannot overﬂow. With this predicate we can now writethe pre/post condition for the function ex4 . During veriﬁcation we can now check that the sum of values of linked listwill not lead to an integer overﬂow. A common security vulnerability that can be exploited using Integer overﬂow isthe buffer overrun. The following example ( ex5 ) shows how two character arrays can be concatenated. We expressthe bounds on the domain of arrays using a relation dom . This program can lead to an integer overﬂow if the sum ofthe size of two arrays is greater than the maximum integer that can be represented by the underlying architecture. Bycapturing the explicit condition under which this function can lead to an integer overﬂow ( ioc ) we can verify andprevent buffer overrun. We again use the ∞ logical constant in the precondition to specify the integer overﬂow. dom ( char [] c , int low , int high ) dom ( c , low , high ) ∧ low ≤ l ∧ h ≤ high = ⇒ dom ( c , l , h ) int ex5 ( ref char [] buf1 , char [] buf2 , size t len1 , size t len2 ) requires dom ( buf1 , , len1 ) ∧ dom ( buf2 , , len2 ) ∧ len1 + len2 ≤ = ∧ dom ( buf1 , , len1 + len2 ) ; requires dom ( buf1 , , len1 ) ∧ dom ( buf2 , , len2 ) ∧ len1 + len2 > = − ∧ dom ( buf1 , , len1 ) ; requires dom ( buf1 , , len1 ) ∧ dom ( buf2 , , len2 ) ∧ len1 + len2 > ∞ ensures ( true ) ioc ; { char buf [ ] ; if ( len1 + len2 > ) return − ; memcpy ( buf , buf1 , len1 ) ; memcpy ( buf + len1 , buf2 , len2 ) ; buf1 = buf ; return 0 ; } Using ∞ constant as part of the speciﬁcation language we can represent various cases of integer overﬂows in aconcise manner. In this way we also avoid multiple constants like INT MAX, INT MIN etc., typically found in headerﬁles for various architectures. When compared to other approaches, this speciﬁcation mechanism is more suited toﬁnding integer overﬂows during veriﬁcation.Exploitable vulnerabilities caused by integer overﬂows can also be prevented by speciﬁcations preventing overﬂowusing ∞ . The following example (taken from (Dowd et al., 2006)) shows how integer overﬂow checking can prevent3etwork buffer overrun. For brevity we show only part of the functions and omit the speciﬁcation for the method aswell. The function ex6 reads an integer from the network and performs some sanity checks on it. First, the length ischecked to ensure that it’s greater than or equal to zero and, therefore, positive. Then the length is checked to ensurethat it’s less than MAXCHARS. However, in the second part of the length check, 1 is added to the length. This opensa possibility of the following buffer overrun: A value of INT MAX passes the ﬁrst check (because it’s greater than0) and passes the second length check (as INT MAX + 1 can wrap around to a negative value). read() would then becalled with an effectively unbounded length argument, leading to a potential buffer overﬂow situation. This situationcan be prevented by using the speciﬁcations given for the network get int method that ensures that length is alwaysless than ∞ . int network get int ( int sockfd ) requires trueensures res < ∞ ; char ∗ ex6 ( int sockfd ) { char buf ; int length = network get int ( sockfd ) ; if ( ! ( buf = ( char ∗ ) malloc ( MAXCHARS ))) die ( “ malloc : % m ′′ ) ; if ( length < || length + > = MAXCHARS ) { free ( buf ) ; die ( “ badlength : % d ′′ , value ) ; } if ( read ( sockfd , buf , length ) < = ) { free ( buf ) ; die ( “ read : % m ′′ ) ; } return buf ; } SPECIFICATION LANGUAGE

We present the speciﬁcation language of our system which is extended from (Chin et al., 2012) with the addition of aconstant representing logical inﬁnity. The detailed language is depicted in ﬁgure 1. Φ pr ∗→ Φ po captures a precondition Φ pr and a postcondition Φ po of a method or a loop. They are abbreviated from the standard representation requires Φ pr and ensures Φ po , and formalized by separation logic formula Φ . In turn, the separation logic formula is adisjunction of a heap formula and a pure formula ( κ ∧ π ). The pure part π captures a rich constraint from the domainsof Presburger arithmetic (supported by Omega solver (Pugh, 1992)), monadic set constraint (supported by MONAsolver (Klarlund and Moller, 2001)) or polynomial real arithmetic (supported by Redlog solver (Dolzmann and Sturm,1997)). Following the deﬁnitions of separation logic in (Ishtiaq and O’Hearn, 2001; Reynolds, 2002), the heap partprovides notation to denote emp heap, singleton heaps , and disjoint heaps ∗ .The major feature of our system compared to (Ishtiaq and O’Hearn, 2001; Reynolds, 2002) is the ability for userto deﬁne recursive data structures. Each data structure and its properties can be deﬁned by an inductive predicate pred ,that consists of a name p , a main separation formula Φ and an optional pure invariant formula π that must hold forevery predicate instance. In addition to the integer constant k we now support a new inﬁnite constant denoted by ∞ .This enables us to represent positive and negative inﬁnities by ∞ and − ∞ respectively. For the following discussion weassume the existence of an entailment prover for separation logic (like (Chin et al., 2012)) and a solver for Presburgerarithmetic (like (Pugh, 1992)). We now focus only on integrating automated reasoning with the new inﬁnite constant ∞ inside these existing provers. 4 red :: = p ( v ∗ ) ≡ Φ [ inv π ] mspec :: = Φ pr ∗→ Φ po Φ :: = W ( ∃ w ∗ · κ ∧ π ) ∗ κ :: = emp | v c ( v ∗ ) | p ( v ∗ ) | κ ∗ κ π :: = α | π ∧ π α :: = β | ¬ ββ :: = v = v | v = null | a ≤ a | a = a a :: = k | k × v | a + a | − a | max ( a , a ) | min ( a , a ) | ∞ where p is a predicate name ; v , w are variable names ; c is a data type name ; k is an integer constant ; Figure 1.

The Speciﬁcation LanguageAn entailment prover for the speciﬁcation language is used to discharge proof obligations generated during forwardveriﬁcation. The entailment checking for separation logic formulas is typically represented (Chin et al., 2012) asfollows. Φ ⊢ Φ , Φ r This attempts to prove that Φ entails Φ with Φ r as its frame (residue) not required for proving Φ . This entailmentholds, if Φ = ⇒ Φ ∗ Φ r . Entailment provers for separation logic deal with the heap part ( κ ) of the formula andreduce the entailment checking to satisﬁability queries over the pure part ( π ). We now show how this reasoning canbe extended to deal with the new constant representing inﬁnity ( ∞ ). A satisﬁability check over pure formula with ∞ is reduced to a satisﬁability check over a formula without ∞ which can be discharged by using existing solvers (likeOmega). In order to eliminate ∞ from the formula we take help of the equisatisﬁable normalization rules shown inﬁgure 2 and proceed as follows. SAT ( π ) substitute equalities ( v = ∞ )= ⇒ SAT ([ v / ∞ ] π ) normalization = ⇒ SAT ( π ; π norm ) elimintate ∞ = ⇒ SAT ([ ∞ / v ∞ ] π norm ) We start with substituting any equalities with ∞ constants then we apply the normalization rules. The normalizationrules eliminate certain expressions containing ∞ based on comparison with integer constants ( k ) and variables ( v ). Weshow rules for both ∞ and − ∞ in ﬁgure 2. In the normalization rules we use a = a as a shorthand for ¬ ( a = a ) .During normalization, we may generate some equalities involving ∞ (in [ NORM − VAR − INF ] ). In that case, we normalizeagain after substituting the new equalities in the pure formula. Once no further equalities are generated we eliminatethe remaining ∞ constant if any by replacing it with a fresh integer variable v ∞ in the pure formula. The pure formulanow does not contain any inﬁnite constants and a satisﬁability check on the formula can now be done using existingmethods. 5 NORM − INF − INF ] ∞ = ∞ ; true ∞ = ∞ ; false ∞ ≤ ∞ ; true ∞ = − ∞ ; false ∞ = − ∞ ; true ∞ ≤− ∞ ; false − ∞ = − ∞ ; true − ∞ = − ∞ ; false − ∞ ≤− ∞ ; true − ∞ ≤ ∞ ; true [ NORM − CONST − INF ] k = ∞ ; false k = ∞ ; true k ≤ ∞ ; true ∞ ≤ k ; false k = − ∞ ; false k = − ∞ ; true k ≤− ∞ ; false − ∞ ≤ k ; true [ NORM − VAR − INF ] v ≤ ∞ ; true ∞ ≤ v ; v = ∞ v ≤− ∞ ; v = − ∞ − ∞ ≤ v ; true [ NORM − MIN − MAX ] min ( a , ∞ ) ; amax ( a , ∞ ) ; ∞ min ( a , − ∞ ) ; − ∞ max ( a , − ∞ ) ; a Figure 2.

Equisatisﬁable NormalizationEnriching the speciﬁcation language with inﬁnite constants is quite useful as it allows users to specify properties(integer overﬂows) using ∞ as demostrated in the motivating examples. The underlying entailment procedure canautomatically handle ∞ by equisatisﬁable normalization. VERIFICATION WITH INTEGER OVERFLOW

Our core imperative language is presented in ﬁgure 3. A program P comprises of a list of data structure declarations tdecl ∗ and a list of method declarations meth ∗ (we use the superscript ∗ to denote a list of elements). Data structuredeclaration can be a simple node datat or a recursive shape predicate declaration pred as shown in ﬁgure 1.A method is declared with a prototype, its body e , and multiple speciﬁcation mspec ∗ . The prototype comprisesa method return type, method name and method’s formal parameters. The parameters can be passed by value orby reference with keyword ref and their types can be primitive τ or user-deﬁned c . A method’s body consists ofa collection of statements. We provide basic statements for accessing and modifying shared data structures and forexplicit allocation of heap storage. It includes:1. Allocation statement: new c ( v ∗ ) Lookup statement: For simplifying the presentation but without loss of expressiveness, we just provide one-levellookup statement v . f rather than v . f . f .3. Mutation statement: v . f : = v In addition we provide core statements of an imperative language, such as semicolon statement e ; e , functioncall mn ( v ∗ ) , conditional statement if v e e , and loop statement while v e ( mspec ) ∗ . Note that for simplicity, wejust allow boolean variables (but not expression) to be used as the test conditions for conditional statements and loopstatement must be annotated with invariant through mspec ∗ . To illustrate some of the basic operations on integers in thelanguage we also show the addition operation between two integers (unsigned and signed) in ﬁgure 3 as k [ u ] int1 + k [ u ] int2 .We now present the modiﬁcations needed to do forward veriﬁcation with interger overﬂow. The core language usedby our system is a C-like imperative language described in ﬁgure 3. The complete set of forward veriﬁcation rules are6 :: = tdecl ∗ meth ∗ tdecl :: = datat | preddatat :: = data c { ﬁeld ∗ } ﬁeld :: = t vt :: = c | ττ :: = uint | int | bool | float | void meth :: = t mn (([ ref ] t v ) ∗ ) where ( mspec ) ∗ { e } e :: = null | k τ | k [ u ] int + k [ u ] int | v | v . f | v : = e | v . f : = v | new c ( v ∗ ) | e ; e | t v ; e | mn ( v ∗ ) | if v then e else e | while v do e ( mspec ) ∗ Figure 3.

A Core Imperative Languageas given in (Chin et al., 2012). We use P to denote the program being checked. With the pre/post conditions declaredfor each method in P, we can now apply modular veriﬁcation to its body using Hoare-style triples ⊢ { ∆ } e { ∆ } . Weexpect ∆ to be given before computing ∆ since the rules are based on a forward veriﬁer. To capture proof search,we generalize the forward rule to the form ⊢ { ∆ } e { Ψ } where Ψ is a set of heap states, discovered by a search-basedveriﬁcation process (Chin et al., 2012). When Ψ is empty, the forward veriﬁcation is said to have failed for ∆ asprestate. As most of the forward veriﬁcation rules are standard (Nguyen et al., 2007), we only provide those for methodveriﬁcation and method call. Veriﬁcation of a method starts with each precondition, and proves that the correspondingpostcondition is guaranteed at the end of the method. The veriﬁcation is formalized in the rule [ FV − [ METH ] ] : • function prime(V) returns { v ′ | v ∈ V } . • predicate nochange(V) returns V v ∈ V ( v = v ′ ) . If V = {} , nochange(V)=true . • ∃ W · Ψ returns {∃ W · Ψ i | Ψ i ∈ Ψ } . [ FV − [ METH ] ] V = { v m .. v n } W = prime ( V ) ∀ i = , .., p · ( ⊢ { Φ ipr ∧ nochange ( V ) } e { Ψ i } ( ∃ W · Ψ i ) ⊢ κ V , I Φ ipo ∗ Ψ i Ψ i = {} ) t mn (( ref t j v j ) m − j = , ( t j v j ) nj = m ) { requires Φ ipr ensures Φ ipo } pi = { e } At a method call, each of the method’s precondition is checked, ∆ ⊢ κ V , I ρΦ ipr ∗ Ψ i , where ρ represents a substitutionof v j by v ′ j , for all j = ,.., n . The combination of the residue Ψ i and the postcondition is added to the poststate. If aprecondition is not entailed by the program state ∆ , the corresponding residue is not added to the set of states. The test Ψ = {} ensures that at least one precondition is satisﬁed. Note that we use the primed notation for denoting the latestvalue of a variable. Correspondingly, [ v ′ / v i ] is a substitution that replaces the value v i with the latest value of v ′ . [ FV − [ CALL ] ] t mn (( ref t j v j ) m − j = , ( t j v j ) nj = m ) { requires Φ ipr ensures Φ ipo } pi = { e } ∈ P ρ =[ v ′ j / v j ] nj = m ∆ ⊢ κ V , I ρΦ ipr ∗ Ψ i ∀ i = , .., p Ψ = S pi = Φ ipo ∗ Ψ i Ψ = {}⊢ { ∆ } mn ( v .. v n ) { Ψ }

7n order to integrate integer overﬂow checking with automated veriﬁcation we ﬁrst translate the basic operations inthe core language (like integer addition) to method calls to speciﬁc functions which do integer overﬂow checking. Inthis paper we illustrate the veriﬁcation using only the addition overﬂow, however similar translations can be done forother operators like multiplication (Moy et al., 2009) etc.. The addition operation for unsigned integers k uint1 + k uint2 is translated to the method uadd whose speciﬁcation is given below. int uadd ( uint k , uint k ) requires k + k > ∞ ensures ( true ) ioc ; requires k + k ≤ ∞ ensures res = k + k ;The addition of unsigned integers overﬂows when their sum is greater than ∞ . The case of signed integer overﬂowhas several cases. We translate addition of signed integers to the method add . The speciﬁcation of the add methodcovers all the cases for signed integer overﬂow as detailed in (Dannenberg et al., 2010). int add ( int k , int k ) requires k > ∧ k > ∧ k + k > ∞ ∨ k > ∧ k ≤ ∧ k + k < − ∞ ∨ k ≤ ∧ k > ∧ k + k < − ∞ ∨ k ≤ ∧ k ≤ ∧ k + k < ∞ ∧ k = ( true ) ioc ; requires k > ∧ k > ∧ k + k ≤ ∞ ∨ k > ∧ k ≤ ∧ k + k ≥− ∞ ∨ k ≤ ∧ k > ∧ k + k ≥− ∞ ∨ k ≤ ∧ k ≤ ∧ ( k + k ≥− ∞ ∨ k = ) ensures res = k + k ;The speciﬁcation of these methods ( uadd and add ) uses the inﬁnite constants ( ∞ and − ∞ ) from the enrichedspeciﬁcation language given in the previous section. An expressive speciﬁcation language reduces the task of integeroverﬂow checking to just specifying and verifying of appropriate methods. After translation of basic operators intomethod calls, during forward veriﬁcation the [ FV − [ CALL ] ] rule will ensure that we check each operation for integeroverﬂow. Thus a simple encoding of basic operators and translation of the source program before veriﬁcation enablesus to do integer overﬂow checking along with automated veriﬁcation. SOUNDNESS

In this section we outline the soundness properties of our entailment procedure with inﬁnities and the forward ver-iﬁer with integer overﬂow checking. We assume the soundness of the underlying entailment checker and veriﬁer(Chin et al., 2012).

Lemma 1. (Equisatisﬁable Normalization) If π ; π norm then SAT ( π ) = ⇒ SAT ( π norm ) and SAT ( π norm ) = ⇒ SAT ( π ) Proof

We sketch the proof for each normalization rule given in ﬁgure 2.case [ NORM − INF − INF ] : From the ﬁrst rule we get, SAT ( ∞ = ∞ ) ≡ true and the normalization gives ∞ = ∞ ; true , since SAT ( true ) ≡ true we have, SAT ( ∞ = ∞ ) = ⇒ SAT ( ∞ = ∞ ; true ) and SAT ( ∞ = ∞ ; true ) = ⇒ SAT ( ∞ = ∞ ) .Hence the normalization preserves satisﬁability of pure formulas. We can prove the other rules in [ NORM − INF − INF ] similarly.case [ NORM − CONST − INF ] : From the ﬁrst rule we get, SAT ( k = ∞ ) ≡ false and the normalization gives k = ∞ ; false , since SAT ( false ) ≡ false

8e have,

SAT ( k = ∞ ) = ⇒ SAT ( k = ∞ ; false ) and SAT ( k = ∞ ; false ) = ⇒ SAT ( k = ∞ ) .Hence the normalization preserves satisﬁability of pure formulas. We can prove the other rules in [ NORM − CONST − INF ] similarly.case [ NORM − VAR − INF ] : We sketch the proof for the following rule, SAT ( ∞ ≤ v ) ⇐⇒ SAT ( ∞ < v ∨ ∞ = v ) ⇐⇒ SAT ( false ∨ ∞ = v ) we have, SAT ( ∞ ≤ v ) = ⇒ SAT ( ∞ ≤ v ; v = ∞ ) and SAT ( ∞ ≤ v ; v = ∞ ) = ⇒ SAT ( ∞ ≤ v ) Hence the normalization preserves satisﬁability of pure formulas. Other rules from [ NORM − VAR − INF ] can be provensimilarly.case [ NORM − MIN − MAX ] : We sketch the proof for the following rule, SAT ( max ( a , ∞ )) ⇐⇒ SAT (( a > ∞ = ⇒ a ) ∨ ( a ≤ ∞ = ⇒ ∞ )) ⇐⇒ SAT (( false = ⇒ a ) ∨ ( a ≤ ∞ = ⇒ ∞ )) ⇐⇒ SAT (( true ) ∨ ( a ≤ ∞ = ⇒ ∞ )) ⇐⇒ SAT ( a ≤ ∞ = ⇒ ∞ ) ⇐⇒ SAT ( true = ⇒ ∞ ) ⇐⇒ SAT ( ∞ ) we have, SAT ( max ( a , ∞ )) = ⇒ SAT ( max ( a , ∞ ) ; ∞ ) and SAT ( max ( a , ∞ ) ; ∞ ) = ⇒ SAT ( max ( a , ∞ )) Hence the normalization preserves satisﬁability of pure formulas. Other rules from [ NORM − MIN − MAX ] can be provensimilarly. Lemma 2. (Soundness of Integer Overﬂow Checking)

If the program e has an integer overﬂow ( ioc ) then,with forward veriﬁcation ⊢ { ∆ } e { Ψ } , we have ( true ) ioc ∈ Ψ Proof

Provided all basic operators on integers in the program are translated to method calls that check for integeroverﬂows. The soundness of integer overﬂow checking follows from lemma 1 and the soundness of error calculus(Le et al., 2013).The soundness of heap entailment and forward veriﬁcation with separation logic based speciﬁcations is alreadyestablished in (Chin et al., 2012). Lemma 1 establishes that the normalization rules indeed preserve the satisﬁabilityof pure formulas. Lemma 2 then shows that the integer overﬂow checking with forward veriﬁcation is sound. If theprogram has an integer overﬂow the forward veriﬁcation with integer overﬂow checking detects it.

EXPERIMENTS

We have implemented our approach in an OCaml prototype called

HIPioc evaluate automated veriﬁcation using log-ical inﬁnities ( ∞ ) we created benchmark of several programs that use inﬁnite constants as sentinel values in searchingand sorting. As an example the following predicate deﬁnition of a sorted linked list uses ∞ in the base case to expressthat the minimum value in an empty list is inﬁnity. data { int val ; node next } Sortedll h root , min i≡ ( root = null ∧ min = ∞ ) ∨∃ q · ( root node h min , q i∗ Sortedll h q , minrest i ) ∧ min < minrest In addition,

HIPioc allows us to do integer overﬂow checking of programs during veriﬁcation. We have run

HIPioc on several existing veriﬁcation benchmarks which contain different kinds of programs. The benchmarks Available at http://loris-5.d2.comp.nus.edu.sg/SLPAInf/SLPAInf.ova (md5sum 4afb66d65bfa442726717844f46eb7b6)

Sorting ( with ∞ ) are the programs which use ∞ as sentinel value in predicate deﬁnitions as described above and do not contain integeroverﬂows. Comparing the times between HIPioc and previous version we see that the veriﬁcation with ∞ in generaladds some overhead. Benchmark LOC Num o f Time Time Integer FalsePrograms ( Total ) Programs ( Secs ) (

HIPioc ) Over f lows PositivesSorting ( with ∞ )

282 4 5 .

45 5 .

42 0 0

Arrays .

92 76 .

65 1 0

HIP / SLEEK (Chin et al., 2012) 5779 42 56 .

15 78 .

80 4 0

Imm (David and Chin, 2011) 2069 11 120 .

82 126 .

61 18 0

V Perm (Le et al., 2012) 778 14 3 .

43 3 .

46 3 0

Barriers (Hobor and Gherghina, 2012) 1281 10 60 .

54 60 .

83 16 0

SIR (Le et al., 2013) 2616 4 34 .

64 41 .

73 1 1

Total .

95 393 . RELATED WORK

There has been considerable interest in recent years to detect and prevent integer overﬂows in software (Cotroneo and Natella,2012). Dietz et al. (Dietz et al., 2012) present a study of integer overﬂows in C/C++ programs. They ﬁnd out thatintentional and unintentional integer overﬂows are prevalent in system software. Integer overﬂows often lead to ex-ploitable vulnerabilities in programs (Christey et al., 2011). In this paper we presented a method to detect uninten-tional integer overﬂows and provided a mechanism to specify intentional integer overﬂows. Program transformations(Coker and Haﬁz, 2013) can be used to guide the programmer and aid in refactoring the source code to avoid integeroverﬂows. Our focus is on speciﬁcation of intentional integer overﬂows which helps make the conditions under whichthe program may use an integer overﬂow explicit. It also aids in automated veriﬁcation as such cases can be validatedas error scenarios for the program.Most existing techniques for detecting integer overﬂows are focused on dynamic checking and testing of programs(Molnar et al., 2009; Wang et al., 2009; Chen et al., 2009; Brumley et al., 2007). Dynamic analysis suffers from thepath explosion problem and although several improvements in constraints solving have been proposed (Sharma, 2012,2013) the approach cannot guarantee the absence of integer overﬂows. There are not many veriﬁcation or staticanalysis tool that can do integer overﬂow checking. KINT (Wang et al., 2012) is a static analysis tool which candetect integer overﬂows by solving constraints generated from source code of programs. Another static analysis basedapproach by Moy et al. (Moy et al., 2009) uses the Z3 solver to do integer overﬂow checking as part of the PREFIXtool (Bush et al., 2000). A certiﬁed prover for presburger arithmetic extended with positive and negative inﬁnities hasbeen described in (Sharma, 2015; Sharma et al., 2015).Our focus in this paper is on integrating integer overﬂow checking with program veriﬁcation to improve the re-liability of veriﬁed software. Dynamic techniques may not explore all paths in the programs while static techniquessuffer form loss of precision in tracking integer overﬂows. Our speciﬁcation mechanism allows us to integrate integeroverﬂow checking inside a prover for speciﬁcation logic. This allows us to track integer overﬂows through the heapand inside various data structures. The beneﬁt of this integration is that we can detect numeric integer overﬂow errorsin programs with complex sharing and heap manipulation.10

ONCLUSION

Integer overﬂows are a major source of errors in programs. Most veriﬁcation systems do not focus on the underlyingnumeric operations on integers and do not handle integer overﬂow checking. We presented a technique to do integeroverﬂow checking of programs during veriﬁcation. Our speciﬁcation mechanism also allows expressing intentionaluses of integer overﬂows. We implemented a prototype of our proposal inside an existing veriﬁer and found realinteger overﬂow bugs in benchmarks of veriﬁed software.

REFERENCES

Brumley, D., Song, D. X., cker Chiueh, T., Johnson, R., and Lin, H. (2007). Rich: Automatically protecting againstinteger-based vulnerabilities. In

NDSS . 10Bush, W. R., Pincus, J. D., and Sielaff, D. J. (2000). A static analyzer for ﬁnding dynamic programming errors.

Softw.Pract. Exper. , 30(7):775–802. 10Chen, P., Wang, Y., Xin, Z., Mao, B., and Xie, L. (2009). Brick: A binary tool for run-time detecting and locatinginteger-based vulnerability. In

Availability, Reliability and Security, 2009. ARES ’09. International Conference on ,pages 208–215. 10Chin, W.-N., David, C., and Gherghina, C. (2011). A hip and sleek veriﬁcation system. In

OOPSLA Companion ,pages 9–10. 1Chin, W.-N., David, C., Nguyen, H. H., and Qin, S. (2012). Automated veriﬁcation of shape, size and bag propertiesvia user-deﬁned predicates in separation logic.

Sci. Comput. Program. , 77(9):1006–1036. 1, 4, 5, 7, 8, 9, 10Christey, S., Martin, R. A., Brown, M., Paller, A., and Kirby, D. (2011). 2011 CWE/SANS Top 25 Most DangerousSoftware Errors. Technical report, MITRE Corporation. http://cwe.mitre.org/top25. 1, 10Coker, Z. and Haﬁz, M. (2013). Program transformations to ﬁx c integers. In

Proceedings of the 2013 InternationalConference on Software Engineering , ICSE 2013. 10Cotroneo, D. and Natella, R. (2012). Monitoring of aging software systems affected by integer overﬂows. In

SoftwareReliability Engineering Workshops (ISSREW), 2012 IEEE 23rd International Symposium on , pages 265–270. 10Dannenberg, R. B., Dormann, W., Keaton, D., Seacord, R. C., Svoboda, D., Volkovitsky, A., Wilson, T., and Plum, T.(2010). As-if inﬁnitely ranged integer model. In

Proceedings of the 2010 IEEE 21st International Symposium onSoftware Reliability Engineering , ISSRE ’10, pages 91–100, Washington, DC, USA. IEEE Computer Society. 8David, C. and Chin, W.-N. (2011). Immutable speciﬁcations for more concise and precise veriﬁcation. In

OOPSLA ,pages 359–374. 10Dietz, W., Li, P., Regehr, J., and Adve, V. (2012). Understanding integer overﬂow in c/c++. In

Proceedings of the2012 International Conference on Software Engineering , ICSE 2012, pages 760–770, Piscataway, NJ, USA. IEEEPress. 1, 2, 10Dolzmann, A. and Sturm, T. (1997). Redlog: computer algebra meets computer logic.

SIGSAM Bull. , 31:2–9. 4Dowd, M., McDonald, J., and Schuh, J. (2006).

Art of Software Security Assessment, The: Identifying and PreventingSoftware Vulnerabilities . Addison-Wesley Professional. 3Hobor, A. and Gherghina, C. (2012). Barriers in concurrent separation logic: Now with tool support!

Logical Methodsin Computer Science , 8(2). 10Ishtiaq, S. and O’Hearn, P. (2001). BI as an assertion language for mutable data structures. In

ACM POPL , pages14–26, London. 4Klarlund, N. and Moller, A. (2001). MONA Version 1.4 - User Manual. BRICS Notes Series. 4Le, D.-K., Chin, W.-N., and Teo, Y. M. (2012). Variable permissions for concurrency veriﬁcation. In

ICFEM , pages5–21. 10Le, Q. L., Sharma, A., Craciun, F., and Chin, W.-N. (2013). Towards complete speciﬁcations with an error calculus.In

NASA Formal Methods . 2, 9, 10Molnar, D., Li, X. C., and Wagner, D. A. (2009). Dynamic test generation to ﬁnd integer bugs in x86 binary linuxprograms. In

Proceedings of the 18th conference on USENIX security symposium , SSYM’09, pages 67–82, Berkeley,CA, USA. USENIX Association. 10Moy, Y., Bjørner, N., and Sielaff, D. (2009). Modular bug-ﬁnding for integer overﬂows in the large: Sound, efﬁcient,bit-precise static analysis. Technical report, Microsoft Research. 8, 1011guyen, H., David, C., Qin, S., and Chin, W. (2007). Automated Veriﬁcation of Shape And Size Properties viaSeparation Logic. In

VMCAI , pages 251–266. 7Pugh, W. (1992). The Omega Test: A fast practical integer programming algorithm for dependence analysis.

Commu-nications of the ACM , 8:102–114. 4Reynolds, J. (2002). Separation Logic: A Logic for Shared Mutable Data Structures. In

IEEE LICS , pages 55–74. 4Sharma, A. (2012). A critical review of dynamic taint analysis and forward symbolic execution. Technical report,National University of Singapore. 10Sharma, A. (2013). An empirical study of path feasibility queries. arXiv preprint arXiv:1302.4798 . 10Sharma, A. (2015).

Certiﬁed Reasoning for Automated Veriﬁcation . PhD thesis, National University of Singapore. 10Sharma, A., Wang, S., Costea, A., Hobor, A., and Chin, W.-N. (2015). Certiﬁed reasoning with inﬁnity. In

Interna-tional Symposium on Formal Methods , pages 496–513. Springer. 10Wang, T., Wei, T., Lin, Z., and Zou, W. (2009). Intscope: Automatically detecting integer overﬂow vulnerability inx86 binary using symbolic execution. In

NDSS . 10Wang, X., Chen, H., Jia, Z., Zeldovich, N., and Kaashoek, M. F. (2012). Improving integer security for systems withkint. In