[PDF] Gradient expansion approach to nonlinear superhorizon perturbations II -- a single scalar field --

Abstract

We formulate nonlinear perturbations of a scalar field dominated universe on super-horizon scales. We consider the case of a single scalar field. We take the gradient expansion approach. We adopt the uniform Hubble slicing and derive the general solution valid to O( ϵ 2 ) , where ϵ is the expansion parameter associated with a spatial derivative, which includes both the scalar and tensor modes. In particular, the O( ϵ 2 ) correction terms to the nonlinear curvature perturbation, which become important in models with a non-slowroll stage during inflation, are explicitly obtained.

Full PDF

aa r X i v : . [ g r- q c ] J un YITP-07-31

Gradient expansion approach to nonlinear superhorizon perturbations II– a single scalar ﬁeld –

Yoshiharu

TANAKA ∗ and Misao SASAKI † Yukawa Institute for Theoretical Physics, Kyoto University, Kyoto 606-8502, Japan

We formulate nonlinear perturbations of a scalar ﬁeld dominated universe on super-horizon scales.We consider the case of a single scalar ﬁeld. We take the gradient expansion approach. We adoptthe uniform Hubble slicing and derive the general solution valid to O ( ǫ ), where ǫ is the expansionparameter associated with a spatial derivative, which includes both the scalar and tensor modes.In particular, the O ( ǫ ) correction terms to the nonlinear curvature perturbation, which becomeimportant in models with a non-slowroll stage during inﬂation, are explicitly obtained. PACS numbers: 98.80.-k, 98.80.Cq

I. INTRODUCTION

The cosmic microwave background (CMB) anisotropies recently observed by WMAP strongly support the inﬂa-tionary cosmology, and theoretical predictions of various models of inﬂation seem to be well estimated by linearperturbation theory, with the cosmological perturbations generated from quantum vacuum ﬂuctuations which arewell approximated by Gaussian random ﬁelds [1]. Nevertheless, there has been a growing interest in detection ofpossible derivations from the Gaussian statistics. It was suggested that a deviation from the Gaussian stastics may beused to distinguish models of inﬂation [7], and it may indeed be detected by PLANCK [3] in the near future. Thus,making clear predictions on the non-Gaussianity from inﬂation have become one of the urgent issues of the inﬂation-ary cosmology. Since the nonlinearity is essential for the generation of non-Gaussian perturbations, it is necessary todevelop a nonlinear cosmological perturbation theory to evaluate the non-Gaussianity from inﬂation.Our goal is to formulate a nonlinear theory with which we can calculate non-Gaussianities from any models ofinﬂation. The traditional approach is to develop a second-order perturbation theory [4, 5, 6, 8, 9]. However, here weadopt a diﬀerent one, the gradient expansion approach, in which nonlinear perturbations are solved by invoking spatialgradient expansion under the assumption that spatial derivatives are small compared to time derivatives. Technically,we introduce an expansion parameter, ǫ , and associate it with each spatial derivative. Then we expand ﬁeld equationsin terms of ǫ and solve them order by order in ǫ iteratively. The gradient expansion approach has been developed andstudied by many authors previously [11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 29, 31].In cosmological situations, the gradient expansion approach is valid on scales greater than the Hubble horizon scales,and an advantage of the approach is that we can calculate perturbations to full nonlinear order in their amplitudes.This is particularly useful when dealing with the general non-Gaussianity for which it may be necessary to evaluatenot only second-order perturbative corrections but also higher order corrections.At the leading order in the gradient expansion, i.e. neglecting all the spatial gradients, Lyth, Malik and Sasaki [11]studied nonlinear scalar curvature perturbations, proved the nonlinear δN formula, and constructed a gauge invari-ant (time-slice independent) nonlinear scalar curvature perturbation. Although this leading order approximation issuﬃcient for a large class of inﬂation models, there are models for which it is necessary to take into account the nextorder corrections. For example, in the context of the standard linear perturbation theory, Leach et al. pointed outthat there can be enhancement of the comoving curvature perturbation on superhorizon scales, where it is usuallyconserved, even in a single ﬁeld model if the slow-roll condition is temporarily violated [28]. There it was shown that O ( k ) corrections to the curvature perturbation on superhorizon, where k is the comoving wavenumber, play a crucialrole for the enhancement. Because O ( k ) corrections correspond to O ( ǫ ) terms in gradient expansion, this impliesthat it is necessary to include O ( ǫ ) terms in such a model. Then, we expect that the enhancement may give rise tolarge non-Gaussianity. Indeed, Chen et al. [10] numerically found in a single-ﬁeld inﬂation that large non-Gaussianitycan be generated if the slow-roll condition is temporarily violated, using third-order action derived by Maldacena [5].In this paper, focusing on a single-ﬁeld inﬂation, we formulate the gradient expansion to O ( ǫ ) on the uniform Hubbleslicing. In most of the previous studies, either only the leading order terms in gradient expansion was discussed or ∗ E-mail: yotanaka − AT − yukawa.kyoto-u.ac.jp † E-mail: misao − AT − yukawa.kyoto-u.ac.jp the choice of time-slicing was not quite adequate for the study of non-Gaussianity from inﬂation. Here we adopt theuniform Hubble slicing because the curvature perturbation on this slicing directly determines the initial condition forthe CMB anisotropies and the large scale structure of the universe.We employ the (3 + 1)-decomposition of the Einstein equations, and consider a single scalar ﬁeld with an arbitrarypotential. We then derive the general solution for all the variables. As discussed in the case of a perfect ﬂuid in [31],we ﬁnd that the identiﬁcation of the tensor mode in the spatial metric is rather arbitrary, depending on how one ﬁxesthe spatial coordinates, as a reﬂection of general covariance, while it can be unambiguously identiﬁed in the extrinsiccurvature of the metric.This paper is organized as follows. In Section II, we deﬁne basic variables, and describe the assumptions we adoptin gradient expansion. Then we present the general solution for all the physical quantities to O ( ǫ ) on the uniformHubble slicing. In Section III, we discuss the validity of the assumption adopted in Section II by appealing to lineartheory and by considering the vacuum ﬂuctuations of the scalar ﬁeld at and around horizon crossing. We conclude thepaper in Section IV. In Appendix A, the basic equations are presented. In Appendix B, the estimation of the ordersof physical quantities in powers of ǫ is given. In Appendix C, the general solutions for all the physical quantities arederived. II. GRADIENT EXPANSIONA. Basic variables

In the (3 + 1)-decomposition, the metric is expressed as ds = g µν dx µ dx ν = ( − α + β k β k ) dt + 2 β i dx i dt + γ ij dx i dx j , (2.1)where α , β i ( β i = γ ij β j ), and γ ij are the lapse function, shift vector, and the spatial metric, respectively. We rewrite γ ij as γ ij ( t, x k ) = a ( t ) ψ ( t, x k ) ˜ γ ij ( t, x k ) ; det(˜ γ ij ) = 1 , (2.2)where the function a ( t ) is the scale factor of a ﬁducial homogeneous and isotropic background universe.The extrinsic curvature K ij is deﬁned by K ij ≡ −∇ i n j , (2.3)where n µ = ( − α, , ,

0) is the vector unit normal to the time slices. We decompose the extrinsic curvature as K ij = γ ij K + ψ a ˜ A ij ; K ≡ γ ij K ij , (2.4)where ˜ A ij represents the traceless part of K ij . The indices of K ij are to be raised or lowered by γ ij and γ ij , and theindices of ˜ A ij by ˜ γ ij and ˜ γ ij .The stress-energy tensor for a single scalar ﬁeld is T µν = ∇ µ φ ∇ ν φ − g µν ( ∇ α φ ∇ α φ + 2 V ( φ )) . (2.5)We deﬁne the local Hubble parameter as 1 / n µ , which is equal to − / H ≡ − K = 3 ˙ aαa + 6 ∂ t ψαψ − D i β i α , (2.6)where a dot ˙ denotes d/dt and D i is the covariant derivative with respect to γ ij . In the following, we adopt theuniform Hubble slicing. For this slicing, we have H ( t ) = ˙ aa , (2.7)and Eq. (2.6) implies α = 1 + 2 ∂ t ψHψ − D i β i H . (2.8)

B. Expansion scheme

We investigate nonlinear superhorizon perturbations with the gradient expansion approach, which is called byvarious names by various authors, the quasi-isotropic expansion [12, 13, 19, 21], the anti-Newtonian approximation[14, 15], the spatial gradient expansion [16, 17, 18, 20, 22, 23, 24, 25], or the long wavelength approximation [26, 29].In this approach, we assume that the characteristic length scale L of inhomogeneities is always much larger thanthe Hubble horizon scale, L ≫ H − ∼ t . We introduce a small parameter ǫ , and assume that L is of O (1 /ǫ ).This assumption is equivalent to assuming that the magnitude of spatial gradients is given by ∂ i ψ = ψ × O ( ǫ ), ∂ i α = α × O ( ǫ ), etc.. In the limit L → ∞ , i.e., ǫ →

0, the universe looks locally like a FLRW spacetime, where’locally’ means as seen on the scale of the Hubble horizon volume. It is noted that physical quantities which areapproximately homogeneous on each Hubble horizon scale can vary nonlinearly on very large scales.The local homogeneity and isotropy imply that β i = O ( ǫ ) and ∂ t ˜ γ ij = O ( ǫ ) because the local FLRW equationsshould be realized in the limit ǫ →

0. However, we further assume that β i = O ( ǫ ) and ∂ t ˜ γ ij = O ( ǫ ). Technically,these additional assumptions make the analysis of O ( ǫ ) correction terms much simpler, as discussed in Appendix C.1.Physically, of course, it is necessary to justify them. For the former assumption on β i , since it is just a matter ofchoice of the spatial coordinates, this does not cause any loss of generality. In fact, once we obtain the solution,it is straightforward to express it in a more general spatial coordinate system by the coordinate transformation x i → ¯ x i = F i ( t, x k ) such that ∂ t F i = O ( ǫ ), corresponding to ¯ β i = O ( ǫ ) in the new coordinate system. As for thelatter assumption on ∂ t ˜ γ ij , however, it is not simply a matter of choice. In Sec. III, with the help of the result fromlinear theory, we give a convincing argument, if not rigorous, that this assumption is indeed satisﬁed for perturbationsarising from the vacuum ﬂuctuations. To summarize, our basic assumptions are β i = O ( ǫ ) , ∂ t ˜ γ ij = O ( ǫ ) . (2.9)Applying these assumptions to the Einstein-scalar ﬁeld equations, we ﬁnd ψ = O (1) , α − O ( ǫ ) , ∂ t ψ = O ( ǫ ) , ˜ A ij = O ( ǫ ) . (2.10)These estimates are derived in Appendix B. Here one comment is in order. The fact that ∂ t ψ = O ( ǫ ) means ψ isconserved if the O ( ǫ ) corrections can be neglected. This result was derived by Salopek and Bond [16] for a singlescalar ﬁeld system, and by Lyth, Malik, and Sasaki [11] for more general systems. C. General solution

Here we present the general solution for all the physical quantities, valid to O ( ǫ ) in gradient expansion. We deferthe derivation to Appendix C because it is not much diﬀerent from the one we gave in the previous paper [31], exceptfor the fact that the present paper deals with the case of a scalar ﬁeld while the previous paper dealt with a perfectﬂuid.The general solution is α = 1 + 2 ˙ φ ∗ a ˙ φ (cid:20) (2) C ( x k ) (cid:18) a ˙ φ + dVdφ Z tt ∗ a ( t ′ ) dt ′ (cid:19) + (2) D ( x k ) dVdφ (cid:21) , (2.11) ψ = (0) L ( x k ) (cid:16) Z tt ∗ ( α − Hdt ′ (cid:17) , (2.12)˜ γ ij = (0) f ik ( x ℓ ) (cid:16) δ kj − (2) F kj ( x ℓ ) Z tt ∗ dt ′ a ( t ′ ) Z t ′ t ∗ a ( t ′′ ) dt ′′ − (2) C kj ( x ℓ ) Z tt ∗ dt ′ a ( t ′ ) (cid:17) , (2.13)˜ A ij = (2) F ij ( x k ) a Z tt ∗ a ( t ′ ) dt ′ + (2) C ij ( x k ) a , (2.14)˜ φ = φ ( t ) + (2) C ( x k ) ˙ φ ∗ a ˙ φ Z tt ∗ a ( t ′ ) dt ′ + (2) D ( x k ) ˙ φ ∗ a ˙ φ , (2.15)where the index ( n ) is attached to a quantity of O ( ǫ n ) except for the scalar ﬁeld. For notational simplicity, the fullscalar ﬁeld is denoted by ˜ φ , and the lowest order scalar ﬁeld (0) φ , which depends only on time, is denoted simply by φ . The time t ∗ is an arbitrary reference time and ˙ φ ∗ = ˙ φ ( t ∗ ). The tensor (2) F ij and the scalar (2) C are given byEqs. (C14) and (C34), respectively, as functions of (0) L and (0) f ij . The tensor (2) C ij is traceless with respect to (0) f ij ,The function (2) D is related to (2) C ij through the momentum constraint (C36).To clarify the physical role of these freely speciﬁable functions, let us ﬁrst count the degrees of freedom. Since (2) C and (2) F ij are determined by (0) L and (0) f ij , they have no degree of freedom. Since the determinant of (0) f ij is unity,it has 5 degrees of freedom, and since (2) C ij is traceless, it also has 5 degrees of freedom. In addition we have (0) L and (2) D , each of which counts 1 degree of freedom. The momentum constraints which consist of 3 equations, relate (2) C ij to (2) D , and reduce the total degrees of freedom by 3. So, adding together, the total number is 5 + 5 + 1 + 1 − x i → ¯ x i = f i ( x j ). Thus we may regard that (0) f ij contains these 3 gauge degrees of freedom. The correspondencebetween the nonlinear solutions and the solutions in linear theory can be understood from the time dependence of thenonlinear solutions. So, we see that (0) L and (0) f ij represent growing modes, and (2) D and (2) C ij represent decayingmodes.To summarize, the degrees of freedom contained in the freely speciﬁable functions can be interpreted as (0) L · · · , (0) f ij · · · , (2) C ij · · · , (2) D · · · . (2.16)To ﬁx the gauge completely, one has to impose 3 spatial gauge conditions on (0) f ij to extract the physical tensordegrees of freedom. As discussed in Ref. [31], this cannot be done in a spatially covariant way because (0) f ij is themetric and any covariant derivative of it vanishes identically. Thus the tensor modes cannot be extracted out from (0) f ij unless we introduce a certain ’background’ metric. This is an important diﬀerence from the linear case in whichthere exists a background metric.In contrast, as for the extrinsic curvature, we may identify the tensor modes in it, because its transverse-tracelesspart can be extracted unambiguously [30, 31]. Namely, the transverse part of (0) L ˜ A ij can be determined uniquelyand it can be identiﬁed as the tensor modes. As noted in Ref. [31], this implies that the tensor modes in the extrinsiccurvature are determined non-locally, and they exist even for a trivial (0) f ij , say for (0) f ij = δ ij . This generation oftensor modes in the extrinsic curvature is a result of nonlinear interactions of the scalar modes. We note, however,that it is not obvious if the tensor modes we identiﬁed can be called gravitational waves. To make a clear connectionbetween the tensor modes and gravitational waves, it is necessary to evolve the system until the scale of interestbecomes suﬃciently smaller than the Hubble scale. But this is beyond the scope of the present paper. III. VALIDITY OF THE ASSUMPTION IN THE LINEAR LIMIT

In this section, we discuss the validity of our central assumption, ∂ t ˜ γ ij = O ( ǫ ). With the help of linear theory, weargue that it can be physically justiﬁed. We ﬁrst consider the equation for the curvature perturbation during inﬂation,and argue that the condition ∂ t ˜ γ ij = O ( ǫ ) is naturally satisﬁed. Then considering the quantum ﬂuctuations as thesource of the curvature perturbation, we explicitly show that this assumption holds not only in the case of slow-rollinﬂation but also in the case of the Starobinsky model, in which the slow-roll condition is temporarily violated dueto a sudden change in the slope of the potential.Since gradient expansion is eﬀective only on superhorizon scales, as the initial condition for the quantities wecalculate, we need to know their behavior when their wavelength crosses the Hubble horizon radius. To do so, weassume that the linear perturbation is a suﬃciently good approximation up to and around the Hubble horizon scale.For linearized quantities, we follow the notation of Kodama and Sasaki [2].As usual, we expand the linearized quantities in terms of spatial harmonics. The background is a spatially ﬂatFLRW universe. We introduce the scalar, vector and tensor harmonics, Y k , Y (1) k i and Y (2) k ij , respectively,(∆ + k ) Y k = 0 , (∆ + k ) Y (1) k i = 0 ; ∂ i Y (1) k i = 0 , (∆ + k ) Y (2) k ij = 0 ; δ ij Y k ij = ∂ i Y (2) k ij = 0 , (3.1)where ∆ = ∂ i ∂ i is the 3-dimensional Laplacian. For the sake of notational simplicity, we suppress the mode index k below. For a universe dominated by a scalar ﬁeld, it is known that the vector perturbations are not be excited.Therefore, we ignore the vector modes.We express the metric as ds = a ( η ) h − (1 + 2 AY ) dη − BY j dx j dη + ( δ ij + 2 H L Y δ ij + 2 H T Y ij + 2 H (2) T Y (2) ij ) dx i dx j i , (3.2)where dη = dt/a ( t ) is the conformal time, and Y j ≡ − k − ∂ j Y , Y ij ≡ k − (cid:20) ∂ i ∂ j − δ ij ∆ (cid:21) Y . (3.3)The correspondences of these to the metric components deﬁned in Section II are α = 1 + AY , β j = − aBY j , ψ = 1 + H L Y , ˜ γ ij = δ ij + 2 H T Y ij + 2 H (2) T Y (2) ij . (3.4)In passing, we note that our assumption ∂ t ˜ γ ij = O ( ǫ ) corresponds to ∂ t H T = O ( ǫ ) and ∂ t H (2) T = O ( ǫ ) in the linearlimit. We also introduce the quantities R ≡ H L + H T , (3.5) σ g ≡ k − H ′ T − B , (3.6)

K ≡ − A + k H B + H − H ′ L = − A + H − R ′ − k H σ g , (3.7)where a prime ( ′ ) denotes a conformal time derivative d/dη . These quantities are known to be independent of thechoice of the spatial coordinates but depend only on the choice of time-slicing. The ﬁrst one, R is called the curvatureperturbation because the spatial curvature scalar R [ γ ] is given by R [ γ ] = − a ∆ [ R Y ] . (3.8)The variables K and σ g represent the perturbations in the extrinsic curvature, K = − H (1 + K Y ) , ˜ A ij = − ka σ g Y ij . (3.9)Under the gauge transformation induced by an inﬁnitesimal change of time-slicing ¯ η = η + T Y , R , σ g and K transform as ¯ R = R − H

T , (3.10)¯ σ g = σ g − kT , (3.11)¯ K = K + H − (cid:18) H − H ′ + k (cid:19) T , (3.12)where

H ≡ aH is the conformal Hubble parameter.Since the standard linear calculation gives the curvature perturbation on the comoving slicing, we need to relatethe quantities on the comoving slicing to those on the uniform Hubble slicing. In particular, analytic expressions forthe comoving curvature perturbations in the Starobinsky model resulting from the quantum ﬂuctuations are givenin [27]. Here and in what follows, we denote a quantity on the comoving and uniform Hubble slicing by the subscripts c and H , respectively. Thus, we ﬁrst express the geometrical quantities on the comoving slicing in terms of R c . Thenwe consider an inﬁnitesimal transformation from the comoving slicing to the uniform Hubble slicing.The (0 , µ )-components of the Einstein equations on the comoving slicing give δG = κ δT ;2[3 H A c + H k ( σ g ) c − HR ′ c − k R c ] = κ φ ′ A c , (3.13) δG j = κ δT j ; H A c − R ′ c = 0 , (3.14)where κ = 8 πG . From these, we have A c = 1 H R ′ c , k ( σ g ) c = 1 H (cid:18) k R c + κ φ ′ H R ′ c (cid:19) , (3.15)where it is noted that the background equation gives the relation, − a ˙ H = − a (cid:18) H a (cid:19) ′ = H − H ′ = κ φ ′ . (3.16)Then the second equality of Eq. (3.7) gives K c = − k H ( σ g ) c = − H (cid:18) k R c + κ φ ′ H R ′ c (cid:19) . (3.17)Since K is zero on the uniform Hubble slicing by deﬁnition, we have0 = K H = K c + H − ( H − H ′ + k T . (3.18)This gives T in terms of K c , which in turn is given by Eq. (3.17). The result is H T = 13 κ φ ′ + k (cid:18) k R c + κ φ ′ H R ′ c (cid:19) . (3.19)Thus, we obtain R H = (cid:18) H L − H T (cid:19) H = R c − κ φ ′ + k (cid:18) k R c + κ φ ′ H R ′ c (cid:19) , (3.20)( kσ g ) H = ( H ′ T − kB ) H = 1 H κ φ ′ κ φ ′ + k (cid:18) k R c + κ φ ′ H R ′ c (cid:19) . (3.21)Since B = O ( k ) because β i = O ( ǫ ), the above results imply that our assumption ∂ t ˜ γ ij = O ( ǫ ), which corresponds to H ′ T = O ( k ) H T in the linear limit, is justiﬁed if R ′ c = O ( k ) R c on superhorizon scales.Now let us argue that this is indeed so in general. We know that R c satisﬁes the equation, R ′′ c + 2 z ′ z R ′ c + k R c = 0 ; z ≡ aφ ′ H = a ˙ φH . (3.22)We consider an inﬂationary universe in which we approximately have H ∼ − /η . There are two independent solutions u and v , which are functions of kη . We may ﬁx these solutions such that they behave in the superhorizon limit k/ H ∼ k | η | → u = (cid:2) O ( k ) (cid:3) u ; u = const. ,v = (cid:2) O ( k ) (cid:3) v ; v ≡ Z η dη ′ /z ( η ′ ) Z η k dη ′ /z ( η ′ ) , (3.23)where η k ∼ − k − is the horizon crossing time, H ( η k ) ≡ k , and we have used the fact that the long wavelengthexpansion of Eq. (3.22) would give corrections only in powers of k . Then the general solution for R c is given by R c = Au + Bv , (3.24)where A and B are constants of order unity in the sense of the k expansion. Thus at k | η | ≪

1, we have R c = Au , R ′ c = O ( k ) Au + Bv ′ . (3.25)Now, assuming that the violation of the slow-roll condition is not too strong, which is necessary for the spectrum tobe approximately scale-invariant, we have z ∝ a ∝ η − , hence v ′ = − (cid:18)Z η k dη ′ z ( η ) z ( η ′ ) (cid:19) − = O ( k ) . (3.26)Thus we conclude that R ′ c = O ( k ) R c on superhorizon scales.To reinforce the above argument, let us consider the Starobinsky model in which the slow-roll condition can besigniﬁcantly violated [27]. The Starobinsky model has a sudden change in its slope at φ = φ such that V ( φ ) = ( V + A + ( φ − φ ) for φ > φ ,V + A − ( φ − φ ) for φ < φ , (3.27)where A + , A − and φ are assumed to be positive so that the scalar ﬁeld evolves from a large positive value of φ toward φ = 0. Then the background scalar ﬁeld φ satisﬁes3 H ˙ φ = (cid:26) − A + for φ > φ , − (cid:0) A − + ( A + − A − ) e − H ( t − t ) (cid:1) for φ < φ , (3.28)where t is the time at which φ = φ , and the de Sitter approximation, 3 H = κ V is assumed to be valid. Thusthe scalar ﬁeld slow-rolls at φ > φ , and violates the slow-roll condition temporarily at φ < φ . The evolution isdecelerated if A + /A − >

1, or accelerated if A + /A − <

1, compared to the slow-roll evolution.The curvature perturbation on the comoving slicing during the non-slow-roll regime at φ ≤ φ is [27] R c = − iH √ k / ˙ φ (cid:16) α ( k ) e − ikη (1 + ikη ) − β ( k ) e ikη (1 − ikη ) (cid:17) , (3.29) α ( k ) ≡ i (cid:18) A − A + − (cid:19) k k (cid:18) k k (cid:19) , (3.30) β ( k ) ≡ − i (cid:18) A − A + − (cid:19) e ik/k k k (cid:18) ik k (cid:19) , (3.31) | α | − | β | = 1 , where k = ( a ( t ) H ) − . On superhorizon scales, we have k | η | ≪

1. The standard slow-roll case corresponds to A + = A − . In this case, we immediately see that α ( k ) = 1, β ( k ) = 0, and R ′ c = O ( k ) R c on superhorizon scales.We now turn to the case A + = A − . We assume A − /A + = O (1) so that the slow-roll condition is not severelyviolated. For k < k , we expand the three exponentials in Eq. (3.29) to obtain R c = 3 iH √ k / A + (cid:18) − A + H ˙ φ ( kη ) h (cid:16) A − A + − (cid:17)(cid:16) k η − ( k η ) − k η ) (cid:17) + O ( k ) i(cid:19) . (3.32)Thus we have R ′ c = O ( k ) R c . For k > k , we have α ( k ) = 1 + O ( k /k ) and β ( k ) = O ( k /k ). Hence R ′ c = " − a ¨ φ ˙ φ + O ( k ) R c = " O (cid:0) ( kη ) (cid:1) (cid:18) k k (cid:19) + O ( k ) R c . (3.33)Since k > k , we see the right-hand side of this is also of O ( k ) R c . Thus we have shown that R ′ c = O ( k ) R c holdseven for the Starobinsky model in which the slow-roll condition can be violated. This in turn implies H ′ T = O ( k ) H T .Next, we consider the tensor perturbation H (2) T . The evolution equation is H (2) T ′′ + 2 a ′ a H (2) T ′ + k H (2) T = 0 . (3.34)This equation has the same structure as Eq. (3.22) for R c if we replace z with a . Then we can repeat exactly thesame argument and conclude that H (2) T ′ = O ( k ) H (2) T .Again, this conclusion can be supported by considering the quantum ﬂuctuations explicitly. When the de Sitterapproximation H =const. holds, the tensor perturbation from the quantum ﬂuctuations during inﬂation are given by H (2) T = √ πHk / ( kη − i ) e − ikη . (3.35)Taking the conformal time derivative of this equation, we obtain at k | η | ≪ H (2) ′ T = − √ πHk / ( ik η ) e − ikη = O ( k ) H (2) T . (3.36)Thus, provided that the linear theory is a good approximation up to scales corresponding to a few e-foldings afterhorizon crossing, our assumption ∂ t ˜ γ ij = O ( ǫ ) is justﬁed for perturbations originated from the quantum ﬂuctuationsinside the horizon. IV. CONCLUSION

In this paper, taking the gradient expansion approach, we have investigated nonlinear perturbations on superhorizonscales in a universe dominated by a single scalar ﬁeld. We have derived the general solution for all the physicalquantities valid to second order in the spatial gradients on the uniform Hubble slicing. In particular, an expressionfor the nonlinear curvature perturbation, which plays a central role in the evaluation of a possible non-Gaussianityfrom inﬂation, has been obtained.Parallel to our previous paper [31], we have identiﬁed the tensor modes in the extrinsic curvature, while theidentiﬁcation of the tensor modes in the metric is arbitrary unless we specify the background spatial metric. This isan important diﬀerence from the case of linear theory in which the background metric is uniquely given.In our analysis, we have adopted a non-trivial assumption ∂ t ˜ γ ij = O ( ǫ ), which cannot be justiﬁed within thecontext of gradient expansion. To justify it, we have appealed to linear theory, and argued that the assumption ∂ t ˜ γ ij = O ( ǫ ) is naturally satisﬁed in the linear limit. In particular, as an explicit example, we have considered theStarobinsky model, in which the slow-roll condition is temporarily violated due to a sudden change in the slope of thepotential, and we have explicitly shown that the quantum ﬂuctuations indeed satisfy the assumption ∂ t ˜ γ ij = O ( ǫ ).As mentioned in Introduction, in models in which the slow-roll condition is temporarily violated as in the case ofthe Starobinsky model, the O ( k ) corrections to the curvature perturbation, which correspond to O ( ǫ ) corrections ingradient expansion, may play a crucial role in the determination of the ﬁnal amplitude of the curvature perturbation.Since the result of this paper is valid to O ( ǫ ), it provides a very useful tool to investigate the nonlinear behaviorof the curvature perturbation on superhorizon scales for a wide class of models including those which violate theslow-roll condition. In particular, matching our result with the quantum ﬂuctuations on scales slightly beyond thehorizon scale, the non-Gaussianity arising from the nonlinear dynamics of the scalar ﬁeld on superhorizon scales canbe studied. Applications to speciﬁc models of inﬂation will be considered in a future publication. Acknowledgements

This work was supported in part by the Monbu-Kagakusho 21st century COE Program “Center for Diversity andUniversality in Physics”, and by JSPS Grants-in-Aid for Scientiﬁc Research (B) No. 17340075, and (A) No. 18204024.

APPENDIX A: BASIC EQUATIONS

Here we show the basic equations for nonlinear quantities. The Klein-Gordon equation is1 √− g ∂∂x µ h √− gg µν ∂∂x ν φ i − dVdφ = 0 (A1)We employ the (3 + 1)-formalism of the Einstein equations and then the dynamical variables are γ ij and K ij . The( n, n ) and ( n, j ) components of the Einstein equations give the Hamiltonian and momentum constraint equations,respectively, while the ( i, j ) components gives the evolution equations for K ij . The evolution equations for γ ij aregiven by the deﬁnitions of the extrinsic curvature (2.3).In the present case, the Hamiltonian and momentum constraints are R − ˜ A ij ˜ A ij + 23 K = 16 πGE , (A2) D i ˜ A ij − D j K = 8 πGJ j , (A3) E ≡ T µν n µ n ν , J j ≡ − T µν n µ γ νj . (A4)The evolution equations for γ ij are given as( ∂ t − β k ∂ k ) ψ + ˙ a a ψ = ψ {− αK + ∂ k β k } , (A5)( ∂ t − β k ∂ k )˜ γ ij = − α ˜ A ij + ˜ γ ik ∂ j β k + ˜ γ jk ∂ i β k −

23 ˜ γ ij ∂ k β k , (A6)where ˙ = d/dt .The evolution equations for K ij are given as( ∂ t − β k ∂ k ) K = α ( ˜ A ij ˜ A ij + 13 K ) − D k D k α + 4 πGα ( E + S kk ) , (A7)( ∂ t − β k ∂ k ) ˜ A ij = 1 a ψ [ α ( R ij − γ ij R ) − ( D i D j α − γ ij D k D k α )]+ α ( K ˜ A ij − A ik ˜ A kj )+ ˜ A ik ∂ j β k + ˜ A jk ∂ i β k −

23 ˜ A ij ∂ k β k − πGαa ψ ( S ij − γ ij S kk ) , (A8)where R ij is the Ricci tensor of the metric γ ij , R ≡ γ ij R ij , D i is the covariant derivative with respect to γ ij , and S ij ≡ T ij , S kk ≡ γ kl S lk . (A9) APPENDIX B: ORDER ESTIMATION

Here we evaluate the order of magnitude of the basic variables using the equations presented in Appendix A andthe assumptions given by Eq. (2.9), namely β i = O ( ǫ ) , ∂ t ˜ γ ij = O ( ǫ ) . (B1)With these assumptions, Eqs. (A3) and (A6) on the uniform Hubble slicing yield J j = O ( ǫ ) . (B2)Using Eq. (2.5), J j is express as J j = − α ∂ φ∂ j φ + β l α ∂ l φ∂ j φ . (B3)We expand φ as φ = (0) φ + (1) φ + (2) φ + · · · . Then, to satisfy Eq.(B2), (0) φ should depend only on time, and (1) φ should vanish. Thus, we obtain (0) φ = (0) φ ( t ) , (1) φ = 0 . (B4)For notational simplicity, we denote the full scalar ﬁeld by ˜ φ and (0) φ by φ in the following. Thus˜ φ ( t, x i ) = φ ( t ) + (2) φ ( t, x i ) + · · · . (B5)From the O ( ǫ ) part of the Hamiltonian constraint, Eq. (A2), we have13 K ( t ) = 3 H = 8 πG (cid:16) ˙ φ (0) α + V ( φ ) (cid:17) , (B6)where we have also expanded α as (0) α + (2) α + · · · . From this equation, we ﬁnd that (0) α depend only on time, (0) α = (0) α ( t ) . (B7)Since (0) α is spatial homogeneous, we may choose the time coordinate to set it to unity, (0) α = 1. Thus, from Eqs. (A5)and 3 H = − K we have 0 = − aa (2) α + 6 ˙ ψψ (1 − (2) α ) + O ( ǫ ) . (B8)We see ∂ t ψ = O ( ǫ ) from this equation.To summarize, the orders of magnitude of the basic metric quantities are ψ = O (1) , β i = O ( ǫ ) , α − O ( ǫ ) , ∂ t ˜ γ ij = O ( ǫ ) , ∂ t ψ = O ( ǫ ) , ˜ A ij = O ( ǫ ) . (B9)Actually, as for β i , when we solve the equations to O ( ǫ ) in Appendix C, we assume β i = O ( ǫ ). But since the choiceof β i does not aﬀect the temporal behavior, it does not aﬀect the generality of the solution.0 APPENDIX C: DERIVATION OF THE GENERAL SOLUTION1. The leading order solution

We ﬁrst derive the leading order solution for our basic variables. Here, the ”leading order” means not the lowestorder in gradient expansion, but the lowest order of each physical quantity. For example, from Eqs. (2.10), the leadingorder of ψ is O ( ǫ ), but the leading one of ˜ A ij is O ( ǫ ).The O ( ǫ ) part of Eqs. (A2) and (A7) are13 K ( t ) = 3 H = 8 πG (cid:16) ˙ φ V ( φ ) (cid:17) , (C1)˙ K = − H = 3 H + 4 πG (cid:16) ˙ φ − V ( φ ) (cid:17) . (C2)These equations are indeed the Friedmann equations, and the leading solution φ is that of a FLRW spacetime.Setting β i = O ( ǫ ), we substitute the order of magnitude evaluation of the variables shown in Eq. (2.10) into theKlein-Gordon equation and ﬁnd¨ φ + 3 H ˙ φ + dVdφ = 0 , (C3) (2) ¨ φ + 3 H (2) ˙ φ + d Vd φ (2) φ − ˙ φ (cid:0) (2) ˙ α + 3 H (2) α (cid:1) − φ (2) α = 0 . (C4)The Hamiltonian and momentum constraint equations give˜ γ ij ˜ D i ˜ D j ψ = 18 ˜ γ kl ˜ R kl ψ − πGψ a (cid:20) − ˙ φ α + ˙ φ (2) ˙ φ + dVdφ (2) φ (cid:21) + O ( ǫ ) , (C5)˜ D j ( ψ ˜ A ij ) = − πGψ ˙ φ ∂ j ( (2) φ ) + O ( ǫ ) , (C6)where ˜ R ij is the Ricci tensor with respect to ˜ γ ij , and ˜ D i is the covariant derivative with respect to ˜ γ ij .The evolution equations for the spatial metric give6 ∂ t ψψ − H ( α −

1) = D k β k , (C7)( ∂ t − β k ∂ k )˜ γ ij = − A ij + ˜ γ ik ∂ j β k + ˜ γ jk ∂ i β k −

23 ˜ γ ij ∂ k β k + O ( ǫ ) , (C8)while the evolution equations for the extrinsic curvature give ∂ t ˜ A ij + 3 H ˜ A ij = 1 a ψ [ R ij − γ ij R ] + O ( ǫ ) , (C9)0 = 3 H α + 8 πG (cid:20) − (cid:16) ˙ φ + V ( φ ) (cid:17) (2) α + 2 ˙ φ (2) ˙ φ − dVdφ (2) φ (cid:21) . (C10)First, for ψ , we have ψ = (0) L ( x i ) + O ( ǫ ) , (C11)where (0) L is an arbitrary function of the spatial coordinates. And from Eqs. (C9) and (C11) together with theassumption ∂ t ˜ γ ij = O ( ǫ ), we obtain˜ γ ij = (0) f ij ( x k ) + O ( ǫ ) , (C12)˜ A ij = (2) F ij a Z tt ∗ a ( t ′ ) dt ′ + (2) C ij ( x k ) a + O ( ǫ ) , (C13) (2) F ij ≡ (0) L (cid:20) (2) ¯ R ij + (2) R Lij − (0) f ij (0) f kl (cid:0) (2) ¯ R kl + (2) R Lkl (cid:1)(cid:21) , (C14) (2) R Lij ≡ − (0) L ¯ D i ¯ D j (0) L − (0) L (0) f ij ¯∆ (0) L + 6 (0) L ¯ D i (0) L ¯ D j (0) L − (0) L f ij ¯ D k (0) L ¯ D k (0) L , (C15)1where t ∗ is an arbitrary reference time, (0) f ij is an arbitrary and symmetric tensor of the spatial coordinates, ¯ R ij is the Ricci tensor with respect to (0) f ij , ¯ D i is the covariant derivative with respect to (0) f ij , ¯∆ is the Laplacianwith respect to (0) f ij , and (2) C ij is an arbitrary, symmetric and traceless tensor which depends only on the spatialcoordinates.

2. The solution to O ( ǫ ) in gradient expansion Now we consider the general solution valid to O ( ǫ ) in gradient expansion. As we have seen in the previoussubsection, among the basic variables we have introduced, the only quantities whose leading order terms are lowerthan O ( ǫ ) are ψ , ˜ γ ij , α , and φ . Hence what we have to do is to evaluate the next order terms of these variables.First let us consider ˜ γ ij . Substituting Eq. (C13) in Eq. (C8), and choosing β i = O ( ǫ ), we obtain the generalsolution for ˜ γ ij ,˜ γ ij = (0) f ij ( x k ) − (2) F ij ( x k ) Z t dt ′ a ( t ′ ) Z t ′ a ( t ′′ ) dt ′′ − (2) C ij ( x k ) Z t dt ′ a ( t ′ ) + O ( ǫ ) + (2) f ij ( x k ) , (C16)where (2) f ij is an arbitrary and symmetric tensor which depends only on the spatial coordinates. We may absorb itinto (0) f ij without loss of generality. Thus we have˜ γ ij = (0) f ij ( x k ) − (2) F ij ( x k ) Z t dt ′ a ( t ′ ) Z t ′ a ( t ′′ ) dt ′′ − (2) C ij ( x k ) Z t dt ′ a ( t ′ ) + O ( ǫ ) , (C17)Here, we note that (0) f ij is not completely arbitrary, but its determinant must be unity, det( (0) f ij ) = 1 + O ( ǫ ).To obtain the other variables, we ﬁrst consider the solution for (2) φ . We express (2) α in terms of (2) φ by usingEq. (C10). We obtain (2) α = 2˙ φ (cid:18) φ (2) ˙ φ − dVdφ (2) ˙ φ (cid:19) (C18)Inserting this into the ﬁeld equation (C4), we obtain a closed equation for (2) φ , (2) ¨ φ − H ˙ φ + 2 dV /dφ ˙ φ (2) ˙ φ − ˙ φ d V /dφ + 2 HdV /dφ ˙ φ (2) φ = 0 . (C19)Although this equation looks diﬃcult to solve at ﬁrst glance, it turns out that it can be analytically solved. Here,we note that because the lowest order scalar ﬁeld is only a function of time, the above equation is exactly the sameas the one in the long wavelength limit of linear theory. Then, it is known that there exits a particular solution thatsatisﬁes ˙ φ (2) ˙ φ d − dVdφ (2) φ d = 0 , (C20)where the subscript d is attached since it corresponds to a decaying mode solution in the linear limit. This may beintegrated easily to give (2) φ d ∝ v ( t ) ≡ exp (cid:20)Z t dt dV /dφ ˙ φ (cid:21) = ˙ φ ( t ∗ ) a ( t ) ˙ φ ( t ) , (C21)where we have normalized the amplitude so that v = 1 /a ( t ∗ ) at t = t ∗ .Once we know a particular solution, the other independent solution, u ( t ), can be found by the use of the Wronskian.For two independent solutions u and v of Eq. (C19), the Wronskian, W = ˙ u v − ˙ v u , (C22)satisﬁes ˙ W + b ( t ) W = 0 , (C23)2where b ( t ) is the coeﬃcient of (2) ˙ φ in Eq. (C19), b ≡ − H ˙ φ + 2 dV /dφ ˙ φ . (C24)Thus we obtain ˙ u v − ˙ v u = W ∝ exp (cid:20) − Z t b ( t ) dt (cid:21) = a ( a ˙ φ ) ∝ av . (C25)Then since ddt (cid:16) uv (cid:17) = ˙ u v − ˙ v uv = Wv , (C26)we readily ﬁnd u = v Z tt ∗ Wv dt ′ = ˙ φ ( t ∗ ) a ( t ) ˙ φ ( t ) Z tt ∗ a ( t ′ ) dt ′ , (C27)where we have normalized u so that ( u/v )˙ = a ( t ∗ ) at t = t ∗ . Thus, the general solution for (2) φ is given by (2) φ = C ( x k ) u ( t ) + D ( x k ) v ( t ) . (C28)Given the general solution for (2) φ , the remaining variables (2) α and (2) ψ are determined as follows. Equation (C18)gives the solution for (2) α as (2) α = 2˙ φ (cid:20)(cid:18) φ ˙ u − dVdφ u (cid:19) C ( x k ) + (cid:18) φ ˙ v − dVdφ v (cid:19) D ( x k ) (cid:21) = 2 ˙ φ ∗ ˙ φ a (cid:20)(cid:18) a ˙ φ + dVdφ Z tt ∗ a ( t ′ ) dt ′ (cid:19) C ( x k ) + dVdφ D ( x k ) (cid:21) . (C29)In terms of this solution, Eq. (C7) gives the O ( ǫ ) part of ψ as˙ ψ = H (0) L (2) α , (C30)where we have set β i = O ( ǫ ). Integrating this equation, we obtain ψ = (0) L (cid:20) Z tt ∗ (2) α Hdt ′ (cid:21) + (2) L ( x k ) , (C31)where (2) L ( x k ) is an arbitrary spatial function of O ( ǫ ), which we may absorb into (0) L without loss of generality.Up to now we have not considered the Hamiltonian and momentum constraint equations. The constraint equationswill relate the quantities (0) L , (0) f ij , (2) C ij , (2) C and (2) D . First let us focus on the Hamiltonian constraint, Eq. (C5).Using Eqs. (C10) and (C11), it becomes1 a L (cid:2) − (0) f ij ¯ D i ¯ D j (0) L + (0) f kl (2) ¯ R kl (0) L (cid:3) = 48 πG (cid:20) − ˙ φ (2) ˙ φ + dVdφ (2) φ (cid:21) . (C32)Comparing this equation with Eq. (C20), we see that the right-hand side vanishes for the decaying mode solution v .Thus inserting the general solution (C28) into the above, we obtain1 a L (cid:2) − (0) f ij ¯ D i ¯ D j (0) L + (0) f kl (2) ¯ R kl (0) L (cid:3) = − πG ˙ φ ( t ∗ ) a C ( x k ) . (C33)Thus D ( x k ) is not constrained by the Hamiltonian, while C ( x k ) is determined in terms of (0) L and (0) f ij as (2) C = − πG (0) L ˙ φ ∗ (cid:2) − (0) f ij ¯ D i ¯ D j (0) L + (0) f kl (2) ¯ R kl (0) L (cid:3) + O ( ǫ )= − πG (0) L ˙ φ ∗ (0) f kl (cid:2) (2) R Lkl + (2) ¯ R kl (cid:3) + O ( ǫ )= − πG ˙ φ ∗ R (cid:2) L f (cid:3) + O ( ǫ ) , (C34)3where ˙ φ ∗ = ˙ φ ( t ∗ ) and R (cid:2) L f (cid:3) is the Ricci scalar of the metric (0) L f ij .The O ( ǫ ) part of the momentum constraint (C6) yields¯ D j (cid:2) (0) L F ij (cid:3) = − πG (0) L ˙ φ ∗ ∂ j (cid:0) (2) C ( x k ) (cid:1) , (C35)¯ D j (cid:2) (0) L C ij (cid:3) = − πG (0) L ˙ φ ∗ ∂ j (cid:0) (2) D ( x k ) (cid:1) . (C36)The latter equation implies (2) D is not arbitrary but expressed in terms of (0) L , (0) f ij and (2) C ij , while the formerequation is found to be consistent with Eq. (C34). This consistency is a result of the Bianchi identities. [1] D. N. Spergel et al. , arXiv:astro-ph/0603449.[2] H. Kodama and M. Sasaki, Prog. Theor. Phys. Suppl. , 1 (1984).[3] E. Komatsu and D. N. Spergel, Phys. Rev. D , 063002 (2001) [arXiv:astro-ph/0005036].[4] V. Acquaviva, N. Bartolo, S. Matarrese and A. Riotto, Nucl. Phys. B , 119 (2003) [arXiv:astro-ph/0209156].[5] J. M. Maldacena, JHEP , 013 (2003) [arXiv:astro-ph/0210603].[6] N. Bartolo, S. Matarrese and A. Riotto, Phys. Rev. D , 043503 (2004) [arXiv:hep-ph/0309033].K. A. Malik and D. H. Lyth, JCAP , 008 (2006) [arXiv:astro-ph/0604387].[7] N. Bartolo, E. Komatsu, S. Matarrese and A. Riotto, Phys. Rept. , 103 (2004) [arXiv:astro-ph/0406398].[8] K. Tomita, Phys. Rev. D , 103506 (2005) [Erratum-ibid. D , 029901 (2006)] [arXiv:astro-ph/0509518].K. Tomita, Phys. Rev. D , 043526 (2005) [arXiv:astro-ph/0505157].K. Tomita, Phys. Rev. D , 083504 (2005) [arXiv:astro-ph/0501663].K. Nakamura, Phys. Rev. D , 101301 (2006) [arXiv:gr-qc/0605107].K. Nakamura, arXiv:gr-qc/0605108.K. A. Malik, JCAP , 005 (2005) [arXiv:astro-ph/0506532].H. Noh and J. c. Hwang, Phys. Rev. D , 104011 (2004).[9] D. Seery and J. E. Lidsey, JCAP , 003 (2005) [arXiv:astro-ph/0503692].[10] X. Chen, R. Easther and E. A. Lim, arXiv:astro-ph/0611645.[11] D. H. Lyth, K. A. Malik and M. Sasaki, JCAP , 004 (2005) [arXiv:astro-ph/0411220].[12] E. M. Lifshitz and I. M. Khalatnikov, Adv. Phys. , 185 (1963).[13] V. a. Belinsky, I. m. Khalatnikov and E. m. Lifshitz, Adv. Phys. , 639 (1982).[14] K. Tomita, Prog. Theor. Phys. , 1503 (1972).[15] K. Tomita, Prog. Theor. Phys. , 730 (1975).[16] D. S. Salopek and J. R. Bond, Phys. Rev. D , 3936 (1990).[17] G. L. Comer, N. Deruelle, D. Langlois and J. Parry, Phys. Rev. D , 2759 (1994).[18] N. Deruelle and D. Langlois, Phys. Rev. D , 2007 (1995) [arXiv:gr-qc/9411040].[19] V. Muller, H. J. Schmidt and A. A. Starobinsky, Class. Quant. Grav. , 1163 (1990).[20] O. Iguchi, H. Ishihara and J. Soda, Phys. Rev. D (1997) 3337 [arXiv:gr-qc/9606012].O. Iguchi and H. Ishihara, Phys. Rev. D , 3216 (1997) [arXiv:gr-qc/9611047].[21] I. M. Khalatnikov, A. Y. Kamenshchik, M. Martellini and A. A. Starobinsky, JCAP , 001 (2003) [arXiv:gr-qc/0301119].[22] D. S. Salopek, Phys. Rev. D , 3214 (1991).[23] D. S. Salopek and J. M. Stewart, Class. Quant. Grav. , 1943 (1992).J. Parry, D. S. Salopek and J. M. Stewart, Phys. Rev. D , 2872 (1994) [arXiv:gr-qc/9310020].[24] J. Soda, H. Ishihara and O. Iguchi, Prog. Theor. Phys. , 781 (1995) [arXiv:gr-qc/9509008].[25] Y. Nambu and A. Taruya, Class. Quant. Grav. , 705 (1996) [arXiv:astro-ph/9411013].A. Taruya and Y. Nambu, Prog. Theor. Phys. , 295 (1996) [arXiv:gr-qc/9510010].[26] M. Sasaki and T. Tanaka, Prog. Theor. Phys. , 763 (1998) [arXiv:gr-qc/9801017].[27] A. A. Starobinsky, JETP Lett. , 489 (1992) [Pisma Zh. Eksp. Teor. Fiz. , 477 (1992)].[28] S. M. Leach, M. Sasaki, D. Wands and A. R. Liddle, Phys. Rev. D , 023512 (2001) [arXiv:astro-ph/0101406].[29] M. Shibata and M. Sasaki, Phys. Rev. D , 084002 (1999) [arXiv:gr-qc/9905064].[30] J. W. . York, J. Math. Phys. , 456 (1973).[31] Y. Tanaka and M. Sasaki, Prog. Theor. Phys.117