[PDF] An efficient surrogate-aided importance sampling framework for reliability analysis

Abstract

Surrogates in lieu of expensive-to-evaluate performance functions can accelerate the reliability analysis greatly. This paper proposes a new two-stage framework for surrogate-aided reliability analysis named Surrogates for Importance Sampling (S4IS). In the first stage, a coarse surrogate is built to gain the information about failure regions; the second stage zooms into the important regions and improves the accuracy of the failure probability estimator by adaptively selecting support points therein. The learning functions are proposed to guide the selection of support points such that the exploration and exploitation can be dynamically balanced. As a generic framework, S4IS has the potential to incorporate different types of surrogates (Gaussian Processes, Support Vector Machines, Neural Network, etc.). The effectiveness and efficiency of S4IS is validated by five illustrative examples, which involve system reliability, highly nonlinear limit-state function, small failure probability and moderately high dimensionality. The implementation of S4IS is made available to download at this https URL.

Full PDF

AAn efficient surrogate-aided importance sampling framework for reliability analysis

Wang-Sheng LIU , Wen-Jun CAO , and Sai Hung CHEUNG School of Civil & Environmental Engineering, Nanyang Technological University, Block N1, 50 Nanyang Avenue, Singapore 639798 Department of Civil and Environmental Engineering, National University of Singapore, Block E1A, No.1 Engineering Drive 2, Singapore 117576 ETH Zurich, Future Cities Laboratory, Singapore-ETH Centre , Singapore 138602

Abstract:

Keywords: reliability analysis, stochastic sampling, metamodel, active learning, design of experiment

Introduction (Background: reliability analysis) Reliability analysis is very essential for ensuring a system to perform as designed when subjected to different sources of uncertainties. In the context of engineering, the measure of the reliability is the failure probability. Given a model whose uncertain parameters are represented by a vector of 𝑑 random variables 𝚯 = [Θ & , ⋯ , Θ ) ] + , the failure probability 𝑃 - for a specific failure event 𝐹 can be written as: 𝑃 - = / 𝕝 - (𝛉)𝑝 (𝛉)d𝛉 = 𝔼 [𝕝 - (𝛉)] (1) where 𝛉 = [𝜃 & , ⋯ , 𝜃 ) ] + ∈ 𝛺 ⊂ ℝ ) denotes the parameter value of 𝚯 in the parameter space 𝛺 ; 𝕝 - (∙) is the binary indicator function of the failure event 𝐹 which equals to one if 𝛉 belongs to the failure region 𝛺 - and zero otherwise; 𝑝 (∙) is the joint probability density function (PDF) of 𝚯 ; and 𝔼 [∙] denotes the expectation for 𝚯~𝑝 (𝛉) . The failure region 𝛺 - is defined by the performance function 𝑔(𝛉) , i.e., 𝛺 - = {𝛉: 𝑔(𝛉) ≤ 0} . The corresponding limit-state function 𝑔(𝛉) = 0 divides the parameter space 𝛺 into the failure and safety regions. (Reliability methods) A considerable amount of literature has been focusing on developing efficient and accurate ways to calculate the failure probability. One widely used set of reliability methods is based on the linear or quadratic approximation of the limit-state function around the most probable point (MPP) using Taylor series approximation, usually known as the first or second order reliability methods (FORM or SORM), respectively [1, 2]. But the error brought by the approximation tends to be large when the limit-state function is highly nonlinear and/or there exists multiple MPPs in the failure region; furthermore, the computational cost in the search of MPP increases drastically with the number of random variables. Another set of reliability methods is based on stochastic sampling. Among them, Monte Carlo Simulation (MCS) has been acknowledged as a robust technique where the failure probability is approximated statistically through stochastic sampling. In its crude form, MCS first simulates 𝑁 samples from the PDF 𝑝 (𝛉) ; then evaluates the corresponding values of the performance function 𝑔(𝛉) and counts the number of failure samples 𝑁 - ; finally, an estimator 𝑃I - can be given by 𝑁 - /𝑁 . Though 𝑃I - is asymptotically unbiased, the convergence rate of the crude MCS is low. For example, to estimate a small failure probability of LM , it takes a very large ize of MNO samples for the estimator to achieve a Coefficient of Variation (

CoV ) close to 10%. Advanced stochastic simulation techniques have been developed to reduce the required sample size, including Subset Simulation (SS) [3], Importance Sampling (IS) [4], and Spherical Subset Simulation (S ) [5]. Despite the remarkable improvement in efficiency, they still require a moderately large number of samples to be evaluated (e.g., in SS, the sample size in each level is typically set to 1000). (Motivation for surrogates) In practice, the performance function 𝑔(𝛉) of a complex system often involves implicit and time-demanding numerical models such as finite element models which are widely used by engineers to predict the responses of a system. What’s worse, in the attempt to better mimic the behavior of a system, numerical models tend to be more and more sophisticated, making the model evaluation increasingly computationally expensive. Consequently, the aforementioned reliability methods are likely to be inapplicable for complex systems when the computation resource is limited. (Review of surrogates and learning strategies) To alleviate this problem, surrogates (also referred to as metamodels or response surfaces) can be built in lieu of the original expensive performance functions. In the literature, different surrogates have been employed to perform reliability analysis, such as polynomials [6, 7], Gaussian process (GP or Kriging) [8-12], polynomial chaos expansion (PCE) [13, 14], artificially neural networks (ANN) [15, 16] and support vector machines (SVM) [17]. The most critical part of efficiently building a surrogate is probably the adaptive infill strategy, also referred to as adaptive design of experiment or refinement scheme in some literature. Instead of densely filling the parameter space with support points, an adaptive infill strategy often starts with a coarse surrogate built upon a small number of support points, and then refines it progressively using judiciously selected support points. Many GP-based adaptive infill strategies have been reported in the last decade in conjunction with stochastic-sampling-based reliability methods. For example, Bichon et al. [9] presented an Efficient Global Reliability Analysis (EGRA) method where an Expected Feasibility Function (EFF) was maximized by using the global optimization algorithm to select the best next infill support points in each iteration. More recently, Echard et al. [18] proposed a new method AK-MCS which combines GP with crude Monte Carlo Simulation. In this method, an active learning function called U function was developed to guide the selection of equential support points. Later, a modified version AK-IS [12] was proposed using the same learning function, which resorts to importance sampling for solving small-failure-probability cases. Alternatively, Dubourg et al. [11] adaptively built a GP surrogate as a substitute of the performance function and used it to construct the quasi optimal importance sampling density. Surrogate-aided reliability analysis is very parsimonious with the help of progressive refinement, but most adaptive infill strategies in the current literature are customized for GP and specific reliability analysis problems only. The paper presents a new framework for surrogate-aided reliability analysis named Surrogates for Importance Sampling (S4IS). This framework is robust to different types of reliability analysis problems (e.g., high dimensionality and small failure probability) and has the potential to incorporate different types of surrogates. The estimation of the failure probability and the building of the surrogate are performed in two stages. In Stage 1, support points are generated to build a coarse surrogate which can identify multiple failure regions; the next stage focuses more on the important regions to improve the failure probability estimator. The adaptive infill of support points in two stages is under the guidance of two different learning functions such that the dynamic balance between the exploration and exploitation can be achieved. The proposed learning functions are the convex combination of several objectives. The remainder of this paper is organized as follows. Section 2 introduces the importance sampling technique for reliability analysis as well as the existing learning functions used to guide the infill of support points. In Section 3, the proposed two-stage framework S4IS is discussed in detail. To validate the proposed framework, five illustrative examples are investigated in Section 4, which involves system reliability problem, highly nonlinear limit-state function, different reliability levels and moderately high dimensionality. The performance of S4IS is compared with the crude MCS, FORM and AK-IS. Surrogate-aided importance sampling

Importance sampling

It is common that the system failure 𝐹 in Equation (1) is a rare event, meaning the failure region 𝛺 - is in the tail part of 𝑝 (𝛉) . In this case, the crude MCS may cannot afford to imulate sufficient failure samples in order to achieve a small variance of the estimated 𝑃 - . Alternatively, Importance Sampling (IS) [4] is widely used as a variance reduction technique which computes the expectation with respect to a proposed instrumental PDF. Given an instrumental PDF 𝑞(𝛉) supported on 𝛺 ⊂ ℝ ) , the failure probability in Equation (1) can now be rewritten as: 𝑃 -,QR = / 𝕝 - (𝛉) 𝑝 (𝛉)𝑞(𝛉) 𝑞(𝛉)d𝛉 = 𝔼 S T𝕝 - (𝛉)𝑝 (𝛉)𝑞(𝛉) U (2) Instead of sampling from the original PDF 𝑝 (𝛉) , we can sample from 𝑞(𝛉) and adjust 𝕝 - (𝛉) with the likelihood ratio 𝑝 (𝛉)/ 𝑞(𝛉) . Now the IS estimator 𝑃I -,QR can be expressed as: 𝑃I -,QR = 1𝑁 QR V 𝕝 - W𝛉 (X) Y 𝑝 W𝛉 (X) Y𝑞(𝛉 (X) ) Z [\ X]& , 𝛉 (X) ~𝑞(𝛉) (3) where ^𝛉 (X) ; 𝑖: 1 → 𝑁 QR b denotes a set of 𝑁 QR i.i.d (independent and identical distributed) samples from 𝑞(𝛉) . The IS estimator is unbiased and its variance is estimated by: 𝑉𝑎𝑟f𝑃I -,QR g = 1𝑁 QR (𝑁 QR − 1) V T𝕝 - W𝛉 (X) Y𝑝 W𝛉 (X) Y𝑞(𝛉 (X) ) − 𝑃I -,QR U OZ [\ X]& = 1𝑁 QR − 1 i 1𝑁 QR V 𝕝 - W𝛉 (X) Y𝑝 W𝛉 (X) Y O 𝑞(𝛉 (X) ) OZ [\ X]& − 𝑃I -,QRO j (4) The selection of the instrumental PDF is of crucial importance in reducing the variance. It is not difficult to derive that minimizing the variance gives the optimal instrumental PDF 𝑞 ∗ (𝛉) = 𝕝 - (𝛉)𝑝 (𝛉)∫ 𝕝 - (𝛉)𝑝 (𝛉)d𝛉 = 𝕝 - (𝛉)𝑝 (𝛉)𝑃 - (5) Although this zero-variance optimal density in Equation (5) is not usable without knowing 𝑃 - , it can guide the selection procedure of 𝑞(𝛉) and provide insights to the design of the importance sampling scheme. In practice, quasi-optimal instrumental PDFs are commonly used. Building the surrogate adaptively (Surrogate for what) Assume the performance function of interest 𝑔(𝛉) is a time-demanding black-box function. The main goal is to build a surrogate 𝑔m(𝛉) to the original function from a dataset of support points

𝒮 = ^W𝛉 (X) , 𝑦 (X)

Y; 𝑖: 1 → 𝑁 p b so that all failure regions 𝛺 - over the parameter space 𝛺 can be identified accurately. In other words, in contrast with general-purpose surrogates that are optimized to be accurate over the whole parameter space, surrogates or reliability analysis focus on the accuracy of failure region boundaries ∂𝛺 - defined by the limit-state function 𝑔(𝛉) = 0 . Given the surrogate 𝑔m(𝛉) , the binary indicator function in Equation (2) and (3) can be approximated by: 𝕝r - (𝛉) = s1, 𝑔m(𝛉) ≤ 00, 𝑔m(𝛉) > 0 (6) (Active and passive infill strategy) An infill strategy should be employed to select a finite number of support points aiming at gaining the maximal information for building the surrogate. Instead of using dense support points for fitting a general-purpose surrogate, an adaptive infill strategy starts with a small size of support points to fit an initial surrogate and then new support points are sequentially selected to refine the previous surrogate. This way is proved to be far more efficient than the non-adaptive one since it is customized to perform the analysis of interest (e.g., reliability analysis in this paper). (How to build adaptively) Finding the best next support point(s) in each iteration is guided by some criteria or objectives in terms of optimization. In general, on one hand, we expect the new support point to explore the under-explored regions where existing support points are sparse; one the other hand, the previously explored regions should be exploited to improve the accuracy over the regions of interest. These two criteria are referred to as exploration and exploitation , respectively. In the context of reliability analysis, exploration involves adding more support points to sparsely populated regions in order to identify the failure region(s) and exploration involves zooming into the regions that contribute most to the integral for estimating the failure probability. As exploration and exploitation are conflicting goals, the problem of finding the next support point(s) is essentially a multi-objective optimization problem. Various Learning functions have been proposed as objectives to solve the multi-objective optimization problem. An early learning function called Expected Feasibility Function (EFF) was introduced by Bichon et al. [9] in the Efficient Global Reliability Analysis (EGRA). EFF converts a multi-objective problem to a single objective problem, acting as an indicator of how well the true value is expected to satisfy the limit-state function 𝑔(𝛉) = 0 in its vicinity of ±𝜖 𝐸𝐹𝐹(𝛉) = / (𝜖(𝛉) − |𝑔m(𝛉)|)𝑓 zI d𝑔m(𝛉) N{L{ (7) here 𝑔m(𝛉) is a realization of a Gaussian Process (GP) distributed as 𝑓 zI ; and 𝜖 is set to be proportional to the standard deviation of 𝑔m(𝛉) (e.g., 𝜖 = 2𝜎 ~m in [9]). The EFF in Equation (7) can be expressed in the analytical form when random variables 𝚯 are transformed into the uncorrelated standard normal space via isoprobabilistic transformations [19-21]. The support point with the maximum expected feasibility is found by a global optimization method. It can be observed from Equation (7) that the optimization of EFF favors points that are close to the limit-state boundary (small |𝑔(𝛉)| ) and have large variance in the prediction (large 𝜎 ~m ) Similarly, another learning function in terms of the prediction mean and variance of a GP surrogate, called U-function, was introduced in AK-MCS [18] and AK-IS [12] 𝑈(𝛉) = |𝜇 zI (𝛉)|𝜎 zI (𝛉) (8) where 𝜇 zI (𝛉) and 𝜎 zI (𝛉) denote the prediction mean and standard deviation function of GP, respectively. The next point(s) can be obtained by minimizing 𝑈(𝛉) . A sampling-based approach is used to replace the global optimization step in EGRA, where the values of

𝑈(𝛉) are calculated for a large number of candidate support points drawn from the original distribution of random variables. The above two learning functions interpreted exploitation as the closeness to limit-state boundary but neglected the fact that the high-density regions in the failure domain contribute more to the failure probability than the low-density ones. To address this issue, Sun et al. [22] combined the statistical information provided by the GP surrogate with the joint PDF of random variables by proposing a new learning function named Least Expected Improvement Function (LIF). The proposed framework

In the existing framework described in [12] and [23], the first stage consists in the search of the most probable point (MPP) and reliability analysis using FORM; in the second stage, IS is performed by proposing a multivariate Gaussian distribution as the instrumental PDF, which is centered on the MPP and with covariance being the identity matrix. However, this framework can only incorporate Gaussian Process (GP) as the surrogate; furthermore, it is not able to identify multiple MPPs or multiple failure regions. n this paper, a new two-stage framework for surrogate-aided importance sampling called Surrogates for Importance Sampling (S4IS) is presented as illustrated in Figure 1. In the first stage, a coarse surrogate is built adaptively to identify all failure regions in the parameter space. The second stage starts with the construction of the Gaussian Mixture (GM) instrument PDF for IS based on the identified failure regions, then proceeds to the refinement of the surrogate mainly over the important regions to reduce the error of the IS estimator. Two different learning functions proposed in this framework are formulated by the convex combination of different exploration and exploitation objectives and are applicable to different types of surrogates. Also, the learning functions in the two stages balance the exploration and exploitation dynamically, with the one in the second stage focusing more on exploitation to ensure the fast convergence. As shown in Figure 1, infill support points in Stage 1 scatter more uniformly over the whole parameter space than those in Stage 2.

Figure 1: Schematic diagram of the proposed two-stage framework Surrogate for Importance sampling (S4IS)

Stage 1 3.1.1

Initialization

Throughout this paper, an isoprobabilistic transformation 𝑇 [19-21] is applied to the original random variables 𝚯 to transform them into independent standard normal random variables 𝐔 , i.e., 𝐔 = 𝑇(𝚯) . The IS estimator in Equation (3) can be rewritten as: 𝑃I -,QR = 1𝑁 QR V 𝕝 - W𝐮 (X) Y 𝑝 (cid:132) W𝐮 (X) Y𝑞(𝐮 (X) ) Z [\ X]& , 𝐮 (X) ~𝑞(𝐮) (9) . 𝜃 " 𝜃 . Stage 1 𝛺 % Initial support pointsSequential infill support points

Stage 2 𝜕𝛺 % 𝛺 % here 𝑝 (cid:132) (∙) is the multivariate standard normal PDF and the failure region in 𝐔 -space can be given by 𝛺 - = ^𝐮: 𝑔W𝑇 L& (𝐮)Y ≤ 0b . To explore the parameter space, a large number 𝑁 (cid:133)& (in this paper, 𝑁 (cid:133)& = minW10 (cid:137) , 10 ) Y ) of samples are randomly generated from a distribution 𝑞 & (𝐮) in the first stage. They are treated as the inputs of candidate support points 𝐮 (cid:133)& = ^𝐮 (X) ~𝑞 & (𝐮); 𝑖: 1 → 𝑁 (cid:133)& b . After selecting a small subset 𝐮 p&,5 from 𝐮 (cid:133)& , the outputs of the subset are evaluated 𝑦 p&,5 = (cid:138)𝑦 (X) = 𝑔 (cid:139)𝑇 L& W𝐮 (X) Y(cid:140) ; 𝐮 (X) ∈ 𝐮 p&,5 (cid:141) (10) where 𝑦 p&,5 denote the outputs of the time-demanding performance function for 𝐮 p&,5 . Given the dataset of initial support points 𝒮 &,5 = ^W𝛉 p&,5 , 𝑦 p&,5 Yb = ^W𝑇 L& W𝐮 p&,5 Y, 𝑦 p&,5 Yb , the initial surrogate 𝑔m &,5 can be built to predict the output 𝑦m = 𝑔m &,5 (𝛉) = 𝑔m &,5 W𝑇 L& (𝐮)Y (11) In this paper, we suggest the size of 𝐮 p&,5 to be 𝑁 p&,5 = max[12, (𝑑 + 1)(𝑑 + 2)/2] where 𝑑 is the dimension of 𝐮 . The inputs of candidate support points 𝐮 (cid:133)& in the first stage can be simulated from the standard normal distribution, i.e., 𝑞 & (𝐮) = 𝑝 (cid:132) (𝐮) . However, for cases of small failure probabilities, only a few or even not a single 𝐮 (cid:133)& will scatter in the failure regions, leading to at least two problems: (i) since the inputs of support points is selected from the candidates, the surrogate fitted using the corresponding support points will hardly be accurate near the limit-state boundary; (ii) multiple failure regions cannot be effectively identified. In this paper, 𝐮 (cid:133)& are generated uniformly in the hypercubic space ranging from -5 to 5 in each dimension so as to explore the whole parameter space, i.e., 𝑞 & (𝐮) = &&5 (cid:145) , 𝐮 ∈ [−5,5] ) . Adaptive refinement of the surrogate

At the beginning of iteration 𝑘 (𝑘 = 1,2 ⋯ ) , the current surrogate 𝑔m &,(cid:148) has been fitted using the dataset of support points 𝒮 &,(cid:148) = ^W𝛉 p&,(cid:148) , 𝑦 p&,(cid:148) Yb = ^W𝑇 L& W𝐮 p&,(cid:148) Y, 𝑦 p&,(cid:148) Yb (for 𝑘 = 1 , 𝑔m &,(cid:148) = 𝑔m &,5 and 𝒮 &,(cid:148) = 𝒮 &,5 ). Our aim is to find the best next input of support point from the candidates 𝐮 (cid:133)& \𝐮 p&,(cid:148) . The existing inputs of support points 𝐮 p&,(cid:148) is excluded from the candidate set 𝐮 (cid:133)& to avoid duplication. he learning function is essential to the selection of the location of new support points. For the first stage, the learning function achieves the balance between exploration and exploitation by minimizing the distance to the limit-state boundary (exploration) and meanwhile maximizing the distance to the existing support points (exploitation) 𝐿𝐹 & (𝐮) = (cid:151)𝑔m &,(cid:148) W𝑇 L& (𝐮)Y(cid:151) − (cid:152)𝐮, 𝐮 p&,(cid:148) (cid:152) (cid:153)X(cid:132) (12) where (cid:152)𝐮, 𝐮 p&,(cid:148) (cid:152) (cid:153)X(cid:132) is the minimum distance between 𝐮 and 𝐮 p&,(cid:148) in iteration 𝑘 . Therefore, the new input of support point selected from 𝐮 (cid:133)& \𝐮 p&,(cid:148) can be obtained as: 𝐮 (cid:148)∗ = argmin 𝐮 ((cid:156)) ∈𝐮 (cid:157)(cid:158) \𝐮 (cid:159)(cid:158),(cid:160) (cid:139)𝐿𝐹 & W𝐮 (X) Y(cid:140) (13) Equation (12) differs from the learning functions in Equation (7) and (8) in a way that the exploration is conducted by maximizing the minimal distance between the new and existing inputs of support points rather than maximizing the prediction variance. Since the learning function in Equation (12) does not rely on the prediction statistics provided by GP, it can be applied to other types of surrogates (SVM, ANN, PCE, etc.). Also, it converts the multi-objective optimization problem into a single objective one by equally weighted sum of objectives. The convex combination of objectives ensures that the optimal solution is a Pareto optimal. Various distance metrics can be used to measure the distance between two points 𝐮 (X) and 𝐮 (¡) in real-valued vector spaces. The Euclidean distance is suggested in this paper, which can be calculated by: (cid:152)𝐮 (X) , 𝐮 (¡) (cid:152) = ¢V (cid:139)u M(X) − u

M(¡) (cid:140)

O)M]& (14) Therefore, the minimal distance (cid:152)𝐮, 𝐮 p&,(cid:148) (cid:152) (cid:153)X(cid:132) where 𝐮 p&,(cid:148) is a set of inputs of support points can be given by: (cid:152)𝐮, 𝐮 p&,(cid:148) (cid:152) (cid:153)X(cid:132) = argmin 𝐮 ((cid:156)) ∈𝐮 (cid:159)(cid:158),(cid:160) W(cid:152)𝐮, 𝐮 (X) (cid:152)Y (15) The selected input 𝐮 (cid:148)∗ in Equation (13) is evaluated by the implicit performance function to obtain the output 𝑦 (cid:148)∗ = 𝑔W𝑇 L& (𝐮 (cid:148)∗ )Y . For the next iteration, the dataset of support points is pdated as 𝒮 &,(cid:148)N& = 𝒮 &,(cid:148) ∪ {(𝑇 L& (𝐮 (cid:148)∗ ), 𝑦 (cid:148)∗ )} and the corresponding updated surrogated is built as 𝑔m &,(cid:148)N& accordingly. Stopping criteria

In each iteration 𝑘 (𝑘 = 1,2 ⋯ ) , the failure probability is estimated by: 𝑃I -&,(cid:148) = 1𝑁 (cid:133)& V 𝕝r - (cid:139)𝑇 L& W𝐮 (X) Y(cid:140) 𝑝 (cid:132) W𝐮 (X) Y𝑞 & (𝐮) Z (cid:157)(cid:158) X]& , 𝐮 (X) ∈ 𝐮 (cid:133)& (16) where the indicator function based on the surrogate 𝑔m (cid:148) is written as: 𝕝r - W𝑇 L& (𝐮)Y = ¥1, 𝑔m &,(cid:148) (𝑇 L& (𝐮)) ≤ 00, 𝑔m &,(cid:148) (𝑇 L& (𝐮)) > 0 (17) Note that 𝑃I -&,(cid:148) may not be a good estimator because 𝑝 (cid:132) (∙) and 𝑞 & (∙) can have different supports, but it can be treated as a measure of how well all candidates 𝐮 (cid:133)& for the first stage have been classified into the safety and failure regions. To decide in which iteration to terminate the Stage 1, early stopping criterion is used in this paper. We stop in iteration 𝑛 XM& , if §𝑃I -&,(cid:132) (cid:156)¤(cid:158) − 1𝑎 & ∑ 𝑃I -&,(cid:148)(cid:132) (cid:156)¤(cid:158) (cid:148)](cid:132) (cid:156)¤(cid:158) N&L“ (cid:158) §1𝑎 & ∑ 𝑃I -&,(cid:148)(cid:132) (cid:156)¤(cid:158) (cid:148)](cid:132) (cid:156)¤(cid:158) N&L“ (cid:158) ≤ 𝜀 & (18) where 𝑎 & is a positive integer and 𝜀 & is the tolerance to the relative error of the estimated failure probability with respect to the mean failure probabilities from previous 𝑎 & iterations. The setting 𝑎 & = 5 works well for the proposed framework. And 𝜀 & controls the accuracy of the built surrogate and is set to 0.01. Another stopping criterion is 𝑛 XM& exceeding the maximal allowable number of iterations [𝑛 XM& ] for Stage 1, i.e., 𝑛 XM& > [𝑛

XM& ] . At the end of the first stage, we can have (i) a coarse estimator of the failure probability 𝑃I -&,(cid:132) (cid:156)¤(cid:158) ; (ii) a coarse surrogate 𝑔m &,(cid:132) (cid:156)¤(cid:158) ; (iii) failure samples which are generated by evaluating all candidates 𝐮 (cid:133)& using 𝑔m &,(cid:132) (cid:156)¤(cid:158) . The results can be further utilized in the second stage to improve the estimation of the failure probability as we will discuss in the next section. It is also interesting to find that Stage 1 degrades to AK-MCS [18] if 𝑞 & (𝐮) = 𝑝 (cid:132) (𝐮) in Equation (16) but with a different learning function. .1.4 High dimensionality

Sufficient inputs of candidate support points should be generated to ensure an effective exploration. Considerable attention must be paid to the input size 𝑁 (cid:133)& when the dimension 𝑑 is large ( 𝑑 ≥ 10 ). Specifically, for the space defined by [−5,5] ) , an input size ) is required to achieve a density of one point per unit volume. Even though these inputs of candidates are evaluated only by the surrogate which requires relatively small computing efforts, the computational burden is likely to be prohibitive when 𝑑 becomes large for non-high-performance-computer users. An alternative solution for exploration is to combine the search of multiple most probable points (MPP) with FORM as discussed in [1]. Unlike the sampling-based exploration, [1] searches for multiple MPPs via optimization by taking advantage of the first order gradient information of the performance function, making it very efficient even in high dimensional space. As a result, MPPs as well as a FORM estimator of the failure probability can be obtained. The support points generated during the optimization procedure are used to fit the initial surrogate for Stage 2. This exploration solution is suggested when 𝑑 is large and computers with large memory and computing power are not available. Stage 2 3.2.1

Instrumental probability density function

Similar to [10], in this paper we use Gaussian Mixture (GM) as the instrumental PDF 𝑞 O (𝐮) in Stage 2 in order to take multiple MPPs into account. The GM is made up of 𝐾 equally weighted Gaussian distributions centered on the MPPs 𝐮 ﬁﬂﬂ 𝑞 O (𝐮) = 1𝐾 V 𝒩W𝐮(cid:151)𝐮 ﬁﬂﬂ,M , 𝟏Y †M]& (19) where the mean 𝐮 ﬁﬂﬂ,M is the MPP in the cluster 𝑡 and represents the identity matrix. The MPPs 𝐮 ﬁﬂﬂ are determined approximately based on the failure samples from Stage 1 which are denoted as 𝐮 (cid:133)&,- = ^𝐮 (X) ∈ 𝐮 (cid:133)& ; 𝑔m &,(cid:132) (cid:156)¤(cid:158) (𝑇 L& W𝐮 (X) Y) ≤ 0b (or based on [1]). Specifically, first 𝐾 -means algorithm [24] is used to partition 𝐮 (cid:133)&,- into 𝐾 clusters. In each cluster 𝑡 , the failure sample with the largest 𝑝 (cid:132) (𝐮) is selected as the MPP for this cluster, i.e., ﬁﬂﬂ,M = argmax 𝐮 ((cid:156)) ∈𝐮 (cid:157)(cid:158),·(¤) (cid:139)𝑝 (cid:132) W𝐮 (X) Y(cid:140) (20) where 𝐮 (cid:133)&,-(M) is the set of failure samples in cluster 𝑡 . It should be noted that the number of clusters 𝐾 may not equal to the number of failure regions. But it is not a problem as shown in the examples if we set a reasonably large 𝐾 . The reason is that 𝐾 being larger than the number of failure regions simply leads to partitioning the connected failure regions and samples simulated from the GM 𝑞 O (𝐮) will still scatter over all the failure regions. For initialization in Stage 2, the initial surrogate is 𝑔m O,5 = 𝑔m &,(cid:132) (cid:156)¤(cid:158) and the initial dataset of support points 𝒮 O,5 = ^W𝛉 p&,(cid:132) (cid:156)¤(cid:158) , 𝑦 p&,(cid:132) (cid:156)¤(cid:158)

Yb = ^W𝑇 L& W𝐮 p&,(cid:132) (cid:156)¤(cid:158) Y, 𝑦 p&,(cid:132) (cid:156)¤(cid:158) Yb . To calculate the IS estimator and further refine the surrogate, a large number of samples 𝐮 (cid:133)O =^𝐮 (X) ~𝑞 O (𝐮); 𝑖: 1 → 𝑁 (cid:133)O b are generated from 𝑞 O (𝐮) and evaluated by 𝑔m O,5 . Then, the initial IS estimator in the second stage can be given by: 𝑃I -O,5 = 1𝑁 (cid:133)O V 𝕝r - (cid:139)𝑇 L& W𝐮 (X) Y(cid:140) 𝑝 (cid:132) W𝐮 (X) Y𝑞 O (𝐮) Z (cid:157)(cid:181) X]& , 𝐮 (X) ∈ 𝐮 (cid:133)O (21) Also, the samples 𝐮 (cid:133)O serves as the inputs of candidate support points in Stage 2 whose outputs will be evaluated by the original performance function if selected. The sample size of 𝐮 (cid:133)O is large in general (e.g., 𝑁 (cid:133)O = 10000 ). Adaptive refinement of the surrogate

Differing from the learning function 𝐿𝐹 & (∙) for the first stage, the learning function in the second stage puts more emphasis on the exploitation task by zooming into the important regions. Observing Equation (21), we can find that in the failure regions the larger the likelihood ratio 𝑝 (cid:132) / 𝑞 O is, the more it contributes to the IS estimator. By taking this into account, the learning function in Stage 2 adds another exploitation term to 𝐿𝐹 & (∙) , which can be written as: 𝐿𝐹 O (𝐮) = (cid:151)𝑔m O,(cid:148) W𝑇 L& (𝐮)Y(cid:151) − (cid:152)𝐮, 𝐮 p&,(cid:148) (cid:152) (cid:153)X(cid:132) − log ‚𝑝 (cid:132) (𝐮) 𝑞 O (𝐮)„ (22) Like 𝐿𝐹 & (∙) , the multi-objective optimization is converted to a single objective optimization problem by weighted convex combination of objectives. In iteration 𝑘 (𝑘 = 1,2 ⋯ ) , the current surrogate 𝑔m O,(cid:148) has been fitted using the dataset of support points 𝒮 O,(cid:148) = ^W𝛉 pO,(cid:148) , 𝑦 pO,(cid:148)

Yb = ^W𝑇 L& W𝐮 pO,(cid:148) Y, 𝑦 pO,(cid:148) Yb (for 𝑘 = 1 , 𝑔m O,(cid:148) = 𝑔m

O,5 and

O,(cid:148) = 𝒮

O,5 ). The new input of support points can be selected from 𝐮 (cid:133)O \𝐮 pO,(cid:148) by minimizing the learning function 𝐮 (cid:148)∗ = argmin 𝐮 ((cid:156)) ∈𝐮 (cid:157)(cid:181) \𝐮 (cid:159)(cid:181),(cid:160) (cid:139)𝐿𝐹 O W𝐮 (X) Y(cid:140) (23) Again, 𝐮 (cid:148)∗ is evaluated by the original performance function. New support point (𝑇 L& (𝐮 (cid:148)∗ ), 𝑦 (cid:148)∗ ) is appended to 𝒮 O,(cid:148) to update the surrogate.

Stopping criteria

Similar to the first stage, the iterative procedure terminates when either the IS estimator converges or the number of iterations 𝑛 XMO exceeds the predefined maximal allowable number of iterations [𝑛 XMO ] . The convergence criterion is expressed as: §𝑃I -O,(cid:132) (cid:156)¤(cid:158) − 1𝑎 O ∑ 𝑃I -O,(cid:148)(cid:132) (cid:156)¤(cid:181) (cid:148)](cid:132) (cid:156)¤(cid:181) N&L“ (cid:181) §1𝑎 O ∑ 𝑃I -O,(cid:148)(cid:132) (cid:156)¤(cid:181) (cid:148)](cid:132) (cid:156)¤(cid:181) N&L“ (cid:181) ≤ 𝜀 O (24) where 𝑎 O = 5 and 𝜀 O = 0.001 . Flowchart

The implementation of S4IS is summarized in Figure 2. To begin the process, a large number of inputs of candidate support points (or candidates) are generated from 𝑞 & (∙) for Stage 1, which will later be selected adaptively to refine the surrogate in each iteration. To explore the whole parameter space effectively, heavy-tailed distributions 𝑞 & (∙) such as uniform distributions are suggested. Based on the information about the failure regions obtained in Stage 1, Stage 2 starts with the construction of the instrumental probability density function 𝑞 O (∙) for IS. Again, after selecting inputs of support points from the candidates generated from 𝑞 O (∙) , they are evaluated by the performance function to further refine the surrogate. Balancing dynamically between exploration and exploitation is achieved in the two stages by (i) different candidates to be evaluated by the performance function; (ii) different learning functions to select the best inputs of support points. Figure 2: Flowchart of the proposed framework S4IS. Filled rectangles represent where the performance function is evaluated. Illustrative examples

In this section, five examples are discussed to illustrate the effectiveness and efficiency of S4IS for different types of reliability analysis problems, which are featured by the system reliability, the involvement of nonlinear limit-state functions, small failure probability, and moderately high dimensionality. The efficiency is measured in terms of 𝑁 »…“‰ , i.e., the number of performance function evaluations required to achieve a small Coefficient of Variation ( CoV ). For all examples below, we use Gaussian Process (GP) as the surrogate and ten replicates are run to compute the statistics of the final estimator of the failure probability.

Example 1: a series system with four failure regions [18, 25]

This example involves a series system, in which the performance function returns the smallest value of four component performance function. The purpose of this example is to validate the proposed framework for system reliability analysis and multiple failure regions. The performance function is: 𝑔(𝛉) = min ⎣⎢⎢⎢⎢⎢⎡3 + 0.1(𝜃 & − 𝜃 O ) O − √2(𝜃 & + 𝜃 O )23 + 0.1(𝜃 & − 𝜃 O ) O + √2(𝜃 & + 𝜃 O )2(𝜃 & − 𝜃 O ) + 3√2−(𝜃 & − 𝜃 O ) + 3√2 ⎦⎥⎥⎥⎥⎥⎤ (25) Initialization

1. Generate a large number of inputs of candidate support points 𝐮 𝑐1 = 𝐮 (𝑖) ~𝑞 𝐮 ;𝑖:1 → 𝑁 𝑐1 from 𝑞 𝐮

2. Select the inputs of initial support points 𝐮 𝑠1,0 ⊂ 𝐮 𝑐1

3. Evaluate the performance function for 𝐮 to obtain the initial support points 𝒮 =𝑇 −1 𝐮 𝑠1,0 ,𝑔 𝑇 −1 𝐮

4. Build the initial surrogate 𝑔9 based on 𝒮

1. Evaluate the current surrogate 𝑔9 for candidates 𝐮 ;3

2. Estimate the failure probability 𝑃= 𝐹1,𝑘

3. Find the input of the next support point by minimizing the learning function 𝐮 𝑘∗ = argmin 𝐮 (𝑖) ∈𝐮 𝑐1 \𝐮 𝑠1,𝑘 𝐿𝐹 𝐮 (𝑖)

4. Check the stopping criteria. If continue, a) Update the support points by adding the new one 𝒮 = 𝒮 ∪ 𝑇 −1 𝐮 𝑘∗ ,𝑔 𝑇 K3 𝐮 L∗ b) Update the surrogate 𝑔9 c) Return to Step 1 Adaptive refinement of the surrogate Initialization

1. Construct the instrumental probability density function 𝑞 N 𝐮

2. Generate a large number of inputs of candidate support points for Stage 2 𝐮 𝑐2 = P𝐮 (𝑖) ~𝑞 𝐮 ;𝑖:1 →𝑁 𝑐2 Q

3. The initial surrogate of Stage 2 𝑔9 = 𝑔9 𝑖𝑡1 where 𝑔9 UVW is the final surrogate of Stage 1

Adaptive refinement of the surrogate

1. Evaluate 𝑔9 N,L for candidates 𝐮 ;N

2. Estimate the failure probability 𝑃= XN,L

3. Find the input of the next support point by minimizing the learning function 𝐮 L ∗ = argmin 𝐮 (𝑖) ∈𝐮 𝑐2 \𝐮 𝑠2,𝑘 𝐿𝐹 𝐮 (𝑖)

4. Check the stopping criteria. If continue, a) Update the support points by adding the new one 𝒮 N,𝑘+1 = 𝒮

N,𝑘 ∪ 𝑇 −1 𝐮 𝑘∗ ,𝑔 𝑇 K3 𝐮 L∗ b) Update the surrogate 𝑔9 N,LM3 c) Return to Step 1

Stage 1 Stage 2 here 𝜃 & and 𝜃 O are parameter values of two independent standard normal random variables Θ & ~𝑁(0,1) and Θ O ~𝑁(0,1) , respectively. Figure 3 shows how the support points are adaptively selected to construct the surrogate in S4IS. They are scattering over the all four failure regions especially the four failure boundaries. As a result, as can be seen from Figure 3, the original limit-state function can be well approximated by the surrogate. Table 1 compares the results from S4IS with those from the crude MCS, FORM and AK-IS [18]. The failure probability from MCS 𝑃I -,ﬁ˘R is considered as the ground truth of the problem and the relative error 𝜀 ˙ is calculated by: 𝜀 ˙ = (cid:151) 𝑃 m 𝐹,𝑀𝐶𝑆 − 𝑃 m 𝐹 (cid:151) 𝑃 m 𝐹,𝑀𝐶𝑆 (26) where 𝑃I - is the failure probability obtained from the reliability analysis by other methods. Due to the nonlinearity of the limit-state function and the multiple failure regions as show in Figure 3, the failure probability from FORM is inaccurate with 𝜀 ˙ = 69.8% . No improvement is observed in AK-IS as the importance sampling in the second stage is based on the FORM results in the first stage. For S4IS, an estimator with a large CoV was obtained in Stage 1 based on the coarse surrogate; the estimator was improved significantly in Stage 2 resulting in 𝜀 ˙ = 0.5% and 𝐶𝑜𝑉 = 1.06% . Also, S4IS is more efficient compared with AK-IS, requiring only a total number of 60.6 evaluations of the performance function at the end of Stage 2. It should be pointed out that for system reliability problems, we suggest to use the multi-output surrogate or multiple surrogates to approximate all component performance functions in the system then call the min (for series systems) or max (for parallel systems) function, rather than approximating the aggregated performance function only. The reason is that the min or max function of simple component performance functions may become too complex to be approximated accurately, leading to a large error of the final failure probability estimator.

Figure 3: Adaptively selected support points and the limit-state function using S4IS in Example 1 Table 1: Comparison of the results using different methods for Example 1 Methods MCS FORM AK-IS S4IS

Stage 1 Stage 2 𝑃I - L(cid:210)

L(cid:210)

L(cid:210) 𝜀 ˙ -- 69.8% 73.6% 17.1% 0.5% 𝐶𝑜𝑉 𝑁 »…“‰ (cid:212)

12 71.1 35.7 60.6

Example 2: a nonlinear oscillator [18, 25, 26]

Example 2 investigates the effect of the highly nonlinear limit-state function on the performance of S4IS. The oscillator of interest is shown in Figure 4 and the statistical properties f its six random variables are summarized in Table 2. The performance function can be expressed as: 𝑔(𝛉) = 3𝑟 − (cid:213) 2𝐹 & 𝑚𝜔 𝑠𝑖𝑛 (cid:217)𝜔 𝑡 & (27) where 𝛉 = [𝑐 & , 𝑐 O , 𝑚, 𝑟, 𝑡 & , 𝐹 & ] + and 𝜔 = (cid:220) (cid:133) (cid:158) N(cid:133) (cid:181) (cid:153) . Figure 4: Schematic diagram of the oscillator in Example 2 Table 2: Statistical characteristics of random variables in Example 2 Random variable Distribution Mean Standard deviation 𝑐 & Normal 1 0.1 𝑐 O Normal 0.1 0.01 𝑚 Normal 1 0.05 𝑟 Normal 0.5 0.05 𝑡 & Normal 1 0.2 𝐹 & Normal 1 0.2

The results of the reliability analysis using different methods are reported in Table 3. While the relative error of the estimated failure probability from FORM is large, both AK-IS and S4IS results in accurate estimators with

𝐶𝑜𝑉 < 5.0% and 𝜀 ˙ < 1.0% . Furthermore, S4IS only requires a total number of 53.3 calls to the performance function, being about half of the number required by AK-IS. Table 3: Comparison of the results using different methods for Example 2 Methods MCS FORM AK-IS S4IS 𝑐 " 𝑐 𝐹 𝑡

𝐹 𝑡 𝑡𝑡 " 𝐹 " 𝑚 tage 1 Stage 2 𝑃I - 𝜀 ˙ -- 9.1% 0.2% 17.2% 0.9% 𝐶𝑜𝑉 𝑁 »…“‰ (cid:212)

39 91.4 34.2 53.3

Example 3: a multimodal nonlinear function [9]

This example involves a highly nonlinear and multimodal performance function defined by: 𝑔(𝛉) = − (𝜃 &O + 4)(𝜃 O − 1)20 + sin (cid:217)5𝜃 & (28) Two independent random variables follow the normal distribution Θ & ~𝑁(1.5,1) and Θ O ~𝑁(2.5,1) . As can been seen from Figure 1, as a whole, the original limit-state function can be approximated by the surrogate accurately with the multiple modals identified. What is interesting to observe in this figure is that the surrogate has a very high accuracy over the important region where the density value of the distribution of random variables is large. It is a natural result of the adaptive infill strategy as in S4IS the second stage zooms into the important region and produces support points therein. Table 3 summarizes the results by using different reliability analysis methods. Both AK-IS and S4IS can achieve the small relative error and variance in the failure probability estimation. In this example, S4IS outperforms greatly AK-IS in terms of efficiency as it takes hundreds of iterations for the optimization procedure in the first stage of AK-IS to find the most probable point (MPP). Figure 5: Adaptively selected support points and the limit-state function using S4IS in Example 3. The dotted lines are the contour lines of the joint probability density function. Table 4: Comparison of the results using different methods for Example 3 Methods MCS FORM AK-IS S4IS

Stage 1 Stage 2 𝑃I - 𝜀 ˙ -- 277.6% 0.2% 34.5% 1.7% 𝐶𝑜𝑉 𝑁 »…“‰ (cid:212)

695 985.9 46.2 71.4

Example 4: the influence of reliability levels [11, 27]

The purpose of this example is to investigate the influence of different reliability levels on the performance of S4IS, particularly when the reliability level is very high (small failure probability). The performance function reads: 𝑔(𝛉) = min ⎩⎪⎨⎪⎧𝑐 − 1 − 𝜃 O + exp ‚− 𝜃 &O

10„ + (cid:217)𝜃 & (cid:137) 𝑐 O & 𝜃 O ⎭⎪⎬⎪⎫ (29) Two independent random variables in the performance function follow the standard normal distribution. By increasing the constant 𝑐 , the reliability level can be increased accordingly. o investigate the influence of different reliability levels, three cases ( 𝑐 = 3 , 𝑐 = 4 and 𝑐 = 5 ) are discussed. Comparative results are reported from Table 5 to Table 7. S4IS performs consistently well for different reliability levels, in which the coarse estimator 𝑃I - in Stage 1 can be improved significantly in Stage 2. In this example, increasing the reliability level from 𝑃I - = 3.470 × 10 L(cid:210) to 𝑃I - = 9.485 × 10 LŁ only add a little computational effort (from 𝑁 »…“‰ = 72.8 to 𝑁 »…“‰ = 118.6 ) to achieve the high accuracy. Again, FORM and AK-IS tend to be mistaken by the only MPP they can find, resulting in the inaccurate estimators. Table 5: Comparison of the results using different methods for Example 4 ( 𝒄 = 𝟑 ) Methods MCS FORM AK-IS S4IS

Stage 1 Stage 2 𝑃I - L(cid:210)

L(cid:210)

L(cid:210) 𝜀 ˙ -- 61.1% 57.9% 13.6% 1.8% 𝐶𝑜𝑉 𝑁 »…“‰ (cid:212)

7 97.6 44.9 72.8

Table 6: Comparison of the results using different methods for Example 4 ( 𝒄 = 𝟒 ) Methods MCS FORM AK-IS S4IS

Stage 1 Stage 2 𝑃I - L(cid:236)

L(cid:236)

L(cid:236) 𝜀 ˙ -- 65.5% 50.8% 1.2% 0.6% 𝐶𝑜𝑉 𝑁 »…“‰ (cid:212)

7 110.3 54.4 83.2

Table 7: Comparison of the results using different methods for Example 4 ( 𝒄 = 𝟓 ) Methods MCS FORM AK-IS S4IS

Stage 1 Stage 2 𝑃I - LŁ LŁ LŁ LŁ LŁ 𝜀 ˙ -- 69.8% 76.0% 36.1% 4.7% 𝐶𝑜𝑉 »…“‰ (cid:238)

7 92.4 63.4 118.6

Example 5: the influence of dimensionality [11, 28]

The last example is characterized by an analytical performance function where the number of random variables 𝑑 can be changed without altering the reliability level a lot. The performance function reads as follows: 𝑔(𝛉) = W𝑑 + 6√𝑑Y − V 𝜃 X)X]& (30) This example involves 𝑑 independent lognormal random variables with the mean values 1 and standard deviations 2. Three cases ( 𝑑 = 2 , 𝑑 = 10 and 𝑑 = 50 ) are investigated by using different methods for this example. The results are summarized from Table 8 to Table 10. In S4IS, for moderately high dimensional cases ( 𝑑 = 10 and 𝑑 = 50 )), the combination of the search of multiple MPPs and FORM [1] is adopted in Stage 1 as discussed in Section 3.1.4. It is shown that on this example, the total required number of evaluations 𝑁 »…“‰ in S4RS increases slowly with the number of random variables. For 𝑑 = 50 , less than two hundred evaluations are required, which is generally affordable in the setting of the simulation of the real-world systems. The results using other reliability analysis methods are compared to S4IS. For FORM, though the performance space in the original space is linear, it becomes nonlinear when the original random variables are transformed into the standard normal ones. As it can be seen, FORM fails when the dimension is high since the nonlinearity of the limit-state function in the U-space. AK-IS can produce the accurate estimators of the failure probability for this example even when the dimension is high, but AK-IS requires much more calls of the performance function than S4IS (for 𝑑 = 50 , 𝑁 »…“‰ = 1845.2 in AK-IS). It is not surprising as the stopping criterion in AK-IS rely on the convergence of the minimum U-function value over the all IS samples while S4IS checks the convergence of the estimator directly. Table 8: Comparison of the results using different methods for Example 5 ( 𝒅 = 𝟐 ) Methods MCS FORM AK-IS S4IS

Stage 1 Stage 2 I - L(cid:210)

L(cid:210)

L(cid:210) 𝜀 ˙ -- 22.0% 0.04% 0.2% 0.1% 𝐶𝑜𝑉 𝑁 »…“‰ (cid:212)

20 59.0 16.4 23.9

Table 9: Comparison of the results using different methods for Example 5 ( 𝒅 = 10 ) Methods MCS FORM AK-IS S4IS

Stage 1 Stage 2 𝑃I - L(cid:210)

L(cid:210)

L(cid:210) 𝜀 ˙ -- 63.4% 1.2% 44.5% 0.2% 𝐶𝑜𝑉 𝑁 »…“‰ (cid:212)

35 678.2 38 48.6

Table 10: Comparison of the results using different methods for Example 5 ( 𝒅 = 50 ) Methods MCS FORM AK-IS S4IS

Stage 1 Stage 2 𝑃I - L(cid:210)

L(cid:137)

L(cid:210)

L(cid:137)

L(cid:210) 𝜀 ˙ -- 92.0% 1.6% 88.0% 1.0% 𝐶𝑜𝑉 𝑁 »…“‰ (cid:212)