[PDF] Measuring and estimating interaction between exposures on dichotomous outcome of a population

Abstract

In observational studies for the interaction between exposures on dichotomous outcome of a population, one usually uses one parameter of a regression model to describe the interaction, leading to one measure of the interaction. In this article, we use the conditional risk of outcome given exposures and covariates to describe the interaction and obtain five different measures for the interaction in observational studies, i.e. difference between the marginal risk differences, ratio of the marginal risk ratios, ratio of the marginal odds ratios, ratio of the conditional risk ratios, and ratio of the conditional odds ratios. By using only one regression model for the conditional risk of outcome given exposures and covariates, we obtain the maximum-likelihood estimates of all these measures. By generating approximate distributions of the maximum-likelihood estimates of these measures, we obtain interval estimates of these measures. The method is presented by studying the interaction between a therapy and the environment on eradication of Helicobacter pylori among Vietnamese children.

Full PDF

11 Measuring and estimating interaction between exposures on dichotomous outcome of a population

Xiaoqin Wang , Weimin Ye and Li Yin Department of Electronics, Mathematics and Natural Sciences, University of Gävle, 801 76, Gävle, Sweden Department of Medical Epidemiology and Biostatistics, Karolinska Institute, Box 281, SE 171 77, Stockholm, Sweden * Corresponding author. E-mail: [email protected], Phone: +46-8-52486187, Fax: +46-8- 311101 Summary:

Helicobacter pylori among Vietnamese children.

Key words:

Interaction; Interaction measure; Point estimate; Interval estimate; Regression model Subject classification codes: 62F25, 62E17, 62F10

1. Introduction

The interaction between exposures on an outcome refers to the situation where the effect of one exposure on the outcome depends on the assignment of another exposure [21, 22, 24]. Suppose a hypothetical randomized trial in which one could assign two exposures z and z and wished to investigate the effect of two exposures z and z on an outcome y of certain population, where the variables z , z and y are all dichotomous, i.e. 𝑧 = 0, 1 , 𝑧 = 0, 1 and 𝑦 = 0, 1 . In the randomized trial, covariates are essentially unassociated with exposures ( z , z ) and thus are not confounders. In this case, one may use the marginal risk pr( 𝑦 = 1 | 𝑧 , 𝑧 ) to measure the effect of z and then the interaction between z and z . Here the marginal risk pr( 𝑦 = 1 | 𝑧 , 𝑧 ) is marginal with respect to covariates and thus conditional on exposures ( z , z ) only. One may measure the effect of 𝑧 by the marginal risk difference MRD( 𝑧 ) = pr( 𝑦 = 1 | 𝑧 = 1, 𝑧 ) − pr( 𝑦 = 1 | 𝑧 = 0, 𝑧 ) and the interaction by difference between the marginal risk differences DMRD = MRD( 𝑧 =1) − MRD( 𝑧 =0). One may also measure the effect of 𝑧 by the marginal risk ratio MRR( 𝑧 ) = pr( 𝑦 = 1 | 𝑧 = 1, 𝑧 )/ pr( 𝑦 = 1 | 𝑧 = 0, 𝑧 ) and the interaction by ratio of the marginal risk ratios RMRR = MRR( 𝑧 =1) / MRR( 𝑧 =0). One may also measure the effect of 𝑧 by the marginal odds ratio MOR( 𝑧 ) = pr( 𝑦 = 1 | 𝑧 = 1, 𝑧 ) {1 − pr( 𝑦 = 1 | 𝑧 = 1, 𝑧 )} ⁄ pr( 𝑦 = 1 | 𝑧 = 0, 𝑧 ) {1 − pr( 𝑦 = 1 | 𝑧 = 0, 𝑧 )} ⁄ and the interaction by ratio of the conditional odds ratios RMOR = MOR( 𝑧 =1) / MOR( 𝑧 =0). Oftentimes, DMRD is called the biological interaction because its expression in terms of probability differences has a close relationship with some well-known classification of biological mechanisms [7, 21]. However, the other two measures RMRR and RMOR of the interaction are also commonly used, have causal interpretation in the framework of Rubin causal model [19, 20, 23], and reflect different aspects of the underlying the interaction [24]. Now suppose an observational study in which one investigates the effect of the exposure ( z , z ) on the outcome y of the population above. In the observational study, covariates may be associated with both the outcome y and the exposure ( z , z ) and therefore are confounders. For illustrative purpose, we consider the case of one cofounder x . To adjust for x in estimation of the effect of ( z , z ), one needs to model the conditional risk pr( 𝑦 = 1 | 𝑧 , 𝑧 , 𝑥 )} given not only ( z , z ) but also x . When sample size is small, one often obtains a simple model for pr( 𝑦 = 1 | 𝑧 , 𝑧 , 𝑥 ), for instance, g{pr( 𝑦 = 1 | 𝑧 , 𝑧 , 𝑥 )} = α + 𝛽 𝑧 + 𝛽 𝑧 + 𝛽 ( 𝑧 ∗ 𝑧 ) + 𝜃 𝑥 where g (. ) is a link function. The model contains one term for the product 𝑧 ∗ 𝑧 , but no terms for the products between x and ( z , z ). The parameter 𝛽 is called the statistical interaction which describes deviation of this model from the main effect model containing no term for any product between variables [7, 21, 22]. If there are no unmeasured confounders than the confounder x , then 𝛽 or a function of 𝛽 has causal interpretation and measures the interaction on any stratum 𝑥 = 0, 1 and thus on the population. With the identity link function g ( 𝑝 ) = 𝑝 , the parameter 𝛽 is difference between the conditional risk differences (DCRD), which is also equal to difference between the marginal risk differences (DMRD) because risk differences are collapsible. With the log link function g ( 𝑝 ) =log( 𝑝 ) , the exponential exp ( 𝛽 ) is ratio of the conditional risk ratios (RCRR), which is also equal to ratio of the marginal risk ratios ( RMRR). With the logistic link function g ( 𝑝 ) = logit ( 𝑝 ) , the exponential exp ( 𝛽 ) is ratio of the conditional odds ratios (RCOR), but which is not equal to ratio of the marginal odds ratios (RMOR) due to non-collapsibility of odds ratios. See, for instance, [5, 10, 12, 14] for collapsibility of measures of the exposure effect. In the method above, one first has a model and then uses a parameter of the model to measure the interaction. As a consequence, one uses one model to estimate only one measure of the interaction. When sample size is large, the model inevitably becomes complex, for instance, g{pr( 𝑦 = 1| 𝑧 , 𝑧 , 𝑥 )} = α + 𝛽 𝑧 + 𝛽 𝑧 + 𝛽 ( 𝑧 ∗ 𝑧 ) + 𝜃 𝑥 + 𝜃 ( 𝑧 ∗ 𝑥 ) + 𝜃 ( 𝑧 ∗ 𝑥 ) which contains, in addition to one term for the product 𝑧 ∗ 𝑧 , two terms for the products 𝑧 ∗ 𝑥 and 𝑧 ∗ 𝑥 . The parameter 𝛽 or a function of 𝛽 still measures the interaction. With a linear function, 𝛽 is DCRD which is equal to DMRD. With a log link function, exp ( 𝛽 ) is RCRR but no longer equal to RMRR. With a logistic link function, exp ( 𝛽 ) is RCOR but still not equal to RMOR. In practical researches, one usually omits the last two terms in the model despite 𝜃 ≠ and 𝜃 ≠ , leading to a mis-specified model for pr( 𝑦 = 1| 𝑧 , 𝑧 , 𝑥 ). Then the parameter 𝛽 does not measure the interaction. Furthermore, the model may even contain a triple-product term 𝜃 ( 𝑧 ∗ 𝑧 ∗ 𝑥 ) . In this case, the parameter 𝛽 measures the interaction on stratum 𝑥 = 0 but not on the population. If one omits this triple-product term despite 𝜃 ≠ , one obtains a mis-specified model for pr( 𝑦 = 1| 𝑧 , 𝑧 , 𝑥 ), but then, 𝛽 does not measure the interaction on stratum 𝑥 = 0 nor on the population. With these mis-specified models, it would not be possible to have consistent estimate of any measure of the interaction. In this article, we measure and estimate the interaction between exposures in two separate steps. In the measuring step, we express all the five measures of the interaction, DMRD, RMRR, RMOR, RCRR and RCOR, in terms of the conditional risk pr ( 𝑦 = 1| 𝑧 , 𝑧 , 𝒙 ) , rather than model parameters, where 𝒙 is a vector of covariates. In the estimating step, we estimate DMRD, RMRR, RMOR, RCRR and RCOR through pr( 𝑦 = 1| 𝑧 , 𝑧 , 𝒙 ), which can be described by any model. As a result, we can use one model in one study to estimate all these measures. Our method of measuring and estimating the interaction is extension of a well-known method of measuring and estimating the exposure effect. In the latter method, one expresses a measure of the exposure effect in terms of pr ( 𝑦 = 1| 𝑧 , 𝑧 , 𝒙 ) and then estimates the measure through pr ( 𝑦 =1| 𝑧 , 𝑧 , 𝒙 ) [1,6, 8, 13]. The difficulty with this method is interval estimation for the measure of the exposure effect. The common method for the interval estimation is the parametric or non-parametric bootstrap method, but it is highly difficult to correct the finite-sample bias arising from the bootstrap sampling with dichotomous outcome [3, 4, 9]. To avoid complexity of the bootstrap method for interval estimation of a measure of the interaction, we use distribution of the ML estimates of the model parameters to obtain distribution of the ML estimate of the measure and then the corresponding confidence interval. We are going to present the method through an observational study embedded in a randomized trial, in which we investigated the interaction between a therapy and the environment on eradication of Helicobacter pylori among Vietnamese children. The interaction between a therapy and the environment on eradication of

Helicobacter pylori among Vietnamese children 2.1 Medical background and the data

In a randomized trial [18], researchers studied two triple therapies – (lansoprazole, amoxicillin, metronidazole) and (lansoprazole, amoxicillin, clarithromycin) – for their abilities to eradicate

Helicobacter pylori among Vietnamese children. From several children hospitals in Hanoi, restricting body weight to a range between 13 kg and 45 kg, a sample was collected between May 2005 and January 2006. In an observational substudy embedded in this randomized trial, they focused on one treatment arm of the triple therapy (lansoprazole, amoxicillin, metronidazole). They analyzed the effect of high versus low doses of the therapy and the effect of the environment encoded by geographic areas in which these children lived. In this article, we study how the environment interacted with the therapy on eradication of

Helicobacter pylori among Vietnamese children. The treatment arm comprised 109 patients. The therapy eradicated

Helicobacter pylori through systemic circulation, so the researchers assigned the therapy to the children according to their body weights. According to the pediatric procedure, children with 13 kg < body weight < 23 kg received the therapy once daily and those with 23 kg ≤ weight < 45 kg twice daily. Of medical relevance was dose in unit body weight. Following the earlier report on the same data [18], we categorized the children into receiving low versus high doses of the therapy by taking the middle point of each weight category as the threshold. Hence children with 13 kg < body weight ≤

18 kg in the first weight category and 23 kg ≤ body weight < 34 kg in the second weight category were considered as receiving the high dose ( z = 1) of the therapy whereas those with 18 kg < body weight < 23 kg in the first weight category and 34 kg ≤ body weight < 45 kg in the second weight category as receiving the low dose ( z = 0). Let z = z = 𝒛 = ( z , z ) = (therapy, environment). The outcome was successful ( y =

1) versus unsuccessful ( y =

0) eradication of

Helicobacter pylori . In this article, we estimate the interaction between exposures z and z on the outcome y = z and thus was not an independent covariate. The covariate age ( x ) was associated with body weight and thus with the assignment of exposure z . Furthermore, age was known to have an effect on the outcome y and thus was a confounding covariate. In addition to age, other possible confounding covariates were recorded, which were gender ( x ) and antibiotic resistance to metronidazole ( x ). Age was categorized into younger ( x = versus older ( x =

0) than 9 years. Let x = 1 indicate female and x = 0 male of the gender. Antibiotic resistance to metronidazole was categorized into non-resistant ( x = versus resistant ( x = 𝒙 = ( x , x , x ) be the set of all the recorded covariates. The data of this study is given in Table 1. In this article, we aim at the interaction of causal interpretation, that is, we compare the potential outcomes y ( 𝒛 ) of the patients in the population under different exposures 𝒛 = ( z , z ) = (0, 0), (0, 1), (1, 0), (1, 1). Because it is only possible to observe potential outcome of a patient under one value of the exposure 𝒛 , the following supposition is needed to allow for estimation of the interaction [19, 20, 23]. Supposition

The assignment of exposures 𝒛 is strongly ignorable given the covariates 𝒙 and the probability pr( 𝒛 | 𝒙 ) of 𝒛 given 𝒙 is larger than zero . Therefore the risk pr { 𝑦 ( 𝒛 )| 𝒙 } of the potential outcome y ( 𝒛 ) in stratum 𝒙 under exposure 𝒛 is equal to the risk pr( y | 𝒛 , 𝒙 ) of the observable outcome y in stratum ( 𝒙 , 𝒛 ), that is, pr { 𝑦 ( 𝒛 )| 𝒙 } = pr( 𝑦 | 𝒛 , 𝒙 ) . The supposition is also called the assumption of no unmeasured confounding covariates given the covariates x [11, 21]; see [24] for examples of the supposition and their causal directed acyclic graphs. Measuring interaction on population 3.1 Conditional measures of interaction on population

The conditional odds CO( 𝒛 ; 𝒙 ) of the potential outcome y ( 𝒛 ) = 𝒙 = ( x , x , x ) under exposure 𝒛 = ( z , z ) is CO( 𝒛 ; 𝒙 ) = pr { 𝑦 ( 𝒛 ) = 1| 𝒙 } pr { 𝑦 ( 𝒛 ) = 0| 𝒙 } , (1) where pr { 𝑦 ( 𝒛 ) = 0| 𝒙 } = − pr { 𝑦 ( 𝒛 ) = 1| 𝒙 } . The conditional odds ratio is

COR( 𝑧 ; 𝒙 ) = CO( 𝑧 = 1, 𝑧 ; 𝒙 ) CO( 𝑧 = 0, 𝑧 ; 𝒙 ), (2) which measures the effect of the therapy z on stratum 𝒙 in which each patient were assigned environment z Then the ratio of the conditional odds ratios is RCOR ( 𝒙 ) = COR ( 𝑧 = 1; 𝒙 ) COR ( 𝑧 = 0; 𝒙 ), (3) which measures the interaction between the exposures z and z on stratum 𝒙 . Taking the average of RCOR( x ) over 𝒙 of the population, we obtain the ratio of the conditional odds ratio on the population RCOR = � pr ( 𝒙 ) 𝒙 RCOR ( 𝒙 ), (4) which measures the interaction on the population on the conditional odds ratio scale. In particular, if RCOR( 𝒙 ) takes the same value for all 𝒙 values, then RCOR( 𝒙 ) = RCOR. Under Supposition in section 2.2, we have pr { 𝑦 ( 𝒛 )| 𝒙 } = pr( 𝑦 | 𝒛 , 𝒙 ) . Inserting this equality into (1) and then using (2)-(4), we obtain RCOR = � pr ( 𝒙 ) 𝒙 � pr ( 𝑦 = 1 | 𝑧 = 1, 𝑧 = 1, 𝒙 ) pr ( 𝑦 = 0 | 𝑧 = 0, 𝑧 = 1, 𝒙 ) pr ( 𝑦 = 0 | 𝑧 = 1, 𝑧 = 1, 𝒙 ) pr ( 𝑦 = 1 | 𝑧 = 0, 𝑧 = 1, 𝒙 ) �� pr ( 𝑦 = 1 | 𝑧 = 1, 𝑧 = 0, 𝒙 ) pr ( 𝑦 = 0 | 𝑧 = 0, 𝑧 = 0, 𝒙 ) pr ( 𝑦 = 0 | 𝑧 = 1, 𝑧 = 0, 𝒙 ) pr ( 𝑦 = 1 | 𝑧 = 0, 𝑧 = 0, 𝒙 ) � . (5) The conditional risk ratio is

CRR( 𝑧 ; 𝒙 ) = pr { 𝑦 ( 𝑧 = 1, 𝑧 ) = 1| 𝒙 } pr { 𝑦 ( 𝑧 = 0, 𝑧 ) = 1| 𝒙 }, (6) which measures the effect of z on stratum 𝒙 in which each patient were assigned z . The ratio of the conditional risk ratios is RCRR ( 𝒙 ) = CRR ( 𝑧 = 1; 𝒙 ) CRR ( 𝑧 = 0; 𝒙 ), (7) which measures the interaction on stratum 𝒙 . Taking the average of RCRR( 𝒙 ) over 𝒙 of the population, we obtain the ratio of the conditional risk ratios on the population RCRR = � pr( 𝒙 ) 𝒙 RCRR( 𝒙 ), (8) which measures the interaction on the population on the conditional risk ratio scale. In particular, if RCRR( 𝒙 ) takes the same value for all 𝒙 values, then RCRR( 𝒙 ) = RCRR. Inserting the equality pr { 𝑦 ( 𝒛 )| 𝒙 } = pr( 𝑦 | 𝒛 , 𝒙 ) into (6) and then using (6)-(8), we obtain RCRR = � pr( 𝒙 ) 𝒙 � pr ( 𝑦 = 1 | 𝑧 = 1, 𝑧 = 1, 𝒙 ) pr ( 𝑦 = 1 | 𝑧 = 0, 𝑧 = 1, 𝒙 ) �� pr ( 𝑦 = 1 | 𝑧 = 1, 𝑧 = 0, 𝒙 ) pr ( 𝑦 = 1 | 𝑧 = 0, 𝑧 = 0, 𝒙 ) � . (9) We may also use the conditional risk difference CRD ( 𝑧 ; 𝒙 ) = pr { 𝑦 ( 𝑧 = 1, 𝑧 ) = 1| 𝒙 } − pr { 𝑦 ( 𝑧 = 0, 𝑧 ) = 1| 𝒙 } (10) to measure the effect of z on stratum x in which each patient were assigned 𝑧 , and the difference between the conditional risk differences DCRD ( 𝒙 ) = CRD ( 𝑧 = 1; 𝒙 ) − CRD ( 𝑧 = 0; 𝒙 ) (11) to measure the interaction on stratum 𝒙 , and the average of DCRD( 𝒙 ) over 𝒙 of the population DCRD = � pr( x )DCRD ( 𝒙 ) 𝒙 (12) to measure the interaction on the population on the conditional risk difference scale. In particular, if DCRD( 𝒙 ) takes the same value for all 𝒙 values, then DCRD( 𝒙 ) = DCRD. Inserting the equality pr { 𝑦 ( 𝒛 )| 𝒙 } = pr( 𝑦 | 𝒛 , 𝒙 ) into (10) and then using (10)-(12), we obtain DCRD = � pr( 𝒙 ) 𝒙 [{ pr ( 𝑦 = 1 | 𝑧 = 1, 𝑧 = 1, 𝒙 ) − pr ( 𝑦 = 1 | 𝑧 = 0, 𝑧 = 1, 𝒙 )} (13) − { pr ( 𝑦 = 1 | 𝑧 = 1, 𝑧 = 0, 𝒙 ) − pr ( 𝑦 = 1 | 𝑧 = 0, 𝑧 = 0, 𝒙 )}]. Marginal measures of interaction on population

The population risk is the risk pr { 𝑦 ( 𝒛 ) = 1} of the potential outcome y ( 𝒛 ) = 𝒛 = ( z , z ). We rewrite the population risk pr { 𝑦 ( 𝒛 ) = 1} by pr { 𝑦 ( 𝒛 ) = 1} = � pr { 𝑦 ( 𝒛 ) = 1| 𝒙 } 𝒙 pr ( 𝒙 ). Under Supposition in section 2.2, we have pr { 𝑦 ( 𝒛 ) = 1| 𝒙 } = pr( 𝑦 = 1 | 𝒛 , 𝒙 ) . Therefore we obtain pr { 𝑦 ( 𝒛 ) = 1} = � pr( 𝑦 = 1 | 𝒛 , 𝒙 ) 𝒙 pr ( 𝒙 ). Let PR ( 𝑦 = 1 | 𝒛 ) = ∑ pr( 𝑦 = 1 | 𝒛 , 𝒙 ) 𝒙 pr ( 𝒙 ) , which is called the population-adjusted risk [6, 8, 13]. Then under Supposition we have that the population-adjusted risk is same as the population risk, namely PR ( 𝑦 = 1 | 𝒛 ) = pr { 𝑦 ( 𝒛 ) = 1} = � pr( 𝑦 = 1 | 𝒛 , 𝒙 ) 𝒙 pr ( 𝒙 ) . (14) The marginal odds MO( 𝒛 ) of the potential outcome y ( 𝒛 ) = 𝒛 is MO( 𝒛 ) = pr { 𝑦 ( 𝒛 ) = 1} pr { 𝑦 ( 𝒛 ) = 0} , (15) where pr { 𝑦 ( 𝒛 ) = 0} = 1 − pr { 𝑦 ( 𝒛 ) = 1} . The marginal odds ratio is MOR( 𝑧 ) = MO( 𝑧 = 1, 𝑧 ) MO( 𝑧 = 0, 𝑧 ) , (16) which measures the effect of the therapy z on the population in which all patients were assigned environment z . The ratio of marginal odds ratios is RMOR = MOR ( 𝑧 = 1) MOR ( 𝑧 = 0) , (17) which measures the interaction between the exposures z and z on the population on the marginal odds ratio scale. Using formulas (14)-(17), we obtain RMOR = � PR ( 𝑦 = 1 | 𝑧 = 1, 𝑧 = 1) PR ( 𝑦 = 0 | 𝑧 = 0, 𝑧 = 1) PR ( 𝑦 = 0 | 𝑧 = 1, 𝑧 = 1) PR ( 𝑦 = 1 | 𝑧 = 0, 𝑧 = 1) �� PR ( 𝑦 = 1 | 𝑧 = 1, 𝑧 = 0) PR ( 𝑦 = 0 | 𝑧 = 0, 𝑧 = 0) PR ( 𝑦 = 0 | 𝑧 = 1, 𝑧 = 0) PR ( 𝑦 = 1 | 𝑧 = 0, 𝑧 = 0) � . (18) The marginal risk ratio is MRR( 𝑧 ) = pr { 𝑦 ( 𝑧 = 1, 𝑧 ) = 1} pr { 𝑦 ( 𝑧 = 0, 𝑧 ) = 1} , (19) which measures the effect of z on the population in which each patient were assigned z . The ratio of marginal risk ratios is RMRR = MRR ( 𝑧 = 1) MRR ( 𝑧 = 0) , (20) which measures the interaction on the population on the marginal risk ratio scale. Using (14), (19) and (20), we obtain RMRR = � PR ( 𝑦 = 1 | 𝑧 = 1, 𝑧 = 1) PR ( 𝑦 = 1 | 𝑧 = 0, 𝑧 = 1) �� PR ( 𝑦 = 1 | 𝑧 = 1, 𝑧 = 0) PR ( 𝑦 = 1 | 𝑧 = 0, 𝑧 = 0) � . (21) The marginal risk difference is MRD( 𝑧 ) = pr { 𝑦 ( 𝑧 = 1, 𝑧 ) = 1} − pr { 𝑦 ( 𝑧 = 0, 𝑧 ) = 1} , (22) which measures the effect of 𝑧 on the population in which each patient were assigned 𝑧 . The difference of the marginal risk differences is DMRD = MRD( 𝑧 = 1) − MRD( 𝑧 = 0), (23) which measures the interaction on the population on the marginal risk difference scale. Using formulas (14), (22) and (23), we obtain DMRD = PR ( 𝑦 = 1 | 𝑧 = 1, 𝑧 = 1) − PR ( 𝑦 = 1 | 𝑧 = 0, 𝑧 = 1) (24) − PR ( 𝑦 = 1 | 𝑧 = 1, 𝑧 = 0) + PR ( 𝑦 = 1 | 𝑧 = 0, 𝑧 = 0). In randomized trial, the covariates 𝒙 are essentially unassociated with the exposure 𝒛 = ( z , z ), i.e. pr( x | 𝒛 ) = pr( x ). As a result, formula (14) becomes pr { 𝑦 ( 𝒛 ) = 1} = PR ( 𝑦 = 1 | 𝒛 ) = pr ( 𝑦 = 1 | 𝒛 ), where pr( y = | 𝒛 ) is the marginal risk of the outcome y = 𝒛 . In this case, the marginal measures of the interaction can be obtained from the marginal risk pr( y = | 𝒛 ) instead of pr( y = | 𝒛 , x ). Therefore the marginal measures we have obtained in this subsection are the same as those we described in the introduction for randomized trial. However, in many situations like the example of this article, randomized trial is not possible, and one can only rely upon observational studies and therefore needs to use the population-adjusted risk PR( 𝑦 = 1 | 𝒛 ) to obtain marginal measures of the interaction on population. From (4), we see that RCOR compares the potential outcomes y ( 𝒛 ) of the patients in the population under different exposures 𝒛 = ( z , z ) = (0, 0), (0, 1), (1, 0), (1, 1) and thus has causal interpretation in the framework of Rubin Causal Model (see, for instance, [19, 20, 23]). Similarly, from (8), (17), (20) and (23), we see that RCRR, RMOR, RMRR and DMRD all have causal interpretation. These measures are symmetric: we obtain the same formulas for them if we switch the order of exposures z and z . We have RCOR ≠ RMOR and RCRR ≠ RMRR because the odds ratio and the risk ratio are not collapsible [5, 10, 12, 14]. We always have DCRD = DMRD because the risk difference is collapsible. Thus we have five different measures of the interaction. These measures reflect different aspects of the same underlying interaction: the null hypotheses, i.e. RCOR = 1, RCRR = 1, RMOR = 1, RMRR = 1 and DMRD = 0, do not imply one another. The conditional measures RCOR and RCRR describe the interaction on population in terms of the conditional risk pr{ y ( 𝒛 ) = | 𝒙 }, emphasizing the interaction on individual covariate-specific strata of the population. The marginal measures RMOR, RMRR and DMRD express interaction on population in terms of the population risk pr { 𝑦 ( 𝒛 ) = 1} , emphasizing the interaction on the population as a whole. According to formulas (5) and (9), the conditional measures RCOR and RCRR can be expressed in terms of pr( y = | 𝒛 , 𝒙 ) and pr( x ). According to (18), (21) and (24), the marginal measures RMOR, RMRR and DMRD can be expressed in terms of PR( y = | 𝒛 ) which in turn can be expressed in terms of pr( y = | 𝒛 , 𝒙 ) and pr( x ) according to (14). Therefore all the five measures can be expressed in terms of pr( y = | 𝒛 , 𝒙 ) and pr( x ), implying that we can estimate these measures by estimating pr( y = | 𝒛 , 𝒙 ) and pr( x ) through observed data. In other words, we can use any model to estimate these measures if the model is correctly specified for pr( y = | 𝒛 , 𝒙 ).

4. Estimating interaction on population 4.1 Regression model

By fitting the data introduced in Section 2 to a logistic model, we obtain the following model for the risk pr ( 𝑦 = 1 | 𝒛 , 𝒙 ) of y = 𝒛 , 𝒙 ) Log � pr ( 𝑦 = 1 | 𝒛 , 𝒙 )1 − pr ( 𝑦 = 1 | 𝒛 , 𝒙 ) � = α + 𝛽 𝑧 + 𝛽 𝑧 + 𝛽 ( 𝑧 ∗ 𝑧 ) + 𝜃 𝑥 + 𝜃 𝑥 + 𝜃 𝑥 + 𝜃 ( 𝑧 ∗ 𝑥 ). (25) In this model, we include, in addition to one term for the product 𝑧 ∗ 𝑧 , another term for the product 𝑧 ∗ 𝑥 , because of the small p-value, 0.10, for the likelihood ratio-based significance test of 𝜃 = 0. Let π = ( α, β , β , β , θ , θ , θ , θ ) be the set of all model parameters. The ML estimate 𝜋� = ( 𝛼� , 𝛽̂ , 𝛽̂ , 𝛽̂ , 𝜃� , 𝜃� , 𝜃� , 𝜃� ) and its approximate covariance matrix Σ� (i.e. the inverse of the observed information) are given in Table 2. Combining model (25) with formula (9), we have that exp( β ) is equal to RCOR. In addition to RCOR, we are also interested in other aspects of the interaction on the population described by RCRR, RMOR, RMRR and DMRD, which are not functions of β . In particular, since the outcome y is common as seen in Table 1, the conditional odds ratios are poor approximations of the conditional risk ratios, so we cannot convert conditional odds ratios into other measures of interaction. If we use a log-linear model log{pr ( 𝑦 = 1 | 𝒛 , 𝒙 )} = α + 𝛽 𝑧 + 𝛽 𝑧 + 𝛽 ( 𝑧 ∗ 𝑧 ) + 𝜃 𝑥 + 𝜃 𝑥 + 𝜃 𝑥 + 𝜃 ( 𝑧 ∗ 𝑥 ) then exp( β ) is equal to RCRR according to (9), but the other measures are not functions of β of this log-linear model. If we use a linear model pr ( 𝑦 = 1 | 𝒛 , 𝒙 ) = α + 𝛽 𝑧 + 𝛽 𝑧 + 𝛽 ( 𝑧 ∗ 𝑧 ) + 𝜃 𝑥 + 𝜃 𝑥 + 𝜃 𝑥 + 𝜃 ( 𝑧 ∗ 𝑥 ) then β is equal to DCRD according to (13), but the other measures are not functions of β of this linear model. Furthermore, the ML estimations of the parameters in the log-linear and linear models do not converge even in the absence of the term for the product 𝑧 ∗ 𝑥 . Robust methods could improve the convergence but need additional assumptions such as the Poisson distribution for the frequency of y = 1 (see, for instance, [2]). Consequently, we are not able to obtain the ML estimates of RCRR and DMRD by these two models without additional assumptions. If the term for the product 𝑧 ∗ 𝑥 in model (25) is omitted despite θ ≠

0, then we obtain a mis-specified model for the conditional risk pr( y = | 𝒛 , x ) Log � pr ( 𝑦 = 1 | 𝒛 , 𝒙 )1 − pr ( 𝑦 = 1 | 𝒛 , 𝒙 ) � = α + 𝛽 𝑧 + 𝛽 𝑧 + 𝛽 ( 𝑧 ∗ 𝑧 ) + 𝜃 𝑥 + 𝜃 𝑥 + 𝜃 𝑥 . (26) This model may lead to biased estimate of pr( y = | 𝒛 , x ) and thus biased estimates of RCOR and RCRR according to formulas (5) and (9). Furthermore, this model may lead to mis-specification of PR( y = | 𝒛 ) according to formula (14) and thus to biased estimates of RMOR, RMRR and DMRD according to formulas (18), (21) and (24). In the rest of this section, we are going to use models (25) and (26) separately to estimate RCOR, RCRR, DMRD, RMOR and RMRR based on the data introduced earlier. First we describe the procedure of obtaining the ML estimates of the conditional measures RCOR and RCRR based on model (25). In model (25) replacing the parameters π = ( α, β , β , β , θ , θ , θ , θ ) by the ML estimates 𝜋� = ( 𝛼� , 𝛽̂ , 𝛽̂ , 𝛽̂ , 𝜃� , 𝜃� , 𝜃� , 𝜃� ) given by Table 2, we obtain the ML estimate pr � ( 𝑦 = 1 | 𝒛 , 𝒙 ) of the conditional risk pr( y = | 𝒛 , 𝒙 ). In formulas (5) and (9) replacing pr( y = | 𝒛 , 𝒙 ) by pr � ( 𝑦 = 1 | 𝒛 , 𝒙 ) and pr( 𝒙 ) by the proportion of 𝒙 in the sample, we obtain the ML estimates RCOR � = 8.85 and RCRR � = 1.60, which are presented in Table 3. To obtain interval estimate of RCOR, we generate approximate distribution of the ML estimate RCOR � . Normal distribution is poor approximation to the distribution of RCOR � , because RCOR ranges from 0 to +∞. We have similar situations for the other measures of the interaction. On the other hand, the model parameters π = ( α, β , β , β , θ , θ , θ , θ ) range from − ∞ to +∞, so normal distribution is good approximation to the distribution of the ML estimate 𝜋� = ( 𝛼� , 𝛽̂ , 𝛽̂ , 𝛽̂ , 𝜃� , 𝜃� , 𝜃� , 𝜃� ) (see, for instance, Lindsey, 1996). Let p be a random variable which follows the normal distribution N ( 𝜋� , Σ� ) , namely, 𝑝 ~ N ( 𝜋� , Σ� ) , where 𝜋� and Σ� in N ( 𝜋� , Σ� ) are given in Table 2. We are going to use this normal distribution to construct approximate distributions of RCOR � and RCRR � in this subsection and those of RMOR � , RMRR � and DMRD � in the next subsection. Now we describe the iteration procedure of generating approximate distribution of RCOR � . First, we draw p from the normal distribution N( 𝜋� , Σ� ) and replace π by p in model (25) to obtain a value of pr � ( 𝑦 = 1 | 𝒛 , 𝒙 ) . Second, we replace pr ( 𝑦 = 1 | 𝒛 , 𝒙 ) by pr � ( 𝑦 = 1 | 𝒛 , 𝒙 ) in (5) to obtain a value of RCOR � . We iterate the procedure, 1000 times in this article, to obtain an approximate distribution of RCOR � . The obtained approximate distribution is used to obtain the confidence interval of RCOR. Similarly, we use the normal distribution 𝑝 ~ N ( 𝜋� , Σ� ) and model (25) and formula (9) to obtain an approximate distribution of RCRR � and then the confidence interval of RCRR. The obtained approximate distributions of RCOR � and RCRR � are shown in Figure 1. The 95 % confidence interval of RCOR is (0.66, 127.04) while that of RCRR is (0.74, 4.60). They are also presented in Table 3, together with 50 % confidence intervals of RCOR and RCRR. Replacing model (25) by model (26) in the above procedure, we obtain the ML estimates and the confidence intervals of RCOR and RCRR based on model (26). The results are presented in Table 3. First we describe the procedure of obtaining the ML estimates of the marginal measures RMOR, RMRR and DMRD based on model (25). In model (25) replacing the parameters π = ( α, β , β , β , θ , θ , θ , θ ) by the ML estimates 𝜋� = ( 𝛼� , 𝛽̂ , 𝛽̂ , 𝛽̂ , 𝜃� , 𝜃� , 𝜃� , 𝜃� ) given in Table 2, we obtain the ML estimate pr � ( 𝑦 = 1 | 𝒛 , 𝒙 ) of the conditional risk pr( y = | 𝒛 , 𝒙 ). In formula (14) replacing pr( y = | 𝒛 , 𝒙 ) by pr � ( 𝑦 = 1 | 𝒛 , 𝒙 ) and pr( 𝒙 ) by the proportion of 𝒙 in the sample, we obtain the ML estimate PR � ( 𝑦 = 1 | 𝒛 ) of the population-adjusted risk PR( y = | 𝒛 ). In formulas (18), (21) and (24) replacing PR( y = | 𝒛 ) by PR � ( 𝑦 = 1 | 𝒛 ) , we obtain the ML estimates RMOR � = 8.62, RMRR � = 1.58 and RMRD � = 0.34. These estimates are also presented in Table 3. Now we describe the iteration procedure of generating approximate distributions of RMOR � . First, we draw p from N( 𝜋� , Σ� ) and replace π by p in model (25) to get pr � ( 𝑦 = 1 | 𝒛 , 𝒙 ) . Second, we replace pr ( 𝑦 = 1 | 𝒛 , 𝒙 ) by pr � ( 𝑦 = 1 | 𝒛 , 𝒙 ) in (14) to get PR � ( 𝑦 = 1 | 𝒛 ) . Finally, we replace PR ( 𝑦 = 1 | 𝒛 ) by PR � ( 𝑦 = 1 | 𝒛 ) in formula (18) to get RMOR � . We iterate the procedure, 1000 times in this article, to get an approximate distribution of RMOR � . The approximate distribution is used to get the confidence interval of RMOR. Similarly, we use the normal distribution 𝑝 ~ N ( 𝜋� , Σ� ) and formulas (25), (14) and (21) to get an approximate distribution of RMRR � and then the confidence interval of RMRR, and the normal distribution and formulas (25), (14) and (24) to get an approximate distribution of DMRD � and then the confidence interval of DMRD. The obtained approximate distributions of RMOR � , RMRR � and DMRD � are shown in Figure 1. The 95 % confidence interval of RMOR is (0.72, 88.51), that of RMRR is (0.77, 3.40), and that of DMRD is ( − In practical researches, one usually uses model (26), which omits the term for the product 𝑧 ∗ 𝑥 , to measure and estimate the interaction as described in the introduction. In this case, one can only have RCOR. Because the outcome y = 1 is common in this study, one cannot convert RCOR into RCRR or DMRD. Despite the common outcome y = 1 and the term for the product 𝑧 ∗ 𝑥 , we have used one logistic model (25) to estimate all the measures of interaction, RCOR, RCRR, RMOR, RMRR and DMRD. Figure 1 shows the approximate distributions of the ML estimates RCOR � , RCRR � , RMOR � , RMRR � and DMRD � based on model (25). Table 3 summarizes the ML estimates and the 50 % and 95 % confidence intervals for the five measures RCOR, RCRR, RMOR, RMRR and DMRD based on model (25) in comparison to the mis-specified model (26). By comparing the second column with the third column in Table 3, we see that model (26) causes considerable biases to all the five measures. These biases might be exacerbated if more terms for complex products between exposures and covariates were needed to model pr( y = | 𝒛 , 𝒙 ) as typical of a large sample. Our method can avoid these biases by being able to use complex models. Model (25) fits the data best and thus provides the most efficient point and interval estimates for these measures of interaction. A clear advantage of using one model in one study to estimate different measures of interaction is that one can avoid different assumptions behind different models, which may complicate interpretations of these measures. Despite somewhat wide confidence intervals, all the five measures of interaction based on model (25) suggest that the therapy would have higher efficacy of eradicating Helicobacter pylori if children lived in the rural environment than if children lived in the urban environment. This finding suggests that urbanization may have contributed to the low efficacy of the therapies containing antibiotics among children in developing countries. To our knowledge, little is seen in the literature about how the environment interacts with therapies on eradication of the bacterium [16, 17].

5. Discussion and conclusions

An underlying interaction between exposures on a certain population has different aspects described by different measures. Depending on their specific research subjects, researchers need to know one or several aspects of the interaction. In this article, we have used one model in one study to obtain point and interval estimates for five different measures. The outcome can be common. The model may contain terms for the products between exposures and covariates. The estimation is based on maximum likelihood. The method can be implemented by using any software that generates normal distribution. In the literature, two methods are available to obtain confidence intervals of various measures of exposure effect, i.e. the normal approximation method and the bootstrap method [1, 6, 8, 13], but which seem less used to obtain confidence intervals for measures of interaction. The normal approximation method is to derive approximate variance of the ML estimate of a measure of exposure effect by the delta method and then use the variance to obtain normal approximation confidence interval of the measure of exposure effect. To obtain normal approximation confidence interval of a measure of interaction, one needs to derive approximate variance of the ML estimate of the measure of interaction, but the derivation is tedious particularly for complex models. The bootstrap method is to generate bootstrap samples by parametric or non-parametric bootstrap method and then use the bootstrap samples to obtain the bootstrap distribution of a measure of exposure effect and then the bootstrap confidence interval of the measure of exposure effect. However, it is highly difficult to correct the finite-sample bias arising from the bootstrap sampling with dichotomous outcome [3, 4, 9], and the difficulty would be exacerbated for a measure of interaction. In our method, we generate approximate distribution of the ML estimate of a measure of interaction and use the distribution to obtain the confidence interval for the measure of interaction. This method provides a simple but reliable approach to interval estimation of the measure of interaction. Although it is beyond the scope of this article, it is of interest to derive the corrected bootstrap confidence interval and the normal approximation confidence interval and compare them with the maximum-likelihood-based confidence interval obtained in this article.

References [1] P.C. Austin,

Absolute risk reductions, relative risks, relative risk reductions, and numbers needed to treat can be obtained from a logistic regression model , Journal of Clinical Epidemiology 63 (2010), pp. 2--6. [2] G.S. Bieler, G.G. Brown, R.L. Williams and D.J. Brogan,

Estimating Model-Adjusted Risks, Risk Differences, and Risk Ratios From Complex Survey Data , American Journal of Epidemiology 171 (2010), pp. 618--623. [3] J. Carpenter and J. Bithell,

Bootstrap confidence intervals: when, which, what? A practical guide for medical statisticians , Statistics in Medicine 19 (2000), pp. 1141--1164. [4] A.C. Davison and D.V. Hinkley,

Bootstrap Methods and their Application , Cambridge University Press, Cambridge, 1997 [5] M.H. Gail, S. Wieand and S. Piantadosi,

Biased estimates of treatment effect in randomized experiments with nonlinear regressions and omitted covariates , Biometrika

71 (1984), pp. 431-- 444. [6] B.I. Graubard and E.L. Korn,

Predictive Margins with Survey Data , Biometrics 55 (1999), pp. 652--659. [7] S. Greenland,

Interactions in Epidemiology: Relevance, Identification and Estimation,

Epidemiology 20 (2009), pp. 14–17. [8] S. Greenland,

Model-based Estimation of Relative Risk and Other Epidemiologic measures in Studies of Common Outcomes and Case-Control Studies , American Journal of Epidemiology

160 (2004a), pp. 301--305. [9] S. Greenland,

Interval estimation by simulation as an alternative to and extension of confidence intervals , International Journal of Epidemiology

33 (2004b), pp. 1389–1397. [10] S. Greenland, J.M. Robins and J. Pearl,

Confounding and collapsibility in causal inference , Statistical Science 14 (1999), pp. 29–46. [11] S. Greenland and J.M. Robins,

Identifiability, exchangeability, and epidemiological confounding , International Journal of Epidemiology 15 (1986), pp. 413–419. [12] J. Guo and Z. Geng,

Collapsibility of logistic regression coefficients , Journal of Royal Statistical Society B 57 (1995), pp. 63--267. [13] P.W. Lane and J.A. Nelder,

Analysis of Covariance and Standardization as Instance of Prediction , Biometrics

36 (1982), pp. 613--621. [14] Y. Lee and J.A. Nelder,

Conditional and marginal models: another view , Statistical Science

19 (2004), pp. 219--228. [15] J.K. Lindsey,

Parametric Statistical Inference,

Oxford, Clarendon Press, 1996 [16] P. Malfertheiner, F. Megraud, C. O'Morain, F. Bazzoli, E. El-Omar and D. Graham,

Current concepts in the management of Helicobacter pylori infection: the Maastricht III Consensus Report , Gut 56 (2007), pp. 772--781. [17] F. Megraud,

H. pylori antibiotic resistance: prevalence, importance, and advances in testing,

Gut 53(2004), pp. 1374—1384. [18] T.V.H. Nguyen, C. Bengtsson, G.K. Nguyen, T.T.H. Hoang, D.C. Phung, M. Sorberg and M. Granstrom,

Evaluation of Two Triple-Therapy Regiments with Metronidazole or Clarithromycin for the Eradication of H. pylori Infection in Vietnamese Children, a Randomized, Double Blind Clinical Trial , Helicobacter 13 (2008), pp. 550--556. [19] P.R. Rosenbaum,

Observational studies , New York, Springer, 1995. [20] P.R. Rosenbaum and D.B. Rubin,

The central role of the propensity score in observational studies for causal effects,

Biometrika 70 (1983), 41--55. [21] K.J. Rothman, S. Greenland and T.L. Lash,

Modern Epidemiology (3rd edition),

Lippincott Williams & Wilkins, Philadelphia, 2008 [22] K.J. Rothman, S. Greenland and A.M. Walker,

Concepts of interaction , American Journal of Epidemiology 112 (1980), pp. 467--470. [23] D.B. Rubin, X. Wang, L. Yin and E. Zell,

Estimating the Effect of Treating Hospital Type on Cancer Survival in Sweden Using Principal Stratification in The HANDBOOK OF APPLIED BAYESIAN ANALYSIS (Eds. T. O’Hagan and M. West), Oxford University Press, Oxford, 2010. [24] T.J. VanderWeele,

On the Distinction Between Interaction and Effect Modification , American Journal of Epidemiology 20 (2009), pp. 863--871. Table 1

Successful eradications / total patients stratified by the covariates 𝑥 (age), 𝑥 (gender), and 𝑥 (antibiotic resistance) for the treatment ( 𝑧 , 𝑧 ) (therapy, environment). Covariates

Treatment ( 𝒛 , 𝒛 ) 𝒙 𝑥 𝑥 (0, 0) (0, 1) (1, 0) (1, 1)

0 0 0 3/4 6/7 0/3 8/8 0 0 1 3/4 1/1 1/1 0 1 0 1/5 3/4 2/3 3/3 0 1 1 2/4 1/2 1/1 1 0 0 1/2 1/1 3/8 6/6 1 0 1 3/3 0/1 1/1 3/3 1 1 0 2/7 0/2 5/8 0/2 1 1 1 1/2 0/3 6/9 1/1 Table 2

ML estimates and its approximate covariance matrix for parameters of the model (25) * * The covariance matrix is obtained with adjustment of over-dispersion in the framework of the ML estimation of a mis-specified model or the generalized estimating equation.

Parameters α β β β θ θ θ θ Estimates 1.19 − − − α − − − − − β − − − − β − − − β − − − − θ − − − − θ − − − − − θ − − − − − θ − − − − Table 3

ML estimates and confidence intervals for the five measures of the interaction describing how efficacy of the therapy on eradication of

Helicobacter pylori would increase if children lived in the rural environment of Vietnam, based on models (25) and (26).

Interaction ML estimates 50 % and 95 % confidence intervals Model (25) Model (26) Model (25) Model (26) RCOR 8.85 6.05 (0.66, 3.61, 21.95, 127.04) (0.40, 2.18, 13.89, 72.00) RCRR 1.60 1.47 (0.74, 1.22, 2.27, 4.60) (0.60, 1.07, 2.05, 4.18) RMOR 8.62 5.47 (0.72, 3.50, 15.93, 88.51) (0.42, 2.04, 11.32, 39.45)

RMRR 1.58 1.41 (0.77, 1.23, 1.90, 3.40) (0.63, 1.06, 1.80, 3.10)

RMRD 0.34 0.27 ( − − Figure 1

Approximate distributions of the ML estimates -- R COR � , R CRR � , R MOR � , R MRR � and D MR D � -- for the five measures of the interaction between the therapy and the environment on the eradication of Helicobacter pylori among Vietnamese children.

RCOR � RCRR � RMOR � RMRR � DMRD ��