[PDF] Estimating confidence regions of common measures of (baseline, treatment effect) on dichotomous outcome of a population

Abstract

In this article we estimate confidence regions of the common measures of (baseline, treatment effect) in observational studies, where the measure of baseline is baseline risk or baseline odds while the measure of treatment effect is odds ratio, risk difference, risk ratio or attributable fraction, and where confounding is controlled in estimation of both baseline and treatment effect. To avoid high complexity of the normal approximation method and the parametric or non-parametric bootstrap method, we obtain confidence regions for measures of (baseline, treatment effect) by generating approximate distributions of the ML estimates of these measures based on one logistic model.

Full PDF

1 Estimating confidence regions of common measures of (baseline, treatment effect) on dichotomous outcome of a population

Li Yin and Xiaoqin Wang

2* 1

Department of Medical Epidemiology and Biostatistics, Karolinska Institute, Box 281, SE-171 77, Stockholm, Sweden Department of Electronics, Mathematics and Natural Sciences, University of Gävle, SE-801 76, Gävle, Sweden (Email: [email protected]). * Corresponding author

Abstract

Keywords: baseline measure; effect measure; confidence region; logistic model

1 Introduction

Suppose that one conducts a randomized trial to investigate the effect of a dichotomous treatment z on a dichotomous outcome y of certain population, where 𝑧 = 0, 1 indicate the active respective control treatments while 𝑦 = 0, 1 indicate positive respective negative outcomes. With a sufficiently large sample, covariates are essentially unassociated with treatments z and thus are not confounders. Let R 𝑧 = pr( 𝑦 = 1 | 𝑧 ) be the risk of 𝑦 = 1 given z . Then R 𝑧 is marginal with respect to covariates and thus conditional on treatment z only, so R 𝑧 is also called marginal risk. One can use R 𝑧 to obtain the measures of baseline and treatment effect which are called marginal measures. The comment measures of baseline are the baseline risk R = pr( 𝑦 = 1 | 𝑧 = 0) and the baseline odds O = R /(1 − R ) . The common measures of treatment effect are the odds ratio OR = O O where O 𝑧 = R 𝑧 /(1 − R 𝑧 ) is the odds under treatment z , and the risk difference RD = R − R , the risk ratio RR = R R , and the attributable fraction AF = 1 − R R . When presenting treatment effect, one also presents baseline in order to learn significance of the treatment effect to the population. For instance, RD = − R is 1 % versus 50 %, so one should present both RD and R . A similar situation occurs in which RR = 0.5 when R is 1 % versus 50 %. The common measures of (baseline, treatment effect) are (O , OR) (R , RD) , (R , RR) and (R , AF) . All these measures have causal interpretation in the framework of the Rubin causal model and reflect different aspects of the same underlying (baseline, treatment effect) (Rosenbaum and Rubin, 1983; Rosenbaum, 1995; Rubin et al., 2009). Now suppose that one conducts an observational study to investigate the effect of the treatment z on the outcome y of the population above and wishes to describe (baseline, treatment effect) by the same measures (O , OR) (R , RD) , (R , RR) and (R , AF) as in the randomized trial. However, in the observational study, covariates may be associated with both y and z and therefore are confounders. In this case, one needs to use the conditional risk pr( 𝑦 = 1| 𝑧 , 𝒙 ) given not only z but also x , instead of the marginal risk pr( 𝑦 = 1| 𝑧 ) , to obtain these measures. As a result, it is highly difficult to estimate these measures as described below. For illustration, consider the case in which there is only one covariate x . To estimate the measures (O , OR) (R , RD) , (R , RR) and (R , AF) through pr( 𝑦 = 1| 𝑧 , 𝒙 ) , the most common model is logistic model, for instance, Log � pr ( 𝑦 = 1 | 𝑧 , 𝒙 )1 − pr ( 𝑦 = 1 | 𝑧 , 𝒙 ) � = α + β 𝑧 + θ𝑥 + 𝛾 ( 𝑧𝑥 ) Here exp( α ) is the baseline odds in stratum 𝑧 = 0, 𝑥 = 0 ; exp( β ) is the conditional odds ratio describing the treatment effect in stratum 𝑥 = 0 . Noticeably, the measures R , O , OR, RD, RR and AF as parameters of interest are not equal to the model parameters or their exponentials and therefore cannot be estimated as parameters of this model. Even if 𝛾 = 0 , i.e. there is no product term between z and x , the conditional odds ratio exp( β ) is not equal to the marginal odds ratio OR; see rich literatures about non-collapsibility of odds ratios (e.g. Gail, 1984; Greenland et al., 1999; Lee and Nelder, 2004; Austin et al., 2007; Austin, 2007). Also see literatures about convertibility and non-convertibility from exp( β ) to RR (Greenland, 1987; Zhang and Yu, 1998; McNutt et al., 2003; Greenland, 2004a); notice literatures about linear or log-linear models in estimation of RD or RR (Bieler et al., 2010; Spiegelman and Hertzmark, 2005; Cheung, 2007; Zou, 2004). On the other hand, one can always express R , O , OR, RD, RR and AP in terms of the parameters of a logistic model. The ML estimates of R , O , OR, RD, RR and AP can be obtained from those of the model parameters. The usual methods of obtaining the interval estimate of treatment effect are the normal approximation method and the bootstrap method (Lane and Nelder, 1982; Graubard and Korn, 1999; Greenland, 2004a; Greenland, 2004b; Austin, 2010; Gehrmanna et al., 2010). In the normal approximation method, one derives approximate variance of the ML estimate of a measure of treatment effect and then uses the variance to obtain normal approximation confidence interval for the measure. However, the derivation of the variance is tedious and the expressions of the variance are different for different logistic models containing different products between z and x . In the bootstrap method, one generates bootstrap samples and then uses the bootstrap samples to obtain bootstrap distribution and thus the confidence interval of a measure of treatment effect. However, it is highly difficult to correct finite-sample bias due to the bootstrap sampling, which typically occurs with the asymmetric – i.e. screwed -- distribution of the ML estimate as is the case for dichotomous outcome (Davison and Hinkley, 1997; Carpenter and Bithell, 2000; Greenland, 2004b). In this article, we use only one logistic model to obtain the confidence regions for all the measures (O , OR) , (R , RD) , (R , RR) and (R , AP) of (baseline, treatment effect) in observational studies. Instead of deriving covariance matrix of the two-dimensional ML estimate or correcting the finite-sample bias of the two-dimensional bootstrap sampling, we use approximate normal distribution of the ML estimate of the logistic model parameters to generate approximate distribution of the ML estimate of a measure of (baseline, treatment effect) and then use the obtained distribution to calculate confidence region for this measure. We are going to present the method through an observational study embedded in a randomized trial, in which we investigated the effect of high versus low doses of a therapy on eradication of Helicobacter pylori among Vietnamese children.

2 Eradication of

Helicobacter pylori among Vietnamese children

In a randomized trial, researchers studied two triple therapies – (lansoprazole, amoxicillin, metronidazole) and (lansoprazole, amoxicillin, clarithromycin) – for their abilities to eradicate

Helicobacter pylori among Vietnamese children (Nguyen et al., 2008). From several children hospitals in Hanoi, restricting body weight to a range between 13 kg and 45 kg, a sample was collected between May 2005 and January 2006. In a substudy, researchers focused on one treatment arm of the triple therapy (lansoprazole, amoxicillin, metronidazole) and analyzed the treatment effect of high versus low doses of the therapy on eradication of

Helicobacter pylori . The treatment arm comprised 109 patients. The therapy eradicated

Helicobacter pylori through systemic circulation, so the researchers assigned the therapy to the children according to their body weights. According to the pediatric procedure, children with 13 kg < body weight < 23 kg received the therapy once daily and those with 23 kg ≤ weight < 45 kg twice daily. Of medical relevance was dose in unit body weight. Following the earlier research on the same data, we categorized the children into receiving high versus low doses of the therapy by taking the middle point of each weight category as the threshold (Nguyen et al., 2008). Hence children with 13 kg < body weight ≤

18 kg in the first category and 23 kg ≤ body weight < 34 kg in the second category were considered as receiving the high dose ( z = 1) of the therapy whereas those with 18 kg < body weight < 23 kg in the first category and 34 kg ≤ body weight < 45 kg in the second category as receiving the low dose ( z = 0). The outcome was successful ( y =

1) versus unsuccessful ( y =

0) eradication of

Helicobacter pylori . In the context of this substudy, body weight was synonymous to treatment z and thus was not a covariate. The covariate age ( x ) was associated with body weight and thus with the assignment of treatments z . Furthermore, age was known to have effect on the outcome y and thus was a confounder. Therefore this substudy had the nature of an observational study. Other possible confounding covariates were documented, which were gender ( x ), geographic area ( x ) and antibiotic resistance to metronidazole ( x ). Age was categorized into younger ( x =

1) than 9 years versus older ( x = x = x = 0 male of the gender. Geographic area was categorized into rural ( x = versus urban ( x = x = versus resistant ( x = 𝒙 = ( x , x , x , x ) be the set of the documented covariates. The data of the substudy is given in Table 1. The conditional risk pr ( 𝑦 = 1 | 𝑧 , 𝒙 ) of 𝑦 = 1 in stratum ( 𝑧 , 𝒙 ) is modeled by logistic model. By the likelihood ratio-based significance testing of the model parameters, we obtain Log { pr ( 𝑦 = 1 | 𝑧 , 𝒙 )1 − pr ( 𝑦 = 1 | 𝑧 , 𝒙 )} = α + β 𝑧 + 𝜃 𝑥 + 𝜃 𝑥 + 𝜃 𝑥 + 𝜃 𝑥 + 𝜃 ( 𝑧 ∗ 𝑥 ) + 𝜃 ( 𝑧 ∗ 𝑥 ). (1) In addition to the main effect terms for the treatment z and the covariates 𝒙 , we include in this model two product terms: one for the treatment--gender ( z * x ) and one for the treatment--geographic area ( z * x ), because of the small p-values, 0.10 and 0.11, for the significance test of θ = 0 and θ = 0 respectively. Let 𝜋 = ( 𝛼 , 𝛽 , 𝜃 , 𝜃 , 𝜃 , 𝜃 , 𝜃 , 𝜃 ) be the set of all the model parameters. The ML estimate 𝜋� = ( 𝛼� , 𝛽̂ , 𝜃� , 𝜃� , 𝜃� , 𝜃� , 𝜃� , 𝜃� ) and its approximate covariance matrix Σ� (i.e. the inverse of the observed information) are presented in Table 2. Confidence regions of (baseline, treatment effect)

Here we use the conditional risk pr ( 𝑦 = 1 | 𝑧 , 𝒙 ) to obtain the measures (O , OR) , (R , RD) , (R , RR) and (R , AF) of (baseline, treatment effect) introduced in the introduction. Let 𝑦 𝑧 be the potential outcome of each patient in the population under treatment z = 0, 1 . Denote the risk pr ( 𝑦 𝑧 = 1) of the potential outcome 𝑦 𝑧 = 1 by R 𝑧 . Then R is the baseline risk, i.e. the risk of 𝑦 𝑧 = 1 in the population under z =

0. Because it is only possible to observe potential outcome of a patient under either z = 0 or z = 1, certain assumption is needed to allow for estimation of R 𝑧 (Greenland et al., 1999; Greenland and Robins, 1986; Rosenbaum and Rubin, 1983; Rosenbaum, 1995; Rubin et al., 2009). In the medical context of this study, it is reasonable to assume that there is no other confounding covariate than the documented covariates 𝒙 (Nguyen et al., 2008). The assumption implies pr ( 𝑦 𝑧 = 1| 𝒙 ) = pr ( 𝑦 = 1 | 𝑧 , 𝒙 ) which states that the patients in stratum x , who have not received treatment z , would have the same risk pr ( 𝑦 = 1 | 𝑧 , 𝒙 ) as those who have received treatment z, if they had received treatment 𝑧 . Therefore we can express the risk R 𝑧 = pr ( 𝑦 𝑧 = 1) of the potential outcome 𝑦 𝑧 = 1 by R 𝑧 = � pr ( 𝑦 𝑧 = 1| 𝒙 ) pr ( 𝒙 ) = 𝒙 � pr ( 𝑦 = 1| 𝑧 , 𝒙 ) pr ( 𝒙 ), (2) 𝒙 and in particular, the baseline risk R by R = � pr ( 𝑦 = 1| 𝑧 = 0, 𝒙 ) pr ( 𝒙 ), 𝒙 where pr ( 𝒙 ) is the probability of x in the population. Formula (2) implies that we can estimate R 𝑧 or its function by modeling pr ( 𝑦 = 1| 𝑧 , 𝒙 ) and pr ( 𝒙 ) through the observed data. Now we express the measures of baseline and treatment effect in terms of pr ( 𝑦 = 1| 𝑧 , 𝒙 ) and pr ( 𝒙 ) . The odds of the population under treatment z has the form O 𝑧 = R 𝑧 − R 𝑧 = ∑ pr ( 𝑦 = 1 | 𝑧 , 𝒙 )pr( 𝒙 ) 𝒙 − ∑ pr ( 𝑦 = 1 | 𝑧 , 𝒙 )pr( 𝒙 ) 𝒙 , 𝑧 = 0,1 (3) Hence the baseline odds is O = R − R = ∑ pr ( 𝑦 = 1 | 𝑧 = 0, 𝒙 )pr( 𝒙 ) 𝒙 − ∑ pr ( 𝑦 = 1 | 𝑧 = 0, 𝒙 )pr( 𝒙 ) 𝒙 . (4) The odds ratio is OR = O O = ∑ pr ( 𝑦 = 1 | 𝑧 = 1, 𝒙 )pr( 𝒙 ) 𝒙 − ∑ pr ( 𝑦 = 1 | 𝑧 = 1, 𝒙 )pr( 𝒙 ) 𝒙 ∑ pr ( 𝑦 = 1 | 𝑧 = 0, 𝒙 )pr( 𝒙 ) 𝒙 − ∑ pr ( 𝑦 = 1 | 𝑧 = 0, 𝒙 )pr( 𝒙 ) 𝒙 � . (5) The risk difference is

RD = R − R = � { pr ( 𝑦 = 1| 𝑧 = 1, 𝒙 ) − pr ( 𝑦 = 1| 𝑧 = 0, 𝒙 )} pr ( 𝒙 ). (6) 𝒙 The risk ratio is

RR = R R = ∑ pr ( 𝑦 = 1 | 𝑧 = 1, 𝒙 )pr( 𝒙 ) 𝒙 ∑ pr ( 𝑦 = 1 | 𝑧 = 0, 𝒙 )pr( 𝒙 ) 𝒙 . (7) Finally, the attributable fraction is

AF=1 − R R = 1 − ∑ pr ( 𝑦 = 1 | 𝑧 = 0, 𝒙 )pr( 𝒙 ) 𝒙 ∑ pr ( 𝑦 = 1 | 𝑧 = 1, 𝒙 )pr( 𝒙 ) 𝒙 . (8) In particular, in randomized trial, the treatment z is not associated with the covariates x , so that pr( 𝒙 ) = pr( 𝒙 | 𝑧 ) . Insertion of this into (2) yields the risk pr( 𝑦 = 1 | 𝑧 ) of 𝑦 = 1 in stratum z , R 𝑧 = � pr ( 𝑦 = 1| 𝑧 , 𝒙 ) pr ( 𝒙 | 𝑧 ) = pr(y = 1|z) 𝒙 . Therefore we see that R 𝑧 and thus R , O , OR, RD, RR and AF defined here are equal to those defined in the introduction for randomized trial. No matter if it is a randomized trial or an observational study, all these measures have causal interpretation in the Rubin causal model (Rosenbaum and Rubin, 1983; Rosenbaum, 1995; Rubin et al., 2009). From model (1), we obtain pr ( 𝑦 = 1 | 𝑧 , 𝒙 ) = exp{ α + β 𝑧 + 𝜃 𝑥 + 𝜃 𝑥 + 𝜃 𝑥 + 𝜃 𝑥 + 𝜃 ( 𝑧 ∗ 𝑥 ) + 𝜃 ( 𝑧 ∗ 𝑥 )}1 + exp{ α + β 𝑧 + 𝜃 𝑥 + 𝜃 𝑥 + 𝜃 𝑥 + 𝜃 𝑥 + 𝜃 ( 𝑧 ∗ 𝑥 ) + 𝜃 ( 𝑧 ∗ 𝑥 )}. (9) We are going to use (9) to express the measures of baseline and treatment effect in terms of the model parameters 𝜋 = ( 𝛼 , 𝛽 , 𝜃 , 𝜃 , 𝜃 , 𝜃 , 𝜃 , 𝜃 ) . We shall ignore variability of the covariates x , so that we replace the probability pr ( 𝒙 ) by the proportion prop( 𝒙 ) and treat the proportion as constant. Inserting pr ( 𝑦 = 1 | 𝑧 , 𝒙 ) of (9) into (2), we obtain R 𝑧 ( 𝑧 = 0,1 ) as a function of 𝜋 , R 𝑧 ( 𝜋 ) = � pr ( 𝑦 = 1| 𝑧 , 𝒙 ) pr op( 𝒙 ). (10) 𝒙 Inserting pr ( 𝑦 = 1 | 𝑧 , 𝒙 ) of (9) into (3)-(5), we obtain O 𝑧 and OR as functions of 𝜋 , O 𝑧 ( 𝜋 ) = R 𝑧 ( 𝜋 )1 − R 𝑧 ( 𝜋 ) , 𝑧 = 0,1 (11) OR( 𝜋 ) = O ( 𝜋 )O ( 𝜋 ) (12) Similarly, from (6)-(8), we obtain

RD( 𝜋 ) = R ( 𝜋 ) − R ( 𝜋 ), (13) RR( 𝜋 ) = R ( 𝜋 )R ( 𝜋 ) , (14) AF( 𝜋 ) = 1 − R ( 𝜋 )R ( 𝜋 ) . (15) Replacing 𝜋 in R ( 𝜋 ) , O ( 𝜋 ) , OR( 𝜋 ) , RD( 𝜋 ), RR( 𝜋 ) and AR( 𝜋 ) by the ML estimate 𝜋� given in Table 2, we obtain the ML estimates R � = R ( 𝜋� ) , O � = O ( 𝜋� ) , OR � = OR( 𝜋� ) , RD � = RD( 𝜋� ) , RR � = RR( 𝜋� ) and AP � = AP( 𝜋� ) . These estimates are R � = 0.53 , O � = 1.13 , OR � = 2.11 , RD � = 0.17 , RR � = 1.33 , and AP � = 0.25 , which are also presented in Figures 1-4 and Table 3. Parameters of a logistic model have good asymptotic normality: normal distribution is good approximation to distribution of the ML estimate of the parameters (e.g. Lindsey, 1996). We are going to use the approximate normal distribution of the ML estimate of the model parameters to generate approximate distribution of the ML estimates of measures of (baseline, treatment effect). Let p be a random variable which follows the normal distribution N( 𝜋� , Σ� ) , namely, 𝑝 ~ N( 𝜋� , Σ� ) , where 𝜋� and Σ� in N( 𝜋� , Σ� ) are given in Table 2. Replacing 𝜋 in {O ( 𝜋 ), OR( 𝜋 )} by p gives {O ( 𝑝 ), OR( 𝑝 )} . By using the normal distribution of p , we generate the distribution of {O ( 𝑝 ), OR( 𝑝 )} , which approximates the distribution of (O � , OR � ) . Using the same method, we also obtain the approximate distributions of (R � , RD � ) , (R � , RR � ) and (R � , AF � ) respectively. In this article, the confidence region of a measure of (baseline, treatment effect) is defined as the smallest area given a confidence level (1 − 𝛼 ) , i.e. the smallest area in which the ML estimate of the measure has the probability (1 − 𝛼 ) . Given the distribution of the ML estimate of a measure of (baseline, treatment effect), we can obtain the confidence region of the measure by a variety of methods. Here we are going to obtain confidence regions of (O , OR) , (R , RD) , (R , RR) and (R , AP) by using convertibility of these measures. As will be seen in Section 4, this method can be easily implemented by using any software that generates normal distribution. It is clear from (3)-(5) that (R , R ) can be converted into (O ,OR) . On the other hand, from (4), we obtain R = O . (16) Combining with (3) and (5), we obtain R = (O )(OR)1 + (O )(OR). (17) Thus (O , OR) can be converted into (R , R ) . Therefore (O , OR) and (R , R ) can be converted into each other. Furthermore, from (6)-(8), we see that (R , R ) can be converted to the measures (R , RD) , (R , RR) and (R , AP) , and vice versa. As a result, the measures (R , RD) , (R , RR) , (O , OR) and (R , AP) can be converted, i.e. obtained, from one another. Similarly, from (12)-(15), we see that the measures of (baseline, treatment effect) as function of the model parameters 𝜋 , {O ( 𝜋 ), OR( 𝜋 )} , {R ( 𝜋 ), RD( 𝜋 )} , {R ( 𝜋 ), RR( 𝜋 )} and {R ( 𝜋 ), AP( 𝜋 )} , are convertible from one another. Because R � = R ( 𝜋� ) , O � = O ( 𝜋� ) , OR � = OR( 𝜋� ) , RD � = RD( 𝜋� ) , RR � = RR( 𝜋� ) and AP � = AP( 𝜋� ) , the ML estimates, (R � , RD � ) , (R � , RR � ) , (O � , OR � ) and (R � , AF � ) , are convertible with one another. Therefore the distributions of these ML estimates are convertible with one another and so are their corresponding confidence regions. To obtain confidence regions of (R , RD) , (R , RR) , (O , OR) and (R , AP) , we consider the logarithm of (O , OR) , i.e. {log(O ), log(OR)} , as an additional measure of (baseline, treatment effect). Because the logarithm is a monotonic function, {log(O ), log(OR)} is convertible with any other measure of (baseline, treatment effect), and so is its ML estimate {log(O � ), log(OR � )} with the ML estimate of any other measure, and similarly for the distribution of {log(O � ), log(OR � )} and its corresponding confidence region. Because log(O ) and log(OR) range from −∞ to + ∞ , the ML estimate {log(O � ), log(OR � )} is fairly normal even for finite sample (e.g. Lindsey, 1996) and thus the corresponding confidence region can be well approximated by an ellipse. Due to the convertibility of confidence regions, we can convert the elliptical confidence region of {log(O ), log(OR)} to non-elliptical confidence regions of (O , OR) , (R , RD) , (R , RR) and (R , AP) . In the next section, we shall describe the procedure of obtaining point estimates and confidence regions of measures of (baseline, treatment effect).

4 Procedure for obtaining confidence regions of (baseline, treatment effect) 4.1 Approximate distributions for the ML estimates of baseline and treatment effect

First we draw p from the normal distribution 𝑝 ~N( 𝜋� , Σ� ) . Second, we replace 𝜋 by the obtained p in formulas (11) and (12) to get { O ( 𝑝 ) , OR( 𝑝 )} which approximates ( O � , OR � ). We iterate the procedure, 1000 times in this article, to get 1000 pairs of values of (O � , OR � ) . All these 1000 pairs form an approximate (joint) distribution of (O � , OR � ) . All values of O � in those 1000 pairs form an approximate (marginal) distribution of O � . All values of OR � in those 1000 pairs form an approximate (marginal) distribution of O R � . Similarly, we obtain the distributions of other measures of baseline and treatment effect. The approximate distributions for ( O � , OR � ), O � and OR � are presented in Figure 1; those for ( R � , RD � ), R � and RD � in Figure 2; those for ( R � , RR � ) and RR � in Figure 3; those for ( R � , AF � ) and AF � in Figure 4. By taking logarithm of the 1000 values of � O � , OR � � , O � and OR � , we obtain the distributions of {log(O � ), log(OR � )} , log(O � ), log(OR � ) , which are presented in Figure 5. From the distribution of {log(O � ), log(OR � )} , we calculate the mean and covariance matrix and then construct the following normal distribution N �� , � − − �� . From this normal distribution, we obtain the ellipse formula {log(O ) − − � − − � � log(O ) − − � = 𝜒 (1 − 𝛼 ) from which we obtain the (1 − 𝛼 ) confidence region of {log(O ), log(OR)} , which is plotted in Figure 5. Now we convert this elliptical confidence region into non-elliptical confidence regions for other measures of (baseline, treatment effect) by the following procedure: a) converting {log(O ), log(OR)} to (O , OR) , b) using formulas (16) and (17) to obtain (R , R ) , and c) using formulas (6)-(8) to obtain (R , RD) , (R , RR) and (R , AP) . The 50% and 95 % confidence regions for (O , OR) , (R , RD) , (R , RR) and (R , AP) are shown in Figures 1-4 respectively. Here we are going to calculate the confidence intervals of measures of treatment alone in order to compare them with the confidence regions of measures of (baseline, treatment effect) above. From (5)-(8) or (12)-(15), we see that the measures of treatment effect, OR, RD, RR and AF, are not convertible from one another, except for RR and AF. Hence we need to use distribution of the ML estimate of each measure of treatment effect to obtain (1 − 𝛼 ) confidence interval of the measure. We obtain that the 95 % confidence interval of OR is (0.79, 4.78), that for RD is ( − − The confidence region of a measure of (baseline, treatment effect) indicates possible values for the measure at the (1 − 𝛼 ) confidence level. Statistically, all the confidence regions for ( O , OR), ( R , RD), ( R , RR) and ( R , AF) are equivalent: each point of a confidence region can be converted to a unique point of another confidence region. Use of a specific confidence region is entirely determined by the specific medical context. The confidence interval of a measure of treatment effect indicates possible values for the measure at certain (1 − 𝛼 ) confidence level regardless of values for baseline. For instance, the confidence interval of RD in Table 3 indicates possible values for RD, i.e. ( − R . If we interpret the confidence interval of a measure of treatment effect together with a value of a measure of baseline, we may get misleading conclusion. For instance, it is misleading to claim a 95 % confidence interval ( − R = 0.68 , because, at the upper limit of the confidence interval, RD + R = 0.36 + 0.68 is larger than one.

5 Conclusions and discussions

To learn significance of treatment effect to the population, one needs to report both baseline and treatment effect. Because the ML estimate of a measure of baseline is highly correlated with that of a measure of treatment effect, one needs to report baseline and treatment effect jointly. To our knowledge, little is seen in the literature of joint estimation of even one measure of (baseline, treatment effect) by a model in observational studies. In this article, we express common measures of (baseline, treatment effect) in terms of the conditional risk of outcome given treatments and covariates, which is then expressed by a logistic model. Therefore we can use only one logistic model to control confounding covariates and estimate these measures together. Because the expressions of these measures are smooth bounded functions of the logistic model parameters, the performance of the logistic model in estimating confidence region is determined by the performance of the logistic model in estimating its own model parameters. Because parameters of a logistic model have good asymptotic normality, we use approximate normal distribution of the ML estimates of the logistic model parameters to generate approximate distributions of the ML estimates of various measures of (baseline, treatment effect) and then use the obtained distributions to calculate confidence regions of these measures. Our method can be easily implemented by any software that generates normal distribution in comparison to high complexity of the normal approximation method and the bootstrap method for estimating confidence regions of measures of (baseline, treatment effect). To conclude the article, we give two immediate extensions of our approach to estimation of confidence regions for measures of (baseline, treatment effect). First, we can replace the covariate-based logistic model by the propensity score-based logistic model to obtain the confidence regions. Due to a relatively small sample size and relatively few confounding covariates of the illustrative example, we have used the covariate-based logistic model to control the confounding covariates. When sample is large with many covariates, one sometimes uses the propensity score of treatment given covariates to control confounding covariates (Rosenbaum and Rubin, 1983; Rosenbaum, 1995). In this case, we can obtain confidence regions for measures of (baseline, treatment effect) by using the propensity score-based logistic model and considering the propensity score as the sole covariate in the model. Second, we can use other models than logistic model to obtain confidence regions for measures of (baseline, treatment effect). For instance, sometimes, one finds that some model, e.g. log-linear model, describes the data better than logistic model. In this case, we can obtain confidence regions for measures of (baseline, treatment effect) by replacing the logistic model by the log-linear model in our method. Acknowledgement

The authors are grateful to Professor Marta Granström for the data and the relevant medical information. This research received no specific grant from any funding agency in the public, commercial, or not-for-profit sectors.

References

Austin, P. C. (2010). Absolute risk reductions, relative risks, relative risk reductions, and numbers needed to treat can be obtained from a logistic regression model.

Journal of Clinical Epidemiology Statistics in Medicine Statistics in Medicine American Journal of Epidemiology . Statistics in Medicine Cheung, Y. B. (2007). A Modified Least-Squares Regression Approach to the Estimation of Risk Difference. American Journal of Epidemiology 166: 1337-1344.

Davison, A. C. and Hinkley, D. V. (1997).

Bootstrap Methods and their Application . Cambridge: Cambridge University Press. Gail, M. H., Wieand, S. and Piantadosi, S. (1984). Biased estimates of treatment effect in randomized experiments with nonlinear regressions and omitted covariates.

Biometrika Gehrmanna, U., Kussb, O., Wellmannc, J. and Ralf Bender, R. (2010). Logistic regression was preferred to estimate risk differences and numbers needed to be exposed adjusted for covariates.

Journal of Clinical Epidemiology Biometrics American Journal of Epidemiology

International Journal of Epidemiology Greenland, S. (1987). Interpretation and choice of effect measures in epidemiologic analyses.

American Journal of Epidemiology

Statistical Science International Journal of Epidemiology Biometrics Statistical Science Lindsey, J. K. (1996).

Parametric Statistical Inference.

Oxford: Clarendon Press. McNutt, L. A., Wu, C., Xue, X. and Hafner, J. P. (2003). Estimating the relative risk in cohort studies and clinical trials of common outcomes.

American Journal of Epidemiology o ̈ rberg, M. and Granstr o ̈ m, M. (2008). Evaluation of Two Triple-Therapy Regiments with Metronidazole or Clarithromycin for the Eradication of H. pylori

Infection in Vietnamese Children, a Randomized Double Blind Clinical Trial.

Helicobacter Rosenbaum, P. R. (1995).

Observational studies . New York: Springer. Rosenbaum, P. R. and Rubin, D. B. (1983). The central role of the propensity score in observational studies for causal effects.

Biometrika Spiegelman, D. and Hertzmark, E. (2005). Easy SAS calculations for risk or prevalence ratios and differences.

American Journal of Epidemiology

Statistical methods in medical research doi:10.1177/0962280213502146. Wang, X., Jin, Y. and Yin, L. (2014b). Point and interval estimations of marginal risk difference by logistic model.

Communications in statistics - Theory and Methods , to appear. Zhang, J., Yu, K. F. (1998). What’s the relative risk? A method of correcting the odds ratio in cohort studies of common outcomes.

JAMA Zou, Y. (2014). A Modified Poisson Regression Approach to Prospective Studies with Binary Data.

American Journal of Epidemiology Table 1

Eradication of

Helicobacter pylori among Vietnamese children: successful eradications / total patients on levels of the covariates 𝑥 -- 𝑥 for high respective low doses of the treatment z . Covariates Treatment 𝑥 (age) 𝑥 (gender) 𝑥 (geographic) 𝑥 (resistance) 𝑧 = 1 (high) 𝑧 = 0 (low) 0 0 0 0 0/3 3/4 0 0 0 1 1/1 3/4 0 0 1 0 8/8 6/7 0 0 1 1 1/1 0 1 0 0 2/3 1/5 0 1 0 1 1/2 2/4 0 1 1 0 3/3 3/4 0 1 1 1 1/1 1 0 0 0 3/8 1/2 1 0 0 1 1/1 3/3 1 0 1 0 6/6 1/1 1 0 1 1 3/3 0/1 1 1 0 0 5/8 2/7 1 1 0 1 6/9 1/2 1 1 1 0 0/2 0/2 1 1 1 1 1/1 0/3 Table 2

ML estimates and their approximate covariance matrix for parameters of model (1) .

Parameters α β θ θ θ θ θ θ Estimates 1.19 − − − α − − − − − β − − − − θ − − − − θ − − − − − θ − − − θ − − − − − θ − − − − θ − − − − Table 3

ML estimates and confidence intervals for measures of baseline and treatment effect based on model (1). ML estimate 50 % and 95 % Confidence intervals Baseline measure R O − AF 0.25 ( − Figure 1 (a) The scatterplot for approximate distribution of the ML estimate ( O � , OR � ), and the 50% and 95% confidence regions of (O , OR) . (b) Approximate distribution for the ML estimate O � . (c) Approximate distribution for the ML estimate OR � . The point estimate of (O , OR) is equal to (1.13, 2.11) . (a) (b) (c) O � O R � O � D e n s it y D e n s it y O R � Figure 2 (a) The scatterplot for approximate distribution of the ML estimate ( R � , RD � ), and the 50% and 95% confidence regions of (R , RD) . (b) Approximate distribution for the ML estimate R � . (c) Approximate distribution for the ML estimate RD � . The point estimate of (R , RD) is equal to (0.53, 0.17) . (a) (b) (c) R � R D � R � D e n s it y D e n s it y R D � Figure 3 (a) The scatterplot for approximate distribution of the ML estimate ( R � , RR � ), and the 50% and 95% confidence regions of (R , RR) . (b) Approximate distribution for the ML estimate RR � . The point estimate of (R , RR) is equal to (0.53, 1.33) . (a) (b) R � RR � D e n s it y R R � Figure 4 (a) The scatterplot for approximate distribution of the ML estimate ( R � , AF � ), and the 50% and 95% confidence regions of (R , AF) . (b) Approximate distribution for the ML estimate AF � . The point estimate of (R , AF) is equal to (0.53, 0.25) . (a) (b) R � A F � D e n s it y AF � Figure 5 (a) The scatterplot for approximate distribution of the ML estimate � log(O � � , log(OR � )} , and the 50% and 95% confidence regions of {log(O ), log(OR)} . (b) Approximate distribution for the ML estimate log(O � ) . (c) Approximate distribution for the ML estimate log(OR � ) . The point estimate of {log(O ), log(OR)} is equal to (0.12, 0.75) . (a) (b) (c) log(O � ) l o g ( O R � ) log(O � ) D e n s it y D e n s it y log(O R � ))