[PDF] Discussion of "Nonparametric generalized fiducial inference for survival functions under censoring"

Abstract

The following discussion is inspired by the paper Nonparametric generalized fiducial inference for survival functions under censoring by Cui and Hannig. The discussion consists of comments on the results, but also indicates it's importance more generally in the context of fiducial inference. A two page introduction to fiducial inference is given to provide a context.

Full PDF

aa r X i v : . [ s t a t . O T ] M a y Discussion of ‘Nonparametric generalized ﬁducialinference for survival functions under censoring’

G. Taraldsen and B.H. LindqvistDepartment of Mathematical SciencesNorwegian University of Science and TechnologyNTNU, NO-7491 Trondheim, [email protected] and [email protected] 27, 2019

Abstract

The following discussion is inspired by the paper

Nonparametric generalizedﬁducial inference for survival functions under censoring by Cui and Hannig. Thediscussion consists of comments on the results, but also indicates it’s importancemore generally in the context of ﬁducial inference. A two page introduction toﬁducial inference is given to provide a context.

Keywords:

Foundations and philosophical topics (62A01); Bayesian; Fiducial; Fre-quentist

We expect that many readers are not familiar with ﬁducial inference. This is in contrastto the well founded alternatives given by Bayesian and classical inference known to everystatistician today. Fiducial inference has not yet been established as a general theory, butthere has been considerable progress on this during the last decades, as also demonstratedby Cui and Hannig (2019). To discuss their contribution we need to provide a contextgiven by ﬁducial inference as we see it today.The original ﬁducial argument of Fisher (1930, p.532) starts by considering the relation u = F ( x ) (1)where F is the cumulative distribution function for the observation x . Fisher considersin particular the case where x is the empirical correlation of a sample of size n from the1ivariate Gaussian distribution. In this case F is strictly decreasing from 1 down to 0as a function of the unknown correlation θ . From this, Fisher argues that 1 − F ( x | θ )is the cumulative ﬁducial distribution for θ , and that π x ( θ ) = − ∂ θ F ( x | θ ) is the ﬁducialdensity of θ given x .Fisher’s argument uses the fact that equation (1) gives a correspondence between auniform law for u and the sampling law for x . The argument explains, in fact, thatthe percentiles of the ﬁducial distribution give conﬁdence intervals, and hence that theﬁducial distribution is a conﬁdence distribution in this case. Even though Fisher himselfabandoned this interpretation in later works, it must be seen as one of the pioneeringworks that lead to the theory of conﬁdence intervals and hypothesis testing as usedtoday. It is, as far as we know, the ﬁrst paper that calculates exact conﬁdence intervalsand explain them as such.Fiducial inference, in the version considered here, is given by replacing the relation (1)by a ﬁducial model x = θu (2)This economic notation is used by Dawid and Stone (1982, p.1055) when they deﬁne a functional model . It is a generalization of the structural models of Fraser (1968) whoconsiders the case where the model space Ω Θ is a group, and θu is the action of θ on u .Cui and Hannig (2019, eq.1) refer to equation (2) as a data generating equation . Samplesfrom a known distribution for u gives samples from the distribution of the observation x .In modern statistics, the possibility of simulating data from a statistical model is mostcentral, and any such algorithm is in fact a ﬁducial model.Equation (1) can be inverted to give x = θu = F − ( u ), where F depends on θ .Fisher’s initial model is hence a special case of a ﬁducial model. Consider for a momentthe following problem: The observation x is given and known to be generated from the ﬁducial model (2)by sampling u from a known distribution. How would You quantify Your un-certainty about the unknown model parameter θ ? It is clear that both u and θ are still uncertain, and it is reasonable, we claim, to quantifythese uncertainties by a joint distribution for ( u, θ ) such that equation (2) holds. Deﬁne θ = xu − to be a measurable selection solution of equation (2) for those ( x, u ) that allowsa solution. Assume, as we will exemplify below, that there exists a ﬁducial distributionfor u x derived from the original distribution of u and the observation x . A ﬁducialdistribution for the model θ can then be deﬁned to be the distribution of θ x = x ( u x ) − (3)The ﬁducial distribution quantiﬁes the uncertainty of θ given the assumed ﬁducial modeland given the observation x . This interpretation of the ﬁducial is what Fisher (1973,p.54-55) aimed at in his ﬁnal writing on this: By contrast, the ﬁducial argument uses the observations only to change thelogical status of the parameter from one in which nothing is known of it, and no robability statement about it can be made, to the status of a random variablehaving a well-deﬁned distribution. The correlation coeﬃcient example treated initially by Fisher is such that the ﬁducialequation (2) deﬁnes a one-one correspondence between any two variables when the thirdis ﬁxed. In this case, a simple ﬁducial model, the distribution of u x can be set equalto the original distribution of u . Fiducial samples are obtained simply by solving theﬁducial equation for each sample u and returning the solution θ x = xu − .Another example is given by x = θu = θ + u , where θ is an element of a subspace Ω Θ of a Hilbert space Ω X . An important class of problems is obtained by letting Ω Θ be theimage space of the design matrix in linear regression. In this case, the ﬁducial equationwill fail to have solutions for all ( x, u ). Let P be the orthogonal projection on Ω Θ , andlet Q = 1 − P . Deﬁne the law of u x to be the conditional law of u given Qu = Qx . Theﬁducial is then θ x = x [ u x ] − = x − u x .The previous example includes the general case of a location parameter, and in par-ticular inference based on sampling from the Gaussian distribution with unknown meanand known variance. As demonstrated by Fraser (1968), this can be seen as a particularcase of a group Ω Θ acting on the observation space Ω X , and cases with unknown variancecan also be included by considering other group actions. It follows in these cases, as alsofor the simple ﬁducial models, that the ﬁducial is a conﬁdence distribution. Furthermore,Taraldsen and Lindqvist (2013) have proved that classical optimal actions, if they exist,are determined by the ﬁducial if the loss is invariant. Incidentally, the previous alsoexemplify a nonparametric ﬁducial in the sense given by an inﬁnite dimensional Ω Θ .The previous indicate that a ﬁducial model (2) can be used to obtain a distributionwith interpretation similar to a Bayesian posterior as intended originally by Fisher. Italso show that conﬁdence distributions and classical optimal actions can be obtained byﬁducial arguments. Finally, a ﬁducial model (2) can also be used as a method for samplingfrom a Bayesian posterior. In a Bayesian set-up the joint distribution of ( u, θ ) is speciﬁed,and the distribution of u used above must be identiﬁed with the conditional distributionof u given θ . Sampling from the posterior can be done by sampling u conditionally given x and then θ given ( u, x ). In the case of group actions with prior equal to the rightinvariant prior this gives that the posterior coincides with the ﬁducial. Cui and Hannig (2019) consider failure distributions based on right censored data in anonparametric case. For simplicity, and since we will focus on the theoretical principles,we will focus on the uncensored case. Before leaving the censored case we will emphasizeits importance in applications, and add, as we see it, that the ﬁducial model for this caseis most natural. The ease of including this in the analysis is by itself a most convincingargument for the success of ﬁducial inference as demonstrated by Cui and Hannig (2019).The obvious choice, in retrospect, is to base nonparametric ﬁducial inference onFisher’s original ﬁducial relation in equation (1). The data is given by an ordered sam-ple x that obeys the ﬁducial relation u i = F ( x i ), or equivalently the ﬁducial model3 i = F − ( u i ). Here u ≤ · · · ≤ u n is the order statistic of a random sample from theuniform distribution on [0 , F is given by a measurable selection solution of this ﬁducial relation.We can and will restrict attention to the case where it is assumed that F is absolutelycontinuous in accordance with Cui and Hannig (2019, Assumption 2). In this case itfollows hence that the ﬁducial distribution for u x equals the original distribution for u asin Fishers original ﬁducial argument for the correlation coeﬃcient. In contrast to Fishersoriginal argument there is here an inﬁnity of possible randomized measurable selectionsolutions. It can, additionally, be observed that the given ﬁducial model is equivalentwith a group model x = θv : Ω Θ is the group of increasing and diﬀerentiable transfor-mations θ of the positive real line and v ≤ · · · ≤ v n is the order statistic of a randomsample from the standard exponential distribution.A particular absolutely continuous ﬁducial F I is determined by log-linear interpo-lation as described by Cui and Hannig (2019). This gives ﬁducial distributions for anyparameters of interest, and in particular for F ( x ) for a ﬁxed x and the percentiles x α for a ﬁxed α . The case with k samples can be treated similarly by the joint ﬁducial for F , . . . , F k . It is straightforward, in principle, to calculate corresponding ﬁducial intervalsor regions and corresponding ﬁducial p -values. This is exempliﬁed by Cui and Hannig(2019) by a series of examples for k = 1 ,

2, and good frequentist properties are demon-strated as compared with existing methodology. The group model structure opens thequestion: Is optimal equivariant inference possible?The demonstrations, and the previous two paragraphs, constitute, in our opinion, themain message of the paper. Many more examples can, and should, be published based onconcrete applied problems, and the indicated natural route for nonparametric inference.An alternative approach is to take your favorite book on nonparametric inference andimplement and experiment with corresponding ﬁducial solutions.Proofs of stated coverage in the ﬁnite sample case are absent, but for k > k = 1 case seems possible to analyse completely, and the methodol-ogy should then be compared with similar results for the uncensored case presented bySchweder and Hjort (2016, Chap.11). It should be noted that Schweder and Hjort (2016)only consider conﬁdence distributions for real valued parameters, and not for the unknown F itself. It is, in fact, unknown if the ﬁducial for F is a conﬁdence distribution in a strictsense. The group model structure gives a starting point for investigating this further.All of these questions are related to the choice of a measurable selection solution. Isthere a natural choice? Is there a best choice? This question should be investigated inconcrete data situations. It can be observed that the choice F I is quick and convenient,but each realization is so special that it is not realistic in most situations. An alternative,which is still quick and convenient, is given by monotonic spline interpolation. Theﬁducial distribution given by F I has defects when considered as a ﬁducial distribution for F , but the simulations demonstrate that resulting ﬁnite dimensional ﬁducials of certainfocus parameters have excellent properties.In summary, what is the possible role of the ﬁducial argument and distribution? The4ollowing Bayesian-Fiducial-Frequentist list give guidance: (B) Alternative algorithms for Bayesian analysis. (F)

A posterior ﬁducial state interpreted as Fisher intended. (F)

Alternative algorithms for frequentist analysis.All of this, seen in retrospect, is excellently presented and exempliﬁed by Fraser (1968)for classical linear models. We believe that Cui and Hannig (2019) have taken the ﬁrstimportant step for similar results in the nonparametric case. Their main technical resultproves that the nonparametric ﬁducial is asymptotically a conﬁdence distribution.

We take the opportunity of expressing our thanks for the invitation to comment on theinteresting and thought-provoking paper by Cui and Hannig (2019). This paper will serveas motivation for further developments of the theory of ﬁducial inference as initiated byFisher in the

Inverse probability paper from 1930. The importance of the 1930 paper byFisher, lies, according to Fisher (1950), in retrospect, in setting forth a new mode of rea-soning from observations to their hypothetical causes. We congratulate Cui and Hannigwith a successful demonstration of a ﬁducial argument in a nonparametric problem. Inconclusion, we can wholeheartedly and repeatedly agree with Efron (1998, p.107):

This is all quite speculative, but here is a safe prediction for the 21st century:statisticians will be asked to solve bigger and more complicated problems. Ibelieve that there is a good chance that objective Bayes methods will be de-veloped for such problems, and that something like ﬁducial inference will playan important role in this development. Maybe Fisher’s biggest blunder willbecome a big hit in the 21st century!

Additionally, we believe that the addition of nonparametric ﬁducial inference, as intro-duced by Cui and Hannig (2019), will play an important part of this adventure.

References

Cui, Y. and J. Hannig (2019). Nonparametric generalized ﬁducial inference for survivalfunctions under censoring.

Biometrika (to appear) .Dawid, A. P. and M. Stone (1982). The functional-model basis of ﬁducial inference (withdiscussion).

The Annals of Statistics 10 (4), 1054–1074.Efron, B. (1998). R. A. Fisher in the 21st century (with discussion).

Statist. Sci. 13 ,95–122.Fisher, R. (1950).

Contributions to Mathematical Statistics . London: Chapman and Hall.5isher, R. A. (1930). Inverse probability.

Proc. Camb. Phil. Soc. 26 , 528–535.Fisher, R. A. (1973).

Statistical methods and scientiﬁc inference . Hafner press.Fraser, D. A. S. (1968).

The structure of inference . John Wiley.Schweder, T. and N. L. Hjort (2016).

Conﬁdence, Likelihood, Probability: StatisticalInference with Conﬁdence Distributions.

Cambridge University Press.Taraldsen, G. and B. H. Lindqvist (2013). Fiducial theory and optimal inference.