[PDF] UltraNest -- a robust, general purpose Bayesian inference engine

Abstract

UltraNest is a general-purpose Bayesian inference package for parameter estimation and model comparison. It allows fitting arbitrary models specified as likelihood functions written in Python, C, C++, Fortran, Julia or R. With a focus on correctness and speed (in that order), UltraNest is especially useful for multi-modal or non-Gaussian parameter spaces, computational expensive models, in robust pipelines. Parallelisation to computing clusters and resuming incomplete runs is available.

Full PDF

UUltraNest - a robust, general purpose Bayesian inferenceengine

Johannes Buchner

1, 2, 3, 4 Max Planck Institute for Extraterrestrial Physics, Giessenbachstrasse, 85741 Garching, Germany. Millenium Institute of Astrophysics, Vicuña MacKenna 4860, 7820436 Macul, Santiago, Chile . . . Pontiﬁcia Universidad Católica de Chile, Instituto de Astrofísica, Casilla 306, Santiago 22, Chile. Excellence Cluster Universe, Boltzmannstr. 2, D-85748, Garching, Germany

DOI:Software • Review • Repository • Archive

Submitted:Published:License

Authors of papers retain copyrightand release the work under a Cre-ative Commons Attribution 4.0 In-ternational License (CC BY 4.0).

Summary

UltraNest is a general-purpose Bayesian inference package for parameter estimation andmodel comparison. It allows ﬁtting arbitrary models speciﬁed as likelihood functionswritten in Python, C, C++, Fortran, Julia or R. With a focus on correctness and speed(in that order), UltraNest is especially useful for multi-modal or non-Gaussian parameterspaces, computational expensive models, in robust pipelines. Parallelisation to computingclusters and resuming incomplete runs is available.

OutlineStatement of need

When scientiﬁc models are compared to data, two tasks are important: 1) contrainingthe model parameters and 2) comparing the model to other models. While several opensource, Bayesian model ﬁtting packages are available that can be easily tied to existingmodels, they are diﬃcult to run such that the result is reliable and user interaction isminimized. A chicken-and-egg problem is that one does not know a priori the posteriordistribution of a given likelihood, prior and data set, and cannot chose a sampler thatperforms well. For example, Markov Chain Monte Carlo convergence checks may suggestgood results, while in fact another distant but important posterior peak has remainedunseen. Current and upcoming large astronomical surveys require characterising a largenumber of highly diverse objects, which requires reliable analysis pipelines. This is whatUltraNest was developed for.

Overview

Nested sampling (Skilling 2004) allows Bayesian inference on arbitrary user-deﬁned likeli-hoods. Additional to computing parameter posterior samples, it also estimates the marginallikelihood (“evidence”, Z ). Bayes factors between two competing models B = Z /Z are ameasure of the relative prediction parsimony of the models, and form the basis of Bayesianmodel comparison. By performing a global scan of the parameter space from the worst tobest ﬁts to the data, nested sampling performs well also in multi-modal settings.In the last decade, several variants of nested sampling have been developed. These include(1) how better and better ﬁts are found while respecting the priors, (2) whether it isallowed to go back to worse ﬁts and explore the parameter space more, and (3) diagnosticsthrough tests and visualisations. UltraNest develops novel, state-of-the-art techniques for Johannes Buchner, (). UltraNest - a robust, general purpose Bayesian inference engine. , (), . https://doi.org/ a r X i v : . [ s t a t . C O ] J a n ll of the above. They are especially remarkable for being free of tuning parameters andtheoretically justiﬁed.Currently available eﬃcient nested sampling implementations such as MultiNest (Feroz,Hobson, and Bridges 2009) and its open-source implementations (e.g., nestle , dynesty )rely on the heuristic multi-ellipsoidal rejection algorithm which has shown biases whenthe likelihood contours are not ellipsoidal (Buchner 2014; Nelson et al. 2020). UltraNestinstead implements better motivated self-diagnosing algorithms, and improved, conservativeuncertainty propagation. In other words, UltraNest prioritizes robustness and correctness,and maximizes speed second. For potentially complex posteriors where the user is willingto invest computation for obtaining a gold-standard exploration of the entire posteriordistribution in one run, UltraNest was developed.This package provides feature parity with other packages (such as MultiNest ), e.g., circularparameters, resuming incomplete runs, eﬃcient multi-core computation on clusters, andprovides additional convenience features for visualisation and diagnostics.

Figure 1:

Method

Nested sampling methods are systematically reviewed in Buchner et al., submitted,highlighting also the approaches used in UltraNest.The basic outline of vanilla nested sampling (see Skilling 2004 for details) is as follows:A set of N live points is drawn from the prior. The unit hypercube is used as a naturalspace (u-space) and inverse cumulative prior transforms convert the point coordinates tophysical parameter units (v-space). The likelihood L is then evaluated. Nested samplingthen repeatedly identiﬁes the current worst ﬁt and replaces it with a better ﬁt, through alikelihood-constrained prior sampling (LRPS) procedure. At each iteration (representedby the removed, dead point), the prior space investigated shrinks by approximately V i +1 /V i = ( N − /N , starting from the entire prior volume V i = 1. Through a Lebegueintegration, the dead point becomes a posterior sample with weight w i = L i × V i , andwe can estimate Z i = P ij =1 w i , yielding the posterior distribution and evidence. Theiteration procedure can terminate when the weight of the live points, e.g., estimated as w live = V i +1 max Ni =1 L live,i , is small ( w live (cid:28) Z i ). Johannes Buchner, (). UltraNest - a robust, general purpose Bayesian inference engine. , (), . https://doi.org/ eactive Nested Sampling Instead of iterating with a ﬁxed array of live points, UltraNest uses a tree. The root ofthe tree represents the entire prior volume, and its child nodes are samples from the entireprior. A breadth-ﬁrst search is run, which keeps a stack of the opened nodes sorted bylikelihood. The volume shrinkage is the size of the stack after removing the currentlyvisited node to the size of the stack with the children of the currently visited node added.When encountering a node, attaching a child to it is decided by several criteria. Theseinclude the termination criterion, minimum number of live points, whether the maximumnumber of model evaluations has already been reached, whether more points are desiredbecause of the number of identiﬁed cluster or because more posterior resolution is wantedin a particular likelihood interval.The tree formulation easily allows resuming from a run later. This includes attachingmore live points where desired, e.g., to improve the number of eﬀective samples or the Zuncertainty. For this, the uncertainty contribution is measured for each dead point. Thedead points are then randomly sampled proportional to their contribution. Then, a childis added to the parents of the selected nodes. This decreases the weight of the selectednode, as its volume is reduced.Reactive Nested Sampling is a ﬂexible generalisation of the Dynamic Nested Sampling(Higson et al. 2017), which used a simple heuristic for identifying where to add morelive points. The tree formulation of Reactive Nested Sampling makes implementing errorpropagation and variable number of live points straight-forward.

Integration procedure

UltraNest tries hard to be conservative and provides uncertainties on Z and the posteriorweights. Instead of a single Reactive Nested Sampling explorer, it employs several, whichare randomly blinded to some parts of the tree. In particular, they see a bootstrappedsubsample of the root edges (called roots in the code).Additionally, instead of static volume shrinkage, UltraNest incorporates the scatter involume shrinkages by drawing samples from a Beta(1, N) distribution (Skilling 2004).Each explorer provides for each sample a weight estimate (0 if it is blind to it), whichprovide uncertainty on the posterior distribution. Each explorer provides a Z estimate;the dispersion quantiﬁes the Z uncertainty.The bootstrapped integrators is an evolution over single-bulk evidence uncertainty measuresand includes the scatter in volume estimates (by beta sampling) and and likelihood values(by bootstrapping). LRPS procedures in UltraNest

The live points all fulﬁll the current likelihood threshold, therefore they can be used totrace out the neighbourhood where a new, independent prior sample can be generated thatalso fulﬁlls the threshold. Region-based sampling uses rejection sampling using constructedgeometries.

Region construction

Mukherjee, Parkinson, and Liddle (2006) proposed heuristically ﬁtting an ellipsoid tothe live points to achieve this, which is expanded by a empirically determined factor.Shaw, Bridges, and Hobson (2007) expanded this with multiple ellipsoids constructed viaa recursive splitting heuristic, which are then again expanded by a factor. This works wellfor many problems. However, these heuristics are not well justiﬁed, and do not detect when

Johannes Buchner, (). UltraNest - a robust, general purpose Bayesian inference engine. , (), . https://doi.org/ hey work poorly. Buchner (2014) explored hyper-rectangle posteriors and a heavy-taileddistribution which show-case that the ellipsoids cut oﬀ too much of the prior space.Buchner (2014) proposed a more robust algorithm, RadFriends. RadFriends creates anhyper-sphere around each live point. The radius of the spheres is determined by cross-validation: Some live points are randomly left out following a bootstrapping procedure,and based on sphere around the remainder the radius is chosen so that the former arerecovered. This is repeated over many rounds, and the largest radius kept. In this fashion,some guarantees for the robustness of the region are available. An animation of thisprocedure is available. As a side-eﬀect, RadFriends automatically provides parameter-freeunsupervised clustering, which relies only on the uniformity assumption of the u-space.MLFriends (Buchner 2019) makes several eﬃciency improvements to RadFriends: Firstly,when a secondary likelihood peak is dying out, RadFriends tends to drastically increase itsradius to be able to recapture it from the other peak(s). This is avoided by not allowingthe radius to increase. Secondly, instead of spheres, MLFriends learns a metric (hencethe name), initially by taking a sample variance of the live points. In later iterations, theidentiﬁed clusters are overlaid by subtracting their mean, and the sample variance of theco-centered points taken as the metric. This helps identify the shape within clusters anddiscards more space between the clusters.The bootstrapping approach can also be applied the single ellipsoid method: The samplecovariance of the selected live points identiﬁes the shape of the ellipse, while the left-outlive points identify the scale. A single ellipsoid performs best in ellipsoidal posteriors (suchas gaussians). MLFriends performs best in complex posterior shapes with low dimension.UltraNest therefore combines multiple region constructions, and uses their intersection:(1) MLFriends, (2) a bootstrapped single ellipsoid in u-space (3) a bootstrapped singleellipsoid in v-space. The last one drastically helps when one parameter constraint scaleswith another, (e.g., funnel shapes). By altering how the parameter is applied in thelikelihood function and how its prior is transformed, the user can – without altering theposterior – help produce ellipsoidal shapes. The better parameterization narrows thesampling region, and leads to eﬃciency gains. Region sampling

Samples are then drawn either from the entire prior, the single u-space ellipsoid orMLFriends ellipsoids (accounting for overlaps), and ﬁltered by the other constraints(including the transformed v-space ellipsoid). UltraNest dynamically switches away fromslow methods (e.g., sampling from the unit hypercube) when that is ineﬃcient.Once a region proposal is chosen, the likelihood function is evaluated, and proposals belowthe current threshold are rejected. The proposal process is repeated until success.Instead of handling one proposal point at a time, many points are proposed and ﬁltered atonce (vectorization). Because Python function calls can be costly, this speeds up UltraNest,e.g., in comparison to dynesty . Step sampling

Besides rejection sampling, UltraNest supports several types of Monte Carlo random walks.These are more eﬃcient in some problems, in particular in high dimensions ( d >

Polychord , Handley, Hobson, and Lasenby 2015)• Hit-and-run sampling• Constrained Hamiltonian Monte Carlo with No-U turn sampling (similar to

NoGUTS ,Griﬃths and Wales 2019)

Johannes Buchner, (). UltraNest - a robust, general purpose Bayesian inference engine. , (), . https://doi.org/ he user can choose whether to ﬁlter proposals with the constructed regions. This hassome function call overhead but can reduce likelihood evaluations, especially importantfor slow models.Variations that alter some parameters more often than others (fast-slow) are also imple-mented. Diagnostics & Visualisations

Run-time visualisation

During the run, UltraNest by default shows a visualisation in the standard output wherethe current live points are distributed. Thus already during a run, the user can identifywhere the ﬁt is spending its time and choose to abort the ﬁt and alter the model. InJupyter notebooks, a visualisation widget is shown, when run in the terminal, it is shownin the standard output. Additionally, log output gives information on the progress. Bothcan be turned oﬀ.

Posterior visualisations

Publication-read corner/pairs plots and trace plots (Higson et al. 2019) are created, basedon code from corner (Foreman-Mackey 2016) and dynesty (Speagle 2020).The weighted posterior samples (or a resampled set with uniform weighting) can howeveralso be plotted with any other tool.

Diagnostic tests

Fowlie, Handley, and Su (2020) proposed a test for identifying LRPS biases during a run,by checking whether new children are inserted in the sorted stack at uniformy distributedpositions (insertion order or rank). Buchner et al., in prep. presents a similar test based onrank statistics, which is slightly more sensitive and easier to implement. UltraNest printsthe p-value of this test during the run. More experience is needed to identify reasonablyrolling windows and thresholds for applying this test.

Features

Parallelisation

UltraNest allows parallelisation from laptops to computing clusters with the Message Pass-ing Interface (MPI). Programs using UltraNest can be run with MPI without modiﬁcation,if the mpi4py python library is installed. The region construction, step sampling, andmulti-explorer approaches are all parallelised across cores and are kept on the same coresfor eﬃciency.

Resuming

UltraNest can optionally write to a folder, where it will store summary statistics of theposterior and evidence, posterior chains, already sampled points, diagnostic log ﬁles, andvisualisations. Resuming a run, also with diﬀerent algorithm parameters is supported.Previously sampled points are stored in compressed HDF5, which saves disk space andavoids ﬁle corruption. It also avoids two processes trying to operate on the same ﬁle atonce.

Johannes Buchner, (). UltraNest - a robust, general purpose Bayesian inference engine. , (), . https://doi.org/ anguages In the Github repository, wrappers are provided for models (prior transforms and likelihoodfunctions) written in:• Python• C• C++• Fortran• Julia• R• JavascriptFor Julia, the dedicated wrapper package UltraNest.jl is available (Schulz and Buchner2020).For Javascript, a simpler implementation of MLFriends is available (Buchner 2018), whichallows building interactive Bayesian inference websites.

Documentation

Extensive documentation is available. This includes several practical tutorials highlightinggood practices of data analysis in astro-statistics, including:• Fitting linear or powerlaw correlations• Inferring parameter distributions from many uncertain parameter measurements(hierarchical Bayesian model)• Diﬀerent types of model comparison• Prior and posterior predictive checks• Uncertainty quantiﬁcation

Basic Example

Finally, a simple example of running UltraNest is given in the following Python snippet: import ultranestsampler = ultranest.ReactiveNestedSampler(param_names, my_likelihood, my_prior_transform)result = sampler.run()print('evidence estimate:', results['logz'])print('first parameter posterior mean:', results['samples'][:,0].mean())

Here param_names is a list of strings, my_prior_transform is a function transformingfrom a sample from the unit cube to physical parameter, and my_likelihood is thelikelihood function receiving such parameters and returning their likelihood value.In case a output folder is used and resumed from: import ultranestsampler = ultranest.ReactiveNestedSampler(param_names, my_likelihood, my_prior_transform,log_dir="myanalysis", resume=True)result = sampler.run()sampler.print_results()sampler.plot()

More advanced uses are show-cased in the documentation, including speed-ups, paralleli-sation and step samplers.

Johannes Buchner, (). UltraNest - a robust, general purpose Bayesian inference engine. , (), . https://doi.org/ cknowledgements I am very thankful to Fred Beaujean, Josh Speagle and J. Michael Burgess for insightfulconversations.

References

Buchner, Johannes. 2014. “A statistical test for Nested Sampling algorithms.”

Statisticsand Computing , July, 1–10. https://doi.org/10.1007/s11222-014-9512-y.———. 2018. “Ultranest-Js.”

GitHub Repository . GitHub. https://github.com/JohannesBuchner/ultranest-js.Buchner, Johannes. 2019. “Collaborative Nested Sampling: Big Data versus ComplexPhysical Models” 131 (1004): 108005. https://doi.org/10.1088/1538-3873/aae7fc.Feroz, F., M. P. Hobson, and M. Bridges. 2009. “MULTINEST: an eﬃcient and robustBayesian inference tool for cosmology and particle physics” 398 (October): 1601–14.https://doi.org/10.1111/j.1365-2966.2009.14548.x.Foreman-Mackey, Daniel. 2016. “Corner.py: Scatterplot Matrices in Python.”

The Journalof Open Source Software arXiv E-Prints , June, arXiv:2006.03371. http://arxiv.org/abs/2006.03371.Griﬃths, Matthew, and David J Wales. 2019. “Nested Basin-Sampling.”

Journal ofChemical Theory and Computation

ArXiv E-Prints ,April. http://arxiv.org/abs/1704.03459.Mukherjee, P., D. Parkinson, and A. R. Liddle. 2006. “A Nested Sampling Algorithm forCosmological Model Selection” 638 (February): L51–L54. https://doi.org/10.1086/501068.Nelson, Benjamin E., Eric B. Ford, Johannes Buchner, Ryan Cloutier, Rodrigo F. Díaz,João P. Faria, Nathan C. Hara, Vinesh M. Rajpaul, and Surangkhana Rukdee. 2020.“Quantifying the Bayesian Evidence for a Planet in Radial Velocity Data” 159 (2): 73.https://doi.org/10.3847/1538-3881/ab5190.Schulz, Oliver, and Johannes Buchner. 2020. “UltraNest.jl.”

GitHub Repository . GitHub.https://github.com/bat/UltraNest.jl.Shaw, J. R., M. Bridges, and M. P. Hobson. 2007. “Eﬃcient Bayesian inference formultimodal problems in cosmology” 378 (July): 1365–70. https://doi.org/10.1111/j.1365-2966.2007.11871.x.Skilling, J. 2004. “Nested Sampling.” Edited by Roland Preuss Rainer Fischer and Udo vonToussaint.

AIP Conference Proceedings

735 (1): 395. https://doi.org/10.1063/1.1835238.