[PDF] MRIReco.jl: An MRI Reconstruction Framework written in Julia

Abstract

Purpose: The aim of this work is to develop a high-performance, flexible and easy-to-use MRI reconstruction framework using the scientific programming language Julia. Methods: Julia is a modern, general purpose programming language with strong features in the area of signal / image processing and numerical computing. It has a high-level syntax but still generates efficient machine code that is usually as fast as comparable C/C++ applications. In addition to the language features itself, Julia has a sophisticated package management system that makes proper modularization of functionality across different packages feasible. Our developed MRI reconstruction framework MRIReco.jl can therefore reuse existing functionality from other Julia packages and concentrate on the MRI-related parts. This includes common imaging operators and support for MRI raw data formats. Results: MRIReco.jl is a simple to use framework with a high degree of accessibility. While providing a simple-to-use interface, many of its components can easily be extended and customized. The performance of MRIReco.jl is compared to the Berkeley Advanced Reconstruction Toolbox (BART) and we show that the Julia framework achieves comparable reconstruction speed as the popular C/C++ library. Conclusion: Modern programming languages can bridge the gap between high performance and accessible implementations. MRIReco.jl leverages this fact and contributes a promising environment for future algorithmic development in MRI reconstruction.

Full PDF

aa r X i v : . [ phy s i c s . m e d - ph ] J a n ORIGINAL ARTICLEJournal Section

MRIReco.jl: An MRI Reconstruction Frameworkwritten in Julia

Tobias Knopp | Mirco Grosser Section for Biomedical Imaging, UniversityMedical Center Hamburg-Eppendorf,Hamburg, Germany Institute for Biomedical Imaging, HamburgUniversity of Technology, Hamburg,Germany

Correspondence

Tobias Knopp, Lottestrasse 55, 22529Hamburg, GermanyEmail: [email protected]

Funding information - Word count

Purpose:

The aim of this work is to develop a high-perfor-mance, ﬂexible and easy-to-use MRI reconstruction frame-work using the scientiﬁc programming language Julia.

Methods:

Julia is a modern, general purpose programminglanguage with strong features in the area of signal / imageprocessing and numerical computing. It has a high-level syn-tax but still generates eﬃcient machine code that is usually asfast as comparable C/C++ applications. In addition to the lan-guage features itself, Julia has a sophisticated package man-agement system that makes proper modularization of func-tionality across diﬀerent packages feasible. Our developedMRI reconstruction framework MRIReco.jl can therefore reuseexisting functionality from other Julia packages and concen-trate on the MRI-related parts. This includes common imag-ing operators and support for MRI raw data formats.

Results:

MRIReco.jl is a simple to use framework with a highdegree of accessibility. While providing a simple-to-use in-terface, many of its components can easily be extended andcustomized. The performance of MRIReco.jl is compared tothe Berkeley Advanced Reconstruction Toolbox (BART) andwe show that the Julia framework achieves comparable re-construction speed as the popular C/C++ library.

Conclusion:

Modern programming languages can bridge thegap between high performance and accessible implementa-tions. MRIReco.jl leverages this fact and contributes a promis-ing environment for future algorithmic development in MRI reconstruction.

KEYWORDS magnetic resonance imaging, image reconstruction, open source,Julia, numerical computing | INTRODUCTION

Magnetic resonance imaging (MRI) is a radiation-free tomographic imaging modality allowing for both high spatialresolution and high soft-tissue contrast. This, in combination with the large number of available contrasts, makes it anindispensable tool for many clinical imaging applications. In recent times, acquisition times and spatial resolution havebeen pushed further through the introduction of new, advanced signal processing techniques such as compressedsensing (CS) [1, 2] or structured matrix completion [3, 4, 5]. In a similar way, algorithmic developments have stimu-lated the development of new techniques for the quantitative imaging of tissue parameters, such as relaxation times,magnetic susceptibility or apparent diﬀusion coeﬃcients [6, 7, 8, 9]. An important catalyst for this development is theavailability of open source software. The latter facilitates the development of new methods while taking away theneed of a full, self-implemented MRI signal processing pipeline.An important aspect to keep in mind is that improvements in image quality and / or reduction in scan time areoften only possible by computationally intensive algorithms shifting the bottleneck in latency from acquisition toreconstruction. To reduce the reconstruction time to a manageable level massive parallelization including the usageof GPGPU (General Purpose Graphical Purpose Units) [10, 11] hardware acceleration are often necessary. While thisallows for acceleration of MRI reconstruction by a factor of 10-1000, it at the time increases the complexity of thesoftware system.Researchers developing new image reconstruction algorithms usually face two conﬂicting goals. On the one hand,one seeks a programming environment that allows for an easy translation of mathematical notation into programmingcode. On the other hand, the program should run as fast as possible. The ﬁrst goal can be accomplished by using a high-level (HL) programming environment. However, this is often accompanied by suboptimal computational performance.On the other hand, computation speed can be ensured by working in a low-level (LL) programming environment, albeitat the cost of an increased system complexity. For this reason, one can observe that HL languages, such as Matlaband Python/Numpy [12], are popularly used by mathematical/theoretical researchers. In contrast, the LL approach isoften used by the vendor of an imaging system, since the product needs to runs in a suﬃciently short reconstructiontime. In that case, the increase in system complexity is thus addressed by increasing the development resources.A popular approach to reconcile both goals is to use a mix of a HL and a LL programming environment. In thishybrid approach, only hot loops that take most of the time are implemented in the LL language, while the remainingpart is implemented in the HL programming environment. Popular examples are Matlab with its Mex interface, i.e. acombination of Matlab and C/C++ and Python with its diﬀerent ways to call C code (e.g. ctypes). Note that the hybridapproach is not restricted to user code but that entire frameworks are built in this way. Examples include Numpy [12],which is mainly written in C, and machine learning frameworks such as TensorFlow [13] and PyTorch [14], which areimplemented in C/C++ with Python language bindings.The hybrid approach has many advantages but it also has issues, which the authors in [15] summarize in the twolanguage problem :• For someone programming in the HL language it is often unclear what the LL implementation looks like. Thus, . Knopp et al. 3 the programming interface is often very high level, which is ﬁne in most situations but becomes problematic ifcustom algorithms need to be implemented.• Bridging from one language to the other often has a high computational cost and only amortizes for large problemsizes. This leads to the situation that code is often vectorized to minimize the number of calls into the LL language.Beyond that it should be noted that two-language solutions are more complex to set up since multiple compilers anda proper management of dependencies are needed. For researchers this has the consequence that using an existingframework is easy. On the other hand developing new methods/algorithms can be challenging as this often requiresprogramming skills in the LL language used.In the realm of MRI, the hybrid approach is very popular. Prominent examples are the packages Gadgetron [16]and BART [17], which both have many features and follow the two-language principle with a core written in C/C++and bindings to diﬀerent HL languages. In contrast, SigPy [18] is a HL package entirely written in Python and allowsfor rapid prototyping of MRI reconstruction algorithms.In this paper, we present an MRI reconstruction framework, which yields high performance while allowing for easyprototyping of new algorithms. At the same time the framework is implemented in a single programming language,thus avoiding the two-language problem. To achieve this, our framework is written in the programming language Julia,which was invented by researchers at the MIT in 2012 with the aim of solving aforementioned two-language problem[15]. The aim of this paper is two-fold. First, we describe our framework with its main components and we outlinehow it can be used for the development of new algorithms. Second, we perform comparisons in order to illustratethat the computation speed of our framework indeed matches that of popular frameworks such as BART. | METHODS

When developing an MRI reconstruction framework, it is beneﬁcial to formulate a set of design goals to guide thedesign. For instance, the developers of Gadgetron formulated the following list of properties, which an MRI recon-struction framework should fulﬁll: Free / open Modular Flexible Cross platform High performance Facilitate prototyping Facilitate deploymentFor a proper deﬁnition of each of the design goals we refer the reader to [16].

MRIReco.jl has the same design goalsalthough the deployment aspect is currently not yet addressed. Instead

MRIReco.jl has the following additional designgoals: Reuse of components : The goal is to use as many existing software components as possible if and only if they aresuitable for the task and appropriately maintained. For instance transformations like the FFT as well as iterativesolvers like the conjugated gradients method (CG) are independent of MRI and used in other domains. This design

T. Knopp et al. goal not only implies reusing existing software components but also to put non-MRI speciﬁc functionality intodedicated packages if such a package does not yet exist. Hackability : It should be possible for software developers to access low-level parts of the software stack anddevelop extensions or even make modiﬁcations to the source code easily. We note that hackability only slightlyoverlaps with prototyping mentioned in [16]. Hackability in particular means that the gap between being a userand being a developer of a software framework is small.

Accessibility : The software components should be easy to install with only few instructions. It should be simpleto access and modify the existing code.

Testing : The code should be properly tested using continuous integration services.The last two design goals are certainly also followed by Gadgetron and BART. We note, however, that the two-languageprinciple imposes limits on how well the ﬁrst two of the additional design goals can be met. For instance, we notethat hackability is diﬃcult to address in a two language solution, since code modiﬁcations often require knowledge ofboth the HL and the LL language used.To make design goal 8 more concrete we note that image reconstruction frameworks, even for diﬀerent imagingmodalities, often share a large number of generic building blocks. These shared components include common trans-formations such as the FFT, standard linear algebra tools, and optimization algorithms to solve the reconstructionproblem at hand. In order to share these building blocks it is advantageous to put them into dedicated software pack-ages. Moreover, this allows developers to reuse building blocks implemented by experts of the given ﬁeld at hand andit reduces potential errors that could arise when self-implementing all building blocks. Finally, we note that the reuseof standard libraries leads to a higher level of scrutiny for the latter. In turn, this decreases the likelihood of persistentbugs. | Julia

To meet the design goals outlined in the previous section, we focus on a new programming language / environmentcalled Julia. Julia was invented by researchers at the MIT in 2012 [15] and has the aim of solving the two-languageproblem discussed in Sec. 1. Its most important features include:• Julia uses a just-in-time (JIT) compiler based on the LLVM compiler infrastructure [19] and is therefore capableof generating eﬃcient machine code.• Julia has a sophisticated type system including type inference [20], which allows the compiler to identify staticallytyped code fragments and in turn emit eﬃcient machine code.• Julia code can be both HL and LL. While hot code paths should be kept type stable for maximizing performance,it is ﬁne to mix this with a dynamic programming style where convenient code is more important than high per-formance.• The syntax of Julia is very similar to Matlab and thus familiar to a wide range of researchers.• Julia can call C/C++ code with no overhead, it is thus possible to cross language boundaries even for scalarfunctions.For more details on Julia we refer the reader to Ref. [15].Julia was designed from the ground up to be capable of generating eﬃcient machine code. In combination withthe HL features of the language, this allows us to develop a hackable framework (design goal 9) while retaining a high . Knopp et al. 5 performance. In contrast, many dynamically typed languages like Python are challenging to JIT compile since theycontain language features preventing emitting eﬃcient machine code. Therefore, one of the most successful PythonJIT compilers Numba [21] uses an approach where the user needs to manually annotate the code to be compiled andonly a subset of the language can be used. Here, again, the key for reaching high performance is the ability of thesystem to determine the types of all variables in a function body. A more general JIT compiler for Python that doesnot need code annotations is PyPy [22].Another feature of Julia is its built-in package manager. The latter can take care of all the dependencies of acode package. As a consequence, Julia packages are considered to be cheap and it is possible to achieve a ﬁnegrained modularization across packages. This modularization is pushed by a large community of package developers,which enrich the power of the Julia programming environment substantially. By exploiting this aspect of the Juliaprogramming environment,

MRIReco.jl manages to fulﬁll design goal 8 (reuse of components). Beyond that the clearversioning performed by the julia package manager can greatly simplify reproducible research. This stands in contrastto other approaches, where it is in the hand of the user to download the correct versions of the dependencies at hand. | Functionality

MRIReco.jl oﬀers a wide range of functionality while concentrating mainly on basic building blocks that have provento be useful for MRI image reconstruction. In particular

MRIReco.jl currently oﬀers:•

Simulation:

Methods for simulating MRI data based on software phantoms. This includes support for taking intoaccount B inhomogeneity, R ∗ relaxation and multiple receive coils. Moreover, basic support for simulating multi-echo sequences is implemented. Simulation can be performed using a direct evaluation of the imaging equationas well as a faster method that applies common approximations such as the NFFT.• Imaging Operators:

Basic Cartesian and non-Cartesian imaging operators based on FFT and NFFT. Oﬀ-resonanceand relaxation term aware imaging operators are available as well. These can be evaluated using both data-driven[23] and analytical [24, 25] approximations. All operators can be combined to form encoding operators for moreextended acquisitions such as parallel imaging [26] and multi-echo data acquisition.•

Coil Estimation: MRIReco.jl implements methods for determining sensitivity maps including the sum-of-squaresmethod and ESPIRiT [27].•

Iterative Reconstruction:

Iterative reconstruction algorithms using solvers such as CGNR [28], FISTA [29], ADMM[30], and the split Bregman method [31] are available. For regularization,

MRIReco.jl oﬀers TV, ℓ , ℓ priors includ-ing wavelet sparsiﬁcation. Density compensation weighting is available as a method for speeding up convergenceof iterative solvers [32, 33]. Sampling density weights can be computed for arbitrary trajectories based on themethod described in [34]. MRIReco.jl is implemented in a Julia idiomatic way and uses multiple dispatch to execute diﬀerent code paths de-pending on the type of function arguments. We note that this is a similar form of polymorphism known in classicalobject-orientated languages but that the dispatch is more generic. On purpose we have not introduced a furtherabstraction mechanism like a pipeline architecture [16]. Such patterns are useful to turn statically compiled programsinto dynamic ones by introducing a new pipeline language. They are, however, not needed in dynamic programminglanguages like Matlab, Python, and Julia. In fact, pipelining can be written in Julia as simple function composition.An overview of the internal architecture and a typical data ﬂow within

MRIReco.jl is given in ﬁgure 1. In thefollowing sections we sketch individual parts of the framework using small code snippets to illustrate some of the

T. Knopp et al.

Raw Data SimulationAcquisition DataReconstructionImage Data

BrukerFile ISMRMRDRawAcquisitionData ParameterssimulationAcquisitionData Trajectoryreconstruction ParametersAxisArray(Image) DICOMNIfTI

FIGURE 1 Overview of the internal architecture and the typical data ﬂow in

MRIReco.jl . An

AcquisitionData object can be constructed either from a ﬁle through a

RawAcquisitionData object or using simulation . The

AcquisitionData object and additional reconstruction parameters are passed to the reconstruction method thatyields an

AxisArray object that can be stored in DICOM or NIfTI ﬁles.design principles. For more detailed information we refer to the documentation and the source code of the package. | Datatypes

In order to eﬃciently work with MRI data

MRIReco.jl introduces two distinct datatypes for representing the latter.

RawAcquisitionData describes the data as it is stored in a data ﬁle. Since this data is typically not stored in a formsuitable for reconstruction, it is ﬁrst converted into the type

AcquisitionData , describing the pre-processed data.The latter can then be passed to the reconstruction method in order to obtain an image.

Trajectory is anotherimportant datatype, which is used for describing the MRI sampling trajectory. We outline each datatype in moredetail next. . Knopp et al. 7 unprocessed data preprocessed data mutable struct RawAcquisitionDataparams::Dict{String,Any}profiles::Vector{Profile}endmutable struct Profilehead::AcquisitionHeadertraj::Matrix{Float32,2}data::Matrix{ComplexF32,2}end mutable struct AcquisitionDatasequenceInfo::Dict{Symbol,Any}traj::Vector{Trajectory}kdata::Array{Matrix{ComplexF64},3}subsampleIndices::Vector{Int64}encodingSize::Vector{Int64}fov::Vector{Float64}endmutable struct Trajectoryname::Stringnodes::Matrix{Float64}times::Vector{Float64}TE::Float64AQ::Float64numProfiles::Int64numSamplingPerProfile::Int64numSlices::Int64cartesian::Boolcircular::Boolend

FIGURE 2 Datatypes used to represent MRI data. The left-hand side contains the julia deﬁnition of the datatype

RawAcquisitionData used for storing unprocessed MRI data. The right-hand side contains the correspondingdeﬁnition of the type

AcquisitionData used for describing the pre-processed MRI data.

T. Knopp et al. | Raw Data

RawAcquisitionData is a datatype closely resembling the ISMRMRD data format [35]. Its julia deﬁnition is containedin the left-hand side of Fig. 2. The syntax profiles::Vector{Profile} means that the ﬁeld profiles is of the type

Vector{Profile} . The philosophy of the ISMRMRD format is to store all global metadata in an XML header. Thisheader has a static structure but it can also be extended by custom ﬁelds. In Julia the appropriate data structure forstoring key-value pairs is a

Dict , i.e. an associative array. In the parameter params all data that are present in theXML header of an ISMRMRD ﬁle are stored. If possible the content of each parameter is converted to an appropriateJulia datatype. For convenience, the structure of the ISMRMRD header is ﬂattened, since this makes all parametersdirectly accessible. For instance the parameter 128128128 is available in the length 3 vector head["reconSize"] .Each measurement proﬁle is stored in the type

Profile . It describes the data measured after a single excitationduring an MRI experiment. It has members head , traj , and data , which exactly correspond to the structures speciﬁedby the ISMRMRD ﬁle format. The members of the Profile datatype are also bit-compatible with corresponding HDF5structs in an ISMRMRD ﬁle. | Pre-Processed Data

The goal of the

RawAcquisitionData datatype is to have a very ﬂexible representation for storing a great varietyof measurements. As a matter of fact, this generic representation can be inconvenient when used for reconstruc-tion. For instance, the proﬁles can be stored in a diﬀerent order than required for image reconstruction. For thisreason,

MRIReco.jl uses an additional datatype

AcquisitionData to store the data in a way that is convenient forreconstruction. The deﬁnition of

AcquisitionData can be found on the right hand side of Fig. 2. It contains thesequence information stored in a dictionary, the k -space trajectory, the k -space data, several parameters describingthe dimensionality of the data, and some additional index vectors. The k -space data kdata has three dimensionsencoding contrasts/echoes slices repetitionsEach element is a matrix whose dimensions encode k -space nodes . Knopp et al. 9 SpiralCartesian Radial

FIGURE 3 Exemplary trajectories available in

MRIReco.jl . channels/coilsFor undersampled data, the indices of the measured samples are stored in the ﬁeld subsampleIndices . We note thatboth the traj and the subsampleIndices ﬁelds are deﬁned as vectors with one entry for each contrast/echo.The encoded space is stored in the ﬁeld encodingSize . It is especially relevant for non-Cartesian trajectorieswhere it is not clear upfront, how large the grid size for reconstruction is to be chosen. Finally, the parameter fov describes the physical size of the encoding grid. | Trajectory

The type

Trajectory describes the sampling trajectory of the imaging sequence. Its deﬁnition is also contained onthe right-hand side of Fig. 2. The most important ﬁeld is nodes , which contains the sampling points as a matrixwhere the ﬁrst dimension encodes the dimensionality of the k -space and the second dimension encodes the numberof sampling points. This general structure allows implementing arbitrary sampling trajectories. Moreover, MRIReco.jl provides convenient constructors for the most common types of trajectories. For instance

MRIReco.jl has supportfor Cartesian, radial, spiral, dual density spiral, variable density spiral, and perturbed spiral trajectories. Beside 2Dtrajectories, the framework also implements 3D trajectories like the kooshball or the stack of stars trajectory. Aselection of exemplary trajectories is illustrated in Fig. 3. | Image Data

MRIReco.jl returns image reconstruction results in the form of an

AxisArray , which is special datatype allowing toencode dimensions of an array. For image processing of reconstructed data one can use the package

Images.jl . Storageof this data is possible using

NIfTi.jl , DICOM.jl or HDF5.jl . | Raw Data File Handling

The ﬁle handling in

MRIReco.jl is build around the ISMRMRD ﬁle format for which full read and write support isimplemented. In addition, a ﬁle reader for proprietary ﬁles from the vendor Bruker is implemented. The following code example shows how to convert a Bruker MRI ﬁle into an ISMRMRD ﬁle: f = BrukerFile(filenameBruker) raw = RawAcquisitionData(f) fout = ISMRMRDFile("outputfile.h5") save(fout, raw)

We note that the raw data object raw can also be created by a simulation. It is thus possible to cache simulated datain a ﬁle, which is useful to save computation time associated with more complex simulations. | Reconstruction Building Blocks

In general, MRI image reconstruction aims to recover a discrete image of the transverse magnetization m ∈ Ã N froma given set of measurements s ∈ Ã M and a signal encoding model described by a linear operator H ∈ Ã M × N . Thus,one seeks a solution to an inverse problem of the formargmin m k s − H m k + R ( m ) , (1)where R denotes a regularization function expressing prior knowledge about the solution. As a consequence mostimage reconstruction schemes can be formulated using a set of basic building blocks• Linear Operators:

These are used to describing the action of the signal encoding operator H and its adjoint H H .Moreover, they describe transformations applied to m within the regularization term R ( m ) (e.g. sparsifying trans-forms used in CS).• Solvers:

An optimization algorithm used to solve problem (1).•

Proximal Maps:

These are associated with the given regularization functions.The interface in

MRIReco.jl is designed in such a way that all relevant parameters can be passed to the reconstructionmethod via a dictionary. The reconstruction method uses these parameters to form the main building blocks and thensolves the corresponding image reconstruction problem. In the following sections, we provide more information onthe use and implementation of aforementioned building blocks. | Linear Operators

Linear operators are used in multiple places of a reconstruction pipeline. Most importantly, the signal encoding oper-ator H is a discrete approximation of the underlying signal model s p ( t ) = ∫ Ò d c p ( r ) m ( r ) e − z ( r ) t e − π i k ( t )· r d r . (2)Here s p ( t ) denotes the demodulated signal received in the p -th of P receive coils at time t . Moreover, m ( r ) is thetransverse magnetization of the object at position r and c p ( r ) are the receive coil sensitivities. Finally, the term z ( r ) ∈ Ã contains the R ∗ and B maps in its respective real and imaginary parts. As a second application, linear operators can . Knopp et al. 11 be applied to m in the regularizer R . This commonly happens in compressed-sensing-type reconstructions, where theimage needs to be transformed into a sparse representation.All operators are implemented in a matrix-free manner. This means that an operator is characterized solely byits action when applied to a vector. This allows to evaluate an operator using eﬃcient algorithms, such as the FFTand NFFT, while avoiding storage of the underlying matrix representation. Other common matrix operations areimplemented in complete analogy. For instance, one can apply a linear operator using the * -operator or form itsadjoint using the postﬁx ’ . Similarly, linear operators can be composed using either of the operators * and ◦ , and theirsize can be determined using the function size . MRIReco.jl provides implementations of the operators that are most commonly used for MRI image reconstruction.These include•

FFTOp : A multidimensional FFT operator.•

NFFTOp : A multidimensional NFFT operator.•

FieldmapNFFTOp : An extension of the NFFT operator taking into account complex ﬁeldmaps.•

SensitivityOp : An operator, which multiples an image by a set of coil-sensitivities as used in SENSE-type imagereconstructions.•

SamplingOp : An operator for (sub)sampling ( k -space) data. The operator is used for compressed-sensing recon-struction.• WeightingOp : A weighting operator used for tasks such as sampling density compensation.Additional operators are reexported from the Julia package

SparsityOperators.jl . These include common sparsifyingtransforms such as the Wavelet transform and ﬁnite diﬀerences operators.In addition to theses methods,

MRIReco.jl provides high-level constructors that compose aforementioned oper-ators and return signal encoding operators for the diﬀerent reconstruction schemes. These are automatically calledby the reconstruction methods implemented. Alternatively, each operator can be built manually by calling the cor-responding constructor. In this way the preimplemented operators can be used as building blocks when developingnew algorithms.Finally, we note that iterative solvers often require repeated application of the normal operator N = H H H ofthe encoding operator. Thus, algorithms can sometimes be accelerated by optimizing the normal operator instead ofoptimizing the encoding operator itself. A relevant example is the case when H is an NFFT. In that case the corre-sponding normal operator has a Toeplitz structure. Thus, multiplication with the normal operator can be carried outusing two FFTs with an oversampling factor of 2 [36]. To allow for this kind of optimization, MRIReco.jl reexports thetype normalOperator , which is implemented in the packages

SparsityOperators.jl and

RegularizedLeastSquares.jl . Itsdefault implementation performs a sequential multiplication with the underlying operator followed by a multiplica-tion with the adjoint. When an optimized implementation of the normal operator exists, the latter can be used bysimply overloading the constructor function normalOperator . | Solvers

In order to solve the given reconstruction problem at hand,

MRIReco.jl uses the infrastructure provided by the package

RegularizedLeastSquares.jl . The latter implements popular iterative optimization methods, such as the CGNR, FISTA and We note that the term operator is used here for both the binary mathematical operation and the linear mapping.

ADMM. For all of the reconstruction methods implemented in

MRIReco.jl the solver can be determined by assigningits name to the parameter :solver in the dictionary containing the reconstruction parameters. | Regularization

To describe regularization functions,

MRIReco.jl uses the type

Regularization from the package

RegularizedLeast-Squares.jl . Most notably, this type contains a function to compute the associated proximal map of the regularizationfunction. This approach is very generic in the sense that most common solvers incorporate regularization in the formof such a proximal map.For convenience,

RegularizedLeastSquares.jl implements several common regularization functions such as TV, ℓ , ℓ and low rank regularization. Analogously to the solver to be used, these regularization functions can be speciﬁedby assigning their respective names to the parameter :regularization (e.g. params[:regularization] = "L1" ).Alternatively, one can also assign one (or more) Regularization objects to aforementioned parameter. This allowsthe incorporation of custom regularization functions. In this case, the main work to be done is the implementationof the corresponding proximal map. Finally, the preimplemented regularization objects can serve as building blockswhen developing new optimization algorithms. | High-Level Reconstruction

Next we sketch a high-level simulation and reconstruction with

MRIReco.jl . As outlined before, the interface is de-signed in such a way that all parameters are passed to the routine simulation and reconstruction via a parameterdictionary. This approach is very generic and allows specifying a large set of parameters without having to use longargument lists. The dictionary can also be stored in an XML or TOML ﬁle.The example simulation uses a × pixel sized Shepp-Logan phantom, 8 birdcage coil sensitivities, and a variabledensity spiral trajectory with 1 interleave and 2048 samples. This corresponds to an 8-fold undersampling. After sim-ulating k -space data, reconstruction is performed using a SENSE-type compressed sensing reconstruction. Since theShepp-Logan phantom is piece-wise constant, edge-preserving TV-regularization is used to mitigate undersamplingartifacts. In the example code, the latter is speciﬁed by the assignment params[:regularization] = "TV" . The re-construction problem is then solved using ADMM with 10 inner CG-iterations and 30 outer iterations. The resultingimage is shown on the right-hand side of Fig. 4. | Low-Level Reconstruction

The high-level reconstruction outlined in the previous section allows performing image reconstruction for a widerange of MRI imaging scenarios. This is achieved by a dispatch to diﬀerent individual reconstruction methods, suchas direct reconstruction, iterative single-channel reconstruction, iterative multi-channel reconstruction or iterativemulti-echo reconstruction. Alternatively, the building blocks in

MRIReco.jl can be used in a more low-level way toimplement custom reconstruction methods. In the next example we illustrate this using the example of a simplegridding reconstruction. We assume that a dataset has been either loaded or simulated. With the

AcquisitionData at hand, a gridding reconstruction could then be implemented using the following code snippet. First, the densitycompensation weights are determined using the method samplingDensity . Next the sampling trajectory is extractedfrom the acqData and the gridding operator is formed by calling the function

NFFTOp . Afterwards, the k -space datais extracted from acqData and weighted with the obtained density compensation weights. Finally, the reconstructed . Knopp et al. 13 N = 128x = shepp_logan(N)

Phantom smaps = birdcageSensitivity(N,8,3.0) params = Dict{Symbol, Any}()params[:simulation] = "fast"params[:trajName] = "SpiralVarDens"params[:numProfiles] = 1params[:windings] = 16params[:numSamplingPerProfile] = 2048params[:senseMaps] = smaps acqData = simulation(x, params)

Simulation params[:reco] = "multiCoil"params[:reconSize] = (N,N)params[:regularization] = "TV"params[: λ ] = 2.0e-3params[:solver] = "admm"params[:iterationsInner] = 10params[:iterations] = 30params[: ρ ] = 1.0e-1params[:absTol] = 1.0e-5params[:relTol] = 1.0e-4 y = reconstruction(acqData, params) Reconstruction

FIGURE 4 Example of a high-level simulation and reconstruction script using the software package

MRIReco.jl . Inthe lower part of the ﬁgure the generating code snippets are shown while the resulting data is shown in the top part.The example starts by generating a × Shepp–Logan phantom (left column), which is used for simulating8-fold undersampled k -space data (middle column). Finally, a simple TV -regularized iterative parallel imagingreconstruction is performed (right column). image is obtained by applying the adjoint of the imaging operator to the weighted k -space data. This illustrates thatthe notation within MRIReco.jl is very close to what one expects from the mathematical description. using MRIReco reconSize = acqData.encodingSize[1:2]weights = samplingDensity(acqData, reconSize)[1]tr = trajectory(acqData)F = NFFTOp(reconSize, tr)kdata = kData(acqData,1,1,1) .* (weights.^2)reco = adjoint(F) * kdata | Parallelization

One important way to speed up reconstruction is to use some form of parallel code execution. Nowadays mostcomputing hardware provides multiple cores that can operate in parallel. There are basically three types of parallelcomputing: shared memory parallelism, distributed memory parallelism and the usage of specialized computing hard-ware that has massively parallelized circuits (usually a GPU). Julia allows for implementing all three forms of parallelcomputing.

MRIReco.jl supports the use of multi-threading (i.e. shared memory parallelism) to speed up image reconstruction.Multi-threading is a very popular form of parallelism since it does not require additional hardware and can be imple-mented without majorly rewriting an existing codebase. In particular it does not require to copy memory between pro-cesses, or systems, which is required both in distributed memory systems and GPU computing. Julia’s multi-threadingcapabilities are based on a prototype developed in [37] and ﬁrst published as an experimental feature in Julia v0.6.Unlike in Python, the threads run truly in parallel and thus it is not required to write C/C++ to use concurrency. Whilewe focus on multi-threading in this work, we note that with version 0.6 the library NFFT.jl gained GPU support. Thelatter will be integrated into

MRIReco.jl in the future, such that it will be possible to perform image reconstructionon the GPU.Multi-threading usually complicates the implementation, and also needs to be done with care to get decentspeedups. One major challenge arises when diﬀerent components within a software stack perform independentparallelization. For instance, low-level libraries such as BLAS and FFTW have dedicated thread pools that are notaware of a parallelism in the high-level code calling BLAS and FFTW. These issues can be solved by using centralizedthread pool implementations like oﬀered for C applications by OpenMP [38], Cilk [39], and Intel TBB [40]. In version1.3, Julia introduced a similar system called partr (parallel tasks runtime) that allows for nested parallelism.To achieve decent speed-ups irrespective of the reconstruction setting, multi-threading is implemented in dif-ferent parts of

MRIReco.jl . On a low level, several imaging operators, such as the multi-coil imaging operator andthe ﬁeldmap-aware imaging operator, require the computation of multiple NFFTs. By parallelizing the correspondingloops, multiple NFFTs can be computed simultaneously resulting in signiﬁcant speed-ups. On a higher level, one oftenwants to perform multiple independent reconstructions. This includes the reconstruction of multiple slices of a 2d . Knopp et al. 15 acquisition or the reconstruction of multiple coil images, when no parallel imaging is applied. As a consequence, allloops over independent reconstructions are parallelized as well. To avoid interference with the Julia thread pool, thethread pools of

BLAS and

FFTW are disabled when loading

MRIReco.jl . For FFTW this is not strictly necessary sincethis C library has already been adapted to be partr -aware.For the implementation of parallel loops we use the @spawn macro and rewrite serial loops of the form for n=1:N end into the form @sync for n=1:NThreads.@spawn begin endend

In case of data dependencies a mutex is used to lock access to the critical section. Tasks in Julia do not directly mapto threads and thus it is usually ﬁne to spawn more tasks than threads are available in the thread pool. Note, however,that task creation of course also has a small overhead so that this form of loop parallelization should only be donewhen the body of the for loop contains a suﬃcient number of operations. For instance the loop over the coils in aSENSE reconstruction needs to perform a Fourier transform in the function body, which is a suﬃcient workload andthus the overhead of task creation can be neglected. | Availability and Platform Support

MRIReco.jl is developed within a public Git repository hosted at Github. The project contains online documentationthat can be accessed from the project homepage. Bug reports, feature requests and comments can be made using anissue tracker. Any commit made to

MRIReco.jl is tested using continuous integration services.

MRIReco.jl is supposedto run on any operating system and platform that Julia itself supports. Currently, the test suite runs successfully onLinux, OS X, and Windows.The software package is licensed under the MIT license , as are most parts of Julia and its package ecosystem.The MIT license is a permissive license and allows to use the code even in a closed-source application. MRIReco.jl has only one GPL dependency (FFTW [41]), which would need to be replaced prior inclusion into a closed sourceapplication . https://github.com/MagneticResonanceImaging/MRIReco.jl https://opensource.org/licenses/MIT Massachusetts Institute of Technology (MIT) and Intel (MKL, Math Kernel Library) provide binary compatible FFTW implementations that can be used inclosed-source applications. | Experimental Evaluation

After giving an overview of the functionality and implementation of

MRIReco.jl we next apply

MRIReco.jl to openlyavailable MRI data and perform a comparison with an existing MRI reconstruction framework.The ﬁrst test aims at testing/demonstrating the full reconstruction pipeline from loading MRI data in the com-monly used ISMRMRD format to performing the ﬁnal image reconstruction. For this purpose, we downloaded apublicly available MRI dataset from the database http://mridata.org [42]. The dataset contains data of a humanknee acquired using a 3d FSE sequence and an 8-channel receive coil on a GE scanner. The acquisition was performedusing a ﬁeld of view of 160 mm x 160 mm x 124.8 mm and a matrix size of 320 x 274 x 208. The repetition timeand echo time are 1400 ms and 20 ms, respectively. The data was measured using variable density Poisson disk sam-pling with a fully sampled calibration area of size 35x35 and an overall undersampling factor of 7.13. After loadingthe data, the data was converted to 2d data by applying a Fourier transform along the readout direction. Next, coilsensitivy maps were obtained using the ESPIRiT implementation contained in

MRIReco.jl . Finally, image slices werereconstructed using a SENSE-type reconstruction with ℓ -regularization in the Wavelet domain. For the reconstruc-tion we employed a regularization parameter of 0.2. The corresponding reconstruction problem was solved using theADMM with 30 iterations and 10 iterations of the inner CG method.Secondly, we perfomed a comparison of MRIReco.jl with the popular C/C++ reconstruction framework BART [17].As a model problem we chose an iterative SENSE reconstruction using the radial dataset published as part of theISMRM reproducibility challenge 1 [43]. It consists of a brain dataset acquired with 12 coils using a radial trajectorywith 96 proﬁles and 512 samples per proﬁle. The coil sensitivities are estimated from the data itself using the ESPIRiTimplementation that is part of each framework. As was proposed in the reproducibility challenge, the fully sampleddataset was retrospectively undersampled by reduction factors between R = 1 and R = 4 . For both frameworks,images were recovered using an iterative ℓ regularized SENSE reconstruction based on the conjugated gradientmethod. In all cases we used 20 iterations and a regularization parameter of . . It is run on a workstation equippedwith two AMD EPYC 7702 CPUs running at 2.0 GHz (256 cores in total) and a main memory of 1024 GB.The data/software that support the ﬁndings of this study are openly available in MRIReco.jl at . /zenodo. , SHA-1 hash 72fbbd0485fde314df75b30a61b42ab9606ca188,and MRIRecoBenchmarks at . /zenodo. , SHA-1 hash b24b60ca512589ec6284b31d8e7c07628c2c5886. To be precise, the reconstruction of theknee dataset is contained in the examples folder in MRIReco.jl , while the code for the comparison with BART is con-tained in

MRIRecoBenchmarks . | RESULTS3.1 | Reconstruction Results of the Knee Data Set

Fig. 5 shows images of the reconstructed knee dataset for some exemplary slices. These results illustrate a typicalapplication for researchers, who wish to test their reconstruction methods not only using simulation data but alsodata from one or more MRI scanners. As is illustrated in the example code accessible in the Git repository, this can byachieved quite easily by using

MRIReco.jl in conjunction with the ISMRMRD format. Thus, all that is needed to workwith raw data is to convert the latter from its vendor speciﬁc format into the ISMRMRD format. Afterwards, the datacan be reconstructed using either one of the preimplemented reconstruction methods of

MRIReco.jl or by a customreconstruction method making use of the low-level building blocks provided by the package. . Knopp et al. 17 slice 50 slice 100 slice 150 slice 200

FIGURE 5 Results of the ℓ -Wavelet reconstruction of the knee data set. The reconstructed images are shown forthe slices 50, 100, 150 and 200 (along readout direction). | Runtime Performance

Designing a proper performance benchmark for independent software frameworks can be a challenging task sinceeach of the frameworks often diﬀers not only in the implementation but also in the choice of implemented algorithm-s/optimizations and in the choice of default reconstruction parameters. Since both BART and

MRIReco.jl implementmulti-threading we performed benchmarks for diﬀerent numbers of threads (1,4,8,12). For each framework, the re-construction is performed multiple times and the minimum time is used for comparison. In this way, both frameworksare benchmarked under idealized but comparable conditions. In particular taking the minimum time of several runsconsiders hot CPU caches.BART by default uses the Toeplitz optimization for eﬃcient multiplication with the normal matrix. During thedevelopment of the benchmark we implemented this feature as well to match the implementation in BART. In additionto the Toeplitz optimization, which implies using FFTs with an oversampling of factor σ = 2 . , MRIReco.jl also allowsrunning the code without the Toeplitz optimization but with a smaller oversampling factor such as σ = 1 . (see [44] forinvestigation of oversampling factor sizes). Although this may slightly reduce the accuracy of the NFFT approximation,we observe that in practice the approximation error is so small that it is not visually perceptible in the reconstructedimages.The results of the performance comparison are summarized in Fig. 6 for the reduction factors R = 1 and R = 4 .One can see that both reconstruction frameworks achieve very similar reconstruction times. MRIReco.jl is slightlyfaster for 1 and 4 threads while BART is faster for 12 threads. When comparing the result with and without Toeplitzoptimization one can see that for R = 1 both code paths achieve similar performance while for R = 4 the non-Toeplitzreconstruction with an oversampling factor of σ = 1 . is clearly faster. This can be explained by the smaller size ofthe FFT and the fact that for R = 4 signiﬁcantly fewer points need to be gridded compared to the case where R = 1 . | Reconstruction Accuracy

Reconstruction results are summarized in Fig. 7. Shown are the results for reduction factors R = 1 , , , using thesame reconstruction parameters. For MRIReco.jl the images obtained using the Toeplitz optimization are shown. Forboth frameworks only a minor decrease of image quality can be observed with increasing reduction factor. Whenlooking at diﬀerence maps between reduction factor R = 1 and higher reduction factors (row two and four) one cansee that R = 3 shows a smaller deviation in outer image regions of the head than reduction factors R = 2 and R = 4 . T i m e [ s ] R = 1

BARTMRIReco.jlMRIReco.jl* T i m e [ s ] R = 4

FIGURE 6 Performance comparison between BART and

MRIReco.jl shown are the minimum reconstruction timesof an iterative SENSE reconstruction for diﬀerent numbers of threads ranging from to . On the right, the resultsfor reduction factor R = 4 are shown while on the left, the results without data reduction can be seen. Orangeshows the reconstruction times for BART while dark blue shows the reconstruction times for MRIReco.jl both usingthe Toeplitz optimization. Light blue shows reconstruction time for

MRIReco.jl without the Toeplitz approach usingan oversampling factor σ = 1 . .When comparing the reconstruction results of the two frameworks one can hardly see a diﬀerence. Only the outerregions beyond the head look slightly diﬀerent, which is likely caused by diﬀerences in the coil estimation algorithm.However, these were not investigated in detail in this paper. Even the diﬀerence maps compared to the R = 1 reconstruction look very similar, which indicates that both frameworks implement the iterative SENSE reconstructionin a similar way. This is further supported by the diﬀerence maps between the two reconstruction frameworks shownin the ﬁfth row. These show mostly noise in the head region and one cannot identify the structure of the underlyingimage. Hence, there appears to be no bias between the diﬀerent reconstructions. | Syntactic Comparison

In order to provide a rough estimate for the complexity of the user interface, Table 1 summarizes the number of codelines required to perform the central tasks associated with the reconstruction of the brain dataset. This count excludescode lines speciﬁc to the benchmarking, such as setting up the benchmark or running multiple trials of the reconstruc-tion. The results show that both frameworks require a similar number of code lines to perform reconstruction. Themain diﬀerence arises from the fact that the MRIReco.jl-implementation uses a dictionary to store reconstruction pa-rameters, which is not required for BART. We note however that passing the parameters as a dictionary is done solelyfor the purpose of keeping the code readable. Alternatively, the parameters could be passed directly as keyword argu-ments as done by BART. We conclude that both frameworks allow to perform image reconstruction with a comparablycomplex interface and we acknowledge that the complexity of the ﬁnal code depends a lot on the preferences of theuser at hand. . Knopp et al. 19

R = 1 R = 2 R = 3 R = 4 B A R T D i ﬀ B A R T M R I R e c o . j l D i ﬀ M R I R e c o . j l M R I R e c o . j l - B A R T FIGURE 7 Comparison of SENSE reconstructions of a public brain imaging dataset using BART and

MRIReco.jl . Thereconstruction results are shown in rows one and three, while rows two and four show diﬀerence maps to thereference reconstruction ( R = 1 ). Row ﬁve shows diﬀerence maps between the images reconstructed using bothframeworks for the same reduction factor. task BART MRIReco.jldata loading & conversion 5 7gridding & ESPIRiT 4 2undersampling data 3 1reconstruction parameters 0 10reconstruction 1 1sum 13 21TABLE 1 Number of code lines needed to perform the central tasks associated with the reconstruction of the braindataset. | DISCUSSION

MRIReco.jl started as an experiment of the authors to check the suitability of Julia for developing an MRI reconstructionframework. Initial success was measured in a very rapid development experience while still generating programsthat can compete in terms of runtime speed with equivalent C/C++ programs. From this point the framework wasdeveloped into a state making it not only usable for the authors themselves but also for other users. We characterizethe status of the project as usable but not yet ﬁnished. This implies that interfaces of upcoming versions might slightlychange and that parts of the documentation are still incomplete.In a performance benchmark it was shown that

MRIReco.jl achieves similar and in some cases even better per-formance than the state-of-the-art C/C++ MRI reconstruction framework BART. We note that the benchmark wasperformed for a very speciﬁc reconstruction algorithm (iterative SENSE) and therefore its results are not directly ap-plicable to other aspects of each of the frameworks.One key philosophy of

MRIReco.jl is to reuse existing code and keep the code base small. This makes the packagemore maintainable as it avoids code duplication and keeps the responsibility for code in the software repository whereit belongs. This approach has many advantages, but it also increases the complexity in version management. Therefore,this approach requires a package management system that is capable of handling complex version dependencies.Julia addresses this aspect with the package manager, which is an integral part of the programming environment.One important new responsibility in a package with many dependencies is that it requires the authors to follow thedevelopments of the depending packages. This is simple if the dependencies are managed by oneself, but it canincrease the complexity if not. Fortunately, the Julia dependency system allows to pin packages to certain versionssuch that breaking API changes in depending packages can be avoided.Compared to existing packages

MRIReco.jl has more in common with SigPy [18] than with Gadgetron or BART.SigPy allows to easily combine linear operators, optimization algorithms, and proximal mappings and keeps this or-thogonal functionality in separate code units. Since

MRIReco.jl was developed independently of SigPy both packageshave not yet inﬂuenced each other. Both projects are licensed under a permissive license and it is therefore possibleto reuse components, which may beneﬁt both packages. We emphasise that it is simple to convert between Pythonand Julia code since both programming languages use similar notation.One of the weak points of

MRIReco.jl is that no functionality for deployment on an MRI scanner is implementedso far. There are basically three diﬀerent implementations thinkable. The ﬁrst is to write a server in Julia and com-municate with the host server via TCP/IP. A similar architecture has been implemented by the authors in [45] for thetomographic imaging modality magnetic particle imaging where data is measured in real-time and directly processed. . Knopp et al. 21

The TCP-based data acquisition server can be found at https://github.com/tknopp/RedPitayaDAQServer . Thesecond possibility is to embed

MRIReco.jl into an existing C/C++ program. This can be done, since Julia has an em-bedding ABI/API that can be called from a variety of programming languages, in particular from C/C++. In this wayit would be possible to integrate

MRIReco.jl into for instance Gadgetron and implement a

JuliaGadget similar to theexisting

PythonGadget . Since Julia arrays have a C-compatible binary format, it is possible to pass data from C to Juliaby only passing the pointer to the data. Similarly it would also be possible to integrate

MRIReco.jl with BART. Finally,deployment can be achieved using automated oﬄine reconstruction techniques such as Yarra [46] and Autorec [47].Julia itself as a language has evolved since its introduction in 2012 into a stable, featureful language that cancompete with more widely developed languages such as Python and C/C++. One of the remaining issues of Juliacompared to other programming languages is its latency. Code compilation in Julia is currently done right beforecode execution and thus can add a certain amount of latency. The latency of packages is reduced to some extend byso-called pre-compilation but currently (Julia version 1.5.3) only an intermediate representation and not the nativemachine code is stored. The remaining latency issue can be mitigated by the package

Revise.jl , which allows to cachecompiled code during a julia session. An alternative for deployment usage is the package

PackageCompiler.jl , whichallows to either compile packages into the system image of Julia or to generate standalone executables. | CONCLUSIONS

In conclusion we have introduced a new image reconstruction framework for magnetic resonance imaging data, whichis both performant and accessible. The framework is implemented purely in the programming language Julia. Our testshave shown that the latter is very suitable for implementing such a framework. Two key aspects of

MRIReco.jl are itsmodularity and its open interface. These make it a useful tool not only for performing image reconstruction but also forthe development and testing of new reconstruction methods. In a comparison with the MRI reconstruction frameworkBART we have shown that

MRIReco.jl reaches similar performance and yields similar image quality, which proves thatJulia can reach a similar performance as comparable C/C++ implementations not only in micro benchmarks but alsoin real-world applications. references [1] Lustig M, Donoho D, Pauly JM. Sparse MRI: The application of compressed sensing for rapid MR imaging. Mag-netic Resonance in Medicine: An Oﬃcial Journal of the International Society for Magnetic Resonance in Medicine2007;58(6):1182–1195.[2] Lustig M, Pauly JM. SPIRiT: iterative self-consistent parallel imaging reconstruction from arbitrary k-space. Magneticresonance in medicine 2010;64(2):457–471.[3] Jin KH, Lee D, Ye JC. A general framework for compressed sensing and parallel MRI using annihilating ﬁlter basedlow-rank Hankel matrix. IEEE Transactions on Computational Imaging 2016;2(4):480–495.[4] Haldar JP. Low-rank modeling of local k -space neighborhoods (LORAKS) for constrained MRI. IEEE transactions onmedical imaging 2013;33(3):668–681.[5] Shin PJ, Larson PE, Ohliger MA, Elad M, Pauly JM, Vigneron DB, et al. Calibrationless parallel imaging reconstructionbased on structured low-rank matrix completion. Magnetic resonance in medicine 2014;72(4):959–970.[6] Doneva M, Börnert P, Eggers H, Stehning C, Sénégas J, Mertins A. Compressed sensing reconstruction for magneticresonance parameter mapping. Magnetic Resonance in Medicine 2010;64(4):1114–1120. [7] Zhang T, Pauly JM, Levesque IR. Accelerating parameter mapping with a locally low rank constraint. Magnetic resonancein medicine 2015;73(2):655–661.[8] Schweser F, Sommer K, Deistung A, Reichenbach JR. Quantitative susceptibility mapping for investigating subtle sus-ceptibility variations in the human brain. Neuroimage 2012;62(3):2083–2100.[9] Mani M, Jacob M, Kelley D, Magnotta V. Multi-shot sensitivity-encoded diﬀusion data recovery using structured low-rank matrix completion (MUSSELS). Magnetic resonance in medicine 2017;78(2):494–507.[10] Stone SS, Haldar JP, Tsao SC, Sutton B, Liang ZP, et al. Accelerating advanced MRI reconstructions on GPUs. Journal ofparallel and distributed computing 2008;68(10):1307–1318.[11] Sorensen TS, Atkinson D, Schaeﬀter T, Hansen MS. Real-time reconstruction of sensitivity encoded radial magneticresonance imaging using a graphics processing unit. IEEE transactions on medical imaging 2009;28(12):1974–1985.[12] Van Der Walt S, Colbert SC, Varoquaux G. The NumPy array: a structure for eﬃcient numerical computation. Computingin Science & Engineering 2011;13(2):22.[13] Abadi M, Barham P, Chen J, Chen Z, Davis A, Dean J, et al. Tensorﬂow: A system for large-scale machine learning. In:12th { USENIX } symposium on operating systems design and implementation ( { OSDI } . Knopp et al. 23 [25] Knopp T, Eggers H, Dahnke H, Prestin J, Sénégas J. Iterative oﬀ-resonance and signal decay correction for improvedmulti-echo imaging in MRI. IEEE Trans Med Imaging 2009;28(3):394–404.[26] Pruessmann KP, Weiger M, Scheidegger MB, Boesiger P. SENSE: sensitivity encoding for fast MRI. Magnetic resonancein medicine 1999;42(5):952–962.[27] Uecker M, Lai P, Murphy MJ, Virtue P, Elad M, Pauly JM, et al. ESPIRiT—an eigenvalue approach to autocalibratingparallel MRI: where SENSE meets GRAPPA. Magnetic resonance in medicine 2014;71(3):990–1001.[28] Hestenes MR, Stiefel E. Methods of conjugate gradients for solving linear systems. Journal of Research of the NationalBureau of Standards 1952;49(1).[29] Beck A, Teboulle M. A fast iterative shrinkage-thresholding algorithm for linear inverse problems. SIAM journal onimaging sciences 2009;2(1):183–202.[30] Parikh N, Boyd S, et al. Proximal algorithms. Foundations and Trends® in Optimization 2014;1(3):127–239.[31] Goldstein T, Osher S. The split Bregman method for L1-regularized problems. SIAM journal on imaging sciences2009;2(2):323–343.[32] Pruessmann KP, Weiger M, Börnert P, Boesiger P. Advances in sensitivity encoding with arbitrary k-space trajectories.Magnetic Resonance in Medicine: An Oﬃcial Journal of the International Society for Magnetic Resonance in Medicine2001;46(4):638–651.[33] Knopp T, Kunis S, Potts D. A note on the iterative MRI reconstruction from nonuniform k-space data. Internationaljournal of biomedical imaging 2007;2007.[34] Pipe JG, Menon P. Sampling density compensation in MRI: rationale and an iterative numerical solution. Magnetic Reso-nance in Medicine: An Oﬃcial Journal of the International Society for Magnetic Resonance in Medicine 1999;41(1):179–186.[35] Inati SJ, Naegele JD, Zwart NR, Roopchansingh V, Lizak MJ, Hansen DC, et al. ISMRM Raw data format: A proposedstandard for MRI raw datasets. Magnetic resonance in medicine 2017;77(1):411–421.[36] Fessler JA, Sutton BP. Nonuniform fast Fourier transforms using min-max interpolation. IEEE transactions on signalprocessing 2003;51(2):560–574.[37] Knopp T. Experimental multi-threading support for the Julia programming language. In: Proceedings of the 1st FirstWorkshop for High Performance Technical Computing in Dynamic Languages IEEE Press; 2014. p. 1–5.[38] Dagum L, Menon R. OpenMP: an industry standard API for shared-memory programming. IEEE computational scienceand engineering 1998;5(1):46–55.[39] Blumofe RD, Joerg CF, Kuszmaul BC, Leiserson CE, Randall KH, Zhou Y. Cilk: An eﬃcient multithreaded runtime system.Journal of parallel and distributed computing 1996;37(1):55–69.[40] Kukanov A, Voss MJ. The Foundations for Scalable Multi-core Software in Intel Threading Building Blocks. Intel Tech-nology Journal 2007;11(4).[41] Frigo M, Johnson SG. FFTW: An adaptive software architecture for the FFT. In: Proceedings of the 1998 IEEE Interna-tional Conference on Acoustics, Speech and Signal Processing, ICASSP’98 (Cat. No. 98CH36181), vol. 3 IEEE; 1998. p.1381–1384.[42] Ong F, Amin S, Vasanawala S, Lustig M. Mridata. org: An open archive for sharing MRI raw data. In: Proc. Intl. Soc. Mag.Reson. Med, vol. 26; 2018. p. 1. [43] Maier O, Baete SH, Fyrdahl A, Hammernik K, Harrevelt S, Kasper L, et al. CG-SENSE revisited: Results from the ﬁrstISMRM reproducibility challenge. Magnetic Resonance in Medicine 2020;.[44] Eggers H, Boernert P, Boesiger P. Comparison of gridding-and convolution-based iterative reconstruction algorithmsfor sensitivity-encoded non-Cartesian acquisitions. In: Proceedings of the 10th Annual Meeting of ISMRM, Honolulu;2002. p. 743.[45] Gräser M, Thieben F, Szwargulski P, Werner F, Gdaniec N, Boberg M, et al. Human-sized magnetic particle imaging forbrain applications. Nature communications 2019;10:1–9.[46] Block KT, Wiggins R, Yarra - A Toolbox for Clinical MRI Research;. Accessed: 2020-01-20. http://http://yarraframework.comhttp://http://yarraframework.com