[PDF] A Feasibility Study on Deep Learning Based Individualized 3D Dose Distribution Prediction

Abstract

Purpose: Radiation therapy treatment planning is a trial-and-error, often time-consuming process. An optimal dose distribution based on a specific anatomy can be predicted by pre-trained deep learning (DL) models. However, dose distributions are often optimized based on not only patient-specific anatomy but also physician preferred trade-offs between planning target volume (PTV) coverage and organ at risk (OAR) sparing. Therefore, it is desirable to allow physicians to fine-tune the dose distribution predicted based on patient anatomy. In this work, we developed a DL model to predict the individualized 3D dose distributions by using not only the anatomy but also the desired PTV/OAR trade-offs, as represented by a dose volume histogram (DVH), as inputs. Methods: The desired DVH, fine-tuned by physicians from the initially predicted DVH, is first projected onto the Pareto surface, then converted into a vector, and then concatenated with mask feature maps. The network output for training is the dose distribution corresponding to the Pareto optimal DVH. The training/validation datasets contain 77 prostate cancer patients, and the testing dataset has 20 patients. Results: The trained model can predict a 3D dose distribution that is approximately Pareto optimal. We calculated the difference between the predicted and the optimized dose distribution for the PTV and all OARs as a quantitative evaluation. The largest average error in mean dose was about 1.6% of the prescription dose, and the largest average error in the maximum dose was about 1.8%. Conclusions: In this feasibility study, we have developed a 3D U-Net model with the anatomy and desired DVH as inputs to predict an individualized 3D dose distribution. The predicted dose distributions can be used as references for dosimetrists and physicians to rapidly develop a clinically acceptable treatment plan.

Full PDF

AA Feasibility Study on Deep Learning–Based Individualized 3D Dose Distribution Prediction Jianhui Ma , Dan Nguyen , Ti Bai , Michael Folkerts , Xun Jia , Weiguo Lu , Linghong Zhou

1, a) and Steve Jiang

2, a) School of Biomedical Engineering, Southern Medical University, Guangzhou, Guangdong 510515, China Medical Artificial Intelligence and Automation (MAIA) Laboratory, Department of Radiation Oncology, University of Texas Southwestern Medical Center, Dallas, TX 75235, USA a) Co-corresponding authors. Emails: [email protected] and [email protected]

Abstract Purpose : Radiation therapy treatment planning is a trial-and-error, often time-consuming process. An approximately optimal dose distribution corresponding to a specific patient’s anatomy can be predicted by using pre-trained deep learning (DL) models. However, dose distributions are often optimized based not only on patient-specific anatomy but also on physicians’ preferred trade-offs between planning target volume (PTV) coverage and organ at risk (OAR) sparing or among different OARs. Therefore, it is desirable to allow physicians to fine-tune the dose distribution predicted based on patient anatomy. In this work, we developed a DL model to predict the individualized 3D dose distributions by using not only the patient’s anatomy but also the desired PTV/OAR trade-offs, as represented by a dose volume histogram (DVH), as inputs.

Methods:

In this work, we developed a modified U-Net network to predict the 3D dose distribution by using patient PTV/OAR masks and the desired DVH as inputs. The desired DVH, fine-tuned by physicians from the initially predicted DVH, is first projected onto the Pareto surface, then converted into a vector, and then concatenated with feature maps encoded from the PTV/OAR masks. The network output for training is the dose distribution corresponding to the Pareto optimal DVH. The training/validation datasets contain 77 prostate cancer patients, and the testing dataset has 20 patients.

Results:

The trained model can predict a 3D dose distribution that is approximately Pareto optimal while having the DVH closest to the input desired DVH. We calculated the difference between the predicted dose distribution and the optimized dose distribution that has a DVH closest to the desired one for the PTV and for all OARs as a quantitative evaluation. The largest average error in mean dose was about 1.6% of the prescription dose, and the largest average error in the maximum dose was about 1.8% of the prescription dose.

Conclusions:

In this feasibility study, we have developed a 3D U-Net model with the patient’s anatomy and the desired DVH curves as inputs to predict an individualized 3D dose distribution that is approximately Pareto optimal while having the DVH closest to the desired one. The predicted dose distributions can be used as references for dosimetrists and physicians to rapidly develop a clinically acceptable treatment plan.

1. Introduction

With the rapid development of external beam radiotherapy, the treatment planning procedure has become increasingly complicated for many tumor sites. In the current treatment planning workflow, a treatment planner works towards a good quality plan in a trial-and-error fashion by using a commercial treatment planning system. In the meantime, many rounds of consultation between planner and physician are often needed to reach a plan that meets the physician’s satisfaction for a particular patient, mainly because medicine, to some degree, is still an art, and a physician’s preferences for a particular patient cannot be quantified and precisely conveyed to the planner. Consequently, planning can take up to a week for complex cases, and plan quality may be poor and can vary significantly according to the varying levels of the physician’s and the planner’s skills, the quality of their communication, etc. This work was initially presented at the Annual Conference of American Association of Physicists in Medicine in Nashville, TN, 2018. o rapidly produce a treatment plan with consistent plan quality, knowledge-based planning (KBP) was developed to build a heuristic correlation between patient anatomy and the best achievable dose volume histogram (DVH) based on the treatment plans of previously treated patients. This provides useful clues as to how a plan of acceptable quality should look for any particular patient by looking at common features among similar patients treated in the past. Yet such methods heavily rely on the selection of handcrafted features to train the linear regression model. These handcrafted features, such as distance-to-target histograms, overlapping volume histograms, and organ shapes, are oversimplified representations of patient anatomy (e.g., they only focus on the pair-wise relationship between the target and one particular organ at a time) and thus cannot precisely predict the treatment plan with the best achievable quality. Moreover, current KBP approaches can only predict DVH curves, not 3D dose distributions; these DVH curves provide an incomplete description of the plan quality. Over the past couple of years, deep learning (DL) has been used for predicting dose distribution. Nguyen et al . first explored the feasibility of predicting dose distributions from organ contours by utilizing a modified U-Net model for prostate cancer and then extended their work to more complicated head and neck cancer cases.

Their model can automatically extract critical features from a patient’s anatomy without any handcrafted parameters to precisely predict the dose distributions. Barragán-Montero et al . extended this work to develop a more general model that considers variable beam configurations in addition to patient anatomy to predict dose distributions for lung intensity-modulated radiation therapy (IMRT), thus indicating a potentially easier clinical implementation with no need to train specific models for different beam settings. Similar ideas have been implemented by different research groups and applied to various clinical scenarios. In theory, there is an unlimited number of optimal treatment plans corresponding to a particular patient anatomy, due to the different trade-offs between the coverage of the planning target volume (PTV) and the sparing of various organs at risk (OARs). These optimal plans constitute the so-called Pareto surface. Therefore, dose prediction should also consider, in addition to patient anatomy, the trade-off preferred by the attending physician for a particular patient. There are also other clinical factors that must be considered in the planning process. Therefore, when a dose prediction model is used to generate a dose distribution and the corresponding DVH based a patient’s anatomy, the physician may often need to tune the results based on some specific considerations for the particular patient to produce the directive for the planner to follow. Nguyen et al . trained a neural network for generating Pareto optimal dose distributions by proposing a differentiable loss function based on the DVH and combining it with an adversarial loss to train deep neural networks. Although this model can predict Pareto optimal dose distributions, it requires structure prioritization weights as part of the input. These weights are essentially abstract concepts related to, but not equal to, clinically relevant metrics and values, which makes the model less accessible to physicians. In this work, we propose to develop and test the feasibility of another DL model to predict the optimal dose distribution from the patient anatomy and PTV/OARs trade-offs represented by a set of desired DVH curves. We envision the following clinical workflow, illustrated in Figure 1: 1) a conventional anatomy-based DL model will be used to predict a dose distribution first, 2) if the predicted dose distribution is not the desired one, the physician will tune the DVH of the predicted dose distribution to reflect the individualized planning goals for the particular patient, and 3) the proposed DL model will be used to predict a refined dose distribution by using the patient anatomy and tuned DVH as inputs. Steps 2) and 3) can be repeated until the desired dose distribution and DVH are achieved. The remainder of this paper is organized as follows. Section 2 first introduces the framework of our method and the network architecture. Then, it describes the dataset and the training configuration. The performance of our proposed method is presented in Section 3. Section 4 discusses and summarizes the strengths and weaknesses of the proposed model. Figure 1. The envisioned clinical workflow based on the proposed DL model.

2. Methods and Materials

We first mathematically formulate our task. Basically, given patient CT anatomy 𝑋 , we would like to predict the associated 3D dose distribution 𝑌 , which can be achieved by maximizing the following conditional probability: 𝑚𝑎𝑥 𝑃(𝑌|𝑋) (1) As mentioned above, in practice, problem (1) is underdetermined and therefore has multiple optimal solutions corresponding to different trade-offs among PTV coverage and OAR sparing. The DL models previously developed for dose prediction generate the optimal dose distributions by averaging all PTV/OAR trade-offs presented in the training dataset. In this work, we introduce an extra condition—i.e., a set of DVH curves that can represent the desired trade-offs—to constrain the solution space to one specific point on the Pareto surface. In other words, the objective function can now be expressed as: 𝑚𝑎𝑥 𝑃 (𝑌 𝐷 |𝑋, 𝐷) (2) where 𝐷 represents the desired DVH curves and 𝑌 𝐷 is a point on the Pareto surface corresponding to the sample pair (𝑋, 𝐷) . Now the problem (2) is well-defined and can be solved in a learning-and-prediction fashion. To be specific, suppose we have a training dataset {𝑋 𝑖 , 𝐷 𝑖𝑗 , 𝑌 𝑖𝑗 |𝑖 ∈ 1, ⋯ , 𝑀, 𝑗 ∈ 1, ⋯ , 𝑁} where 𝑖 indexes the patients and 𝑗 indexes the sampling point on the Pareto surface. We assume that there are 𝑀 patients in the dataset and 𝑁 data points on the Pareto surface for each patient, meaning we have 𝑁 pairs of Pareto optimal DVH curves and dose distributions. Then, problem (2) can be solved based on the above training dataset by training a convolutional neural network (CNN) 𝜙 𝑤 (𝑋, 𝐷) parameterized by 𝑤 . If one uses mean squared error (MSE) as the cost function, the network parameter 𝑤 can be calculated by solving the following specialized cost function: 𝑤̅ = 𝑎𝑟𝑔 𝑚𝑖𝑛 𝑤 ∑ ∑ ||𝜙 𝑤 (𝑋 𝑖 , 𝐷 𝑖𝑗 ) − 𝑌 𝑖𝑗 || (3) Once the network 𝜙 𝑤̅ (𝑋, 𝐷) is trained, in theory, it can produce the optimal dose distribution if it is fed as inputs the patient anatomy 𝑋 and the DVH vectors 𝐷 that describe the desired trade-off between PTV coverage and OAR sparing. ne may notice that, based on our envisioned clinical workflow, we want to predict a dose distribution that is approximately Pareto optimal, although the physician-tuned DVH 𝔇 𝑖 is unlikely to correspond to a Pareto optimal dose distribution. Therefore, when training the model, we need to first project 𝔇 𝑖 onto the Pareto surface of the patient 𝑖 to find the nearest Pareto optimal DVH 𝐷 𝑖𝑗 and the corresponding dose distribution 𝑌 𝑖𝑗 (𝑗 ∈ 1, ⋯ , 𝑁) from the training dataset (Fig. 2). 𝑌 𝑖𝑗 is termed “Pareto optimal dose distribution” in Figure 2 and used as the ground truth for model training. In this work, we use 𝑙 -norm to conduct the DVH projection operation, i.e., we solve the following problem to select the Pareto optimal DVH 𝐷 𝑖𝑗 from the training dataset: 𝑆 𝑖 = 𝑎𝑟𝑔 𝑚𝑖𝑛 𝑆 𝑖 ||𝑫 𝒊 𝑆 𝑖 − 𝒟 𝑖 || (4) where each column of the matrix 𝑫 𝒊 represents one point on the Pareto surface, which corresponds to an optimal DVH for patient 𝑖 , i.e., 𝑫 𝒊 = |𝐷 𝑖0 , 𝐷 𝑖1 , ⋯ , 𝐷 𝑖𝑗 , ⋯ , 𝐷 𝑖𝑁 | . 𝑆 𝑖 is an N-length one-hot vector, which is used to indicate the selection state. Let us assume that one can generate 𝐾 desired DVHs for each training patient. We denote the 𝑘 th desired DVH of patient 𝑖 as 𝒟 𝑖𝑘 , and the corresponding one-hot vector can be calculated as 𝑆 𝑖𝑘 according to equation (4). Then, the associated dose distribution can be calculated as 𝒴 𝑖𝑘 = 𝒀 𝑖 𝑆 𝑖𝑘 , where 𝒀 𝒊 = |𝑌 𝑖0 , 𝑌 𝑖1 , ⋯ , 𝑌 𝑖𝑗 , ⋯ , 𝑌 𝑖𝑁 | . By using this projection operation, we now convert our original training dataset with 𝑁 pairs of Pareto optimal DVH curves and dose distributions, {𝑋 𝑖 , 𝐷 𝑖𝑗 , 𝑌 𝑖𝑗 |𝑖 ∈ 1, ⋯ , 𝑀; 𝑗 ∈ 1, ⋯ , 𝑁} , into an augmented training dataset as {𝑋 𝑖 , 𝒟 𝑖𝑘 , 𝒴 𝑖𝑘 |𝑖 ∈ 1, ⋯ , 𝑀; 𝑘 ∈ 1, ⋯ , 𝐾} . Here, the data points {𝑋 𝑖 , 𝐷 𝑖𝑗 , 𝑌 𝑖𝑗 } in the original training dataset are generated and saved before model training, while the desired DVH 𝒟 𝑖𝑘 and the association relationship {𝒟 𝑖𝑘 , 𝑆 𝑖𝑘 } , and hence {𝒟 𝑖𝑘 , 𝒴 𝑖𝑘 } in the augmented training dataset, can be calculated in an on-the-fly fashion during the training phase by solving problem (4). In this training strategy, the cost function can be expressed as: 𝑤̅ = 𝑎𝑟𝑔 𝑚𝑖𝑛 𝑤 ∑ ∑ ||𝜙 𝑤 (𝑋 𝑖 , 𝒟 𝑖𝑘 ) − 𝒴 𝑖𝑘 || (5) For the testing phase, one just needs to feed the network with the patient contours and a desired (unlikely Pareto optimal) DVH to predict an approximately Pareto optimal dose distribution prediction with a DVH close to the desired one, without needing the DVH projection operation, i.e., 𝑌̅ 𝑖 = 𝜙 𝑤 (𝑋 𝑖 , 𝒟 𝑖𝑘 ) . Figure 2. The workflow of the proposed method for model training and testing. Figure 3 details the architecture design for the deep neural network employed in Figure 2. In this work, we use a modified 3D U-Net architecture with an encoder (left half) and a decoder (right half) as the architecture. In more etail, the encoder first extracts the features from the patient anatomy input, which is a multi-channel representation, each channel representing the contour of an OAR or the PTV. Then, these patient anatomy features are concatenated with the desired DVH before they are fed into the decoder for individualized dose prediction. It should be noted that, in this work, the DVH is a vector representation, i.e., the dose volume histograms of each OAR and PTV are stacked into a 1D vector. For the inner architecture, in the encoder part, we use consecutive convolutional layers to extract the features (Fig. 3). Each layer contains three operators: a convolutional operator, a batch normalization (BN) operator, and a rectified linear unit (ReLU). All the convolutional operators have a kernel size of 3 × 3 × 3, except the last two layers, which use a kernel size of 1 × 1 × 1 instead because of the limitation of the feature map size. Zero padding is applied to keep the feature size invariant during the convolution process. Six max-pooling operations with a 2 × 2 × 2 pooling size are employed to reduce the input size from 128 × 128 × 64 to 2 × 2 × 1, then one max-pooling with a 2 × 2 × 1 pooling size is utilized to obtain the feature maps with a size of 1 × 1 × 1. In the encoder part, we doubled the channel number with the depth increasing to capture more global feature information, and we set the maximum of channels at 128 to speed up training. To prevent overfitting, we use well-known dropout techniques after each convolutional layer. For the decoder, we use the double-channel strategies to construct the convolutional layers. All other layer details are the same as in the encoder. Figure 3. Modified 3D U-Net architecture with two inputs and one output. The green boxes denote multi-channel 3D feature maps, and each white box indicates a copied 3D feature map. The number at the top of the box represents the channel number for each feature map, and the map size is at the lower left corner of the box.

To demonstrate the feasibility of our proposed method, we used data from 97 patients with prostate cancer. We generated ten Pareto optimal IMRT treatment plans for each patient. Four critical structures—rectum, bladder, conv 3 × ×

3, Batch Normalization, ReLU, Dropout max-pooling 2 × × × × × × × × × ×

1, Batch Normalization, ReLU, Dropout deconv 3 × × × × copy

16 164 32 32 64 64 128 128 128 128

128 128

256 128 256 128 256 128 256 128 128 64

64 32

32 16 × × × × × × × × × × × × × × × × × × × × × × × × × × × × × × concatenate ody, and PTV—for each patient were used in IMRT planning with a standard 7-beam protocol. We split the dataset into a training dataset of 77 patients and a testing dataset of 20 patients. The dimensions of contours and dose distributions were 128 × 128 × 64, and each DVH vector contained 32 elements, so the total number of DVH elements was 128, which is also the channel number of contours at the bottom of the encoder. All dose distributions were normalized by PTV mean dose to generate a uniform dataset to stabilize the training process. Both model training and testing require physician-tuned desired DVHs 𝒟 𝑖𝑘 as model inputs. The corresponding dose distributions 𝒴 𝑖𝑘 are required for training as labels and for testing for performance evaluation. Here, 𝒴 𝑖𝑘 should be approximately Pareto optimal and should have DVHs close to 𝒟 𝑖𝑘 in the 𝑙 -norm sense (Equation (4)). Therefore, we augmented the training dataset from {𝑋 𝑖 , 𝐷 𝑖𝑗 , 𝑌 𝑖𝑗 |𝑖 ∈ 1, ⋯ ,77; 𝑗 ∈ 1, ⋯ ,10} to {𝑋 𝑖 , 𝒟 𝑖𝑘 , 𝒴 𝑖𝑘 |𝑖 ∈1, ⋯ ,77; 𝑘 ∈ 1, ⋯ , 𝐾} and the testing dataset from {𝑋 𝑖 , 𝐷 𝑖𝑗 , 𝑌 𝑖𝑗 |𝑖 ∈ 78, ⋯ ,97; 𝑗 ∈ 1, ⋯ ,10} to {𝑋 𝑖 , 𝒟 𝑖𝑘 , 𝒴 𝑖𝑘 |𝑖 ∈78, ⋯ ,97; 𝑘 ∈ 1, ⋯ , 𝐾’} . In this feasibility study, we generated 𝒟 𝑖𝑘 by randomly choosing a patient and a plan for patient 𝑖 from {𝑋 𝑖 , 𝐷 𝑖𝑗 , 𝑌 𝑖𝑗 |𝑖 ∈ 1, ⋯ ,77; 𝑗 ∈ 1, ⋯ ,10} for training and from {𝑋 𝑖 , 𝐷 𝑖𝑗 , 𝑌 𝑖𝑗 |𝑖 ∈ 78, ⋯ ,97; 𝑗 ∈ 1, ⋯ ,10} for testing to mimic the physician’s desired DVH tuned on the fly during the training and testing process. The corresponding 𝐷 𝑖𝑗 (and then 𝑌 𝑖𝑗 ) was selected through Equation (4). Here, 𝐾 is about 770 for training, and 𝐾’ is about 200 for testing. We adopted the Adam optimizer to minimize the loss function (5). The batch size was 3, and the model was trained with 300 epochs. The learning rate decayed as the number of iterations increased, which can be defined as: 𝑙𝑟 = 𝑙𝑟 𝑖𝑛𝑖𝑡𝑖𝑎𝑙 (6) where the initial learning rate 𝑙𝑟 𝑖𝑛𝑖𝑡𝑖𝑎𝑙 is set as 0.001, decay factor 𝑑𝑒𝑐𝑎𝑦 is 0.002, and 𝑖𝑡𝑒𝑟𝑎𝑡𝑖𝑜𝑛𝑠 denotes the number of model weights updated. Since deeper layers can more easily be overfitted due to the greater number of weights, we gradually increased the dropout rate based on the following equation: 𝑑𝑟𝑜𝑝𝑜𝑢𝑡 = 𝑟𝑎𝑡𝑒 𝑖𝑛𝑖𝑡𝑖𝑎𝑙 × ( 𝑙𝑎𝑦𝑒𝑟 𝑐𝑢𝑟𝑟𝑒𝑛𝑡 𝑙𝑎𝑦𝑒𝑟 𝑚𝑎𝑥 ) (7) where the initial rate 𝑟𝑎𝑡𝑒 𝑖𝑛𝑖𝑡𝑖𝑎𝑙 was set to 0.2 and the max layer 𝑙𝑎𝑦𝑒𝑟 𝑚𝑎𝑥 was equal to 7. In this work, we utilized a workstation equipped with 12 NVIDIA Tesla K80 GPUs to implement our network in Keras library with a TensorFlow back end.

3. Results

Figure 4 shows the losses over epochs, where both the training loss and the validation loss follow a convergence trend, though validation loss has some oscillations. This implies that our model gradually reaches the optimal solution as the epochs increase. Figure 4. Loss values as a function of epochs in the training phase. Although the validation loss (orange line) has some oscillations, both the training loss (blue line) and the validation loss follow a convergence trend. Figure 5 shows the individualized dose distributions predicted for a test patient, where each row represents a Pareto optimal plan of a different PTV/OAR trade-off for this patient. We can see that the predicted dose distributions are quite close to the corresponding true dose distributions for different PTV/OAR trade-offs. Although the dose distributions in the first and second rows are based on the same patient’s anatomy, they are very different due to the different desired trade-offs. The predicted plan in the second row has much better rectum sparing with a slightly higher bladder dose, which might be preferable clinically.

Contours True Dose Predicted Dose Difference Map DVH Comparison

Figure 5. Individualized dose distributions predicted for a testing patient. From left column to right column: contours, true dose, predicted dose distribution, difference map (true – prediction), and DVH comparison between true (solid line) and predicted (dashed line) dose distributions. The input desired DVH curves are also shown in dotted lines. All dose values are normalized to the prescription dose. Each row represents a Pareto optimal plan of a different PTV/OAR trade-off for this patient. For each OAR and PTV of the 20 test patients, we computed the differences between the predicted and the true mean and maximum doses. For all calculations, we subtracted the predicted dose from the true dose, then normalized the value to the prescription dose. The results are presented via violin plots in Figure 6 and via means and standard deviations in Table 1. We can see that the overall performance of the developed model is quite good. Specifically, the difference between the predicted and the true maximum OAR or PTV dose has both mean values and standard deviations within 2% of the prescription dose. The difference between the predicted and the true mean PTV doses is very small, with about 0.5% mean value and 0.2% standard deviation, because of the normalization to the prescription dose. The difference for body dose is also very small. However, the standard eviations of the differences between the predicted and the true mean rectum and bladder doses are quite large, close to 5%, though the mean values are still small (0.5% for rectum and 1.6% for bladder). The relatively large standard deviations for the mean dose differences in those two OARs may be explained by the small number of plans for each patient in the training and testing datasets (

𝑁 = 10 ). This will be discussed in the next section. Figure 6. Violin plot of the differences, normalized to the prescription dose, between the predicted and true (a) mean and (b) maximum doses in each OAR and PTV for all test patients. Table 1. Mean and maximum dose differences between the true and predicted doses in each OAR and PTV ( avg . ± std .). Mean dose difference Max dose difference Body 0.33 ± 0.73 1.21 ± 1.90 Bladder 1.63 ± 4.48 0.02 ± 1.17 Rectum 0.54 ± 4.83 0.64 ± 1.46 PTV 0.52 ± 0.23 1.83 ± 1.09

4. Discussion and Conclusions

Nguyen et al. discovered in 2017 that the relationship between the clinically optimal dose distribution and patient anatomy can be learned through supervised learning and then demonstrated that the trained DL model can predict the dose distribution given the PTV and OAR masks without going through the traditional treatment planning process [6-7]. Since then, many studies have exploited this idea [10-13]. In our clinic, we have implemented this work in routine clinical practice to assist physicians and treatment planners in producing treatment plans with higher efficiency and consistently higher quality. During the clinical implementation process, we realized that the predicted dose distributions represent a population average of the previously delivered treatment plans and often require further tuning to suit the treatment goal for a particular patient. When the physician tunes the predicted dose distribution to generate dose volume constraints as directives to guide the treatment planner, the tuned dose volume constraints may not correspond to a Pareto optimal plan and may not be achievable. Then, the clinical significance of using DL-based dose prediction to guide treatment planning will be greatly diminished. To solve this problem, we developed another DL model that takes a desired DVH as input in addition to the patient’s anatomy. We propose a new clinical workflow based on this DL model: 1) an anatomy-based DL model will be used to predict a population-averaged dose distribution and the corresponding DVH; 2) the physician or planner will tune the predicted DVH to reflect the individualized planning goals for a particular patient; and 3) the proposed DL model will be used to predict the refined dose distribution and DVH using the patient anatomy and (a) (b) he tuned DVH as inputs. Steps 2) and 3) can be repeated until the desired dose distribution and DVH are achieved. This paper presents a feasibility study of the proposed DL model. We used 77 patients treated with IMRT for prostate cancer for model training and 20 patients for testing. Each patient has 10 plans corresponding to different PTV/OAR trade-offs. The results shown in Figures 5 and 6 and Table 1 demonstrate the feasibility of the proposed model. The relatively large standard deviations (~5% of the prescription dose) for the mean dose differences in rectum and bladder are likely due to the small number of plans for each patient in the training and testing datasets. With only 10 plans, the Pareto surface can only be covered with a very low resolution, which led to the sub-optimal model performance in the testing phase. When the proposed method is used clinically, much more Pareto optimal plans should be generated for each training patient. This could be very time-consuming, but it could be done automatically by scripting the planning systems overnight when the systems are not in use clinically. Although we have demonstrated the feasibility of the proposed method, a much more comprehensive and clinically realistic performance evaluation study should be conducted before it can be used clinically. Ideally, the trained DL model needs to be integrated with other clinical software and tested by physicians and treatment planners on real patient cases. An important metric for evaluation should be the efficiency gain with this method by comparing it with the traditional methods of iterative communication between physicians and planners. In conclusion, we have developed and demonstrated the feasibility of a DL model that can predict individualized and approximately Pareto optimal 3D dose distributions by using the desired DVH as input in addition to the patient’s anatomy. This model can facilitate a new clinical workflow to guide treatment planning based on DL-based dose prediction.

Acknowledgements

This work was supported by an NIH R01 grant (1R01CA237269-01). We would like to thank Dr. Jonathan Feinberg for editing the manuscript.

References

1. Das IJ, Cheng C-W, Chopra KL, Mitra RK, Srivastava SP, Glatstein E. Intensity-modulated radiation therapy dose prescription, recording, and delivery: patterns of variability among institutions and treatment planning systems.

Journal of the National Cancer Institute.

Practical radiation oncology.

Medical physics. ‐ volume histograms for organs ‐ at ‐ risk in IMRT planning. Medical physics. ‐ and ‐ neck case study. Medical physics.

Scientific reports.

Physics in medicine & Biology. ‐ Montero AM, Nguyen D, Lu W, et al. Three ‐ dimensional dose prediction for lung IMRT patients with deep neural networks: robust learning from heterogeneous beam configurations. Medical physics. ‐ dimensional dose distribution predicted from deep learning technique. Medical physics. ‐ specific dose distributions for radiotherapy using deep learning. Medical physics. ‐ tunable Pareto optimal dose distribution for intensity ‐ modulated radiation therapy. Medical physics.

Radiotherapy and Oncology. ‐ volume histogram and adversarial inspired framework for generating Pareto optimal dose distributions in radiation therapy. Medical physics.

The journal of machine learning research. arXiv preprint arXiv:14126980. arXiv preprint arXiv:160304467.2016.