Improving Error Detection in Deep Learning Based Radiotherapy Autocontouring Using Bayesian Uncertainty

Conference paper (2022)

Authors

Prerak Mody Leiden University Medical Center

Nicolas F. Chaves-de-Plaza Computer Graphics and Visualisation -

K.A. Hildebrandt Computer Graphics and Visualisation -

M. Staring Leiden University Medical Center, Pattern Recognition and Bioinformatics -

Research Group

Computer Graphics and Visualisation () (TU Delft)

DOI: https://doi.org/10.1007/978-3-031-16749-2_7

Deep learning Radiotherapy Bayesian uncertainty Quality assessment FlipOut AvU loss Organs-at-Risk Uncertainty-ROC

To reference this document use:

http://resolver.tudelft.nl/uuid:12f1b627-18e9-4e00-a42f-81adad5b946f

More Info

expand_more

Published Date

2022

Language

English

Reuse Rights

Other than for strictly personal use, it is not permitted to download, forward or distribute the text or part of it, without the consent of the author(s) and/or copyright holder(s), unless the work is under an open content license such as Creative Commons.

Faculty

Electrical Engineering, Mathematics and Computer Science

Department

Intelligent Systems

Research Group

Computer Graphics and Visualisation

Abstract

Bayesian Neural Nets (BNN) are increasingly used for robust organ auto-contouring. Uncertainty heatmaps extracted from BNNs have been shown to correspond to inaccurate regions. To help speed up the mandatory quality assessment (QA) of contours in radiotherapy, these heatmaps could be used as stimuli to direct visual attention of clinicians to potential inaccuracies. In practice, this is non-trivial to achieve since many accurate regions also exhibit uncertainty. To influence the output uncertainty of a BNN, we propose a modified accuracy-versus-uncertainty (AvU) metric as an additional objective during model training that penalizes both accurate regions exhibiting uncertainty as well as inaccurate regions exhibiting certainty. For evaluation, we use an uncertainty-ROC curve that can help differentiate between Bayesian models by comparing the probability of uncertainty in inaccurate versus accurate regions. We train and evaluate a FlipOut BNN model on the MICCAI2015 Head and Neck Segmentation challenge dataset and on the DeepMind-TCIA dataset, and observed an increase in the AUC of uncertainty-ROC curves by 5.6% and 5.9%, respectively, when using the AvU objective. The AvU objective primarily reduced false positives regions (uncertain and accurate), drawing less visual attention to these regions, thereby potentially improving the speed of error detection.

Files

978_3_031_16749_2_7.pdf

(pdf | 2.83 Mb)

- Embargo expired in 01-07-2023

Unknown license