TY - JOUR
T1 - Evaluation of algorithms for Multi-Modality Whole Heart Segmentation
T2 - An open-access grand challenge
AU - Zhuang, Xiahai
AU - Li, Lei
AU - Payer, Christian
AU - Štern, Darko
AU - Urschler, Martin
AU - Heinrich, Mattias P
AU - Oster, Julien
AU - Wang, Chunliang
AU - Smedby, Örjan
AU - Bian, Cheng
AU - Yang, Xin
AU - Heng, Pheng-Ann
AU - Mortazi, Aliasghar
AU - Bagci, Ulas
AU - Yang, Guanyu
AU - Sun, Chenchen
AU - Galisot, Gaetan
AU - Ramel, Jean-Yves
AU - Brouard, Thierry
AU - Tong, Qianqian
AU - Si, Weixin
AU - Liao, Xiangyun
AU - Zeng, Guodong
AU - Shi, Zenglin
AU - Zheng, Guoyan
AU - Wang, Chengjia
AU - MacGillivray, Tom
AU - Newby, David
AU - Rhode, Kawal
AU - Ourselin, Sebastien
AU - Mohiaddin, Raad
AU - Keegan, Jennifer
AU - Firmin, David
AU - Yang, Guang
N1 - Copyright © 2019. Published by Elsevier B.V.
PY - 2019/12
Y1 - 2019/12
N2 - Knowledge of whole heart anatomy is a prerequisite for many clinical applications. Whole heart segmentation (WHS), which delineates substructures of the heart, can be very valuable for modeling and analysis of the anatomy and functions of the heart. However, automating this segmentation can be challenging due to the large variation of the heart shape, and different image qualities of the clinical data. To achieve this goal, an initial set of training data is generally needed for constructing priors or for training. Furthermore, it is difficult to perform comparisons between different methods, largely due to differences in the datasets and evaluation metrics used. This manuscript presents the methodologies and evaluation results for the WHS algorithms selected from the submissions to the Multi-Modality Whole Heart Segmentation (MM-WHS) challenge, in conjunction with MICCAI 2017. The challenge provided 120 three-dimensional cardiac images covering the whole heart, including 60 CT and 60 MRI volumes, all acquired in clinical environments with manual delineation. Ten algorithms for CT data and eleven algorithms for MRI data, submitted from twelve groups, have been evaluated. The results showed that the performance of CT WHS was generally better than that of MRI WHS. The segmentation of the substructures for different categories of patients could present different levels of challenge due to the difference in imaging and variations of heart shapes. The deep learning (DL)-based methods demonstrated great potential, though several of them reported poor results in the blinded evaluation. Their performance could vary greatly across different network structures and training strategies. The conventional algorithms, mainly based on multi-atlas segmentation, demonstrated good performance, though the accuracy and computational efficiency could be limited. The challenge, including provision of the annotated training data and the blinded evaluation for submitted algorithms on the test data, continues as an ongoing benchmarking resource via its homepage (www.sdspeople.fudan.edu.cn/zhuangxiahai/0/mmwhs/).
AB - Knowledge of whole heart anatomy is a prerequisite for many clinical applications. Whole heart segmentation (WHS), which delineates substructures of the heart, can be very valuable for modeling and analysis of the anatomy and functions of the heart. However, automating this segmentation can be challenging due to the large variation of the heart shape, and different image qualities of the clinical data. To achieve this goal, an initial set of training data is generally needed for constructing priors or for training. Furthermore, it is difficult to perform comparisons between different methods, largely due to differences in the datasets and evaluation metrics used. This manuscript presents the methodologies and evaluation results for the WHS algorithms selected from the submissions to the Multi-Modality Whole Heart Segmentation (MM-WHS) challenge, in conjunction with MICCAI 2017. The challenge provided 120 three-dimensional cardiac images covering the whole heart, including 60 CT and 60 MRI volumes, all acquired in clinical environments with manual delineation. Ten algorithms for CT data and eleven algorithms for MRI data, submitted from twelve groups, have been evaluated. The results showed that the performance of CT WHS was generally better than that of MRI WHS. The segmentation of the substructures for different categories of patients could present different levels of challenge due to the difference in imaging and variations of heart shapes. The deep learning (DL)-based methods demonstrated great potential, though several of them reported poor results in the blinded evaluation. Their performance could vary greatly across different network structures and training strategies. The conventional algorithms, mainly based on multi-atlas segmentation, demonstrated good performance, though the accuracy and computational efficiency could be limited. The challenge, including provision of the annotated training data and the blinded evaluation for submitted algorithms on the test data, continues as an ongoing benchmarking resource via its homepage (www.sdspeople.fudan.edu.cn/zhuangxiahai/0/mmwhs/).
U2 - 10.1016/j.media.2019.101537
DO - 10.1016/j.media.2019.101537
M3 - Article
C2 - 31446280
SN - 1361-8415
VL - 58
SP - 101537
JO - Medical Image Analysis
JF - Medical Image Analysis
ER -