TY - JOUR
T1 - Explainable Deep Learning Improves Physician Interpretation of Myocardial Perfusion Imaging
AU - Miller, Robert J.H.
AU - Kuronuma, Keiichiro
AU - Singh, Ananya
AU - Otaki, Yuka
AU - Hayes, Sean
AU - Chareonthaitawee, Panithaya
AU - Kavanagh, Paul
AU - Parekh, Tejas
AU - Tamarappoo, Balaji K.
AU - Sharir, Tali
AU - Einstein, Andrew J.
AU - Fish, Mathews B.
AU - Ruddy, Terrence D.
AU - Kaufmann, Philipp A.
AU - Sinusas, Albert J.
AU - Miller, Edward J.
AU - Bateman, Timothy M.
AU - Dorbala, Sharmila
AU - Carli, Marcelo Di
AU - Cadet, Sebastien
AU - Liang, Joanna X.
AU - Dey, Damini
AU - Berman, Daniel S.
AU - Slomka, Piotr J.
N1 - Publisher Copyright:
© 2022 by the Society of Nuclear Medicine and Molecular Imaging.
PY - 2022/11/1
Y1 - 2022/11/1
N2 - Artificial intelligence may improve accuracy of myocardial perfusion imaging (MPI) but will likely be implemented as an aid to physician interpretation rather than an autonomous tool. Deep learning (DL) has high standalone diagnostic accuracy for obstructive coronary artery disease (CAD), but its influence on physician interpretation is unknown. We assessed whether access to explainable DL predictions improves physician interpretation of MPI. Methods: We selected a representative cohort of patients who underwent MPI with reference invasive coronary angiography. Obstructive CAD, defined as stenosis ≥50% in the left main artery or ≥70% in other coronary segments, was present in half of the patients. We used an explainable DL model (CAD-DL), which was previously developed in a separate population from different sites. Three physicians interpreted studies first with clinical history, stress, and quantitative perfusion, then with all the data plus the DL results. Diagnostic accuracy was assessed using area under the receiver-operating-characteristic curve (AUC). Results: In total, 240 patients with a median age of 65 y (interquartile range 58-73) were included. The diagnostic accuracy of physician interpretation with CAD-DL (AUC 0.779) was significantly higher than that of physician interpretation without CAD-DL (AUC 0.747, P = 0.003) and stress total perfusion deficit (AUC 0.718, P < 0.001). With matched specificity, CAD-DL had higher sensitivity when operating autonomously compared with readers without DL results (P < 0.001), but not compared with readers interpreting with DL results (P = 0.122). All readers had numerically higher accuracy with CAD-DL, with AUC improvement 0.02-0.05, and interpretation with DL resulted in overall net reclassification improvement of 17.2% (95% CI 9.2%-24.4%, P < 0.001). Conclusion: Explainable DL predictions lead to meaningful improvements in physician interpretation; however, the improvement varied across the readers, reflecting the acceptance of this new technology. This technique could be implemented as an aid to physician diagnosis, improving the diagnostic accuracy of MPI.
AB - Artificial intelligence may improve accuracy of myocardial perfusion imaging (MPI) but will likely be implemented as an aid to physician interpretation rather than an autonomous tool. Deep learning (DL) has high standalone diagnostic accuracy for obstructive coronary artery disease (CAD), but its influence on physician interpretation is unknown. We assessed whether access to explainable DL predictions improves physician interpretation of MPI. Methods: We selected a representative cohort of patients who underwent MPI with reference invasive coronary angiography. Obstructive CAD, defined as stenosis ≥50% in the left main artery or ≥70% in other coronary segments, was present in half of the patients. We used an explainable DL model (CAD-DL), which was previously developed in a separate population from different sites. Three physicians interpreted studies first with clinical history, stress, and quantitative perfusion, then with all the data plus the DL results. Diagnostic accuracy was assessed using area under the receiver-operating-characteristic curve (AUC). Results: In total, 240 patients with a median age of 65 y (interquartile range 58-73) were included. The diagnostic accuracy of physician interpretation with CAD-DL (AUC 0.779) was significantly higher than that of physician interpretation without CAD-DL (AUC 0.747, P = 0.003) and stress total perfusion deficit (AUC 0.718, P < 0.001). With matched specificity, CAD-DL had higher sensitivity when operating autonomously compared with readers without DL results (P < 0.001), but not compared with readers interpreting with DL results (P = 0.122). All readers had numerically higher accuracy with CAD-DL, with AUC improvement 0.02-0.05, and interpretation with DL resulted in overall net reclassification improvement of 17.2% (95% CI 9.2%-24.4%, P < 0.001). Conclusion: Explainable DL predictions lead to meaningful improvements in physician interpretation; however, the improvement varied across the readers, reflecting the acceptance of this new technology. This technique could be implemented as an aid to physician diagnosis, improving the diagnostic accuracy of MPI.
KW - artificial intelligence
KW - deep learning
KW - implementation
UR - http://www.scopus.com/inward/record.url?scp=85139396800&partnerID=8YFLogxK
U2 - 10.2967/jnumed.121.263686
DO - 10.2967/jnumed.121.263686
M3 - Article
C2 - 35512997
AN - SCOPUS:85139396800
SN - 1535-5667
VL - 63
SP - 1768
EP - 1774
JO - Journal of nuclear medicine : official publication, Society of Nuclear Medicine
JF - Journal of nuclear medicine : official publication, Society of Nuclear Medicine
IS - 11
ER -