TY - JOUR
T1 - Automated Differentiation between Osteoporotic Vertebral Fracture and Malignant Vertebral Fracture on MRI Using a Deep Convolutional Neural Network
AU - Yoda, Takafumi
AU - Maki, Satoshi
AU - Furuya, Takeo
AU - Yokota, Hajime
AU - Matsumoto, Koji
AU - Takaoka, Hiromitsu
AU - Miyamoto, Takuya
AU - Okimatsu, Sho
AU - Shiga, Yasuhiro
AU - Inage, Kazuhide
AU - Orita, Sumihisa
AU - Eguchi, Yawara
AU - Yamashita, Takeshi
AU - Masuda, Yoshitada
AU - Uno, Takashi
AU - Ohtori, Seiji
N1 - Publisher Copyright:
© 2022 Lippincott Williams and Wilkins. All rights reserved.
PY - 2022/4/15
Y1 - 2022/4/15
N2 - Study Design.Retrospective study of magnetic resonance imaging (MRI).Objectives.To assess the ability of a convolutional neural network (CNN) model to differentiate osteoporotic vertebral fractures (OVFs) and malignant vertebral compression fractures (MVFs) using short-TI inversion recovery (STIR) and T1-weighted images (T1WI) and to compare it to the performance of three spine surgeons.Summary of Background Data.Differentiating between OVFs and MVFs is crucial for appropriate clinical staging and treatment planning. However, an accurate diagnosis is sometimes difficult. Recently, CNN modeling-an artificial intelligence technique-has gained popularity in the radiology field.Methods.We enrolled 50 patients with OVFs and 47 patients with MVFs who underwent thoracolumbar MRI. Sagittal STIR images and sagittal T1WI were used to train and validate the CNN models. To assess the performance of the CNN, the receiver operating characteristic curve was plotted and the area under the curve was calculated. We also compared the accuracy, sensitivity, and specificity of the diagnosis made by the CNN and three spine surgeons.Results.The area under the curve of receiver operating characteristic curves of the CNN based on STIR images and T1WI were 0.967 and 0.984, respectively. The CNN model based on STIR images showed a performance of 93.8% accuracy, 92.5% sensitivity, and 94.9% specificity. On the other hand, the CNN model based on T1WI showed a performance of 96.4% accuracy, 98.1% sensitivity, and 94.9% specificity. The accuracy and specificity of the CNN using both STIR and T1WI were statistically equal to or better than that of three spine surgeons. There were no significant differences in sensitivity based on both STIR images and T1WI between the CNN and spine surgeons.Conclusion.We successfully differentiated OVFs and MVFs based on MRI with high accuracy using the CNN model, which was statistically equal or superior to that of the spine surgeons.Level of Evidence: 4.
AB - Study Design.Retrospective study of magnetic resonance imaging (MRI).Objectives.To assess the ability of a convolutional neural network (CNN) model to differentiate osteoporotic vertebral fractures (OVFs) and malignant vertebral compression fractures (MVFs) using short-TI inversion recovery (STIR) and T1-weighted images (T1WI) and to compare it to the performance of three spine surgeons.Summary of Background Data.Differentiating between OVFs and MVFs is crucial for appropriate clinical staging and treatment planning. However, an accurate diagnosis is sometimes difficult. Recently, CNN modeling-an artificial intelligence technique-has gained popularity in the radiology field.Methods.We enrolled 50 patients with OVFs and 47 patients with MVFs who underwent thoracolumbar MRI. Sagittal STIR images and sagittal T1WI were used to train and validate the CNN models. To assess the performance of the CNN, the receiver operating characteristic curve was plotted and the area under the curve was calculated. We also compared the accuracy, sensitivity, and specificity of the diagnosis made by the CNN and three spine surgeons.Results.The area under the curve of receiver operating characteristic curves of the CNN based on STIR images and T1WI were 0.967 and 0.984, respectively. The CNN model based on STIR images showed a performance of 93.8% accuracy, 92.5% sensitivity, and 94.9% specificity. On the other hand, the CNN model based on T1WI showed a performance of 96.4% accuracy, 98.1% sensitivity, and 94.9% specificity. The accuracy and specificity of the CNN using both STIR and T1WI were statistically equal to or better than that of three spine surgeons. There were no significant differences in sensitivity based on both STIR images and T1WI between the CNN and spine surgeons.Conclusion.We successfully differentiated OVFs and MVFs based on MRI with high accuracy using the CNN model, which was statistically equal or superior to that of the spine surgeons.Level of Evidence: 4.
KW - artificial intelligence
KW - deep convolutional neural network
KW - deep learning
KW - malignant vertebral compression fractures
KW - osteoporotic vertebral fractures
KW - spinal metastasis
UR - http://www.scopus.com/inward/record.url?scp=85129997074&partnerID=8YFLogxK
U2 - 10.1097/BRS.0000000000004307
DO - 10.1097/BRS.0000000000004307
M3 - Article
C2 - 34919075
AN - SCOPUS:85129997074
SN - 0362-2436
VL - 47
SP - E347-E352
JO - Spine
JF - Spine
IS - 8
ER -