Evaluation Challenges for Computer-Aided Diagnostic Characterization

Data Mining ◽  
2013 ◽  
pp. 1794-1818
Author(s):  
William H. Horsthemke ◽  
Daniela S. Raicu ◽  
Jacob D. Furst ◽  
Samuel G. Armato

Evaluating the success of computer-aided decision support systems depends upon a reliable reference standard, a ground truth. The ideal gold standard is expected to result from the marking, labeling, and rating by domain experts of the image of interest. However experts often disagree, and this lack of agreement challenges the development and evaluation of image-based feature prediction of expert-defined “truth.” The following discussion addresses the success and limitation of developing computer-aided models to characterize suspicious pulmonary nodules based upon ratings provided by multiple expert radiologists. These prediction models attempt to bridge the semantic gap between images and medically-meaningful, descriptive opinions about visual characteristics of nodules. The resultant computer-aided diagnostic characterizations (CADc) are directly usable for indexing and retrieving in content-based medical image retrieval and supporting computer-aided diagnosis. The predictive performance of CADc models are directly related to the extent of agreement between radiologists; the models better predict radiologists’ opinions when radiologists agree more with each other about the characteristics of nodules.

Author(s):  
William H. Horsthemke ◽  
Daniela S. Raicu ◽  
Jacob D. Furst ◽  
Samuel G. Armato

Evaluating the success of computer-aided decision support systems depends upon a reliable reference standard, a ground truth. The ideal gold standard is expected to result from the marking, labeling, and rating by domain experts of the image of interest. However experts often disagree, and this lack of agreement challenges the development and evaluation of image-based feature prediction of expert-defined “truth.” The following discussion addresses the success and limitation of developing computer-aided models to characterize suspicious pulmonary nodules based upon ratings provided by multiple expert radiologists. These prediction models attempt to bridge the semantic gap between images and medically-meaningful, descriptive opinions about visual characteristics of nodules. The resultant computer-aided diagnostic characterizations (CADc) are directly usable for indexing and retrieving in content-based medical image retrieval and supporting computer-aided diagnosis. The predictive performance of CADc models are directly related to the extent of agreement between radiologists; the models better predict radiologists’ opinions when radiologists agree more with each other about the characteristics of nodules.


2010 ◽  
Vol 50 (1) ◽  
pp. 43-53 ◽  
Author(s):  
Michael C. Lee ◽  
Lilla Boroczky ◽  
Kivilcim Sungur-Stasik ◽  
Aaron D. Cann ◽  
Alain C. Borczuk ◽  
...  

2009 ◽  
Vol 36 (7) ◽  
pp. 3086-3098 ◽  
Author(s):  
Ted W. Way ◽  
Berkman Sahiner ◽  
Heang-Ping Chan ◽  
Lubomir Hadjiiski ◽  
Philip N. Cascade ◽  
...  

2020 ◽  
Vol 10 (5) ◽  
pp. 1033-1039
Author(s):  
Huihong Duan ◽  
Xu Wang ◽  
Xingyi He ◽  
Yonggang He ◽  
Litao Song ◽  
...  

Background: In the pulmonary nodules computer aided diagnosis systems (CAD), feature selection plays an important role in reducing the false positive rate and improving the system accuracy. To solve the problem of feature selection techniques by which the diversity of features was damaged in the process of distinguishing malignant pulmonary nodules from benign pulmonary nodules, this study developed a novel feature selection algorithm for improving the accuracy of traditional computer-aided differential diagnosis for benign and malignant classification of pulmonary nodules. Method: Firstly, we divided the extracted features of nodules into several groups by using Gaussian mixture model (GMM). Secondly, we applied Relief and sequential forward selection (SFS) algorithm to find local optimum features dataset for each group. Afterwards, we used the optimumpath forest (OPF) classifier with the found features dataset to obtain the classification results. Finally, the local optimum features dataset with the highest area under curve AUC in all groups were added into the final selected set. Results: According to collected pulmonary nodules on computed tomography (CT) scans, tested with two set of samples, we achieved an average accuracy of 89.5%, sensitivity of 87.1% and specificity of 90.9% on the first set of samples, and 90.1%, 88.7% and 92.1% on the second set of samples. The areas under the receiver operating characteristic (ROC) curves based on these two sample sets were 95.2%, and 96.3% respectively. Conclusions: This study shows that the proposed method was promising for improving the pulmonary nodules computer aided diagnosis systems performance of benign and malignant pulmonary nodules.


Radiology ◽  
1999 ◽  
Vol 213 (3) ◽  
pp. 723-726 ◽  
Author(s):  
Heber MacMahon ◽  
Roger Engelmann ◽  
Fred M. Behlen ◽  
Kenneth R. Hoffmann ◽  
Takayuki Ishida ◽  
...  

Radiology ◽  
2004 ◽  
Vol 230 (2) ◽  
pp. 347-352 ◽  
Author(s):  
Kazuo Awai ◽  
Kohei Murao ◽  
Akio Ozawa ◽  
Masanori Komi ◽  
Haruo Hayakawa ◽  
...  

Sign in / Sign up

Export Citation Format

Share Document