Can incorrect artificial intelligence (AI) results impact radiologists, and if so, what can we do about it? A multi-reader pilot study of lung cancer detection with chest radiography - Report - MDSpire
Advertisement
Can incorrect artificial intelligence (AI) results impact radiologists, and if so, what can we do about it? A multi-reader pilot study of lung cancer detection with chest radiography
Clinical Report: Impact of Erroneous AI on Radiologists in Lung Cancer Chest X-rays
Overview
This pilot study evaluated how incorrect AI outputs affect radiologist performance in lung cancer detection on chest radiographs and whether human factors modifications can mitigate these effects. Findings indicate that false AI results can mislead radiologists, but strategies such as deleting AI feedback or visually outlining suspicious areas may reduce errors.
Background
Artificial intelligence (AI) use in radiology has expanded rapidly, with many systems approved for clinical use and increasing adoption by radiologists. AI algorithms for lung cancer detection on chest imaging demonstrate good but imperfect accuracy, with notable false positive and false negative rates. While AI can improve radiologist performance overall, erroneous AI feedback may adversely influence diagnostic decisions. Understanding and optimizing human factors in AI implementation is critical to minimizing these negative impacts and improving patient outcomes.
Data Highlights
Condition
AI Feedback
False Positives (FP)
False Negatives (FN)
ROC-AUC
No AI
None
0
0
NA
AI Keep (No Box)
Keep AI result, no box
8
4
0.87
AI Delete (No Box)
Delete AI result, no box
8
4
0.87
AI Keep (Box)
Keep AI result with box outlining abnormality
8
4
0.87
Key Findings
AI systems for lung cancer detection on chest X-rays had a ROC-AUC of 0.87 with 8 false positives and 4 false negatives among 90 cases.
Radiologists’ diagnostic performance was negatively impacted by incorrect AI feedback, leading to potential misdiagnoses.
Deleting AI feedback from patient files reduced the deleterious influence of false AI results on radiologist decisions.
Providing a visual box outlining the AI-identified suspicious area improved radiologist accuracy compared to AI feedback without a box.
The order of AI feedback presentation was controlled to prevent contamination of radiologist interpretation across conditions.
Clinical Implications
Clinicians should be aware that erroneous AI outputs can bias radiologist interpretations, potentially leading to missed or incorrect diagnoses. Implementing human factors strategies such as removing incorrect AI feedback from patient records or visually highlighting AI-flagged abnormalities may mitigate these risks. Careful integration of AI tools with attention to feedback presentation is essential to optimize diagnostic accuracy and patient care.
Conclusion
Erroneous AI results can adversely affect radiologist performance in lung cancer detection on chest radiographs, but optimizing human factors in AI implementation—such as deleting AI feedback or using visual cues—can reduce this impact. These findings underscore the importance of thoughtful AI integration to enhance rather than hinder clinical decision-making.
References
ACR Data Science Institute and related studies (2020-2022) -- AI in Radiology Approvals and Usage
Alberdi et al. (Mammography CAD Study) -- Impact of False Negative AI Feedback on Radiologist Decisions
Review of 503 AI Studies (2020) -- AI Accuracy in Lung Nodule Detection
Human Factors Psychology in Radiology -- Optimizing AI Implementation
by Michael H. Bernstein, Michael K. Atalay, Elizabeth H. Dibble, Aaron W. P. Maxwell, Adib R. Karam, Saurabh Agarwal, Robert C. Ward, Terrance T. Healey, Grayson L. Baird