Diagnostic performance of artificial intelligence models for pulmonary nodule classification: a multi-model evaluation - Scorecard - MDSpire

Diagnostic performance of artificial intelligence models for pulmonary nodule classification: a multi-model evaluation

  • By

  • Sarah K. Herber

  • Lukas Müller

  • Daniel Pinto dos Santos

  • Tobias Jorg

  • Fabio Souschek

  • Tobias Bäuerle

  • Sebastian Foersch

  • Christian Galata

  • Peter Mildenberger

  • Moritz C. Halfmann

  • July 25, 2025

  • 0 min

Share

Clinical Scorecard: Evaluation of Artificial Intelligence Models in Classifying Pulmonary Nodules: A Comprehensive Diagnostic Assessment

At a Glance

CategoryDetail
ConditionPulmonary nodules and lung cancer
Key MechanismsAI models detect, measure, and classify pulmonary nodules on CT scans to predict malignancy risk
Target PopulationPatients aged >18 years with pulmonary nodules sized 4-30 mm undergoing CT-guided biopsy or surgical resection
Care SettingTertiary care hospital radiology and pathology departments

Key Highlights

  • Lung cancer mortality is high due to late-stage diagnosis; early detection of malignant pulmonary nodules improves prognosis.
  • High-resolution CT increases nodule detection but reduces specificity, increasing the need for accurate classification tools.
  • Three commercial AI models were evaluated for diagnostic accuracy against histopathology, showing potential to automate nodule classification.

Guideline-Based Recommendations

Diagnosis

  • Use high-resolution thoracic CT scans with slice thickness ≤4 mm to detect pulmonary nodules sized 4-30 mm.
  • Confirm nodule presence and size by experienced radiologists before AI assessment.
  • Apply AI models to classify nodules as benign, intermediate, or malignant based on proprietary malignancy risk scores.

Management

  • Integrate AI malignancy risk predictions to support clinical decision-making for follow-up, biopsy, or surgical resection.
  • Consider nodule size and histopathological subgrouping when interpreting AI results, especially for nodules 5-8 mm.
  • Recognize AI models are primarily trained on primary lung cancers; metastases may require additional clinical correlation.

Monitoring & Follow-up

  • Monitor changes in pulmonary nodules via follow-up CT scans, particularly for nodules classified as intermediate risk by AI.
  • Evaluate AI model performance continuously against histopathological outcomes to ensure diagnostic accuracy.

Risks

  • Limited generalizability and transparency of AI decision-making may impact clinical adoption.
  • AI models may have variable thresholds and scoring systems, requiring cautious interpretation.
  • Potential for false positives and negatives necessitates radiologist oversight and correlation with clinical data.

Patient & Prescribing Data

Adults with pulmonary nodules detected on thoracic CT scans undergoing biopsy or resection

AI models provide malignancy risk stratification to guide clinical management but require validation against histopathology and clinical context.

Clinical Best Practices

  • Ensure CT scans meet technical criteria (slice thickness ≤4 mm) for optimal AI model performance.
  • Use experienced radiologists to confirm nodule detection prior to AI analysis.
  • Interpret AI malignancy risk scores within the context of nodule size, morphology, and clinical history.
  • Maintain multidisciplinary collaboration between radiologists, pathologists, and clinicians when integrating AI results.
  • Recognize limitations of AI models, including restricted nodule size ranges and training biases toward primary lung cancers.

References

Original Source(s)

Related Content