Data-driven refinements for voice disorder classification: improving accuracy and generalisability - Report - MDSpire

Data-driven refinements for voice disorder classification: improving accuracy and generalisability

  • By

  • Rijul Gupta

  • Catherine Madill

  • Craig Jin

  • June 23, 2026

  • 0 min

Share

Clinical Report: Enhancing Voice Disorder Classification Through Data-Driven Improvements

Background

Voice disorders are critical indicators of impaired vocal function and serve as a valuable testbed for developing AI-driven acoustic analysis methods. Accurate classification of these disorders is essential for leveraging voice as a biomarker in healthcare. Current multi-class classification systems face significant challenges, achieving only 50%-60% balanced accuracy, which limits their clinical applicability. [Citation needed]

Data Highlights

FrameworkBalanced Accuracy
CarLab 202567.20%
Best-performing Clinical Framework61.03%

Key Findings

  • CarLab 2025 achieved superior in-domain classification accuracy compared to established clinical taxonomies.
  • Models trained with structured taxonomies outperformed those with narrow, single-disorder labels for out-of-domain generalisation.
  • Training on diverse vocal tasks improved cross-database performance more effectively than single-task training.
  • Injecting a small amount of data from target domains boosted binary detection accuracy but did not consistently enhance multi-class recall.
  • Robust multi-class generalisation requires diverse multi-source training data.

Clinical Implications

The findings suggest that aligning classification frameworks with acoustic manifestations of voice disorders may enhance diagnostic accuracy.

Conclusion

The development of CarLab 2025 provides an approach to improving voice disorder classification, addressing existing performance gaps in multi-class detection tasks.

Related Resources & Content

  1. Frontiers in Digital Health, 2026 -- Voice disorders classification using machine learning: a scoping review
  2. conexiant, 2026 -- Accuracy of AI Laryngeal Disorder Detection
  3. BMC Psychiatry (Springer), 2025 -- Utilizing Voice-Activated Machine Learning for Efficient Screening of Bipolar Disorder and Major Depressive Disorder in Youth: A Reliable and Simple Diagnostic Approach
  4. JMIR Medical Informatics, 2026 -- A Machine Learning Approach to Voice-Based Parkinson Disease Screening Using Multiview Spectrogram and Speech Recognition Features: Diagnostic Study
  5. Clinical Practice Guideline: Hoarseness (Dysphonia) (Update) - Stachler, 2018 - Otolaryngology–Head and Neck Surgery
  6. Voice Disorders, ASHA Practice Portal
  7. Artificial Intelligence to Detect Voice Disorders: An AI-Supported Systematic Review of Accuracy Outcomes - ScienceDirect
  8. Clinical Practice Guideline: Hoarseness (Dysphonia) (Update) - Stachler - 2018 - Otolaryngology–Head and Neck Surgery - Wiley Online Library
  9. Voice Disorders
  10. Artificial Intelligence to Detect Voice Disorders: An AI-Supported Systematic Review of Accuracy Outcomes - ScienceDirect

Original Source(s)

Related Content