A device-invariant multi-modal learning framework for respiratory disease classification - Scorecard - MDSpire

A device-invariant multi-modal learning framework for respiratory disease classification

  • By

  • Mo Yang

  • Xuefei Liu

  • Wei Du

  • Yang Liu

  • Wenyu Zhu

  • Zhaoyang Bu

  • Jiaxuan Mao

  • Qian Wang

  • Si Chen

  • Min Zhou

  • Jie-ming Qu

  • February 26, 2026

  • 0 min

Share

Clinical Scorecard: A Multi-Modal Deep Learning Approach for Classifying Respiratory Diseases Across Diverse Devices

At a Glance

CategoryDetail
ConditionAdult respiratory diseases including COPD, lower respiratory tract infection, pulmonary shadows, and comorbidities
Key MechanismsDevice-invariant multimodal deep learning integrating cough acoustics, demographics, and symptom descriptions with adversarial and invariant risk minimization techniques
Target PopulationAdults with suspected respiratory diseases across diverse devices and populations
Care SettingHome-based self-management and multi-center clinical screening

Key Highlights

  • Proposed method achieves high AUROC for COPD (0.9698), LRTI (0.8483), and pulmonary shadows (0.8720)
  • Multimodal fusion and device-invariant feature learning mitigate device heterogeneity and improve cross-device generalization
  • Demonstrates scalable, transferable AI-based cough-driven respiratory screening suitable for diverse real-world settings

Guideline-Based Recommendations

Diagnosis

  • Utilize multimodal deep learning models combining cough sound analysis with demographic and symptom data for respiratory disease classification
  • Incorporate device-invariant feature learning to reduce variability from different recording devices

Management

  • Apply smartphone-based respiratory screening tools to support self-management and early detection in home settings
  • Consider multi-label classification approaches to identify comorbid respiratory conditions

Monitoring & Follow-up

  • Leverage AI-driven cough sound analysis for ongoing respiratory disease monitoring across diverse devices
  • Ensure robustness of models to non-structural shifts and device effects for reliable longitudinal assessment

Risks

  • Be aware of limitations due to proprietary data and code restricting external validation
  • Consider potential variability in cough sound quality and recording environments impacting model performance

Patient & Prescribing Data

Adults with suspected or confirmed respiratory diseases across multiple centers and device types

AI-based cough sound classification can aid early diagnosis and monitoring, potentially improving timely intervention and management

Clinical Best Practices

  • Integrate multimodal data inputs (acoustic, demographic, symptom) for comprehensive respiratory disease assessment
  • Employ adversarial training and invariant risk minimization to enhance model generalizability across devices
  • Validate AI models on large, diverse, multi-center datasets to ensure clinical applicability
  • Use smartphone-based tools to facilitate accessible respiratory screening and self-management
  • Maintain data privacy and regulatory compliance when handling sensitive clinical information

References

Original Source(s)

Related Content