Multi-modal dataset creation for federated learning with DICOM-structured reports

By
Malte Tölle
Lukas Burger
Halvar Kelm
Florian André
Peter Bannas
Gerhard Diller
Norbert Frey
Philipp Garthe
Stefan Groß
Anja Hennemuth
Lars Kaderali
Nina Krüger
Andreas Leha
Simon Martin
Alexander Meyer
Eike Nagel
Stefan Orwat
Clemens Scherer
Moritz Seiffert
Jan Moritz Seliger
Stefan Simm
Tim Friede
Tim Seidler
Sandy Engelhardt
February 3, 2025

International Journal Of Computer Assisted Radiology And Surgery

Overview

This study demonstrates a novel approach to harmonize multi-modal medical data for federated learning by leveraging the DICOM standard and its structured reports. The authors developed publicly available tools and an intuitive dashboard to enable multi-center cohort creation while respecting privacy regulations.

Background

Multi-modal deep learning in medicine requires large, harmonized datasets, but privacy regulations like GDPR limit centralized data sharing. Federated learning offers a solution by training models locally and aggregating weights centrally, yet data harmonization across institutions remains challenging, especially for multi-modal data. Standardized data formats such as DICOM structured reports can facilitate consistent data representation and interoperability across centers. An intuitive, non-programmatic dashboard for filtering and cohort creation can further streamline multi-center studies in a federated setting.

Data Highlights

The study introduces a workflow converting diverse medical data into DICOM format, uploading it into a research PACS, and enabling filtering via a graphical dashboard. The dashboard supports iterative filtering with linked views across images, waveforms, segmentations, and metadata. The approach was implemented across multiple consortium sites, allowing federated cohort selection and subsequent federated training. The source code is publicly available at https://github.com/Cardio-AI/fl-multi-modal-dataset-creation.

Key Findings

Leveraging DICOM standards enables harmonization of imaging, waveform, and metadata through structured reports and inherent matching capabilities.
Publicly available Python scripts were developed to create and convert data into DICOM SR templates suitable for deep learning pipelines.
An intuitive, customizable dashboard was created for iterative filtering and cohort creation of multi-modal DICOM data residing in PACS systems.
The filtering dashboard supports linked graphical views and instant updates, enhancing usability without programming knowledge.
DICOM-based data ensures interoperability, supporting standard SQL queries and potential conversion to other standards like FHIR.
The federated infrastructure was successfully deployed across multiple sites, enabling local cohort selection and federated model training.

Clinical Implications

This approach facilitates multi-center federated learning by overcoming privacy and data heterogeneity barriers through standardized DICOM data representation. Clinicians and researchers can intuitively create multi-modal cohorts without programming expertise, accelerating collaborative studies. The interoperability with existing clinical systems supports integration into clinical workflows and future scalability.

Conclusion

Utilizing DICOM structured reports for multi-modal data harmonization enables effective federated learning across institutions while respecting privacy constraints. The provided tools and dashboard offer a practical solution for scalable, interoperable cohort creation in medical imaging and related modalities.

References

Cardio-AI Consortium 2024 -- Creation of Multi-Modal Datasets for Federated Learning Utilizing DICOM-Formatted Reports

Multi-modal dataset creation for federated learning with DICOM-structured reports

Creation of Multi-Modal Datasets for Federated Learning Using DICOM-Formatted Reports

Overview

Background

Data Highlights

Key Findings

Clinical Implications

Conclusion

References

Original Source(s)

Multi-modal dataset creation for federated learning with DICOM-structured reports

Related Content

Minimally invasive vs. open esophagectomy in patients with a BMI > 35: a NSQIP analysis

Development and validation of a prognostic model for stage IV breast cancer based on primary tumor resection with machine learning methods: retrospective cohort study

A comparative study on percutaneous vertebroplasty for osteoporotic vertebral compression fractures under day surgery and inpatient management models