Large language models for extraction of OPS-codes from operative reports in meningioma surgery

By
Sebastian Lehmann
Florian Wilhelmy
Nikolaus von Dercks
Erdem Güresir
Johannes Wach
July 31, 2025
0 min

Acta Neurochirurgica

Objective:

To analyze the capability of GPT in extracting OPS codes from surgical reports of meningioma resection conducted between January 2023 and December 2024.

Key Findings:

Sufficient coding achieved 99-100% by surgeons and professional coders, compared to 78-89% by LLMs.
GPT CodeMedic outperformed GPT-4o by over 11% in sufficient coding.
Professional coders achieved the highest optimal coding performance (94%), while surgeons had the highest error rate (69% of optimal coding).
GPT CodeMedic significantly outperformed surgeons in optimal coding.

Interpretation:

While human coders performed better overall, GPT CodeMedic showed promise in OPS coding, particularly excelling in optimal coding scenarios.

Limitations:

Study limited to specific tumor types and a single institution, which may not represent broader coding practices.
Potential biases in report selection and coding accuracy assessment could influence the results.

Conclusion:

GPT CodeMedic demonstrates potential for improving OPS coding accuracy, but human coders remain superior in overall performance.

Large language models for extraction of OPS-codes from operative reports in meningioma surgery

Objective:

Key Findings:

Interpretation:

Limitations:

Conclusion:

Original Source(s)

Large language models for extraction of OPS-codes from operative reports in meningioma surgery

Related Content

Clinical Features and Risk Factors for Outcomes of Postoperative Bacterial Meningitis in Pediatric Neurosurgery: A Retrospective Study from 2013 to 2023

Use of HIFU for Tremor-Dominant Parkinson’s Disease: A Patient Success Story

External ventricular drainage in modern neurosurgical practice: optimization, standardization, and emerging guidance technologies