Automated Prediction of Glasgow Coma Scale Scores From Unstructured Electronic Health Records Using Natural Language Processing: Development and Validation Study

By
Marta Fernandes
Niels Turley
Haoqi Sun
Shibani S Mukerji
Lidia M V R Moura
M Brandon Westover
Sahar F Zafar
June 29, 2026
0 min

Journal Of Medical Internet Research (Jmir)

Objective:

To develop an automated natural language processing (NLP) model capable of estimating Glasgow Coma Scale (GCS) scores from unstructured clinical text.

Approach:

Study Population: Included adult patients (aged ≥18 years) with inpatient hospital admissions, utilizing retrospective data analysis.
Datasets: Cohort derived from inpatient admissions at Mass General Brigham and the MIMIC-III database, including daily clinical notes and GCS assessments.
Model Design: Developed an ordinal regression model with elastic net penalty to predict GCS score classes and a linear regression model for full GCS score range.
Model Evaluation: Used metrics such as AUROC, AUPRC, sensitivity, specificity, and F1-score for model performance assessment.

Key Findings:

The NLP model can estimate GCS scores from unstructured clinical notes, addressing the issue of missing GCS data in EHRs.
The model demonstrated generalizability across different institutions, with performance metrics indicating its effectiveness.

Interpretation:

Limitations:

The study is based on retrospective data and may not account for all clinical variables.
Generalizability may be limited to similar healthcare settings, and results may not apply universally.

Automated Prediction of Glasgow Coma Scale Scores From Unstructured Electronic Health Records Using Natural Language Processing: Development and Validation Study

Objective:

Approach:

Key Findings:

Interpretation:

Limitations:

Conclusion:

Original Source(s)

Automated Prediction of Glasgow Coma Scale Scores From Unstructured Electronic Health Records Using Natural Language Processing: Development and Validation Study

Related Content

The Fellow Eye Is Never Just a Bystander

Cocaine- and amphetamine-regulated transcripts in two percomorphs: evolutionary conservation and energy-status dependent responses

Parkinson Mortality Rate Was 72 per 100,000 in 2024