Data-driven simulations to assess the impact of study imperfections in time-to-event analyses - Report - MDSpire

Data-driven simulations to assess the impact of study imperfections in time-to-event analyses

  • By

  • Michal Abrahamowicz

  • Marie-Eve Beauchamp

  • Anne-Laure Boulesteix

  • Tim P Morris

  • Willi Sauerbrei

  • Jay S Kaufman

  • on behalf of the STRATOS Simulation Panel

  • May 6, 2024

  • 0 min

Share

Data-Driven Simulations to Assess Study Limitations in Time-to-Event Analysis

Overview

This article presents a novel data-driven simulation approach to quantitatively evaluate the impact of data imperfections on multivariable time-to-event analyses, including those with time-varying exposures. Two real-world examples illustrate how this method can separate confounding bias from noncollapsibility and assess the effects of imprecise event timing on estimated associations.

Background

Epidemiologic studies often face limitations such as unmeasured confounders, measurement errors, and sparse data collection, which can bias results and affect conclusions. Quantitative bias analysis (QBA) methods have been developed to formally assess these impacts, but traditional approaches may not quantify bias, mean squared error, or coverage rate effectively in time-to-event settings. The proposed data-driven simulation method complements existing QBA by preserving key features of real-world data while allowing flexible control over assumptions and parameters, enhancing the evaluation of bias in survival analyses with right-censored endpoints and time-varying covariates.

Data Highlights

The approach involves seven key steps: (1) identifying data imperfections, (2) fitting initial regression models, (3) formulating assumptions for simulations, (4) generating oracle datasets free of imperfections, (5) modifying datasets to reintroduce imperfections, (6) analyzing both datasets, and (7) summarizing results across multiple repetitions (m=1000). This framework enables assessment of bias, mean squared error, and coverage rates by comparing oracle and imperfect data analyses.

Key Findings

  • Data-driven simulations can isolate the expected impact of unmeasured confounding from noncollapsibility in prognostic cancer mortality studies.
  • Imprecise timing of interval-censored events, such as those only observed at clinic visits, can bias associations with time-varying drug exposures.
  • The simulation framework allows comparison of alternative imputation strategies for unknown event times in interval censoring scenarios.
  • Using real-world data ensures relevance and applicability of simulation results to specific empirical studies.
  • Performing 1000 simulation repetitions provides sufficient power to detect bias corresponding to 10% of the empirical standard error.

Clinical Implications

Clinicians and researchers can apply this data-driven simulation approach to better understand and quantify the impact of common data imperfections on survival analysis results. This method facilitates more informed interpretation of time-to-event study findings, particularly when dealing with unmeasured confounders or imprecise event timing, ultimately supporting improved study design and analytic strategies.

Conclusion

Data-driven simulations offer a flexible and rigorous tool to evaluate and quantify the effects of study limitations in time-to-event analyses, enhancing the robustness and interpretability of real-world epidemiologic research findings.

References

  1. Original Article 2024 -- Utilizing Data-Driven Simulations to Evaluate the Effects of Study Limitations in Time-to-Event Analysis

Original Source(s)

Related Content