Advertisement
Original Article|Articles in Press

An integrated approach to geographic validation helped scrutinize prediction model performance and its variability

  • Tsvetan R. Yordanov
    Correspondence
    Corresponding author at: department of medical informatics, Amsterdam UMC - Location AMC, University of Amsterdam, P.O. Box 22660, 1105 AZ Amsterdam, The Netherlands
    Affiliations
    Amsterdam UMC location University of Amsterdam, Medical Informatics, Meibergdreef 9, Amsterdam, The Netherlands

    Amsterdam Public Health Research Institute, Amsterdam, The Netherlands
    Search for articles by this author
  • Ricardo R. Lopes
    Affiliations
    Amsterdam UMC location University of Amsterdam, Biomedical Engineering and Physics, Meibergdreef 9, Amsterdam, The Netherlands

    Amsterdam UMC location University of Amsterdam, Radiology and Nuclear Medicine, Meibergdreef 9, Amsterdam, The Netherlands

    Amsterdam Cardiovascular Sciences, Heart Failure and Arrhythmias, Amsterdam, The Netherlands
    Search for articles by this author
  • Anita C.J. Ravelli
    Affiliations
    Amsterdam UMC location University of Amsterdam, Medical Informatics, Meibergdreef 9, Amsterdam, The Netherlands

    Amsterdam Public Health Research Institute, Amsterdam, The Netherlands
    Search for articles by this author
  • Marije Vis
    Affiliations
    Amsterdam Public Health Research Institute, Amsterdam, The Netherlands

    Amsterdam UMC location University of Amsterdam, Cardiology, Meibergdreef 9, Amsterdam, The Netherlands

    Amsterdam Cardiovascular Sciences, Atherosclerosis and Ischemic Syndromes, Amsterdam, The Netherlands
    Search for articles by this author
  • Saskia Houterman
    Affiliations
    Netherlands Heart Registration, Utrecht, The Netherlands
    Search for articles by this author
  • Henk Marquering
    Affiliations
    Amsterdam UMC location University of Amsterdam, Biomedical Engineering and Physics, Meibergdreef 9, Amsterdam, The Netherlands

    Amsterdam UMC location University of Amsterdam, Radiology and Nuclear Medicine, Meibergdreef 9, Amsterdam, The Netherlands

    Amsterdam Cardiovascular Sciences, Atherosclerosis and Ischemic Syndromes, Amsterdam, The Netherlands
    Search for articles by this author
  • Ameen Abu-Hanna
    Affiliations
    Amsterdam UMC location University of Amsterdam, Medical Informatics, Meibergdreef 9, Amsterdam, The Netherlands

    Amsterdam Public Health Research Institute, Amsterdam, The Netherlands
    Search for articles by this author
  • on behalf of theNHR THI Registration committee
Open AccessPublished:February 21, 2023DOI:https://doi.org/10.1016/j.jclinepi.2023.02.021
      This paper is only available as a PDF. To read, Please Download here.

      Abstract

      Objective

      To illustrate in-depth validation of prediction models developed on multicenter data.

      Methods

      For each hospital in a multicenter registry, we evaluated predictive performance of a 30-day mortality prediction model for transcatheter aortic valve implantation (TAVI) using the Netherlands heart registration (NHR) dataset. We measured discrimination and calibration per hospital in a leave-center-out analysis (LCOA). Meta-analysis was used to calculate I2 values per performance metric from the LCOA and to compute mean and confidence interval (CI) estimates. Case-mix differences between studies were inspected using the framework of Debray et al. for understanding external validation. We also aimed to discover subgroups with high model prediction error and their distribution over the centers.

      Results

      We studied 16 hospitals with 11,599 TAVI patients with an early mortality of 3.7%. The models’ AUCs had a wide range between hospitals from 0.59 to 0.79 and miscalibration occurred in seven hospitals. Mean AUC from meta-analysis was 0.68 (95%CI 0.65-0.70). I 2 values were 0%, 74%, and 0% for AUC, calibration intercept and slope, respectively. Between-hospital case-mix differences were substantial and model transportability was low. One subgroup was discovered with marked global prediction error and was associated with poor performance on validation centers.

      Conclusion

      The illustrated combination of approaches provides useful insights to inspect multicenter-based prediction models, and exposes their limitations in transportability and performance variability when applied to different populations.

      Graphical abstract

      Keywords