PTB-XL+, a comprehensive electrocardiographic feature dataset

Publikation: Bidrag til tidsskriftTidsskriftartikelForskningfagfællebedømt

Dokumenter

  • Fulltext

    Forlagets udgivne version, 1,46 MB, PDF-dokument

  • Nils Strodthoff
  • Temesgen Mehari
  • Claudia Nagel
  • Philip J. Aston
  • Ashish Sundar
  • Claus Graff
  • Kanters, Jørgen K.
  • Wilhelm Haverkamp
  • Olaf Dössel
  • Axel Loewe
  • Markus Bär
  • Tobias Schaeffter

Machine learning (ML) methods for the analysis of electrocardiography (ECG) data are gaining importance, substantially supported by the release of large public datasets. However, these current datasets miss important derived descriptors such as ECG features that have been devised in the past hundred years and still form the basis of most automatic ECG analysis algorithms and are critical for cardiologists’ decision processes. ECG features are available from sophisticated commercial software but are not accessible to the general public. To alleviate this issue, we add ECG features from two leading commercial algorithms and an open-source implementation supplemented by a set of automatic diagnostic statements from a commercial ECG analysis software in preprocessed format. This allows the comparison of ML models trained on clinically versus automatically generated label sets. We provide an extensive technical validation of features and diagnostic statements for ML applications. We believe this release crucially enhances the usability of the PTB-XL dataset as a reference dataset for ML methods in the context of ECG data.

OriginalsprogEngelsk
Artikelnummer279
TidsskriftScientific Data
Vol/bind10
Antal sider11
ISSN2052-4463
DOI
StatusUdgivet - 2023

Bibliografisk note

Funding Information:
This work was supported by the EMPIR project 18HLT07 MedalCare. The EMPIR initiative is cofunded by the European Union’s Horizon 2020 research and innovation program and the EMPIR Participating States.

Publisher Copyright:
© 2023, The Author(s).

ID: 370737193