Self-Supervised Transformer for Sparse and Irregularly Sampled Multivariate Clinical Time-Series

doi:10.1145/3516367

Open AccessJournal Article10.1145/3516367

Self-Supervised Transformer for Sparse and Irregularly Sampled Multivariate Clinical Time-Series

30 Jul 2022

- ACM Transactions on Knowledge Discovery ...

- Vol. 16, Iss: 6, pp 1-17

53

TL;DR: In this paper , the authors proposed a transformer-based model for multivariate clinical time-series, which is composed of a Transformer component with multi-head attention layers to learn contextual triplet embeddings without recurrence and vanishing gradients.

Abstract: Multivariate time-series data are frequently observed in critical care settings and are typically characterized by sparsity (missing information) and irregular time intervals. Existing approaches for learning representations in this domain handle these challenges by either aggregation or imputation of values, which in-turn suppresses the fine-grained information and adds undesirable noise/overhead into the machine learning model. To tackle this problem, we propose a S elf-supervised Tra nsformer for T ime- S eries (STraTS) model, which overcomes these pitfalls by treating time-series as a set of observation triplets instead of using the standard dense matrix representation. It employs a novel Continuous Value Embedding technique to encode continuous time and variable values without the need for discretization. It is composed of a Transformer component with multi-head attention layers, which enable it to learn contextual triplet embeddings while avoiding the problems of recurrence and vanishing gradients that occur in recurrent architectures. In addition, to tackle the problem of limited availability of labeled data (which is typically observed in many healthcare applications), STraTS utilizes self-supervision by leveraging unlabeled data to learn better representations by using time-series forecasting as an auxiliary proxy task. Experiments on real-world multivariate clinical time-series benchmark datasets demonstrate that STraTS has better prediction performance than state-of-the-art methods for mortality prediction, especially when labeled data is limited. Finally, we also present an interpretable version of STraTS, which can identify important measurements in the time-series data. Our data preprocessing and model implementation codes are available at https://github.com/sindhura97/STraTS .

Chat with Paper

AI Agents for this Paper

Find similar papers on Google Scholar, PubMed and Arxiv
Write a critical review of this paper
Analyze citations of this paper to find unaddressed research gaps

Citations

Journal Article•10.1109/tpami.2024.3387317

Self-Supervised Learning for Time Series Analysis: Taxonomy, Progress, and Prospects

Kexin Zhang, +10 more

- 01 Jan 2024

- IEEE Transactions on Pattern Analysis an...

TL;DR: A comprehensive survey of self-supervised learning methods for time series analysis is missing. This article fills this gap by reviewing existing methods and proposing a new taxonomy. The survey covers generative-based, contrastive-based, and adversarial-based methods, and includes detailed discussions about their key intuitions, frameworks, advantages and disadvantages.

...read moreread less

32

•Journal Article•10.1007/s11633-022-1382-8

Pre-training in Medical Data: A Survey

Yixuan Qiu, +3 more

- 21 Feb 2023

- Machine Intelligence Research

TL;DR: A comprehensive survey of recent advances for pre-training on several major types of medical data is provided in this article , where the authors summarize a large number of related publications and the existing benchmarking in the medical domain.

...read moreread less

20

•Posted Content•10.1145/3580305.3599543

Warpformer: A Multi-scale Modeling Approach for Irregular Clinical Time Series

Jiawen Zhang, +4 more

- 14 Jun 2023

- arXiv.org

TL;DR: Warpformer as discussed by the authors proposes a warping module that adaptively unifies irregular time series in a given scale, and stack multiple warping and attention modules to learn at different scales.

...read moreread less

19

•Journal Article•10.1109/rbme.2022.3216531

Systematic Review of Advanced AI Methods for Improving Healthcare Data Quality in Post COVID-19 Era

01 Jan 2023

- IEEE Reviews in Biomedical Engineering

TL;DR: Wang et al. as mentioned in this paper summarized the important health data quality challenges during COVID-19 pandemic such as lack of data standardization, missing data, tabulation errors, and noise and artifact.

...read moreread less

17

•Journal Article•10.1109/access.2022.3206449

A Survey on Attention Mechanisms for Medical Applications: are we Moving Toward Better Algorithms?

01 Jan 2022

- IEEE Access

TL;DR: In this article , the authors extensively review the use of attention mechanisms in machine learning methods (including Transformers) for several medical applications based on the types of tasks that may integrate several works pipelines of the medical domain.

...read moreread less

16

...

Expand

References

•Journal Article•10.1161/01.CIR.101.23.E215

PhysioBank, PhysioToolkit, and PhysioNet: components of a new research resource for complex physiologic signals.

Ary L. Goldberger, +9 more

- 13 Jun 2000

- Circulation

TL;DR: The newly inaugurated Research Resource for Complex Physiologic Signals (RRSPS) as mentioned in this paper was created under the auspices of the National Center for Research Resources (NCR Resources).

...read moreread less

14.3K

•Book Chapter•10.1007/978-3-540-28650-9_4

Gaussian processes in machine learning

Carl Edward Rasmussen

- 02 Feb 2003

- Lecture Notes in Computer Science

TL;DR: In this paper, the authors give a basic introduction to Gaussian Process regression models and present the simple equations for incorporating training data and examine how to learn the hyperparameters using the marginal likelihood.

...read moreread less

8.4K

•Journal Article•10.1038/SDATA.2016.35

MIMIC-III, a freely accessible critical care database

Alistair E. W. Johnson, +12 more

- 24 May 2016

- Scientific Data

TL;DR: The Medical Information Mart for Intensive Care (MIMIC-III) as discussed by the authors is a large, single-center database comprising information relating to patients admitted to critical care units at a large tertiary care hospital.

...read moreread less

6.5K

•Proceedings Article

Proceedings of the 28th International Conference on Machine Learning, ICML 2011

Darío García-García, +2 more

- 07 Oct 2011

4.9K

•Journal Article•10.1038/S41598-018-24271-9

Recurrent Neural Networks for Multivariate Time Series with Missing Values.

Zhengping Che, +4 more

- 17 Apr 2018

- Scientific Reports

TL;DR: In this article, a deep learning model based on Gated Recurrent Unit (GRU) is proposed to exploit the missing values and their missing patterns for effective imputation and improving prediction performance.

...read moreread less

1.7K