Self-Supervised Transformer for Sparse and Irregularly Sampled Multivariate Clinical Time-Series
TL;DR: In this paper , the authors proposed a transformer-based model for multivariate clinical time-series, which is composed of a Transformer component with multi-head attention layers to learn contextual triplet embeddings without recurrence and vanishing gradients.
read more
Abstract: Multivariate time-series data are frequently observed in critical care settings and are typically characterized by sparsity (missing information) and irregular time intervals. Existing approaches for learning representations in this domain handle these challenges by either aggregation or imputation of values, which in-turn suppresses the fine-grained information and adds undesirable noise/overhead into the machine learning model. To tackle this problem, we propose a S elf-supervised Tra nsformer for T ime- S eries (STraTS) model, which overcomes these pitfalls by treating time-series as a set of observation triplets instead of using the standard dense matrix representation. It employs a novel Continuous Value Embedding technique to encode continuous time and variable values without the need for discretization. It is composed of a Transformer component with multi-head attention layers, which enable it to learn contextual triplet embeddings while avoiding the problems of recurrence and vanishing gradients that occur in recurrent architectures. In addition, to tackle the problem of limited availability of labeled data (which is typically observed in many healthcare applications), STraTS utilizes self-supervision by leveraging unlabeled data to learn better representations by using time-series forecasting as an auxiliary proxy task. Experiments on real-world multivariate clinical time-series benchmark datasets demonstrate that STraTS has better prediction performance than state-of-the-art methods for mortality prediction, especially when labeled data is limited. Finally, we also present an interpretable version of STraTS, which can identify important measurements in the time-series data. Our data preprocessing and model implementation codes are available at https://github.com/sindhura97/STraTS .
read more
Chat with Paper
AI Agents for this Paper
Find similar papers on Google Scholar, PubMed and Arxiv
Write a critical review of this paper
Analyze citations of this paper to find unaddressed research gaps
Citations
Self-Supervised Learning for Time Series Analysis: Taxonomy, Progress, and Prospects
Kexin Zhang,Qingsong Wen,Chaoli Zhang,Rainbow Cai,Ming Jin,Yong Liu,James Y. Zhang,Yuxuan Liang,Guansong Pang,Dongjin Song,Shirui Pan +10 more
TL;DR: A comprehensive survey of self-supervised learning methods for time series analysis is missing. This article fills this gap by reviewing existing methods and proposing a new taxonomy. The survey covers generative-based, contrastive-based, and adversarial-based methods, and includes detailed discussions about their key intuitions, frameworks, advantages and disadvantages.
32
Pre-training in Medical Data: A Survey
TL;DR: A comprehensive survey of recent advances for pre-training on several major types of medical data is provided in this article , where the authors summarize a large number of related publications and the existing benchmarking in the medical domain.
Warpformer: A Multi-scale Modeling Approach for Irregular Clinical Time Series
TL;DR: Warpformer as discussed by the authors proposes a warping module that adaptively unifies irregular time series in a given scale, and stack multiple warping and attention modules to learn at different scales.
Systematic Review of Advanced AI Methods for Improving Healthcare Data Quality in Post COVID-19 Era
TL;DR: Wang et al. as mentioned in this paper summarized the important health data quality challenges during COVID-19 pandemic such as lack of data standardization, missing data, tabulation errors, and noise and artifact.
A Survey on Attention Mechanisms for Medical Applications: are we Moving Toward Better Algorithms?
01 Jan 2022
TL;DR: In this article , the authors extensively review the use of attention mechanisms in machine learning methods (including Transformers) for several medical applications based on the types of tasks that may integrate several works pipelines of the medical domain.
16
References
PhysioBank, PhysioToolkit, and PhysioNet: components of a new research resource for complex physiologic signals.
Ary L. Goldberger,Luís A. Nunes Amaral,Leon Glass,Jeffrey M. Hausdorff,Plamen Ch. Ivanov,Roger G. Mark,Joseph E. Mietus,George B. Moody,Chung-Kang Peng,H. Eugene Stanley +9 more
TL;DR: The newly inaugurated Research Resource for Complex Physiologic Signals (RRSPS) as mentioned in this paper was created under the auspices of the National Center for Research Resources (NCR Resources).
14.3K
Gaussian processes in machine learning
TL;DR: In this paper, the authors give a basic introduction to Gaussian Process regression models and present the simple equations for incorporating training data and examine how to learn the hyperparameters using the marginal likelihood.
MIMIC-III, a freely accessible critical care database
Alistair E. W. Johnson,Tom J. Pollard,Lu Shen,Li-wei H. Lehman,Mengling Feng,Mengling Feng,Mohammad M. Ghassemi,Benjamin Moody,Peter Szolovits,Leo Anthony Celi,Leo Anthony Celi,Roger G. Mark,Roger G. Mark +12 more
TL;DR: The Medical Information Mart for Intensive Care (MIMIC-III) as discussed by the authors is a large, single-center database comprising information relating to patients admitted to critical care units at a large tertiary care hospital.
•Proceedings Article
Proceedings of the 28th International Conference on Machine Learning, ICML 2011
Darío García-García,Ulrike von Luxburg,Raul Santos-Rodriguez +2 more
- 07 Oct 2011
4.9K
Recurrent Neural Networks for Multivariate Time Series with Missing Values.
TL;DR: In this article, a deep learning model based on Gated Recurrent Unit (GRU) is proposed to exploit the missing values and their missing patterns for effective imputation and improving prediction performance.