Journal Article10.48550/arxiv.2407.02911
Non-Adversarial Learning: Vector-Quantized Common Latent Space for Multi-Sequence MRI
Luyi Han,Tao Tan,Tianyu Zhang,X. Wang,Yuan Gao,Chunyao Lu,Xinglong Liang,Haoran Dou,Yunzhi Huang,Ritse M. Mann +9 more
- 03 Jul 2024
TL;DR: This study proposes a non-adversarial generative model for multi-sequence MRI reconstruction, leveraging vector-quantized common latent space and contrastive learning to improve stability and consistency, outperforming GAN-based methods on BraTS2021 dataset.
read more
Abstract: Adversarial learning helps generative models translate MRI from source to target sequence when lacking paired samples. However, implementing MRI synthesis with adversarial learning in clinical settings is challenging due to training instability and mode collapse. To address this issue, we leverage intermediate sequences to estimate the common latent space among multi-sequence MRI, enabling the reconstruction of distinct sequences from the common latent space. We propose a generative model that compresses discrete representations of each sequence to estimate the Gaussian distribution of vector-quantized common (VQC) latent space between multiple sequences. Moreover, we improve the latent space consistency with contrastive learning and increase model stability by domain augmentation. Experiments using BraTS2021 dataset show that our non-adversarial model outperforms other GAN-based methods, and VQC latent space aids our model to achieve (1) anti-interference ability, which can eliminate the effects of noise, bias fields, and artifacts, and (2) solid semantic representation ability, with the potential of one-shot segmentation. Our code is publicly available.
read more
Chat with Paper
AI Agents for this Paper
Find similar papers on Google Scholar, PubMed and Arxiv
Write a critical review of this paper
Analyze citations of this paper to find unaddressed research gaps
Figures

Table 1. The quantitative results of translating T1 to T1Gd, T2, and Flair with a single step or multiple steps. The best result is in bold, and the second best is underlined. 
Table 2. The quantitative results for comparisons of reconstructing images based on noise and bias field data. The best result is in bold, and the second best is underlined. 
Fig. 3. Visualization of translating T1 to T1Gd, T2, and Flair with a single step. 
Fig. 1. Overview of the proposed VQ-Seq2Seq framework. 
Fig. 4. Visualization of reconstruction from input images with artifacts, noise, and bias field. Artifacts exist in the original images, therefore, the target image is unavailable. 
Table 3. The quantitative one-shot segmentation results for using latent space from comparisons. The best result is in bold. ET: enhanced tumor, TC: tumor core, WT: whole tumor.
References
Image quality assessment: from error visibility to structural similarity
TL;DR: In this article, a structural similarity index is proposed for image quality assessment based on the degradation of structural information, which can be applied to both subjective ratings and objective methods on a database of images compressed with JPEG and JPEG2000.
•Posted Content
The Unreasonable Effectiveness of Deep Features as a Perceptual Metric
TL;DR: A new dataset of human perceptual similarity judgments is introduced and it is found that deep features outperform all previous metrics by large margins on this dataset, and suggests that perceptual similarity is an emergent property shared across deep visual representations.
7.5K
nnU-Net: a self-configuring method for deep learning-based biomedical image segmentation
Fabian Isensee,Fabian Isensee,Paul F. Jaeger,Simon A. A. Kohl,Jens Petersen,Jens Petersen,Klaus H. Maier-Hein,Klaus H. Maier-Hein +7 more
TL;DR: nnU-Net as mentioned in this paper is a deep learning-based segmentation method that automatically configures itself, including preprocessing, network architecture, training and post-processing for any new task.
5.8K
•Proceedings Article
Neural Discrete Representation Learning
Aaron van den Oord,Oriol Vinyals,Koray Kavukcuoglu +2 more
- 02 Nov 2017
TL;DR: The Vector Quantised-Variational AutoEncoder (VQ-VAE) as discussed by the authors is a generative model that learns a discrete latent representation by using vector quantization.
The Multimodal Brain TumorImage Segmentation Benchmark (BRATS)
Bjoern H. Menze,Mauricio Reyes,Koen Van Leemput,N Porz,Roland Wiest +4 more
- 01 Jan 2015
TL;DR: The set-up and results of the Multimodal Brain Tumor Image Segmentation Benchmark (BRATS) organized in conjunction with the MICCAI 2012 and 2013 conferences are reported, finding that different algorithms worked best for different sub-regions, but that no single algorithm ranked in the top for all sub-Regions simultaneously.
3.9K