Non-Adversarial Learning: Vector-Quantized Common Latent Space for
  Multi-Sequence MRI

doi:10.48550/arxiv.2407.02911

Journal Article10.48550/arxiv.2407.02911

Non-Adversarial Learning: Vector-Quantized Common Latent Space for Multi-Sequence MRI

Luyi Han, +9 more

- 03 Jul 2024

TL;DR: This study proposes a non-adversarial generative model for multi-sequence MRI reconstruction, leveraging vector-quantized common latent space and contrastive learning to improve stability and consistency, outperforming GAN-based methods on BraTS2021 dataset.

Abstract: Adversarial learning helps generative models translate MRI from source to target sequence when lacking paired samples. However, implementing MRI synthesis with adversarial learning in clinical settings is challenging due to training instability and mode collapse. To address this issue, we leverage intermediate sequences to estimate the common latent space among multi-sequence MRI, enabling the reconstruction of distinct sequences from the common latent space. We propose a generative model that compresses discrete representations of each sequence to estimate the Gaussian distribution of vector-quantized common (VQC) latent space between multiple sequences. Moreover, we improve the latent space consistency with contrastive learning and increase model stability by domain augmentation. Experiments using BraTS2021 dataset show that our non-adversarial model outperforms other GAN-based methods, and VQC latent space aids our model to achieve (1) anti-interference ability, which can eliminate the effects of noise, bias fields, and artifacts, and (2) solid semantic representation ability, with the potential of one-shot segmentation. Our code is publicly available.

Chat with Paper

AI Agents for this Paper

Find similar papers on Google Scholar, PubMed and Arxiv
Write a critical review of this paper
Analyze citations of this paper to find unaddressed research gaps

Figures

Table 1. The quantitative results of translating T1 to T1Gd, T2, and Flair with a single step or multiple steps. The best result is in bold, and the second best is underlined.

Table 2. The quantitative results for comparisons of reconstructing images based on noise and bias field data. The best result is in bold, and the second best is underlined.

Fig. 3. Visualization of translating T1 to T1Gd, T2, and Flair with a single step.

Fig. 1. Overview of the proposed VQ-Seq2Seq framework.

Fig. 4. Visualization of reconstruction from input images with artifacts, noise, and bias field. Artifacts exist in the original images, therefore, the target image is unavailable.

Table 3. The quantitative one-shot segmentation results for using latent space from comparisons. The best result is in bold. ET: enhanced tumor, TC: tumor core, WT: whole tumor.

References

Journal Article•10.1109/TIP.2003.819861

Image quality assessment: from error visibility to structural similarity

Zhou Wang, +3 more

- 01 Apr 2004

- IEEE Transactions on Image Processing

TL;DR: In this article, a structural similarity index is proposed for image quality assessment based on the degradation of structural information, which can be applied to both subjective ratings and objective methods on a database of images compressed with JPEG and JPEG2000.

...read moreread less

56.3K

•Posted Content

The Unreasonable Effectiveness of Deep Features as a Perceptual Metric

Richard Zhang, +5 more

- 11 Jan 2018

- arXiv: Computer Vision and Pattern Recog...

TL;DR: A new dataset of human perceptual similarity judgments is introduced and it is found that deep features outperform all previous metrics by large margins on this dataset, and suggests that perceptual similarity is an emergent property shared across deep visual representations.

...read moreread less

7.5K

•Journal Article•10.1038/S41592-020-01008-Z

nnU-Net: a self-configuring method for deep learning-based biomedical image segmentation

Fabian Isensee, +7 more

- 01 Feb 2021

- Nature Methods

TL;DR: nnU-Net as mentioned in this paper is a deep learning-based segmentation method that automatically configures itself, including preprocessing, network architecture, training and post-processing for any new task.

...read moreread less

5.8K

•Proceedings Article

Neural Discrete Representation Learning

Aaron van den Oord, +2 more

- 02 Nov 2017

TL;DR: The Vector Quantised-Variational AutoEncoder (VQ-VAE) as discussed by the authors is a generative model that learns a discrete latent representation by using vector quantization.

...read moreread less

4.2K

•Journal Article•10.1109/TMI.2014.2377694

The Multimodal Brain TumorImage Segmentation Benchmark (BRATS)

Bjoern H. Menze, +4 more

- 01 Jan 2015

TL;DR: The set-up and results of the Multimodal Brain Tumor Image Segmentation Benchmark (BRATS) organized in conjunction with the MICCAI 2012 and 2013 conferences are reported, finding that different algorithms worked best for different sub-regions, but that no single algorithm ranked in the top for all sub-Regions simultaneously.

...read moreread less

3.9K