Journal Article10.1109/isbi53787.2023.10230796
Inter-Modal Conditional-Guided Fusion Network with Transformer for Grading Hepatocellular Carcinoma
Shangxuan Li,Yanshu Fang,Guangyi Wang,Lijuan Zhang,Wu Zhou +4 more
- 18 Apr 2023
pp 1-5
1
TL;DR: The experimental results of the clinical hepatocellular carcinoma (HCC) dataset show that the proposed inter-modal conditional-guided fusion network with Transformer (ICF-Former) is superior to the previously reported multimodal fusion methods for HCC grading.
read more
Abstract: Multimodal medical imaging plays an important role in the diagnosis and characterization of lesions. Transformer pays more attention to global relationship modeling in data, which has obtained promising performance in lesion characterization. However, there are still challenges in transformer-based multimodal feature fusion. First, simple concatenation of information from other modalities, global and local information within the modality cannot balance the importance of them, so it is necessary to consider how to adaptively fuse to optimize the feature extraction of the modality. Second, inter- and intra- modality information are complementary, which has been ignored by reported feature fusion methods. It is necessary to consider how to use the complementary inter-modal information to restrict the conditional learning of intra-modal information. In this work, we propose an inter-modal conditional-guided fusion network with Transformer (ICF-Former) to realize adaptive fusion of intra- and inter-modal information and intra-modal feature learning constrained by other modality joint condition information. The experimental results of the clinical hepatocellular carcinoma (HCC) dataset show that the proposed method is superior to the previously reported multimodal fusion methods for HCC grading.
read more
Chat with Paper
AI Agents for this Paper
Find similar papers on Google Scholar, PubMed and Arxiv
Write a critical review of this paper
Analyze citations of this paper to find unaddressed research gaps
Citations
Temporal Neighboring Multi-modal Transformer with Missingness-Aware Prompt for Hepatocellular Carcinoma Prediction
Jingwen Xu,Ye Zhu,Fei Lyu,Grace Lai‐Hung Wong,Pong C. Yuen +4 more
References
•Posted Content
BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding
TL;DR: A new language representation model, BERT, designed to pre-train deep bidirectional representations from unlabeled text by jointly conditioning on both left and right context in all layers, which can be fine-tuned with just one additional output layer to create state-of-the-art models for a wide range of tasks.
81.7K
Immunotherapies for hepatocellular carcinoma.
Josep M. Llovet,Josep M. Llovet,Josep M. Llovet,Florian Castet,Mathias Heikenwalder,Mala K. Maini,Vincenzo Mazzaferro,David J. Pinato,Eli Pikarsky,Andrew X. Zhu,Richard S. Finn +10 more
TL;DR: A review of the immune microenvironments underlying the response or resistance of hepatocellular carcinoma (HCC) to immunotherapies is presented in this paper, where current evidence from phase III trials on the efficacy, immune-related adverse events and aetiology-dependent mechanisms of response are described.
1K
CT and MR Imaging Diagnosis and Staging of Hepatocellular Carcinoma: Part II. Extracellular Agents, Hepatobiliary Agents, and Ancillary Imaging Features
TL;DR: The second article of this two-part review discusses basic concepts of diagnosis and staging, reviews the diagnostic performance of CT and MR Imaging with extracellular contrast agents and of MR imaging with hepatobiliary contrast agents, and examines in depth the major and ancillary imaging features used in the diagnosis and characterization of HCC.
Dynamic Fusion With Intra- and Inter-Modality Attention Flow for Visual Question Answering
Peng Gao,Zhengkai Jiang,Haoxuan You,Pan Lu,Steven C. H. Hoi,Xiaogang Wang,Hongsheng Li +6 more
- 15 Jun 2019
TL;DR: Zhang et al. as discussed by the authors propose a novel method of dynamically fuse multi-modal features with intra- and inter-modality information flow, which alternatively pass dynamic information between and across the visual and language modalities.
Multimodal Learning with Transformers: A Survey
TL;DR: A comprehensive survey of Transformer techniques oriented at multimodal data and a discussion of open problems and potential research directions for the community are presented.
337