ConvAttenMixer: Brain Tumor Detection and Type Classification using Convolutional Mixer with External and Self-Attention Mechanisms

doi:10.1016/j.jksuci.2023.101810

Journal Article10.1016/j.jksuci.2023.101810

ConvAttenMixer: Brain Tumor Detection and Type Classification using Convolutional Mixer with External and Self-Attention Mechanisms

Salha Alzahrani

- 01 Oct 2023

- Journal of King Saud University - Comput...

20

TL;DR: ConvAttenMixer, a transformer model, combines convolutional layers with self-attention and external attention mechanisms to enhance brain tumor detection and classification in MRI images, outperforming state-of-the-art baselines with higher precision, recall, and accuracy (0.9794).

Abstract: Attention-based methods have recently demonstrated notable advancements in brain tumor classification. To further advance and strengthen this development, we have developed ConvAttenMixer, a transformer model that incorporates convolutional layers along with two attention mechanisms: self-attention and external attention. The proposed model utilizes two blocks of convolution mixers to effectively process and blend across patches, thereby enhancing the model's ability to capture spatial and channel-wise dependencies in MRI brain images. The self-attention block enables the model to prioritize important regions within the image and establish dependencies by assigning weights to each part based on their relevance to the task. This allows the model to emphasize crucial local features, disregard irrelevant ones, and capture interactions between different patches. On the other hand, the external attention block focuses more on significant global features and captures interactions among different images, enabling the model to establish dependencies and correlations across all samples. The classification head in the proposed model is a simple yet effective block designed to process the output feature maps using a squeeze-and-excitation mechanism, which in turn assigns higher weights to important channels and suppresses less-relevant channels. For experimentation, our ConvAttenMixer model was trained on a dataset consisting of 5712 MRI scans and subsequently tested on 1311 scans for classification into glioma, meningioma, pituitary tumor, and no-tumor images. Different variants of the proposed model were tested and evaluated. The optimally performing architecture was evaluated against the state-of-the-art baselines, namely self-attention MLP, external attention MLP, attention-based pooling convolutional net, and convolutional mixer net. Extensive experiments demonstrated that ConvAttenMixer outperformed the other baselines, which employed either self-attention or external attention mechanisms, while requiring significantly less computational memory. The suggested model exhibited higher precision, recall, and f-measure, achieving the highest accuracy of 0.9794 compared with the baselines' accuracy, which ranged from 0.87 to 0.93. The ConvAttenMixer model demonstrates the ability to operate locally on the patch level using self-attention and globally on the sample level using external attention, as well as prioritize important information on the spatial level and channel level using convolution mixers and the squeeze-and-excitation mechanism.

Chat with Paper

AI Agents for this Paper

Find similar papers on Google Scholar, PubMed and Arxiv
Write a critical review of this paper
Analyze citations of this paper to find unaddressed research gaps

Citations

Journal Article•10.1016/j.bspc.2024.107221

An attention-fused architecture for brain tumor diagnosis

Arash Hekmat, +4 more

- 20 Nov 2024

- Biomedical Signal Processing and Control

7

Journal Article•10.1007/s10278-024-01283-8

Deep Learning Approaches for Brain Tumor Detection and Classification Using MRI Images (2020 to 2024): A Systematic Review

Sara Bouhafra, +1 more

- 30 Sep 2024

- Deleted Journal

TL;DR: This systematic review (2020-2024) examines 60 deep learning-based studies on brain tumor detection and classification using MRI images, highlighting transfer learning, autoencoders, and attention mechanisms, and provides future directions for professionals and academic communities.

...read moreread less

6

Journal Article•10.1038/s41598-024-73803-z

Enhancing brain tumor classification through ensemble attention mechanism

Fatih Çelik, +2 more

- 27 Sep 2024

- Dental science reports

TL;DR: This study introduces the ensemble attention mechanism to enhance brain tumor classification in MRI images, achieving 98.94% and 98.48% accuracy on Figshare and BraTS 2019 datasets, respectively, outperforming other methods through attention-based feature extraction and ensemble learning.

...read moreread less

4

Journal Article•10.1016/j.jksuci.2024.101960

Amyotrophic lateral sclerosis prediction framework using a multi-level encoders-decoders-based ensemble architecture technology

A. Khuzaim Alzahrani, +4 more

- 01 Feb 2024

- Journal of King Saud University - Comput...

TL;DR: This study proposes PMLEDBEAT, a multi-level encoders-decoders-based ensemble architecture, to predict Amyotrophic Lateral Sclerosis (ALS) and estimate its development rate, achieving 92% accuracy, outperforming existing methods, and demonstrating potential for real-world healthcare applications.

...read moreread less

1

Journal Article•10.1109/iceccc61767.2024.10593966

Benign vs. Malignant Brain Tumors: An In-Depth Review Using Deep Learning Techniques

Kirti Rattan, +2 more

- 02 May 2024

TL;DR: This study aims to offer a detailed overview of the latest advancements in brain tumor identification techniques, and provides insightful observations and recommendations for future researchers to improve the precision and make the rate at which tumor diagnosis methods work faster, potentially leading to a quicker and more accurate detection of brain tumors, thereby saving lives.

...read moreread less

...

Expand

References

•Proceedings Article•10.1109/CVPR.2016.90

Deep Residual Learning for Image Recognition

Kaiming He, +3 more

- 27 Jun 2016

TL;DR: In this article, the authors proposed a residual learning framework to ease the training of networks that are substantially deeper than those used previously, which won the 1st place on the ILSVRC 2015 classification task.

...read moreread less

198.7K

•Proceedings Article•10.1109/CVPR.2017.243

Densely Connected Convolutional Networks

Gao Huang, +3 more

- 21 Jul 2017

TL;DR: DenseNet as mentioned in this paper proposes to connect each layer to every other layer in a feed-forward fashion, which can alleviate the vanishing gradient problem, strengthen feature propagation, encourage feature reuse, and substantially reduce the number of parameters.

...read moreread less

46.1K

•Proceedings Article•10.1109/CVPR.2016.308

Rethinking the Inception Architecture for Computer Vision

Christian Szegedy, +4 more

- 27 Jun 2016

TL;DR: In this article, the authors explore ways to scale up networks in ways that aim at utilizing the added computation as efficiently as possible by suitably factorized convolutions and aggressive regularization.

...read moreread less

27.9K

•Journal Article•10.1109/TPAMI.2019.2913372

Squeeze-and-Excitation Networks

Jie Hu, +4 more

- 18 Jun 2018

TL;DR: This work proposes a novel architectural unit, which is term the "Squeeze-and-Excitation" (SE) block, that adaptively recalibrates channel-wise feature responses by explicitly modelling interdependencies between channels and finds that SE blocks produce significant performance improvements for existing state-of-the-art deep architectures at minimal additional computational cost.

...read moreread less

24.5K

•Proceedings Article•10.1109/CVPR.2018.00474

MobileNetV2: Inverted Residuals and Linear Bottlenecks

Mark Sandler, +4 more

- 18 Jun 2018

TL;DR: MobileNetV2 as mentioned in this paper is based on an inverted residual structure where the shortcut connections are between the thin bottleneck layers and intermediate expansion layer uses lightweight depthwise convolutions to filter features as a source of non-linearity.

...read moreread less

19.4K

...

Expand