Journal Article10.1016/j.jksuci.2023.101810
ConvAttenMixer: Brain Tumor Detection and Type Classification using Convolutional Mixer with External and Self-Attention Mechanisms
20
TL;DR: ConvAttenMixer, a transformer model, combines convolutional layers with self-attention and external attention mechanisms to enhance brain tumor detection and classification in MRI images, outperforming state-of-the-art baselines with higher precision, recall, and accuracy (0.9794).
read more
Abstract: Attention-based methods have recently demonstrated notable advancements in brain tumor classification. To further advance and strengthen this development, we have developed ConvAttenMixer, a transformer model that incorporates convolutional layers along with two attention mechanisms: self-attention and external attention. The proposed model utilizes two blocks of convolution mixers to effectively process and blend across patches, thereby enhancing the model's ability to capture spatial and channel-wise dependencies in MRI brain images. The self-attention block enables the model to prioritize important regions within the image and establish dependencies by assigning weights to each part based on their relevance to the task. This allows the model to emphasize crucial local features, disregard irrelevant ones, and capture interactions between different patches. On the other hand, the external attention block focuses more on significant global features and captures interactions among different images, enabling the model to establish dependencies and correlations across all samples. The classification head in the proposed model is a simple yet effective block designed to process the output feature maps using a squeeze-and-excitation mechanism, which in turn assigns higher weights to important channels and suppresses less-relevant channels. For experimentation, our ConvAttenMixer model was trained on a dataset consisting of 5712 MRI scans and subsequently tested on 1311 scans for classification into glioma, meningioma, pituitary tumor, and no-tumor images. Different variants of the proposed model were tested and evaluated. The optimally performing architecture was evaluated against the state-of-the-art baselines, namely self-attention MLP, external attention MLP, attention-based pooling convolutional net, and convolutional mixer net. Extensive experiments demonstrated that ConvAttenMixer outperformed the other baselines, which employed either self-attention or external attention mechanisms, while requiring significantly less computational memory. The suggested model exhibited higher precision, recall, and f-measure, achieving the highest accuracy of 0.9794 compared with the baselines' accuracy, which ranged from 0.87 to 0.93. The ConvAttenMixer model demonstrates the ability to operate locally on the patch level using self-attention and globally on the sample level using external attention, as well as prioritize important information on the spatial level and channel level using convolution mixers and the squeeze-and-excitation mechanism.
read more
Chat with Paper
AI Agents for this Paper
Find similar papers on Google Scholar, PubMed and Arxiv
Write a critical review of this paper
Analyze citations of this paper to find unaddressed research gaps
Citations
An attention-fused architecture for brain tumor diagnosis
Arash Hekmat,Zuping Zhang,Saif Ur Rehman Khan,Ifza Shad,Omair Bilal +4 more
7
Deep Learning Approaches for Brain Tumor Detection and Classification Using MRI Images (2020 to 2024): A Systematic Review
Sara Bouhafra,Hassan El Bahi +1 more
TL;DR: This systematic review (2020-2024) examines 60 deep learning-based studies on brain tumor detection and classification using MRI images, highlighting transfer learning, autoencoders, and attention mechanisms, and provides future directions for professionals and academic communities.
6
Enhancing brain tumor classification through ensemble attention mechanism
Fatih Çelik,Kemal Çelik,Ayse Celik +2 more
TL;DR: This study introduces the ensemble attention mechanism to enhance brain tumor classification in MRI images, achieving 98.94% and 98.48% accuracy on Figshare and BraTS 2019 datasets, respectively, outperforming other methods through attention-based feature extraction and ensemble learning.
4
Amyotrophic lateral sclerosis prediction framework using a multi-level encoders-decoders-based ensemble architecture technology
A. Khuzaim Alzahrani,Ahmed Alsheikhy,Tawfeeq Shawly,Ahmad S. Azzahrani,Aws I. AbuEid +4 more
TL;DR: This study proposes PMLEDBEAT, a multi-level encoders-decoders-based ensemble architecture, to predict Amyotrophic Lateral Sclerosis (ALS) and estimate its development rate, achieving 92% accuracy, outperforming existing methods, and demonstrating potential for real-world healthcare applications.
1
Benign vs. Malignant Brain Tumors: An In-Depth Review Using Deep Learning Techniques
Kirti Rattan,Gaurav Bathla,Vikas Wasson +2 more
- 02 May 2024
TL;DR: This study aims to offer a detailed overview of the latest advancements in brain tumor identification techniques, and provides insightful observations and recommendations for future researchers to improve the precision and make the rate at which tumor diagnosis methods work faster, potentially leading to a quicker and more accurate detection of brain tumors, thereby saving lives.
References
Deep Residual Learning for Image Recognition
Kaiming He,Xiangyu Zhang,Shaoqing Ren,Jian Sun +3 more
- 27 Jun 2016
TL;DR: In this article, the authors proposed a residual learning framework to ease the training of networks that are substantially deeper than those used previously, which won the 1st place on the ILSVRC 2015 classification task.
Densely Connected Convolutional Networks
Gao Huang,Zhuang Liu,Laurens van der Maaten,Kilian Q. Weinberger +3 more
- 21 Jul 2017
TL;DR: DenseNet as mentioned in this paper proposes to connect each layer to every other layer in a feed-forward fashion, which can alleviate the vanishing gradient problem, strengthen feature propagation, encourage feature reuse, and substantially reduce the number of parameters.
Rethinking the Inception Architecture for Computer Vision
Christian Szegedy,Vincent Vanhoucke,Sergey Ioffe,Jonathon Shlens,Zbigniew Wojna +4 more
- 27 Jun 2016
TL;DR: In this article, the authors explore ways to scale up networks in ways that aim at utilizing the added computation as efficiently as possible by suitably factorized convolutions and aggressive regularization.
27.9K
Squeeze-and-Excitation Networks
Jie Hu,Li Shen,Samuel Albanie,Gang Sun,Enhua Wu +4 more
- 18 Jun 2018
TL;DR: This work proposes a novel architectural unit, which is term the "Squeeze-and-Excitation" (SE) block, that adaptively recalibrates channel-wise feature responses by explicitly modelling interdependencies between channels and finds that SE blocks produce significant performance improvements for existing state-of-the-art deep architectures at minimal additional computational cost.
MobileNetV2: Inverted Residuals and Linear Bottlenecks
Mark Sandler,Andrew Howard,Menglong Zhu,Andrey Zhmoginov,Liang-Chieh Chen +4 more
- 18 Jun 2018
TL;DR: MobileNetV2 as mentioned in this paper is based on an inverted residual structure where the shortcut connections are between the thin bottleneck layers and intermediate expansion layer uses lightweight depthwise convolutions to filter features as a source of non-linearity.