Preprint / Version 1

A Comparative Study of EfficientNetB0 and Vision Transformer (ViT-B16) Architectures for Brain Tumor Classification Using MRI Scans

##article.authors##

  • Om Sahu Independent

DOI:

https://doi.org/10.58445/rars.2649

Keywords:

MRI, CNN, ViT, Classification, AI, Machine Learning, Deep Learning, Computer Vision

Abstract

Accurate detection and classification of brain tumors from MRI scans is critical for effective clinical diagnosis and treatment planning. While Convolutional Neural Networks (CNNs) like EfficientNetB0 have been widely used for medical image analysis due to their strong feature extraction capabilities, their performance is often hindered by limited spatial context awareness. Vision Transformers (ViTs), by contrast, leverage self-attention mechanisms to capture global contextual relationships, potentially overcoming these limitations. This study presents a rigorous comparative analysis of EfficientNetB0 and ViT-B16 architectures on the Brain Tumor MRI Dataset, which includes four classes: Glioma, Meningioma, Pituitary Tumor, and No Tumor. Both models were trained under identical preprocessing, augmentation, and hyperparameter settings using Kaggle’s cloud infrastructure. Evaluation based on accuracy, precision, recall, F1-score, AUC-ROC, and interpretability (via Grad-CAM and Attention Maps) revealed stark differences. EfficientNetB0 exhibited severe overfitting, achieving high training but poor test performance (30.89% accuracy), misclassifying most tumor types. In contrast, ViT-B16 achieved superior generalization with a test accuracy of 71.62% and balanced performance across tumor categories. Interpretability analyses confirmed ViT-B16's ability to localize tumors more effectively. These results highlight the promise of transformer-based architectures for robust and clinically viable brain tumor classification.

References

Balaji, G., Sen, R., & Kirty, H. (2022). Detection and classification of brain tumors using deep convolutional neural networks. arXiv. https://arxiv.org/abs/2208.13264

Filatov, D., & Yar, G. N. A. H. (2022). Brain tumor diagnosis and classification via pre-trained convolutional neural networks. PubMed. https://pubmed.ncbi.nlm.nih.gov/39038716/

Liu, H., Dowdell, B., Engelder, T., Pulmano, Z., Osa, N., & Barman, A. (2023). Glioblastoma tumor segmentation using an ensemble of vision transformers. arXiv. https://arxiv.org/abs/2312.11467

M MM, T. R. M., V VK, & Guluwadi, S. (2024). An XAI-enhanced EfficientNetB0 framework for precision brain tumor detection in MRI imaging. PubMed. https://pubmed.ncbi.nlm.nih.gov/39038716/

Minaee, S., Kafieh, R., Sonka, M., Yazdani, S., & Soufi, G. J. (2022). Classification of brain tumor from magnetic resonance imaging using vision transformers ensembling. PubMed. https://pubmed.ncbi.nlm.nih.gov/36290867/

Nickparvar, M. (2021). Brain tumor MRI dataset. Kaggle. https://www.kaggle.com/datasets/masoudnickparvar/brain-tumor-mri-dataset

Downloads

Posted

2025-06-22