DefinePK

DefinePK hosts the largest index of Pakistani journals, research articles, news headlines, and videos. It also offers chapter-level book search.

A Convolutional Neural Network and Vision Transformer Based Framework for Effective Detection of Liver Cancer


Article Information

Title: A Convolutional Neural Network and Vision Transformer Based Framework for Effective Detection of Liver Cancer

Authors: Asma Zahoor, Erssa Arif, Naila Nawaz, Muhammad Amjad, Shahrukh Hamayoun, Arslan Baig

Journal: Journal of Computing & Biomedical Informatics

HEC Recognition History
Category From To
Y 2023-07-01 2024-09-30
Y 2022-07-01 2023-06-30

Publisher: Research Center of Computing & Biomedical Informatics

Country: Pakistan

Year: 2025

Volume: 9

Issue: 02

Language: en

Keywords: Hepatocellular carcinoma (HCC)Clinical Decision Support SystemsMedical Image AnalysisLiver cancer detectionVision Transformers (ViTs)Computed Tomography (CT) ImagingEfficientNet-B0TinyViTMobileViTv2Early Cancer Diagnosis

Categories

Abstract

Liver cancer, particularly hepatocellular carcinoma (HCC), remains one of the most prevalent and lethal malignancies worldwide, underscoring the urgent need for early and reliable diagnostic solutions. Conventional diagnostic methods using computed tomography (CT) imaging are often limited by inter-observer variability and the high cognitive burden on radiologists. To address these challenges, this study proposes a hybrid deep learning framework that leverages Convolutional Neural Networks (CNNs) and Vision Transformers (ViTs) for effective liver cancer detection. The research employs the publicly available 3D-IRCADb1 dataset of contrast-enhanced CT scans, with preprocessing and augmentation techniques applied to enhance model generalization. Three state-of-the-art architectures, EfficientNet-B0, TinyViT, and MobileViT v2, were trained and evaluated to assess their diagnostic performance. Among these, MobileViT v2 demonstrated superior performance and efficiency in classification tasks. To enhance clinical trust, Gradient-weighted Class Activation Mapping (Grad-CAM) was integrated to provide visual explanations of model predictions, highlighting regions of interest corresponding to tumor areas. The findings indicate that the proposed framework not only ensures robust diagnostic capability but also introduces interpretability and efficiency, making it suitable for deployment in clinical and resource-constrained environments. This research contributes to advancing AI-driven liver cancer diagnostics by bridging the gap between performance and transparency, ultimately supporting earlier detection and improved patient outcomes.


Paper summary is not available for this article yet.

Loading PDF...

Loading Statistics...