A Comparative Analysis of Deep Learning Models
for Detection of Lumpy Skin Disease with emphasis
on Shifted Window Transformers
Dublin Core
Title
A Comparative Analysis of Deep Learning Models
for Detection of Lumpy Skin Disease with emphasis
on Shifted Window Transformers
for Detection of Lumpy Skin Disease with emphasis
on Shifted Window Transformers
Subject
Deep learning, Lumpy Skin Disease (LSD), Vision
Transformer, CNN, livestock diagnosis, image analysis, model
explainability, dataset imbalance, veterinary AI
Transformer, CNN, livestock diagnosis, image analysis, model
explainability, dataset imbalance, veterinary AI
Description
Lumpy Skin Disease (LSD) in cattle is an increasingly
prevalent viral infection with significant economic impact.
Traditional detection methods are often labor-intensive and
delayed. In this study, five state-of-the-art deep learning (DL)
architectures—ResNet50, EfficientNetB0, MobileNetV2, Vision
Transformer (ViT-B16), and Swin Transformer Tiny (Swin-T)—
were evaluated and compared for image-based LSD classification.
Publicly available Kaggle datasets of infected and healthy cattle
were used. All models were fine-tuned using transfer learning and
tested for classification accuracy, F1-score, inference time,
explainability (via Grad-CAM), and real-world deployability.
Results show that Swin-T achieved the highest classification
accuracy of 95.3%, while MobileNetV2 emerged as the most
deployment-friendly model. Grad-CAM visualizations confirmed
that transformer-based models captured relevant lesion features
with greater spatial sensitivity than CNNs. The study highlights
the promise of hybrid transformer-CNN models for practical
livestock diagnostics, especially in resource-constrained
environments
prevalent viral infection with significant economic impact.
Traditional detection methods are often labor-intensive and
delayed. In this study, five state-of-the-art deep learning (DL)
architectures—ResNet50, EfficientNetB0, MobileNetV2, Vision
Transformer (ViT-B16), and Swin Transformer Tiny (Swin-T)—
were evaluated and compared for image-based LSD classification.
Publicly available Kaggle datasets of infected and healthy cattle
were used. All models were fine-tuned using transfer learning and
tested for classification accuracy, F1-score, inference time,
explainability (via Grad-CAM), and real-world deployability.
Results show that Swin-T achieved the highest classification
accuracy of 95.3%, while MobileNetV2 emerged as the most
deployment-friendly model. Grad-CAM visualizations confirmed
that transformer-based models captured relevant lesion features
with greater spatial sensitivity than CNNs. The study highlights
the promise of hybrid transformer-CNN models for practical
livestock diagnostics, especially in resource-constrained
environments
Creator
George Mwangi Muhindi
Source
https://ijcit.com/index.php/ijcit/article/view/533
Publisher
Department of Information Technology
Murang’a University of Technology
Murang’a, Kenya.
Murang’a University of Technology
Murang’a, Kenya.
Date
september 2025
Contributor
Fajar bagus W
Format
PDF
Language
English
Type
Text
Files
Collection
Citation
George Mwangi Muhindi, “A Comparative Analysis of Deep Learning Models
for Detection of Lumpy Skin Disease with emphasis
on Shifted Window Transformers,” Repository Horizon University Indonesia, accessed January 1, 2026, https://repository.horizon.ac.id/items/show/9747.
for Detection of Lumpy Skin Disease with emphasis
on Shifted Window Transformers,” Repository Horizon University Indonesia, accessed January 1, 2026, https://repository.horizon.ac.id/items/show/9747.