Early detection of age-related macular degeneration using Vision Transformer-based Architectures – A comparative study with offline metrics and data augmenting

Authors

DOI:

https://doi.org/10.61467/2007.1558.2024.v15i4.522

Keywords:

Multiclass classification, Age-Related Macular Degeneration (AMD), Early Detection, Vision Transformers

Abstract

Age-related macular degeneration (AMD) is one of the leading causes of vision loss in elderly adults around the world and is among the main visual impairments in Mexico. The difficulty of diagnosing AMD in its early stages motivates the use of advanced deep-learning methods that offer significant potential to improve diagnostic accuracy in retinal image analysis. In recent years, Transformer architectures for computer vision, such as Vision Transformer (ViT), Swin Transformer and BERT Pre-training of Image Transformers (BEiT) have provided a novel perspective for image analysis. This study presents a comparative analysis of these architectures, applied to AMD detection, focusing on each model's capability to classify the early stages of the disease. Although the small size of medical image datasets represented a challenge, our results suggest that ViT-based architectures and their derivatives achieve significant performance in AMD detection. BEiT is particularly notable for its consistently superior performance.

Downloads

Published

2024-11-04

How to Cite

Gonzalez Diaz, J. E., Reyes Delgado, A. J., Sánchez Cervantes, J. L., Alor Hernández, G., Rodríguez Mazahua , L., Rodríguez Parada, A., & Jiménez Nieto, Y. A. (2024). Early detection of age-related macular degeneration using Vision Transformer-based Architectures – A comparative study with offline metrics and data augmenting. International Journal of Combinatorial Optimization Problems and Informatics, 15(4), 72–84. https://doi.org/10.61467/2007.1558.2024.v15i4.522

Issue

Section

COMIA