Early detection of age-related macular degeneration using Vision Transformer-based Architectures – A comparative study with offline metrics and data augmenting

Jorge Ernesto Gonzalez Diaz; Augusto Javier  Reyes Delgado; José Luis Sánchez Cervantes; Giner Alor Hernández; Lisbeth Rodríguez Mazahua; Adolfo Rodríguez Parada; Yara Anahí Jiménez Nieto

doi:10.61467/2007.1558.2024.v15i4.522

Early detection of age-related macular degeneration using Vision Transformer-based Architectures – A comparative study with offline metrics and data augmenting

Authors

Jorge Ernesto Gonzalez Diaz Tecnologico Nacional de Mexico/Instituto Tecnologico de Orizaba
Augusto Javier Reyes Delgado Tecnológico Nacional de México/I. T. Orizaba https://orcid.org/0009-0000-6863-3507
José Luis Sánchez Cervantes Tecnológico Nacional de México/I. T. Orizaba https://orcid.org/0000-0001-5194-1263
Giner Alor Hernández Tecnológico Nacional de México/I. T. Orizaba https://orcid.org/0000-0003-3296-0981
Lisbeth Rodríguez Mazahua Tecnológico Nacional de México/I. T. Orizaba https://orcid.org/0000-0002-9861-3993
Adolfo Rodríguez Parada Universidad Veracruzana https://orcid.org/0000-0001-8216-9202
Yara Anahí Jiménez Nieto Universidad Veracruzana https://orcid.org/0000-0002-1604-7087

DOI:

https://doi.org/10.61467/2007.1558.2024.v15i4.522

Keywords:

Multiclass classification, Age-Related Macular Degeneration (AMD), Early Detection, Vision Transformers

Abstract

Age-related macular degeneration (AMD) is one of the leading causes of vision loss in elderly adults around the world and is among the main visual impairments in Mexico. The difficulty of diagnosing AMD in its early stages motivates the use of advanced deep-learning methods that offer significant potential to improve diagnostic accuracy in retinal image analysis. In recent years, Transformer architectures for computer vision, such as Vision Transformer (ViT), Swin Transformer and BERT Pre-training of Image Transformers (BEiT) have provided a novel perspective for image analysis. This study presents a comparative analysis of these architectures, applied to AMD detection, focusing on each model's capability to classify the early stages of the disease. Although the small size of medical image datasets represented a challenge, our results suggest that ViT-based architectures and their derivatives achieve significant performance in AMD detection. BEiT is particularly notable for its consistently superior performance.

Downloads

Published

2024-11-04

How to Cite

Gonzalez Diaz, J. E., Reyes Delgado, A. J., Sánchez Cervantes, J. L., Alor Hernández, G., Rodríguez Mazahua , L., Rodríguez Parada, A., & Jiménez Nieto, Y. A. (2024). Early detection of age-related macular degeneration using Vision Transformer-based Architectures – A comparative study with offline metrics and data augmenting. International Journal of Combinatorial Optimization Problems and Informatics, 15(4), 72–84. https://doi.org/10.61467/2007.1558.2024.v15i4.522

Download Citation

Issue

Vol. 15 No. 4 (2024)

Section

COMIA

License

This work is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License.

Early detection of age-related macular degeneration using Vision Transformer-based Architectures – A comparative study with offline metrics and data augmenting

Authors

DOI:

Keywords:

Abstract

Downloads

Published

How to Cite

Issue

Section

License

Information

Current Issue