Spectral analysis of distorted and autoencoder-reconstructed MNIST images in a kernel-induced feature space

Authors

  • Rafael Castañeda Díaz Instituto Politécnico Nacional, Unidad Profesional Interdisciplinaria de Ingeniería Campus Zacatecas (UPIIZ) https://orcid.org/0009-0008-2867-1734
  • Daniela López-Betancur Universidad Autónoma de Zacatecas. Unidad Académica de Ciencia y Tecnología de la Luz y la Materia
  • Carlos Guerrero-Méndez Universidad Autónoma de Zacatecas, Unidad Académica de Ciencia y Tecnología de la Luz y la Materia. https://orcid.org/0000-0003-0721-8515
  • Efrén González-Ramírez Universidad Autónoma de Zacatecas, Unidad Académica de Ciencia y Tecnología de la Luz y la Materia
  • Flossi Puma-Ttito Instituto Politécnico Nacional, Unidad Profesional Interdisciplinaria de Ingeniería Campus Zacatecas (UPIIZ) https://orcid.org/0000-0001-6692-0991
  • V.H. Carrera-Escobedo Instituto Politécnico Nacional, Unidad Profesional Interdisciplinaria de Ingeniería Campus Zacatecas (UPIIZ)

DOI:

https://doi.org/10.61467/2007.1558.2026.v17i3.1457

Keywords:

Convolutional Neural Networks, principal components analysis, Redes neuronales convolucionales, análisis de componentes principales

Abstract

In this paper, the unsupervised learning spectral methods Principal Component Analysis (PCA) and Kernel Principal Component Analysis (KPCA) were explored. PCA captured the maximum variance of MNIST images, including their distorted and autoencoder-reconstructed versions, and KPCA, that captured the maximum variance and nonlinear structures of the feature space where the same images were mapped. Accordingly, for KPCA the kernel functions: a polynomial of degree 5, a radial basis function (RBF), and a cosine kernel, were evaluated. A correlation analysis between the distortions level, the digit inclination and the anisotropic of pixel values of the same images and the PCA-derived components from KPCA was calculated.  The distortion level and anisotropy parameters appear to have strong nonlinear correlation with PCA-derived components, especially for the polynomial and the RBF kernels. Conversely, for cosine kernel weaker correlations were observed. Moreover, the digit inclination parameter appears unrelated to the PCA-derived components for all three kernels.

 

Spanish-language metadata / Metadatos en español
Título en español:

Análisis espectral de imágenes MNIST distorsionadas y reconstruidas mediante autoencoders en un espacio de características inducido por kernel


Resumen:

En este artículo se analizaron los métodos espectrales de aprendizaje no supervisado: el Análisis de Componentes Principales (PCA) y el Análisis de Componentes Principales con Kernel (KPCA). El PCA capturó la máxima varianza de las imágenes MNIST, incluidas sus versiones distorsionadas y las reconstruidas mediante autoencodificadores, mientras que el KPCA capturó la máxima varianza y las estructuras no lineales del espacio de características en el que se mapearon las mismas imágenes. En consecuencia, para el KPCA se evaluaron las siguientes funciones del núcleo: un polinomio de grado 5, una función de base radial (RBF) y un núcleo coseno. Se calculó un análisis de correlación entre el nivel de distorsión, la inclinación de los dígitos y la anisotropía de los valores de los píxeles de las mismas imágenes, y los componentes derivados del PCA a partir del KPCA. El nivel de distorsión y los parámetros de anisotropía parecen presentar una fuerte correlación no lineal con los componentes derivados del PCA, especialmente en el caso de los núcleos polinómicos y RBF. Por el contrario, en el caso del núcleo coseno se observaron correlaciones más débiles. Además, el parámetro de inclinación de los dígitos no parece estar relacionado con los componentes derivados del PCA en ninguno de los tres núcleos.

Palabras Claves:

Redes neuronales convolucionales, análisis de componentes principales

 

Smart citations:

https://scite.ai/reports/10.61467/2007.1558.2026.v17i3.1457
Dimensions.
Open Alex.

References

Bengio, Y., Courville, A., & Vincent, P. (2013). Representation learning: A review and new perspectives. IEEE Transactions on Pattern Analysis and Machine Intelligence, 35(8), 1798–1828. https://doi.org/10.1109/TPAMI.2013.50

Bishop, C. M., & Bishop, H. (2024). Autoencoders. In C. M. Bishop & H. Bishop (Eds.), Deep Learning: Foundations and Concepts (pp. 563–579). Springer. https://doi.org/10.1007/978-3-031-45468-4_19

Burger, W., & Burge, M. J. (2022). Digital image processing: An algorithmic introduction (3rd ed.). Springer. https://doi.org/10.1007/978-3-031-05744-1

Castaneda-Diaz, R., Lopez-Betancur, D., Guerrero-Mendez, C., Gonzalez-Ramirez, E., Puma-Ttito, F., & Troncoso-Pacheco, R. (2026). Visual analysis of MNIST convolutional autoencoder reconstructions via linear dimensionality reduction. In L. Martínez-Villaseñor, R. A. Vázquez, G. Ochoa-Ruiz, M. Montes Rivera, S. Zapotecas-Martínez, M. L. Barrón-Estrada, E. Mezura-Montes, & A. Gómez-Chávez (Eds.), Advances in Computational Intelligence. MICAI 2025 International Workshops (Vol. 16264, pp. 352–367). Springer. https://doi.org/10.1007/978-3-032-17930-2_25

Castaneda-Diaz, R., Lopez-Betancur, D., Guerrero-Mendez, C., González-Ramírez, E., Gómez-Jiménez, S., & Puma-Ttito, F. (2025). Cleaning binary distortion on MNIST dataset. In L. Martínez-Villaseñor, G. Ochoa-Ruiz, M. Montes Rivera, M. L. Barrón-Estrada, & H. G. Acosta-Mesa (Eds.), Advances in Computational Intelligence. MICAI 2024 International Workshops (pp. 133–142). Springer. https://doi.org/10.1007/978-3-031-83879-8_11

Cayton, L. (2008). Algorithms for manifold learning (Technical Report CS2008-0923). University of California, San Diego.

de Winter, J. C. F., Gosling, S. D., & Potter, J. (2016). Comparing the Pearson and Spearman correlation coefficients across distributions and sample sizes: A tutorial using simulations and empirical data. Psychological Methods, 21(3), 273–290. https://doi.org/10.1037/met0000079

Delete this duplicate. Keep only: van der Maaten, L. J. P., & Hinton, G. E. (2008).

Deng, X., Yuan, M., & Sudjianto, A. (2007). A note on robust kernel principal component analysis. In J. S. Verducci, X. Shen, & J. Lafferty (Eds.), Prediction and Discovery (Contemporary Mathematics, Vol. 443, pp. 21–34). American Mathematical Society. https://doi.org/10.1090/conm/443/08552

Fefferman, C., Mitter, S., & Narayanan, H. (2016). Testing the manifold hypothesis. Journal of the American Mathematical Society, 29(4), 983–1049. https://doi.org/10.1090/jams/852

Fisher, R. A. (1919). XV.—The correlation between relatives on the supposition of Mendelian inheritance. Earth and Environmental Science Transactions of the Royal Society of Edinburgh, 52(2), 399–433. https://doi.org/10.1017/S0080456800012163

Ghojogh, B. (2021). Data reduction algorithms in machine learning and data science [Doctoral dissertation, University of Waterloo]. UWSpace. https://uwspace.uwaterloo.ca/items/b30d8515-09d1-4063-98dd-1d13a4dc4e56

Ghojogh, B., Crowley, M., Karray, F., & Ghodsi, A. (2023). Elements of Dimensionality Reduction and Manifold Learning. Springer. https://doi.org/10.1007/978-3-031-10602-6

Ghojogh, B., Samad, M. N., Mashhadi, S. A., Kapoor, T., Ali, W., Karray, F., & Crowley, M. (2019). Feature selection and feature extraction in pattern analysis: A literature review [Preprint]. arXiv. https://doi.org/10.48550/arXiv.1905.02845

Gonzalez, R. C., & Woods, R. E. (2018). Digital image processing (4th ed., global ed.). Pearson.

Gorban, A. N., Kégl, B., Wunsch, D. C., & Zinovyev, A. Y. (Eds.). (2008). Principal Manifolds for Data Visualization and Dimension Reduction (Vol. 58). Springer. https://doi.org/10.1007/978-3-540-73750-6

He, X., Yan, S., Hu, Y., Niyogi, P., & Zhang, H.-J. (2005). Face recognition using Laplacianfaces. IEEE Transactions on Pattern Analysis and Machine Intelligence, 27(3), 328–340. https://doi.org/10.1109/TPAMI.2005.55

Hira, Z. M., & Gillies, D. F. (2015). A review of feature selection and feature extraction methods applied on microarray data. Advances in Bioinformatics, 2015, Article 198363. https://doi.org/10.1155/2015/198363

Jansen, A., & Niyogi, P. (2013). Intrinsic spectral analysis. IEEE Transactions on Signal Processing, 61(7), 1698–1710. https://doi.org/10.1109/TSP.2013.2238931

Jolliffe, I. T. (2014). Principal component analysis. In Wiley StatsRef: Statistics Reference Online. Wiley. https://doi.org/10.1002/9781118445112.stat06472

LeCun, Y., Bengio, Y., & Hinton, G. (2015). Deep learning. Nature, 521(7553), 436–444. https://doi.org/10.1038/nature14539

McInnes, L., Healy, J., & Melville, J. (2018). UMAP: Uniform Manifold Approximation and Projection for Dimension Reduction [Preprint]. arXiv. https://doi.org/10.48550/arXiv.1802.03426

Mittal, M., J, P. G., M S, G. P., Devadas, R. M., Ambreen, L., & Kumar, V. (2024). Dimensionality reduction using UMAP and TSNE technique. In 2024 Second International Conference on Advances in Information Technology (ICAIT) (Vol. 1, pp. 1–5). IEEE. https://doi.org/10.1109/ICAIT61638.2024.10690797

Rebekić, A., Lončarić, Z., Petrović, S., & Marić, S. (2015). Pearson’s or Spearman’s correlation coefficient—Which one to use? Poljoprivreda, 21(2), 47–54. https://doi.org/10.18047/poljo.21.2.8

Reverter, F., Vegas, E., & Oller, J. M. (2014). Kernel-PCA data integration with enhanced interpretability. BMC Systems Biology, 8(Suppl. 2), Article S6. https://doi.org/10.1186/1752-0509-8-S2-S6

Saul, L. K., Weinberger, K. Q., Sha, F., Ham, J., & Lee, D. D. (2006). Spectral methods for dimensionality reduction. In O. Chapelle, B. Schölkopf, & A. Zien (Eds.), Semi-supervised learning. MIT Press. https://doi.org/10.7551/mitpress/9780262033589.003.0016

Schölkopf, B., Mika, S., Burges, C. J. C., Knirsch, P., Müller, K.-R., Rätsch, G., & Smola, A. J. (1999). Input space versus feature space in kernel-based methods. IEEE Transactions on Neural Networks, 10(5), 1000–1017. https://doi.org/10.1109/72.788641

Schölkopf, B., Smola, A. J., & Müller, K.-R. (1999). Kernel principal component analysis. In B. Schölkopf, C. J. C. Burges, & A. J. Smola (Eds.), Advances in kernel methods—Support vector learning (pp. 327–352). MIT Press.

Schölkopf, B., Smola, A., & Müller, K.-R. (1997). Kernel principal component analysis. In W. Gerstner, A. Germond, M. Hasler, & J.-D. Nicoud (Eds.), Artificial Neural Networks—ICANN’97 (pp. 583–588). Springer. https://doi.org/10.1007/BFb0020217

Schölkopf, B., Smola, A., & Müller, K.-R. (1998). Nonlinear component analysis as a kernel eigenvalue problem. Neural Computation, 10(5), 1299–1319. https://doi.org/10.1162/089976698300017467

Shah, F. P., & Patel, V. (2016). A review on feature selection and feature extraction for text classification. In 2016 International Conference on Wireless Communications, Signal Processing and Networking (WiSPNET) (pp. 2264–2268). IEEE. https://doi.org/10.1109/WiSPNET.2016.7566545

Shawe-Taylor, J., & Cristianini, N. (2004). Kernel Methods for Pattern Analysis. Cambridge University Press. https://doi.org/10.1017/CBO9780511809682

Strange, H., & Zwiggelaar, R. (2014). Spectral dimensionality reduction. In H. Strange & R. Zwiggelaar (Eds.), Open Problems in Spectral Dimensionality Reduction (pp. 7–22). Springer. https://doi.org/10.1007/978-3-319-03943-5_2

van der Maaten, L. J. P., & Hinton, G. E. (2008). Visualizing data using t-SNE. Journal of Machine Learning Research, 9, 2579–2605. https://www.jmlr.org/papers/v9/vandermaaten08a.html

van Vliet, L. J., & Verbeek, P. W. (1995). Estimators for orientation and anisotropy in digitized images. In Proceedings of the First Annual Conference of the Advanced School for Computing and Imaging (pp. 442–450). ASCI.

Walpole, R. E., Myers, R. H., Myers, S. L., & Ye, K. (2012). Probabilidad y estadística para ingeniería y ciencias (9.ª ed.). Pearson.

Wang, Q. (2012). Kernel Principal Component Analysis and its Applications in Face Recognition and Active Shape Models [Preprint]. arXiv. https://doi.org/10.48550/arXiv.1207.3538

Yun, H. S., Jargal, A., Hyun, C. M., & Seo, J. K. (2023). Nonlinear representation and dimensionality reduction. In J. K. Seo (Ed.), Deep Learning and Medical Applications (pp. 1–49). Springer. https://doi.org/10.1007/978-981-99-1839-3_1

Downloads

Published

2026-06-12

How to Cite

Castañeda Díaz, R., López-Betancur, D., Guerrero-Méndez, C., González-Ramírez, E., Puma-Ttito, F., & Carrera-Escobedo, V. (2026). Spectral analysis of distorted and autoencoder-reconstructed MNIST images in a kernel-induced feature space. International Journal of Combinatorial Optimization Problems and Informatics, 17(3), 287–301. https://doi.org/10.61467/2007.1558.2026.v17i3.1457

Issue

Section

Articles

Most read articles by the same author(s)