Bimodal biometric recognition system using Convolutional Neural Networks and fusion of deep audiovisual feature vectors
DOI:
https://doi.org/10.61467/2007.1558.2024.v15i2.289Keywords:
Multimodal biometrics, Speaker recognition, Face recognition, CNN, Audiovisual biometricsAbstract
In recent years, interest has grown in the use biometric systems for identity authentication tasks in digital services, forensic and security applications. A unimodal system (employing a single biometric trait) with high performance is still vulnerable to falsification attacks such as spoofing. For this reason, research on multimodal biometrics (employing various biometric traits) has increased to reinforce security, increase recognition performance, and make false identity authentication more difficult. In this paper, we propose a bimodal system that combines speech and face modalities by concatenating their feature vectors, these vectors are extracted from two convolutional neural networks (CNN) and used for identity verification. The performance of unimodal CNNs was evaluated individually and compared to the bimodal system of concatenated vectors. A data augmentation scheme is used for both modalities to evaluate different operation conditions. Results were measured in terms of Equal Error Rate (EER).
Downloads
Published
How to Cite
Issue
Section
License
Copyright (c) 2024 International Journal of Combinatorial Optimization Problems and Informatics
This work is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License.