Method for the Unification and Reduction of the Search Space of V Gene Segments in Sequence Alignments
DOI:
https://doi.org/10.61467/2007.1558.2025.v16i4.1028Keywords:
Clustering, data integration, reduction of search spaceAbstract
The identification and characterisation of V genes poses a significant challenge due to the substantial number of alignments generated by diverse sequencing systems. This study proposes a method for the unification and reduction of the search space, with the objective of optimising the identification of V genes. The method integrates preprocessing, normalisation, and clustering using Gaussian Mixture Models. This approach facilitates data consolidation and reduces redundancy, thereby enhancing the efficiency and accuracy of the subsequent analysis. The elbow method was employed to determine the optimal number of groups, achieving a 98% reduction in the search space. The findings were validated through the use of metrics such as mean absolute error, mean squared error, and root mean squared error, thereby confirming the effectiveness of the method in improving the precision of gene identification.
Downloads
Published
How to Cite
Issue
Section
License
Copyright (c) 2025 International Journal of Combinatorial Optimization Problems and Informatics

This work is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License.