Method for the Evaluation of Similarity Measures using short Texts

Authors

  • Maricela Bravo Universidad Autónoma Metropolitana
  • Luis Fernando Hoyos Reyes Universidad Autónoma Metropolitana
  • Domingo Rodríguez Benavides Universidad Autónoma Metropolitana
  • Leonardo D. Sánchez-Martínez Universidad Autónoma Metropolitana

Keywords:

Similarity measures, short texts comparison, scientific publishing, evaluation of similarity measures

Abstract

There exist multiple online collections and data bases of scientific articles publicly available, to take full advantage of these resources, it is necessary to process, arrange and correlate texts with respect to a classification or ontology. To achieve an efficient organization and a more relevant correlation between texts, it is necessary to use a similarity measure for short texts. However, determining the best method to calculate the similarity between texts is an arduous task, since there are many similarity measures reported in literature. Additionally, the collection of texts to which the similarity measures are applied should be considered; while some measures are useful for some types of information sources, they fail when the collection of data changes. Therefore, it is necessary to count with a method to evaluate the performance of similarity measures from a statistical perspective and in terms of the accuracy achieved by each measure.

Downloads

Published

2023-02-22

How to Cite

Bravo, M. ., Hoyos Reyes, L. F. ., Rodríguez Benavides, D. ., & Sánchez-Martínez, L. D. . (2023). Method for the Evaluation of Similarity Measures using short Texts. International Journal of Combinatorial Optimization Problems and Informatics, 14(1), 2–10. Retrieved from https://ijcopi.org/ojs/article/view/333

Issue

Section

Articles