Semantic Similarity Measures for Malay-English Ambiguous Words

Authors

  • Nurul Husna Mahadzir School of Quantitative Sciences, University Utara Malaysia, 06100 Sintok, Kedah.
  • Mohd Faizal Omar School of Quantitative Sciences, University Utara Malaysia, 06100 Sintok, Kedah.
  • Mohd Nasrun Mohd Nawi School of Technology Management and Logistic, University Utara Malaysia, 06100 Sintok, Kedah.

Keywords:

Ambiguous Words, Information Content Based, Path-Based, Semantic Similarity Measures,

Abstract

The measurement of semantic similarity between words or concepts has been widely applied in various applications such as Artificial Intelligence, Information Processing and Natural Language Processing (NLP) field. In this paper, we are going to demonstrate how several semantic similarity measures can be adapted to disambiguate words that belong to two different languages. The measures include two path-based and three information content (IC)-based measures. These five measures are evaluated against a test bed of 40 word pairs with eight Malay-English ambiguous words. The experimental results on a common benchmark, created by human judgments show that Wu and Palmer's measures have given the best performance as compared to the other four semantic similarity measures.

Downloads

Published

2018-02-26

How to Cite

Mahadzir, N. H., Omar, M. F., & Mohd Nawi, M. N. (2018). Semantic Similarity Measures for Malay-English Ambiguous Words. Journal of Telecommunication, Electronic and Computer Engineering (JTEC), 10(1-11), 109–112. Retrieved from https://jtec.utem.edu.my/jtec/article/view/3860