Word Sense Disambiguation for Arabic Text Categorization
Meryeme Hadni1, Said El Alaoui1, Abdelmonaime Lachkar2
1Department of Computer Science, Sidi Mohamed Ben Abdellah University, Morocco
2Department of Electrical and Computer Engineering, Sidi Mohamed Ben Abdellah University, Morocco
Abstract: In this paper, we present two contributions for Arabic Word Sense Disambiguation. In the first one, we propose to use both two external resources Arabic WordNet (AWN) and WN based on term to term Machine Translation System (MTS). The second contribution consists of choosing the nearest concept for the ambiguous terms, based on more relationships with different concepts in the same local context. To evaluate the accuracy of our proposed method, several experiments have been conducted using Feature Selection methods; Chi-Square and CHIR, two machine learning techniques; the Naïve Bayesian (NB) and Support Vector Machine (SVM).The obtained results illustrate that using the proposed method increases greatly the performance of our Arabic Text Categorization System.
Keywords: WSD, arabic text categorization system, AWN, MTS.
Received September 1, 2015; accepted October 18, 2015