Morpho-Syntactic Tagging System Based on the Patterns Words for Arabic Texts

Morpho-Syntactic Tagging System Based on  the Patterns Words for Arabic Texts

Abdelhamid El-Jihad1, Abdellah Yousfi2, and Aouragh Si-Lhoussain 3
1Institute for Studies and Research on arabization, Rabat, Morocco
2University Mohamad V Suissi, Rabat, Morocco
3University Mohamad I-Oujda, Morocco
 
Abstract: Text tagging is a very important tool for various applications in natural language processing, namely the morphological and syntactic analysis of texts, indexation and information retrieval, "vocalization" of Arabic texts, and probabilistic language model (n-class model). However, these systems based on the lexemes of limited size, are unable to treat unknown words consequently. To overcome this problem, we developed in this paper, a new system based on the patterns of unknown words and the hidden Markov model. The experiments are carried out in the set of labeled texts, the set of 3800 patterns, and the 52 tags of morpho-syntactic nature, to estimate the parameters of the new model HMM.

Keywords: Hidden markov model, morpho-syntactic tagging, Arabic text, and pattern.

  Received September 22, 2008; accepted May 17, 2009

  

Full Text

Read 2629 times Last modified on Wednesday, 13 July 2011 08:30
Share
Top
We use cookies to improve our website. By continuing to use this website, you are giving consent to cookies being used. More details…