Morpho-Syntactic Tagging System Based on the Patterns Words for Arabic Texts

Morpho-Syntactic Tagging System Based on  the Patterns Words for Arabic Texts

Abdelhamid El-Jihad1, Abdellah Yousfi2, and Aouragh Si-Lhoussain 3
1Institute for Studies and Research on arabization, Rabat, Morocco
2University Mohamad V Suissi, Rabat, Morocco
3University Mohamad I-Oujda, Morocco
 
Abstract: Text tagging is a very important tool for various applications in natural language processing, namely the morphological and syntactic analysis of texts, indexation and information retrieval, "vocalization" of Arabic texts, and probabilistic language model (n-class model). However, these systems based on the lexemes of limited size, are unable to treat unknown words consequently. To overcome this problem, we developed in this paper, a new system based on the patterns of unknown words and the hidden Markov model. The experiments are carried out in the set of labeled texts, the set of 3800 patterns, and the 52 tags of morpho-syntactic nature, to estimate the parameters of the new model HMM.

Keywords: Hidden markov model, morpho-syntactic tagging, Arabic text, and pattern.

  Received September 22, 2008; accepted May 17, 2009

  

Full Text

Read 2924 times Last modified on Wednesday, 13 July 2011 08:30
Share

Upcoming courses

  • Diploma Courses
  • Business and Enterprise
  • Digital Literacy & IT
  • Health Literacy
  • Business Literacy

Free courses

Starting from Jun. 14 2016

the degree finder

in 3 easy steps
Top
We use cookies to improve our website. By continuing to use this website, you are giving consent to cookies being used. More details…