Arabic Phonemes Transcription Using Data Driven Approach

Arabic Phonemes Transcription Using Data Driven Approach

Khalid Nahar1, Husni Al-Muhtaseb1, Wasfi Al-Khatib1, Moustafa Elshafei1 and Mansour Alghamdi2

1College of Computers and Information Technology, Tabuk University, Saudi Arabia

2Computer Research Institute, King Abdulaziz City for Science and Technology (KACST),

Saudi Arabia

Abstract: The efficiency and correctness of continuous Arabic Speech Recognition Systems (ARS) hinge on the accuracy of the language phoneme set. The main goal of this research is to recognize and transcribe Arabic phonemes using a data-driven approach. We used the Hidden Markov Toolkit (HTK) to develop a phoneme recognizer, carrying out several experiments with different parameters, such as varying number of Hidden Markov Model (HMM) states and Gaussian mixtures to model the Arabic phonemes and find the best configuration. We used a corpus consisting of about 4000 files, representing 5 recorded hours of modern standard Arabic of TV - News. A statistical analysis for the phonemes length, frequency and mode was carried out, in order to determine the best number of states necessary to represent each phoneme. Phoneme recognition accuracy of 56.79% was reached without using a language model. The recognition accuracy increased to 96.3% upon using a bigram language model.

 

Keywords: Phoneme transcription, data - driven, speech recognition, network lattices, Arabic speech corpus.

Received November 20, 2012; accept March 21, 2014

Full Text

 

 

Read 2342 times Last modified on Sunday, 19 August 2018 04:51
Share
Top
We use cookies to improve our website. By continuing to use this website, you are giving consent to cookies being used. More details…