Towards A Distributed Arabic OCR Based on the DTW Algorithm: Performance Analysis

Towards A Distributed Arabic OCR Based on the DTW Algorithm: Performance Analysis

Maher Khemakhem1 and Abdelfettah Belghith2
1Miracl Lab Fsegs, University of Sfax, Tunisia
 2Rim-Cristal Lab, Ensi University of Manouba, Tunisia

Abstract: In spite of the diversity of printed Arabic optical character recognition products and proposals, the problem seems to be not yet well solved. The complex morphology and calligraphy of the Arabic writing on one hand and the use of some light approaches on the other hand are behind the poorness of these products. However, some strong proposed approaches didn’t find the opportunity to be commercialised because of generally their corresponding complex computing.  The dynamic time warping algorithm is considered as one among these strong approaches. In fact, several studies and experiments have shown and confirmed that the printed Arabic optical character recognition based on dynamic time warping algorithm provides a very interesting recognition rate especially for large and huge vocabularies. One of the attractive sides of the dynamic time warping algorithm is its ability to recognize properly connected or cursive characters (words or sub words) without prior segmentation. Furthermore, this algorithm performs the recognition process from within a reference library of isolated characters and owns a very good immunity against noises. Unfortunately, the big amount of its computing during the recognition process makes its execution time very slow and, hence, restricts its utilization. Many researchers attempted to speedup the execution time of this algorithm. Unfortunately, the corresponding proposed solutions require generally specific high cost architectures. Loosely coupled architectures such as grapes or grid computing can provide enough power without additional cost to distribute the complexity of some greedy applications. Consequently, we report in this paper the performance analysis of an analytical and an experimental study of a distributed Arabic optical character recognition based on the dynamic time warping algorithm within loosely coupled architectures. Obtained results confirm that loosely coupled architectures and more specifically grid computing present a very interesting framework to speedup the Arabic optical character recognition based on the dynamic time warping algorithm.

Keywords: Arabic OCR, DTW algorithm, loosely coupled architectures, grapes, grid computing, performance analysis.

Received September 11, 2007; accepted December 22, 2007

 

Full Text

 

Read 9944 times Last modified on Wednesday, 20 January 2010 01:33
Share
Top
We use cookies to improve our website. By continuing to use this website, you are giving consent to cookies being used. More details…