ARABASE: A Relational Database for Arabic OCR Systems
Najoua Ben Amara 1, Omar Mazhoud 1, Noura Bouzrara 2, and Noureddine Ellouze 2
1 National School of Engineer of Monastir, Tunisia
2 National School of Engineer of Tunis, Tunisia
Abstract: In this paper we present a database for the research of Arabic off-line and on-line handwriting optical recognition as well as for machine printed text optical recognition. Digital images of documents, text phrases, words/sub-words, isolated characters, digits, signatures, soon are and included in ARABASE. Data corresponds to a variety of lexes (cities names, literal amounts, isolated characters, digits, free texts, etc.). The database organization offers interesting commodities to be explored via an Arabic writing recognition system. A useful tool enables the user, via a graphical interface to experiment different classical tasks of image processing.
Keywords: Databases, Arabic writing recognition, on-line and off-line handwriting, printed documents, multi-fonts, multi-writers.
Received July 2, 2004; accepted September 17, 2004