|
ARABASE: A Relational
Database for Arabic OCR Systems
Najoua Ben Amara1,
Omar Mazhoud1, Noura Bouzrara2, and Noureddine Ellouze2
1National
School of Engineer of Monastir, Tunisia
2National
School of Engineer of Tunis, Tunisia
Abstract:
In this paper we
present a database for the research of Arabic off-line and on-line handwriting
optical recognition as well as for machine printed text optical recognition.
Digital images of documents, text phrases, words/sub-words, isolated characters,
digits, signatures, soon are and included in ARABASE. Data corresponds to a
variety of lexes (cities names, literal amounts, isolated characters, digits,
free texts, etc.). The database organization offers interesting commodities to
be explored via an Arabic writing recognition system. A useful tool enables the
user, via a graphical interface to experiment different classical tasks of image
processing.
Keywords:
Databases,
Arabic writing recognition, on-line and off-line handwriting, printed documents,
multi-fonts, multi-writers.
Received July 2, 2004;
accepted September 17, 2004
|