Arabic Anaphora Resolution: Corpora
Annotation with Coreferential Links
Souha Hammami, Lamia Belguith, and Abdelmajid Ben Hamadou
LARIS-MIRACL Laboratory, University of Sfax, Tunisia
LARIS-MIRACL Laboratory, University of Sfax, Tunisia
Abstract:Annotated resources are much needed for evaluation and training of anaphora resolution systems. The coreferential chain annotation is a difficult task which can not be realised without an appropriate tool. In this paper, we present our work on Arabic corpora annotation with anaphoric links (i.e., the annotation of the identity relation between the anaphors and their antecedents). In particular, we propose an anaphoric annotating tool for Arabic. Anaphoric annotating tool for Arabic has the advantage of automatic detection of Arabic pronouns and allows the human annotator to select several anaphoric pronouns related to the same antecedent. Our aim is to build a real corpus which will be used for anaphora resolution (i.e., either for system training or evaluation).
Keywords: Anaphora resolution, Arabic language, corpus annotation tool, pronominal anaphora, lexical anaphora.
Received December 18, 2008; accepted June 24, 2009