Translation Rules for English to Hindi Machine Translation System: Homoeopathy Domain
Sanjay Dwivedi and Pramod Sukhadeve
Department of Computer Science, Babasaheb Bhimrao Ambedkar University (a Central University), India
Abstract: Rule based machine translation system embraces a set of grammar rules which are mandatory for the mapping of syntactic representations of a source language, on the target language. The system necessitates good linguistic knowledge to write rules and require of acquaintance source such as corpus and bilingual dictionary. In this paper, we have described the grammar rules intended for our English to Hindi machine translation system to translate the homoeopathic literatures, medical reports, prescription etc. The rules which have been written follow the transfer based approach for reordering of rules between two languages. The paper first discusses about our developed stemmer and its rules, further we discuss the Part of Speech tagging (PoS) rules for categorizing each word of the sentence grammatically and our developed homoeopathy corpus in English and Hindi of size 20085 and 20072 words respectively and at the last we discuss the agreement/translation rules for translating various homoeopathic sentences.
Keywords: Machine translation, stemmer, PoS tagging, grammar rules, homoeopathy, corpus.
Received June 14, 2013; Accepted Mrch 17, 2014