A Rule Based Annotaiton System to Extract Tajweed Rules

Conference Paper
Alfaries, Auhood . 2013
Publication Work Type: 
Conference paper
Tags: 
NLP, Arabic text analysis, Information extraction, GATE, Tajweed Rules, Quran
Conference Name: 
Taibah University International Conference on Advances in Information Technology for the Holy Quran and Its Sciences
Conference Location: 
Al-Madinah Al-Munawwarah, Saudi Arabia
Conference Date: 
Sunday, December 22, 2013
Sponsoring Organization: 
IEEE
Publication Abstract: 

Quran Recitation relies on identifying and applying different Tajweed rules [قواعد التجويد] such as Muddud [مدود] and Tanween [تنوين] in the Quran text. This research is aimed at providing a tool that automatically finds and annotates letters that embody Tajweed rules in Quran text. This field remains an open research area due to the lack of open source NLP tools that support the Arabic language. Applying Natural Language Processing (NLP) techniques on Quran text to extract Tajweed letters is considered an important Information Extraction (IE) step. This research explores the field of applying IE techniques on Quran text. Rule based IE techniques are well known to achieve optimal results. This research explores NLP techniques on Quranic text using GATE, an open source flexible NLP environment. GATE is employed for this research to build the application that processes un-annotated Quranic text corpus. The developed application is evaluated using the well known IE evaluation metrics precision and recall. By comparing the system’s automatically annotated text with a gold standard (i.e. Quran text). The system proved to be efficient by achieving 100% precision and recall of the implemented Tajweed rules.