A framework for Quranic verses authenticity detection in online forum
واصفات البياناتعرض سجل المادة الكامل
Quran is the holy book for all Muslims around the world. Since more than 1400 years, it was preserved in all possible ways from distortion. The huge increment and spread of digital media and internet usage, leaded to many organizational and individual websites, services, and applications are being introduced to spread the knowledge related to Quran as well as Quranic Verses, Translations, Explanations with the Tafseer and other Quranic sciences in its digital formats, some of these services are less authentic. In this paper we introduce a framework to detect and authenticate Quranic verses in a text extracted from online source especially forums posts. The proposed methodology of detection is based on the assumption that Quranic Verses are the parts of the text that contain more diacritics (Harakat). Other assumptions were also established to increase the accuracy of detection in case of less diacritic text. Authentication methodology is based on computing numerical Identifiers of words in the detected text then comparing these identifiers with Identifiers of original Quranic manuscript. Experiments show acceptable results on the detections rate of the highly and less diacritic text. The accuracy was 62% in average while the precision and recall were 75% and 78%, respectively. Future works will focus on authentication side as well as incorporating computational intelligence methods, that involved the sound of the words pronounce during the reading of Quranic verses, image processing and others, to improve the detection.