【24h】

A Rule-Based Annotation System to Extract Tajweed Rules from Quran

机译:从古兰经中提取塔吉威规则的基于规则的注释系统

获取原文
获取原文并翻译 | 示例

摘要

Quran Recitation relies on identifying and applying different Tajweed rules [ÞæÇÚÏ ÇáÊÌæíÏ] such as Muddud [ãÏæÏ] and Tanween [Êäæíä] in the Quran text. This research is aimed at providing a tool that automatically finds and annotates letters that embody Tajweed rules in Quran text. This field remains an open research area due to the lack of open source NLP tools that support the Arabic language. Applying Natural Language Processing (NLP) techniques on Quran text to extract Tajweed letters is considered an important Information Extraction (IE) step. This research explores the field of applying IE techniques on Quran text. Rule based IE techniques are well known to achieve optimal results. This research explores NLP techniques on Quranic text using GATE, an open source flexible NLP environment. GATE is employed for this research to build the application that processes un-annotated Quranic text corpus. The developed application is evaluated using the well known IE evaluation metrics precision and recall. By comparing the system's automatically annotated text with a gold standard (i.e. Quran text). The system proved to be efficient by achieving 100% precision and recall of the implemented Tajweed rules.
机译:《古兰经》诵读依赖于识别和应用不同的Tajweed规则[ÞæÇÚÏÇáÊÌæíÏ],例如古兰经文本中的Muddud [ãÏæÏ]和Tanween [Êäæíä]。这项研究旨在提供一种工具,该工具可以自动查找和注释体现古兰经文本中Tajweed规则的字母。由于缺少支持阿拉伯语的开源NLP工具,因此该领域仍然是一个开放的研究领域。在古兰经文本上应用自然语言处理(NLP)技术来提取Tajweed字母被认为是重要的信息提取(IE)步骤。这项研究探索了将IE技术应用于古兰经文本的领域。众所周知,基于规则的IE技术可以达到最佳效果。这项研究使用开源灵活的NLP环境GATE探索了古兰经文本的NLP技术。 GATE用于这项研究,以构建处理未注释的古兰经文本语料库的应用程序。使用众所周知的IE评估指标精度和召回率对开发的应用程序进行评估。通过比较系统的自动注释文本和黄金标准(即古兰经文本)。该系统通过实现100%的精度并调用已实施的Tajweed规则而被证明是有效的。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号