【24h】

IDENTIFYING SEMANTICALLY SIMILAR ARABIC WORDS USING A LARGE VOCABULARY SPEECH RECOGNITION SYSTEM

机译:使用大型语音识别系统识别相似的阿拉伯语单词

获取原文
获取原文并翻译 | 示例

摘要

Users search digital libraries for book references using one or more attributes such as keywords, subject and author name. Some book titles might contain the keyword that the user specified and thus these titles will directly qualify as candidate results. On the other hand there are other titles that are relevant but do not contain the same exact search keyword. A user expects to retrieve all titles that are relevant to a specified keyword. Similarly when searching for an author name, the system should be able to retrieve the different forms of the name. The library science community developed a mechanism called authority control that allows the user to do a comprehensive search and retrieve all the records that are relevant to the query keyword. In this paper we propose an approach that allows the user to query an Arabic audio library using voice. We use a combination of class-based language models and robust interpretation to recognize and identify the spoken keywords. The mechanism uses a Large Vocabulary Recognition System (LVCSR) to implement the functionality of the authority control system. A series of experiments were performed to assess the accuracy and the robustness of the proposed approach: restricted grammar recognition with semantic interpretation, class-based statistical language models (CB_SLM) with robust interpretation, and generalized CB-SLM. The results have shown that the combination of CB-SLM and robust interpretation provides better accuracy and robustness than the traditional grammar-based parsing.
机译:用户使用一个或多个属性(例如关键字,主题和作者姓名)在数字图书馆中搜索书籍参考。一些书名可能包含用户指定的关键字,因此这些书名将直接符合候选结果的资格。另一方面,还有其他相关标题,但不包含相同的确切搜索关键字。用户希望检索与指定关键字相关的所有标题。类似地,当搜索作者姓名时,系统应该能够检索姓名的不同形式。图书馆科学界开发了一种称为权限控制的机制,该机制允许用户进行全面的搜索并检索与查询关键字相关的所有记录。在本文中,我们提出了一种允许用户使用语音查询阿拉伯音频库的方法。我们结合使用基于类的语言模型和强大的解释能力来识别和识别口头关键词。该机制使用大型词汇识别系统(LVCSR)来实现权限控制系统的功能。进行了一系列实验以评估所提出方法的准确性和鲁棒性:具有语义解释的受限语法识别,具有鲁棒解释的基于类的统计语言模型(CB_SLM)和广义CB-SLM。结果表明,与传统的基于语法的分析相比,CB-SLM和鲁棒性解释的结合提供了更好的准确性和鲁棒性。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号