首页> 外文会议>IASTED International Conference on Internet and Multimedia Systems and Applications >IDENTIFYING SEMANTICALLY SIMILAR ARABIC WORDS USING A LARGE VOCABULARY SPEECH RECOGNITION SYSTEM
【24h】

IDENTIFYING SEMANTICALLY SIMILAR ARABIC WORDS USING A LARGE VOCABULARY SPEECH RECOGNITION SYSTEM

机译:使用大词汇语音识别系统识别语义上类似的阿拉伯语单词

获取原文
获取外文期刊封面目录资料

摘要

Users search digital libraries for book references using one or more attributes such as keywords, subject and author name. Some book titles might contain the keyword that the user specified and thus these titles will directly qualify as candidate results. On the other hand there are other titles that are relevant but do not contain the same exact search keyword. A user expects to retrieve all titles that are relevant to a specified keyword. Similarly when searching for an author name, the system should be able to retrieve the different forms of the name. The library science community developed a mechanism called authority control that allows the user to do a comprehensive search and retrieve all the records that are relevant to the query keyword. In this paper we propose an approach that allows the user to query an Arabic audio library using voice. We use a combination of class-based language models and robust interpretation to recognize and identify the spoken keywords. The mechanism uses a Large Vocabulary Recognition System (LVCSR) to implement the functionality of the authority control system. A series of experiments were performed to assess the accuracy and the robustness of the proposed approach: restricted grammar recognition with semantic interpretation, class-based statistical language models (CB-SLM) with robust interpretation, and generalized CB-SLM. The results have shown that the combination of CB-SLM and robust interpretation provides better accuracy and robustness than the traditional grammar-based parsing.
机译:用户使用一个或多个属性(如关键字,主题和作者名称)搜索数字库的图书引用。一些书籍标题可能包含用户指定的关键字,因此这些标题将直接限定为候选结果。另一方面,还有其他相关的标题,但不包含相同的精确搜索关键字。用户希望检索与指定关键字相关的所有标题。同样在搜索作者名称时,系统应该能够检索名称的不同形式。图书馆科学界开发了一种称为权限控制的机制,允许用户全面搜索并检索与查询关键字相关的所有记录。在本文中,我们提出了一种方法,允许用户使用语音查询阿拉伯音频库。我们使用基于类的语言模型的组合和强大的解释来识别和识别口头关键字。该机制使用大型词汇识别系统(LVCSR)来实现权限控制系统的功能。进行了一系列实验以评估所提出的方法的准确性和鲁棒性:限制语法识别与语义解释,基于类的统计语言模型(CB-SLM)具有鲁棒解释和广义CB-SLM。结果表明,CB-SLM和鲁棒解释的组合提供了比传统的基于语法的解析更好的准确性和鲁棒性。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号