IDENTIFYING SEMANTICALLY SIMILAR ARABIC WORDS USING A LARGE VOCABULARY SPEECH RECOGNITION SYSTEM

机译：使用大词汇语音识别系统识别语义上类似的阿拉伯语单词

获取原文

获取外文期刊封面目录资料

页面导航

摘要
著录项
引文网络
相似文献
相关主题

摘要

Users search digital libraries for book references using one or more attributes such as keywords, subject and author name. Some book titles might contain the keyword that the user specified and thus these titles will directly qualify as candidate results. On the other hand there are other titles that are relevant but do not contain the same exact search keyword. A user expects to retrieve all titles that are relevant to a specified keyword. Similarly when searching for an author name, the system should be able to retrieve the different forms of the name. The library science community developed a mechanism called authority control that allows the user to do a comprehensive search and retrieve all the records that are relevant to the query keyword. In this paper we propose an approach that allows the user to query an Arabic audio library using voice. We use a combination of class-based language models and robust interpretation to recognize and identify the spoken keywords. The mechanism uses a Large Vocabulary Recognition System (LVCSR) to implement the functionality of the authority control system. A series of experiments were performed to assess the accuracy and the robustness of the proposed approach: restricted grammar recognition with semantic interpretation, class-based statistical language models (CB-SLM) with robust interpretation, and generalized CB-SLM. The results have shown that the combination of CB-SLM and robust interpretation provides better accuracy and robustness than the traditional grammar-based parsing.

机译：用户使用一个或多个属性（如关键字，主题和作者名称）搜索数字库的图书引用。一些书籍标题可能包含用户指定的关键字，因此这些标题将直接限定为候选结果。另一方面，还有其他相关的标题，但不包含相同的精确搜索关键字。用户希望检索与指定关键字相关的所有标题。同样在搜索作者名称时，系统应该能够检索名称的不同形式。图书馆科学界开发了一种称为权限控制的机制，允许用户全面搜索并检索与查询关键字相关的所有记录。在本文中，我们提出了一种方法，允许用户使用语音查询阿拉伯音频库。我们使用基于类的语言模型的组合和强大的解释来识别和识别口头关键字。该机制使用大型词汇识别系统（LVCSR）来实现权限控制系统的功能。进行了一系列实验以评估所提出的方法的准确性和鲁棒性：限制语法识别与语义解释，基于类的统计语言模型（CB-SLM）具有鲁棒解释和广义CB-SLM。结果表明，CB-SLM和鲁棒解释的组合提供了比传统的基于语法的解析更好的准确性和鲁棒性。

著录项

来源
《IASTED International Conference on Internet and Multimedia Systems and Applications》|2005年||共6页
会议地点
作者
Habib Talhami; Ibrahim Kamel;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类 TP393-53;
关键词
Arabic; Indexing; Speech recognition; Language processing;

机译：阿拉伯语;索引;语音识别;语言处理;

相似文献

外文文献
中文文献
专利

1. Modelling Semantic Context of OOV Words in Large Vocabulary Continuous Speech Recognition [J] . Imran Sheikh, Dominique Fohr, Irina Illina, Audio, Speech, and Language Processing, IEEE/ACM Transactions on . 2017,第3期

机译：大词汇量连续语音识别中OOV词的语义上下文建模
2. Dealing with Out-of vocabulary Words and Filled Pauses in Word N-gram Based Speech Recognition System [J] . ATSUHIKO KAI, YOSHIFUMI HIROSE, SEIICHI NAKAGAWA 情報処理学会論文誌 . 1999,第4期

机译：基于单词N-gram的语音识别系统处理词汇外单词和填充的暂停
3. An improved two-stage mixed language model approach for handling out-of-vocabulary words in large vocabulary continuous speech recognition [J] . Bert Reveil, Kris Demuynck, Jean-Pierre Martens Computer speech and language . 2014,第1期

机译：一种改进的两阶段混合语言模型方法，用于处理大词汇量连续语音识别中的词汇外单词
4. IDENTIFYING SEMANTICALLY SIMILAR ARABIC WORDS USING A LARGE VOCABULARY SPEECH RECOGNITION SYSTEM [C] . Habib Talhami, Ibrahim Kamel, Ibrahim Kamel IASTED(International Association of Science and Technology for Development) International Conference on Internet and Multimedia Systems and Applications; 20050221-23; Grindelwald(CH) . 2005

机译：使用大型语音识别系统识别相似的阿拉伯语单词
5. Learning Out-of-Vocabulary Words in Automatic Speech Recognition. [D] . Qin, Long. 2013

机译：在自动语音识别中学习词汇外单词。
6. Age-related Effects on Word Recognition: Reliance on Cognitive Control Systems with Structural Declines in Speech-responsive Cortex [O] . Mark A. Eckert, Adam Walczak, Jayne Ahlstrom, 2008

机译：与年龄相关的单词识别影响：依赖语音控制皮层结构下降的认知控制系统
7. Word based off-line handwritten Arabic classification and recognition. Design of automatic recognition system for large vocabulary offline handwritten Arabic words using machine learning approaches. [O] . AlKhateeb Jawad Hasan Yasin 2010

机译：基于单词的离线手写阿拉伯语分类和识别。利用机器学习方法设计大词汇量离线阿拉伯语手写单词自动识别系统。

IDENTIFYING SEMANTICALLY SIMILAR ARABIC WORDS USING A LARGE VOCABULARY SPEECH RECOGNITION SYSTEM

摘要

著录项

引文网络

相似文献

相关主题

期刊订阅