IDENTIFYING SEMANTICALLY SIMILAR ARABIC WORDS USING A LARGE VOCABULARY SPEECH RECOGNITION SYSTEM

机译：使用大型语音识别系统识别相似的阿拉伯语单词

获取原文

获取原文并翻译 | 示例

页面导航

摘要
著录项
相似文献
相关主题

摘要

Users search digital libraries for book references using one or more attributes such as keywords, subject and author name. Some book titles might contain the keyword that the user specified and thus these titles will directly qualify as candidate results. On the other hand there are other titles that are relevant but do not contain the same exact search keyword. A user expects to retrieve all titles that are relevant to a specified keyword. Similarly when searching for an author name, the system should be able to retrieve the different forms of the name. The library science community developed a mechanism called authority control that allows the user to do a comprehensive search and retrieve all the records that are relevant to the query keyword. In this paper we propose an approach that allows the user to query an Arabic audio library using voice. We use a combination of class-based language models and robust interpretation to recognize and identify the spoken keywords. The mechanism uses a Large Vocabulary Recognition System (LVCSR) to implement the functionality of the authority control system. A series of experiments were performed to assess the accuracy and the robustness of the proposed approach: restricted grammar recognition with semantic interpretation, class-based statistical language models (CB_SLM) with robust interpretation, and generalized CB-SLM. The results have shown that the combination of CB-SLM and robust interpretation provides better accuracy and robustness than the traditional grammar-based parsing.

机译：用户使用一个或多个属性（例如关键字，主题和作者姓名）在数字图书馆中搜索书籍参考。一些书名可能包含用户指定的关键字，因此这些书名将直接符合候选结果的资格。另一方面，还有其他相关标题，但不包含相同的确切搜索关键字。用户希望检索与指定关键字相关的所有标题。类似地，当搜索作者姓名时，系统应该能够检索姓名的不同形式。图书馆科学界开发了一种称为权限控制的机制，该机制允许用户进行全面的搜索并检索与查询关键字相关的所有记录。在本文中，我们提出了一种允许用户使用语音查询阿拉伯音频库的方法。我们结合使用基于类的语言模型和强大的解释能力来识别和识别口头关键词。该机制使用大型词汇识别系统（LVCSR）来实现权限控制系统的功能。进行了一系列实验以评估所提出方法的准确性和鲁棒性：具有语义解释的受限语法识别，具有鲁棒解释的基于类的统计语言模型（CB_SLM）和广义CB-SLM。结果表明，与传统的基于语法的分析相比，CB-SLM和鲁棒性解释的结合提供了更好的准确性和鲁棒性。

著录项

来源
《IASTED(International Association of Science and Technology for Development) International Conference on Internet and Multimedia Systems and Applications; 20050221-23; Grindelwald(CH)》|2005年|P.293-298|共6页
会议地点 Grindelwald(CH)
作者
Habib Talhami; Ibrahim Kamel; Ibrahim Kamel;
展开▼
作者单位

The Institute of Informatics, The British University in Dubai P. O. Box 502216, Dubai, United Arab Emirates;

展开▼
会议组织
原文格式 PDF
正文语种 eng
中图分类计算机网络;多媒体技术与多媒体计算机;
关键词
arabic; indexing; speech recognition; and language processing;

机译：阿拉伯语;索引;语音识别;和语言处理;

相似文献

外文文献
中文文献
专利

1. Modelling Semantic Context of OOV Words in Large Vocabulary Continuous Speech Recognition [J] . Imran Sheikh, Dominique Fohr, Irina Illina, Audio, Speech, and Language Processing, IEEE/ACM Transactions on . 2017,第3期

机译：大词汇量连续语音识别中OOV词的语义上下文建模
2. Dealing with Out-of vocabulary Words and Filled Pauses in Word N-gram Based Speech Recognition System [J] . ATSUHIKO KAI, YOSHIFUMI HIROSE, SEIICHI NAKAGAWA 情報処理学会論文誌 . 1999,第4期

机译：基于单词N-gram的语音识别系统处理词汇外单词和填充的暂停
3. An improved two-stage mixed language model approach for handling out-of-vocabulary words in large vocabulary continuous speech recognition [J] . Bert Reveil, Kris Demuynck, Jean-Pierre Martens Computer speech and language . 2014,第1期

机译：一种改进的两阶段混合语言模型方法，用于处理大词汇量连续语音识别中的词汇外单词
4. IDENTIFYING SEMANTICALLY SIMILAR ARABIC WORDS USING A LARGE VOCABULARY SPEECH RECOGNITION SYSTEM [C] . Habib Talhami, Ibrahim Kamel IASTED International Conference on Internet and Multimedia Systems and Applications . 2005

机译：使用大词汇语音识别系统识别语义上类似的阿拉伯语单词
5. Learning Out-of-Vocabulary Words in Automatic Speech Recognition. [D] . Qin, Long. 2013

机译：在自动语音识别中学习词汇外单词。
6. Age-related Effects on Word Recognition: Reliance on Cognitive Control Systems with Structural Declines in Speech-responsive Cortex [O] . Mark A. Eckert, Adam Walczak, Jayne Ahlstrom, 2008

机译：与年龄相关的单词识别影响：依赖语音控制皮层结构下降的认知控制系统
7. Word based off-line handwritten Arabic classification and recognition. Design of automatic recognition system for large vocabulary offline handwritten Arabic words using machine learning approaches. [O] . AlKhateeb Jawad Hasan Yasin 2010

机译：基于单词的离线手写阿拉伯语分类和识别。利用机器学习方法设计大词汇量离线阿拉伯语手写单词自动识别系统。

IDENTIFYING SEMANTICALLY SIMILAR ARABIC WORDS USING A LARGE VOCABULARY SPEECH RECOGNITION SYSTEM

摘要

著录项

相似文献

相关主题

期刊订阅