首页> 外国专利> Audio data retrieval device, voice data search method, audio data retrieval program, and media computer with a built-possible voice data retrieval program computer is reading available

Audio data retrieval device, voice data search method, audio data retrieval program, and media computer with a built-possible voice data retrieval program computer is reading available

机译:可以读取音频数据检索设备,语音数据搜索方法,音频数据检索程序以及带有内置语音数据检索程序计算机的媒体计算机

摘要

A speech data retrieval apparatus (10) includes a speech database (1), a speech recognition unit (2), a confusion network creation unit (3), an inverted index table creation unit (4), a query input unit (6), a query conversion unit (7) and a label string check unit (8). The speech recognition unit (2) reads speech data from the speech database (1), carries out a speech recognition process with respect to the read speech data, and outputs a result of speech recognition process as a lattice in which a phoneme, a syllable, or a word is a base unit. The confusion network creation unit (3) creates a confusion network based on the output lattice and outputs the result of speech recognition process as the confusion network. The inverted index table creation unit (4) creates an inverted index table based on the output confusion network. The query input unit (6) receives a query input by a user, carries out a speech recognition process with respect to the received query, and outputs a result of speech recognition process as a character string. The query conversion unit (7) converts the output character string into a label string in which a phoneme, a syllable, or a word is a base unit. The label string check unit (8) checks the label string against the inverted index table and retrieves speech data which is included in both of the label string and the speech database (1).
机译:语音数据检索设备(10)包括语音数据库(1),语音识别单元(2),混淆网络创建单元(3),倒排索引表创建单元(4),查询输入单元(6) ,查询转换单元(7)和标签字符串检查单元(8)。语音识别单元(2)从语音数据库(1)读取语音数据,对读取的语音数据进行语音识别处理,并且将语音识别处理的结果作为其中音素,音节的格子输出。 ,或者单词是基本单位。混淆网络创建单元(3)基于输出格来创建混淆网络,并且将语音识别处理的结果输出为混淆网络。倒排索引表创建单元(4)基于输出混淆网络创建倒排索引表。查询输入单元(6)接收用户输入的查询,对接收到的查询进行语音识别处理,并将语音识别处理的结果作为字符串输出。查询转换单元(7)将输出的字符串转换为以音素,音节或单词为基本单位的标签字符串。标签字符串检查单元(8)对照倒排索引表检查标签字符串,并检索包括在标签字符串和语音数据库(1)中的语音数据。

著录项

相似文献

  • 专利
  • 外文文献
  • 中文文献
获取专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号