首页> 外文会议> >Fast latent semantic indexing of spoken documents by using self-organizing maps
【24h】

Fast latent semantic indexing of spoken documents by using self-organizing maps

机译:通过自组织映射快速对语音文档进行潜在语义索引

获取原文

摘要

This paper describes a new latent semantic indexing (LSI) method for spoken audio documents. The framework is indexing broadcast news from radio and TV as a combination of large vocabulary continuous speech recognition (LVCSR), natural language processing (NLP) and information retrieval (IR). For indexing, the documents are presented as vectors of word counts, whose dimensionality is rapidly reduced by random mapping (RM). The obtained vectors are projected into the latent semantic subspace determined by SVD, where the vectors are then smoothed by a self-organizing map (SOM). The smoothing by the closest document clusters is important here, because the documents are often short and have a high word error rate (WER). As the clusters in the semantic subspace reflect the news topics, the SOMs provide an easy way to visualize the index and query results and to explore the database. Test results are reported for TREC's spoken document retrieval databases (www.idiap.ch/kurimo/thisl.html).
机译:本文介绍了一种新的语音语音文档潜在语义索引(LSI)方法。该框架将广播和广播中的广播新闻编入索引,这是大词汇量连续语音识别(LVCSR),自然语言处理(NLP)和信息检索(IR)的组合。为了建立索引,文档以单词计数的向量表示,其维数通过随机映射(RM)迅速降低。将获得的向量投影到由SVD确定的潜在语义子空间中,然后通过自组织映射(SOM)对向量进行平滑处理。在这里,最接近的文档簇进行平滑处理很重要,因为文档通常很短并且具有较高的误码率(WER)。由于语义子空间中的聚类反映了新闻主题,因此SOM提供了一种简便的方法来可视化索引和查询结果以及浏览数据库。测试结果报告给TREC的口头文档检索数据库(www.idiap.ch/kurimo/thisl.html)。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号