Topic indexing of spoken documents based on optimized N-best approach

机译：基于优化的N最佳方法的语音文档主题索引

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

For topic indexing of spoken documents, the word error rate is hopefully decreased instead of the whole sentence error rate, so the center hypothesis among the N-best results is selected as the final output in speech recognition system. Then all spoken documents can be represented as vectors with high dimensions in vector space model, which can be combined with non-negative matrix factorization or singular value decomposition to map the vector space into semantic space. Experiment results show that optimized N-best approach is more suitable to the topic indexing system than one-best method. Combined with the non-negative matrix factorization, the correct topic indexing can achieve 98.1% in optimized N-best approach, which is 0.9% higher than the onebest approach under the same condition. When the semantic space is decreased to 10, there is about 11.1% difference between these two approaches. Furthermore, compared with singular value decomposition method, non-negative matrix factorization has the advantages of better performance, faster computation speed and less storage space.

机译：对于语音文档的主题索引，希望降低单词错误率而不是整个句子错误率，因此，将N个最佳结果中的中心假设作为语音识别系统的最终输出。这样，在矢量空间模型中，所有语音文档都可以表示为高维矢量，可以与非负矩阵分解或奇异值分解相结合，将矢量空间映射到语义空间中。实验结果表明，与一种最佳方法相比，优化的N最佳方法更适合主题索引系统。结合非负矩阵分解，在优化的N-最佳方法中正确的主题索引可以达到98.1％，比相同条件下的最佳方法高0.9％。当语义空间减小到10时，这两种方法之间大约有11.1％的差异。此外，与奇异值分解方法相比，非负矩阵分解具有更好的性能，更快的计算速度和更少的存储空间。

著录项

来源
《2009 IEEE International Conference on Intelligent Computing and Intelligent Systems(2009 IEEE 智能计算与智能系统国际会议）论文集》|2009年|2831-2834|共4页
会议地点
作者
Lei Zhang; Jingxin Chang; Xuezhi Xiang; Xiaosen Feng;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
Topic indexing; optimaized N-best; NMF; SVD;

机译：主题索引;优化的N最佳; NMF; SVD;

相似文献

外文文献
中文文献
专利

1. A novel approach to perform context-based automatic spoken document retrieval of political speeches based on wavelet tree indexing [J] . Gupta Anishka, Yadav Divakar Multimedia Tools and Applications . 2021,第14期

机译：基于小波树索引的基于语境的自动口语文献检索的新方法
2. Topic Mining based on Word Posterior Probability in Spoken Document [J] . Lei Zhang, Guo-xing Chen, Xue-zhi Xiang, Journal of software . 2011,第11期

机译：语音文档中基于词后验概率的主题挖掘
3. Semantic Analysis and Organization of Spoken Documents Based on Parameters Derived From Latent Topics [J] . Kong S.-Y., Lee L.-S. Audio, Speech, and Language Processing, IEEE Transactions on . 2011,第7期

机译：基于潜在主题的参数对口语文档的语义分析和组织
4. Topic indexing of spoken documents based on optimized N-best approach [C] . Lei Zhang, Jingxin Chang, Xuezhi Xiang, IEEE International Conference on Intelligent Computing and Intelligent Systems;ICIS 2009 . 2009

机译：基于优化的N最佳方法的语音文档主题索引
5. A concept map-based approach to document indexing and navigation. [D] . Reichherzer, Thomas Ralf. 2009

机译：基于概念图的文档索引和导航方法。
6. A New Approach for Indexing Honey for Its Heath/Medicinal Benefits: Visualization of the Concept by Indexing Based on Antioxidant and Antibacterial Activities [O] . Mahmud Masalha, Saleh Abu-Lafi, Basheer Abu-Farich, 2018

机译：为蜂蜜的健康/医学益处编制索引的新方法：通过基于抗氧化和抗菌活性的索引编制来可视化概念
7. WORD-LATTICE BASED SPOKEN-DOCUMENT INDEXING WITH STANDARD TEXT INDEXERS [O] . Frank Seide, Kit Thambiratnam, Roger Peng Yu 2015

机译：基于WORD-LaTTICE的标准文本索引的语音文档索引

Topic indexing of spoken documents based on optimized N-best approach

摘要

著录项

相似文献

相关主题

期刊订阅