首页> 外文会议>INTERSPEECH 2012 >Spoken Document Clustering Using Word Confusion Networks

【24h】

Spoken Document Clustering Using Word Confusion Networks

机译：使用Word混淆网络的口头文档聚类

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

In this paper, we propose a word confusion network (WCN) based approach to perform clustering of the spoken documents and analyze its ability to handle the influence of speech recognition errors. WCN compactly represents multiple confidence weighted recognition hypotheses. Thus it provides scope for improving the clustering accuracy as a result of the likely presence of the correct transcription in the alternative hypotheses for those cases where l-best transcripts are erroneous. On the other hand, several of the remaining hypotheses are incorrect and hence could pose a challenge during the clustering. In our approach, we extract TF-IDF vectors from the WCNs to perform clustering using K-Means algorithm. The components of TF-IDF vectors are further weighted with the word posterior probabilities. This is to potentially down-weight those vector components that are contributed by the incorrect hypotheses of low posterior probabilities. The experimental results obtained using switchboard data illustrate the usefulness of rich information in the WCN for clustering, showing upto 4% absolute improvement in normalized mutual information metric.

机译：在本文中，我们提出了一种基于混淆网络（WCN）的方法来执行口头文档的聚类，并分析其处理语音识别错误影响的能力。 WCN紧凑地表示多个置信度加权识别假设。因此，由于L-BEST转录物错误的这种情况，因此提供了改善聚类精度的范围，以改善替代假设中的正确转录。另一方面，一些剩余的假设是不正确的，因此可能在聚类期间构成挑战。在我们的方法中，我们从WCN中提取TF-IDF向量，以使用K-means算法执行群集。 TF-IDF向量的组件与单词后验概率进一步加权。这是潜在的削减那些由低后验概率的错误假设贡献的传染媒介成分。使用交换机数据获得的实验结果说明了用于聚类的WCN中丰富的信息的有用性，显示标准化的相互信息度量的绝对改善高达4％。

著录项

来源
《INTERSPEECH 2012》|2012年||共4页
会议地点
作者
Shajith Ikbal; Sachindra Joshi; Ashish Verma; Om D Deshmukh;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类 73.4136083;
关键词
spoken document clustering; word confusion network; posterior weighted TF-IDF vector; k-means clustering;

机译：口语文件集群;词混淆网络;后加权TF-IDF矢量;k均值聚类;

相似文献

外文文献
中文文献
专利

1. Beyond ASR 1-best: Using word confusion networks in spoken language understanding [J] . Dilek Hakkani-Tur, Frederic Bechet, Giuseppe Riccardi, Computer speech and language . 2006,第4期

机译：超越ASR 1-最佳：在单词理解中使用单词混淆网络
2. Spoken Document Retrieval Based on Confusion Network with Syllable Fragments [J] . Zhang Lei, Yoshihiko Gotoh, Muhammad Usman Ghani Khan International Journal of Advanced Robotic Systems . 2017,第5期

机译：基于音节片段的混淆网络的语音文档检索
3. Spoken Document Retrieval Based on Confusion Network with Syllable Fragments [J] . Lei Zhang, Gotoh Yoshihiko, Khan Muhammad Usman Ghani International Journal of Advanced Robotic Systems . 2012,第期

机译：用音节碎片的混淆网络进行语音文档检索
4. Spoken Document Clustering Using Word Confusion Networks [C] . Shajith Ikbal, Sachindra Joshi, Ashish Verma, Annual conference of the International Speech Communication Association . 2012

机译：使用单词混淆网络的语音文档聚类
5. Spoken word recognition and serial recall of words from the giant component and words from lexical islands in the phonological network. [D] . Siew, Cynthia S. Q. 2014

机译：语音网络中来自巨型成分的单词的口语单词识别和连续回想以及来自词汇岛的单词。
6. Simulating Retrieval from a Highly Clustered Network: Implications for Spoken Word Recognition [O] . Michael S. Vitevitch, Gunes Ercal, Bhargav Adagarla 2011

机译：模拟从高度群集的网络中检索：语音识别的含义
7. Analytical comparison between position specific posterior lattices and confusion networks based on words and subword units for spoken document indexing [O] . Yi-cheng Pan, Hung-lin Chang, Lin-shan Lee 2007

机译：基于单词和子单元的语音文档索引的位置特定后格与混淆网络的分析比较

Spoken Document Clustering Using Word Confusion Networks

摘要

著录项

相似文献

相关主题

期刊订阅