Spoken Document Clustering Using Word Confusion Networks

机译：使用单词混淆网络的语音文档聚类

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

In this paper, we propose a word contusion network (WCN) based approach to perform clustering of the spoken documents and analyze its ability to handle the influence of speech recognition errors. WCN compactly represents multiple confidence weighted recognition hypotheses. Thus it provides scope for improving the clustering accuracy as a result of the likely presence of the correct transcription in the alternative hypotheses for those cases where 1-best transcripts are erroneous. On the other hand, several of the remaining hypotheses are incorrect and hence could pose a challenge during the clustering. In our approach, we extract TF-IDF vectors from the WCNs to perform clustering using K-Means algorithm. The components of TF-IDF vectors are further weighted with the word posterior probabilities. This is to potentially down-weight those vector components that are contributed by the incorrect hypotheses of low posterior probabilities. The experimental results obtained using switchboard data illustrate the usefulness of rich information in the WCN for clustering, showing upto 4% absolute improvement in normalized mutual information metric.

机译：在本文中，我们提出了一种基于词挫伤网络（WCN）的方法来对语音文档进行聚类，并分析其处理语音识别错误影响的能力。 WCN紧凑地表示多个置信度加权识别假设。因此，它为在最好的1个转录本错误的情况下的替代假设中可能存在正确的转录提供了提高聚类准确性的范围。另一方面，剩余的一些假设是不正确的，因此在聚类期间可能构成挑战。在我们的方法中，我们从WCN中提取TF-IDF向量，以使用K-Means算法进行聚类。 TF-IDF向量的分量进一步用单词后验概率加权。这是为了潜在地权衡由低后验概率的不正确假设引起的那些向量分量。使用总机数据获得的实验结果说明了WCN中的丰富信息对于聚类的有用性，显示出归一化互信息度量中的绝对值提高了4％。

著录项

来源
《Annual conference of the International Speech Communication Association》|2012年|1378-1381|共4页
会议地点
作者
Shajith Ikbal; Sachindra Joshi; Ashish Verma; Om D Deshmukh;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
spoken document clustering; word confusion network; posterior weighted TF-IDF vector; k-means clustering;

机译：语音文件聚类;词混淆网络;后加权TF-IDF向量; k均值聚类;

相似文献

外文文献
中文文献
专利

1. Beyond ASR 1-best: Using word confusion networks in spoken language understanding [J] . Dilek Hakkani-Tur, Frederic Bechet, Giuseppe Riccardi, Computer speech and language . 2006,第4期

机译：超越ASR 1-最佳：在单词理解中使用单词混淆网络
2. Spoken Document Retrieval Based on Confusion Network with Syllable Fragments [J] . Zhang Lei, Yoshihiko Gotoh, Muhammad Usman Ghani Khan International Journal of Advanced Robotic Systems . 2017,第5期

机译：基于音节片段的混淆网络的语音文档检索
3. Spoken Document Retrieval Based on Confusion Network with Syllable Fragments [J] . Lei Zhang, Gotoh Yoshihiko, Khan Muhammad Usman Ghani International Journal of Advanced Robotic Systems . 2012,第期

机译：用音节碎片的混淆网络进行语音文档检索
4. Spoken Document Clustering Using Word Confusion Networks [C] . Shajith Ikbal, Sachindra Joshi, Ashish Verma, INTERSPEECH 2012 . 2012

机译：使用Word混淆网络的口头文档聚类
5. Spoken word recognition and serial recall of words from the giant component and words from lexical islands in the phonological network. [D] . Siew, Cynthia S. Q. 2014

机译：语音网络中来自巨型成分的单词的口语单词识别和连续回想以及来自词汇岛的单词。
6. Simulating Retrieval from a Highly Clustered Network: Implications for Spoken Word Recognition [O] . Michael S. Vitevitch, Gunes Ercal, Bhargav Adagarla 2011

机译：模拟从高度群集的网络中检索：语音识别的含义
7. Analytical comparison between position specific posterior lattices and confusion networks based on words and subword units for spoken document indexing [O] . Yi-cheng Pan, Hung-lin Chang, Lin-shan Lee 2007

机译：基于单词和子单元的语音文档索引的位置特定后格与混淆网络的分析比较

Spoken Document Clustering Using Word Confusion Networks

摘要

著录项

相似文献

相关主题

期刊订阅