首页> 外文会议>International Conference on speech and computer >Automatic Stop List Generation for Clustering Recognition Results of Call Center Recordings
【24h】

Automatic Stop List Generation for Clustering Recognition Results of Call Center Recordings

机译:自动停止列表生成,用于呼叫中心记录的聚类识别结果

获取原文

摘要

The paper deals with the problem of automatic stop list generation for processing recognition results of call center recordings, in particular for the purpose of clustering. We propose and test a supervised domain dependent method of automatic stop list generation. The method is based on finding words whose removal increases the dissimilarity between documents in different clusters, and decreases dissimilarity between documents within the same cluster. This approach is shown to be efficient for clustering recognition results of recordings with different quality, both on datasets that contain the same topics as the training dataset, and on datasets containing other topics.
机译:本文讨论了用于处理呼叫中心记录的识别结果的自动停止列表生成的问题,特别是出于聚类目的。我们提出并测试了自动停止列表生成的受监督域相关方法。该方法基于发现单词的去除会增加不同聚类中文档之间的相似度,并减少同一聚类中文档之间的相异性。在包含与训练数据集相同的主题的数据集以及包含其他主题的数据集上,这种方法对于将具有不同质量的记录的识别结果进行聚类是有效的。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号