Using Conditional Random Fields For Sentence Boundary Detection In Speech

机译：使用条件随机字段进行语音中句子边界检测

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Sentence boundary detection in speech is important for enriching speech recognition output, making it easier for humans to read and downstream modules to process. In previous work, we have developed hidden Markov model (HMM) and maximum entropy (Maxent) classifiers that integrate textual and prosodic knowledge sources for detecting sentence boundaries. In this paper, we evaluate the use of a conditional random field (CRF) for this task and relate results with this model to our prior work. We evaluate across two corpora (conversational telephone speech and broadcast news speech) on both human transcriptions and speech recognition output. In general, our CRF model yields a lower error rate than the HMM and Maxent models on the NIST sentence boundary detection task in speech, although it is interesting to note that the best results are achieved by three-way voting among the classifiers. This probably occurs because each model has different strengths and weaknesses for modeling the knowledge sources.

机译：语句中的句子边界检测对于丰富语音识别输出非常重要，使人类更容易读取和下游模块来处理。在以前的工作中，我们开发了隐藏的Markov模型（HMM）和最大熵（MaxEnt）分类器，可集成文本和韵律知识来源以检测句子边界。在本文中，我们评估了对此任务的条件随机字段（CRF）的使用，并将该模型与此模型相关联的结果。我们在人类转录和语音识别输出中评估了两种Corpora（会话电话语音和广播新闻语音）。通常，我们的CRF模型比在语音中的NIST句子边界检测任务上的HMM和MaxEnt模型产生较低的误差率，尽管有趣的是，请注意，通过分类器之间的三通投票来实现最佳结果。这可能发生，因为每个模型具有不同的优点和缺点，可以对知识来源进行建模。

著录项

来源
《Association for Computational Linguistics Annual Meeting》|2005年||共8页
会议地点
作者
Yang Liu; Andreas Stolcke; Elizabeth Shriberg; Mary Harper; Association for Computational Linguistics(ACL); ACL-05;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类计算机软件;
关键词

相似文献

外文文献
中文文献
专利

1. Sentence boundary detection without speech recognition: A case of an under-resourced language [J] . Nursuriati Jamil, Muhammad Izzad Ramli, Noraini Seman Journal of Electrical Systems . 2015,第5期

机译：没有语音识别的句子边界检测：资源不足的情况
2. Sentence boundary detection in conversational speech transcripts using noisily labeled examples [J] . Hironori Takeuchi, L. Venkata Subramaniam, Shourya Roy, International Journal on Document Analysis and Recognition . 2007,第3a4期

机译：带有语音标签的示例在会话语音记录中的句边界检测
3. A study in machine learning from imbalanced data for sentence boundary detection in speech [J] . Yang Liu, Nitesh V. Chawla, Mary P. Harper, Computer speech and language . 2006,第4期

机译：基于不平衡数据的机器学习用于语音句子边界检测的研究
4. Using Conditional Random Fields For Sentence Boundary Detection In Speech [C] . Yang Liu, Andreas Stolcke, Elizabeth Shriberg, ACL-05; Association for Computational Linguistics Annual Meeting; 20050625-30; Ann Arbor,MI(US) . 2005

机译：使用条件随机场进行语音中句子边界的检测
5. Model-based Single-microphone Speech Separation Using Conditional Random Fields. [D] . Yeung, Yu Ting. 2014

机译：使用条件随机场的基于模型的单麦克风语音分离。
6. Biomedical negation scope detection with conditional random fields [O] . Shashank Agarwal, Hong Yu 2010

机译：条件随机场的生物医学否定范围检测
7. Using conditional random fields for sentence boundary detection in speech [O] . Yang Liu, Andreas Stolcke, Elizabeth Shriberg, 2014

机译：使用条件随机场进行语音中的句子边界检测

Using Conditional Random Fields For Sentence Boundary Detection In Speech

摘要

著录项

相似文献

相关主题

期刊订阅