Comparing Evaluation Metrics for Sentence Boundary Detection

机译：句子边界检测的评价指标比较

获取原文

页面导航

摘要
著录项
引文网络
相似文献
相关主题

摘要

In recent NIST evaluations on sentence boundary detection, a single error metric was used to describe performance. Additional metrics, however, are available for such tasks, in which a word stream is partitioned into subunits. This paper compares alternative evaluation metrics including the NIST error rate, classification error rate per word boundary, precision and recall, ROC curves, DET curves, precision-recall curves, and area under the curves and discusses advantages and disadvantages of each. Unlike many studies in machine learning, we use real data for a real task. We find benefit from using curves in addition to a single metric. Furthermore, we find that data skew has an impact on metrics, and that differences among different system outputs are more visible in precision-recall curves. Results are expected to help us better understand evaluation metrics that should be generalizable to similar language processing tasks.

著录项

作者
Liu, Y. ; Shriberg, E.;
展开▼
作者单位

展开▼
年度 2007
页码 1-5
总页数 5
原文格式 PDF
正文语种 eng
中图分类工业技术;
关键词
Meetings; Comparison; Test and evaluation; Speech recognition; Computational linguistics; Symposia; Natural language; Recall; Learning machines; Errors; Precision;

机译：会议;比较;测试与评估;语音识别;计算语言学;专题讨论会;自然语言;召回;学习机器;错误;精确度;

相似文献

外文文献
中文文献
专利

1. Comparing predictors of sentence self-paced reading times: Syntactic complexity versus transitional probability metrics [J] . Bob Kapteijns, Florian Hintz PLoS One . 2021,第7期

机译：比较句子自定节读数的预测因子：句法复杂性与过渡概率指标
2. Performance Evaluation of Shot Boundary Detection Metrics in the Presence of Object and Camera Motion [J] . Krishna K. Warhade, Shabbier N. Merchant, U. B. Desai IETE Journal of Research . 2011,第5期

机译：存在物体和摄像机运动时镜头边界检测指标的性能评估
3. An Evaluation on Performance different metrics on extraction of Persian-English Parallel sentences [J] . Amin Keshavarzi, Marziyeh Homayouni International journal of computer science and network security . 2016,第7期

机译：波斯英语平行句子提取中不同性能指标的评价
4. Comparing Evaluation Metrics for Sentence Boundary Detection [C] . Yang Liu, Shriberg, E. . 2007

机译：句子边界检测的评估指标比较
5. Evaluating the use of different HPV testing strategies, as compared to the PAP, for the detection of cervical cancer in Mexico: Epidemiologic and economic analyses. [D] . Flores, Yvonne Nicole. 2003

机译：与PAP相比，评估使用不同的HPV检测策略检测墨西哥的宫颈癌：流行病学和经济分析。
6. A Quantitative and Qualitative Evaluation of Sentence Boundary Detection for the Clinical Domain [O] . Denis Griffis, Chaitanya Shivade, Eric Fosler-Lussier, 2016

机译：用于临床领域的句子边界检测的定量和定性评估
7. Comparing Evaluation Metrics for Sentence Boundary Detection [O] . Yang Liu, Elizabeth Shriberg 2007

机译：句子边界检测的评价指标比较

Comparing Evaluation Metrics for Sentence Boundary Detection

摘要

著录项

引文网络

相似文献

相关主题

期刊订阅