METER: MEasuring TExt Reuse

机译：仪表：测量TExt重用

获取原文

获取原文并翻译 | 示例

页面导航

摘要
著录项
相似文献
相关主题

摘要

In this paper we present results from the METER (MEasuring TExt Reuse) project whose aim is to explore issues pertaining to text reuse and derivation, especially in the context of newspapers using newswire sources. Although the reuse of text by journalists has been studied in linguistics, we are not aware of any investigation using existing computational methods for this particular task. We investigate the classification of newspaper articles according to their degree of dependence upon, or derivation from, a newswire source using a simple 3-Ievel scheme designed by journalists. Three approaches to measuring text similarity are considered: 11-gram overlap, Greedy String Tiling, and sentence alignment. Measured against a manually annotated corpus of source and derived news text, we show that a combined classifier with features automatically selected performs best overall for the ternary classification achieving an average F_1-measure score of 0.664 across all three categories.

机译：在本文中，我们介绍了METER（测量TExt重用）项目的结果，该项目的目的是探索与文本重用和派生有关的问题，尤其是在使用新闻通讯社的报纸背景下。尽管已经用语言学研究了新闻工作者对文本的重用，但是我们还不知道有使用此特定任务的现有计算方法进行的调查。我们使用新闻工作者设计的简单的3-Eevel方案，根据报纸对新闻通讯源的依赖程度或衍生的新闻程度来调查报纸的分类。考虑了三种测量文本相似性的方法：11克重叠，贪婪的字符串平铺和句子对齐。对源和派生新闻文本的手动注释语料库进行测量，我们显示，具有自动选择功能的组合分类器在三元分类中总体上表现最佳，在所有三个类别中平均F_1量度得分为0.664。

著录项

来源
《40th Annual Meeting of the Association for Computational Linguistics, Jul 7-12, 2002, Philadelphia, Pennsylvania, USA》|2002年|p.152-159|共8页
会议地点 Philadelphia Pennsylvania USA
作者
Paul Clough; Robert Gaizauskas; Scott S.L. Piao; Yorick Wilks;
展开▼
作者单位

Department of Computer Science University of Sheffield Regent Court, 211 Portobello Street, Sheffield, England, S1 4DP;

展开▼
会议组织
原文格式 PDF
正文语种 eng
中图分类自动化技术、计算机技术;
关键词

相似文献

外文文献
中文文献
专利

1. Reduce reuse ... and don't recycle! On text reuse in academic articles [J] . Dolnicar Sara Annals of tourism research . 2019,第JULa期

机译：减少重复使用...并且不要回收！关于学术文章中的文本重用
2. Reduce reuse ... and don't recycle! On text reuse in academic articles [J] . Dolnicar Sara Annals of tourism research . 2019,第Jula期

机译：减少重复使用......而不回收！论学术文章中的文本重用
3. Measuring the reusability of software components using static analysis metrics and reuse rate information [J] . Papamichail Michail D., Diamantopoulos Themistoklis, Symeonidis Andreas L. The Journal of Systems and Software . 2019,第Deca期

机译：使用静态分析指标和重用率信息来衡量软件组件的可重用性
4. METER: MEasuring TExt Reuse [C] . Paul Clough, Robert Gaizauskas, Scott S.L. Piao, Annual meeting of the Association for Computational Linguistics . 2002

机译：仪表：测量文本重用
5. Development of a steam flow meter calibration based on measurement of the mass of condensate (French text). [D] . Nicolas, Vincent. 2002

机译：基于冷凝水质量的测量开发蒸汽流量计校准（法文）。
6. Diabetes App-Related Text Messages From Health Care Professionals in Conjunction With a New Wireless Glucose Meter With a Color Range Indicator Improves Glycemic Control in Patients With Type 1 and Type 2 Diabetes: Randomized Controlled Trial [O] . Mike Grady, Laurence Barry Katz, Hilary Cameron, 2017

机译：来自医疗保健专业人员的糖尿病与应用相关的文本消息与新型带颜色范围指示器的无线血糖仪相结合可改善1型和2型糖尿病患者的血糖控制：随机对照试验
7. METER: MEasuring TExt Reuse [O] . Clough P., Gaizauskas R., Piao S.S.L., 2002

机译：mETER：测量TExt重复使用
8. Environmental Impact Statement For the Disposal and Reuse of Hunters Point Shipyard, Volume 1: Main Text and Appendices [R] . 2000

机译：关于猎人点造船厂的处置和再利用的环境影响声明，第1卷：主要文本和附录

METER: MEasuring TExt Reuse

摘要

著录项

相似文献

相关主题

期刊订阅