A Corpus-Based Study of Edit Categories in Featured and Non-Featured Wikipedia Articles

机译：基于语料库的特色和非特色维基百科文章中的编辑类别研究

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

In this paper, we present a study of the collaborative writing process in Wikipedia. Our work is based on a corpus of 1,995 edits obtained from 891 article revisions in the English Wikipedia. We propose a 21-category classification scheme for edits based on Faigley and Witte's (1981) model. Example edit categories include spelling error corrections and vandalism. In a manual multi-label annotation study with 3 annotators, we obtain an inter-annotator agreement of α = 0.67. We further analyze the distribution of edit categories for distinct stages in the revision history of 10 featured and 10 non-featured articles. Our results show that the information content in featured articles tends to become more stable after their promotion. On the opposite, this is not true for non-featured articles. We make the resulting corpus and the annotation guidelines freely available.

机译：在本文中，我们展示了维基百科的协同写作过程的研究。我们的工作基于1,995条编辑的语料库，从英国维基百科的891条修订中获得。我们提出了一种基于Faigley和Witte（1981）模型的编辑的21类分类方案。示例编辑类别包括拼写错误纠正和故意破坏。在具有3个注释器的手动多标签注释研究中，我们获得了α= 0.67的共注入者协议。我们进一步分析了10个特色和10条未特色文章的修订历史中的不同阶段的编辑类别的分布。我们的研究结果表明，特色文章中的信息内容往往在促销后变得更加稳定。在相反的情况下，这对于未特色文章来说是不是正确的。我们使由此产生的语料库和注释指南自由可用。

著录项

来源
《International conference on computational linguistics》|2012年||共16页
会议地点
作者
Johannes Daxenberger; Iryna Gurevych;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类程序设计、软件工程;
关键词
Wikipedia; Revision History; Collaborative Writing; Quality Assessment;

机译：维基百科;修订历史;协作写作;质量评估;

相似文献

外文文献
中文文献
专利

1. Relating Wikipedia article quality to edit behavior and link structure [J] . Thorsten Ruprechter, Tiago Santos, Denis Helic Applied Network Science . 2020,第1期

机译：将维基百科文章质量与编辑行为和链接结构相关联
2. Pharmacy students can improve access to quality medicines information by editing Wikipedia articles [J] . Dorie E. Apollonio, Keren Broyde, Amin Azzam, BMC Medical Education . 2018,第1期

机译：药学专业的学生可以通过编辑Wikipedia文章来改善对优质药物信息的访问
3. Edit and update Wikipedia articles [J] . Personal Computer World . 2011,第347期

机译：编辑和更新Wikipedia文章
4. A Corpus-Based Study of Edit Categories in Featured and Non-Featured Wikipedia Articles [C] . Johannes Daxenberger, Iryna Gurevych International conference on computational linguistics . 2012

机译：基于语料库的特色和非特色维基百科文章中的编辑类别研究
5. The future of editing: A textual analysis of three "Wikipedia" articles. [D] . Tellis, Lara Monica. 2010

机译：编辑的未来：对三篇“维基百科”文章的文字分析。
6. Pharmacy students can improve access to quality medicines information by editing Wikipedia articles [O] . Dorie E. Apollonio, Keren Broyde, Amin Azzam, 2018

机译：药学专业的学生可以通过编辑Wikipedia文章来改善对优质药物信息的访问
7. The Anyone-Can-Edit Syndrome Intercreation Stories of Three Featured Articles on Wikipedia [O] . Mattus Maria 2014

机译：维基百科上三篇特色文章的任意编辑综合症创作故事

A Corpus-Based Study of Edit Categories in Featured and Non-Featured Wikipedia Articles

摘要

著录项

相似文献

相关主题

期刊订阅