首页> 外文会议>19th international world wide web conference 2010 >Identifying Featured Articles in Wikipedia Writing Style Matters
【24h】

Identifying Featured Articles in Wikipedia Writing Style Matters

机译:在Wikipedia写作风格问题中识别特色文章

获取原文

摘要

Wikipedia provides an information quality assessment model with criteria for human peer reviewers to identify featured articles. For this classification task "Is an article featured or not?" we present a machine learning approach that exploits an article's character tri-gram distribution. Our approach differs from existing research in that it aims to writing style rather than evaluating meta features like the edit history. The approach is robust, straightforward to implement, and outperforms existing solutions. We underpin these claims by an experiment design where, among others, the domain transferability is analyzed. The achieved performances in terms of the F-measure for featured articles are 0.964 within a single Wikipedia domain and 0.880 in a domain transfer situation.
机译:Wikipedia提供了一种信息质量评估模型,该模型具有供人类同行审阅者识别特色文章的标准。对于此分类任务“文章是否有特色?”我们提出了一种利用文章的字符三元组分布的机器学习方法。我们的方法与现有研究的不同之处在于,该方法旨在编写样式,而不是评估诸如编辑历史记录之类的元功能。该方法功能强大,易于实现,并且性能优于现有解决方案。我们通过实验设计来支持这些主张,在其中分析了域的可移植性。就特色文章的F度量而言,已实现的性能在单个Wikipedia域中为0.964,而在域转移情况下为0.880。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号