首页> 外文会议>International Joint Conference on Knowledge Discovery, Knowledge Engineering and Knowledge Management >Quality of Wikipedia Articles: Analyzing Features and Building a Ground Truth for Supervised Classification
【24h】

Quality of Wikipedia Articles: Analyzing Features and Building a Ground Truth for Supervised Classification

机译:维基百科文章的质量:分析特点,建立受监督分类的基础事实

获取原文

摘要

Wikipedia is nowadays one of the biggest online resources on which users rely as a source of information. The amount of collaboratively generated content that is sent to the online encyclopedia every day can let to the possible creation of low-quality articles (and, consequently, misinformation) if not properly monitored and revised. For this reason, in this paper, the problem of automatically assessing the quality of Wikipedia articles is considered. In particular, the focus is (i) on the analysis of groups of hand-crafted features that can be employed by supervised machine learning techniques to classify Wikipedia articles on qualitative bases, and (ii) on the analysis of some issues behind the construction of a suitable ground truth. Evaluations are performed, on the analyzed features and on a specifically built labeled dataset, by implementing different supervised classifiers based on distinct machine learning algorithms, which produced promising results.
机译:Wikipedia现在是用户依赖信息来源的最大在线资源之一。如果未正确监测和修订,每天发送到在线百科全书的协同生成内容的数量可以让您创造出低质量的文章(以及,因此,错误信息)。出于这个原因,考虑了自动评估维基百科文章的质量的问题。特别地,重点是(i)对可以通过监督机器学习技术采用的手工制作特征组的分析,以对定性基地进行分类的维基百科文章,并在分析建设后面的一些问题一个合适的理论。通过基于不同的机器学习算法实现不同的监督分类器,在分析的特征和专门建立标有数据集上进行评估。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号