首页> 外文会议>International DELOS Conference on Digital Libraries: Research and Development >Searching for Ground Truth: A Stepping Stone in Automating Genre Classification
【24h】

Searching for Ground Truth: A Stepping Stone in Automating Genre Classification

机译:寻找地面真相:踏脚石在自动化类型分类中

获取原文

摘要

This paper examines genre classification of documents and its role in enabling the effective automated management of digital documents by digital libraries and other repositories. We have previously presented genre classification as a valuable step toward achieving automated extraction of descriptive metadata for digital material. Here, we present results from experiments using human labellers, conducted to assist in genre characterisation and the prediction of obstacles which need to be overcome by an automated system, and to contribute to the process of creating a solid testbed corpus for extending automated genre classification and testing metadata extraction tools across genres. We also describe the performance of two classifiers based on image and stylistic modeling features in labelling the data resulting from the agreement of three human labellers across fifteen genre classes.
机译:本文审查了文档的流派分类及其作用,使数字图书馆和其他存储库能够有效自动化数字文件。我们之前将流派分类呈现为实现数字材料的描述性元数据的自动提取的有价值的步骤。在这里,我们使用人类贴标程序的实验存在结果,以协助自动系统需要克服的类型表征和预测需要克服的障碍物,并有助于创建用于扩展自动化类型分类的固体测试用睾丸标记的过程测试跨流域的元数据提取工具。我们还基于图像和风格建模特征描述了两个分类器的性能,在标记了三个人兰布斯跨越十五种类型课程中产生的数据。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号