首页> 外文会议>Joint workshop on automatic knowledge base construction and web-scale 2012 >Relabeling Distantly Supervised Training Data for Temporal Knowledge Base Population
【24h】

Relabeling Distantly Supervised Training Data for Temporal Knowledge Base Population

机译:重新标记针对时间知识库的远程监督训练数据

获取原文
获取原文并翻译 | 示例

摘要

We enhance a temporal knowledge base population system to improve the quality of distantly supervised training data and identify a minimal feature set for classification. The approach uses multi-class logistic regression to eliminate individual features based on the strength of their association with a temporal label followed by semi-supervised relabeling using a subset of human annotations and lasso regression. As implemented in this work, our technique improves performance and results in notably less computational cost than a parallel system trained on the full feature set.
机译:我们增强了时间知识库系统,以改善远程监督训练数据的质量并确定用于分类的最小特征集。该方法使用多类逻辑回归以基于单个特征与时间标签的关联强度来消除单个特征,然后使用人类注释子集和套索回归对它们进行半监督重新标记。正如在这项工作中实现的那样,与在完整功能集上训练的并行系统相比,我们的技术可提高性能并显着降低计算成本。

著录项

  • 来源
  • 会议地点 Montreal(CA)
  • 作者

    Suzanne Tamang; Heng Ji;

  • 作者单位

    Computer Science Department and Linguistics Department Graduate Center and Queens College, City University of New York New York, NY 10016, USA;

    Computer Science Department and Linguistics Department Graduate Center and Queens College, City University of New York New York, NY 10016, USA;

  • 会议组织
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类
  • 关键词

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号