首页> 外文会议>Annual meeting of the Association for Computational Linguistics >Labeling Documents with Timestamps: Learning from their Time Expressions
【24h】

Labeling Documents with Timestamps: Learning from their Time Expressions

机译:用时间戳标记文件:从他们的时间表达学习

获取原文
获取外文期刊封面目录资料

摘要

Temporal reasoners for document understanding typically assume that a document's creation date is known. Algorithms to ground relative time expressions and order events often rely on this timestamp to assist the learner. Unfortunately, the timestamp is not always known, particularly on the Web. This paper addresses the task of automatic document timestamping, presenting two new models that incorporate rich linguistic features about time. The first is a discriminative classifier with new features extracted from the text's time expressions (e.g., 'since 1999'). This model alone improves on previous generative models by 77%. The second model learns probabilistic constraints between time expressions and the unknown document time. Imposing these learned constraints on the discriminative model further improves its accuracy. Finally, we present a new experiment design that facilitates easier comparison by future work.
机译:文档理解的时间推理通常假设已知文档的创建日期。接地相对时间表达和订单事件的算法通常依赖于此时间戳来帮助学习者。不幸的是,时间戳并不总是知道的,特别是在网上。本文介绍了自动文档时间戳的任务,呈现了两种新型号,其中包含了丰富的语言特征。第一个是判别分类器,具有从文本的时间表达中提取的新功能(例如,自1999年以来)。此模型单独提高先前的生成模型77%。第二种模型学会了时间表达式和未知文档时间之间的概率约束。对歧视模型实施这些学习的限制进一步提高了其准确性。最后,我们提出了一种新的实验设计,便于通过未来的工作更容易比较。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号