首页> 外文会议> >Normalizing Chinese temporal expressions with multi-label classification
【24h】

Normalizing Chinese temporal expressions with multi-label classification

机译:使用多标签分类规范汉语时态表达

获取原文

摘要

Temporal expression contains crucial temporal information in texts. Understanding its temporal semantics is important in many NLP applications, such as information extraction, document summarization and question answering. Temporal expression normalization involves mapping from the different classes of expressions to the values of certain temporal attributes. A temporal expression may belong to one or more classes, but each class of expressions normally shares the same mapping procedure. In this paper, we explore the possibility of applying multi-label classification techniques in the context of temporal expression normalization. More specifically, two models, named independent binary classification model and compared binary classification model, are evaluated, compared and analyzed. Once the possible class(es) of a temporal expression is determined, the corresponding mapping rules are called to transform it into the corresponding attribute(s). Experiments on a substantiate data collection show that, based on the result of machine learning classification, the performance of temporal expression normalization is comparable with that of deliberate rule set.
机译:时间表达在文本中包含关键的时间信息。在许多NLP应用程序中,了解其时间语义很重要,例如信息提取,文档摘要和问题解答。时间表达规范化涉及从不同类型的表达映射到某些时间属性的值。时间表达式可以属于一个或多个类,但是每个表达式类通常共享相同的映射过程。在本文中,我们探讨了在时态表达规范化背景下应用多标签分类技术的可能性。更具体地,评估,比较和分析了两个模型,分别称为独立的二进制分类模型和比较的二进制分类模型。一旦确定了时间表达式的可能类别,就会调用相应的映射规则以将其转换为相应的属性。对大量数据进行的实验表明,基于机器学习分类的结果,时态表达式归一化的性能可与故意规则集相媲美。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号