【24h】

Do Automatic Annotation Techniques Have Any Impact on Supervised Complex Question Answering?

机译:自动注释技术对有监督的复杂问题解答有影响吗?

获取原文

摘要

In this paper, we analyze the impact of different automatic annotation methods on the performance of supervised approaches to the complex question answering problem (denned in the DUC-2007 main task). Huge amount of annotated or labeled data is a prerequisite for supervised training. The task of labeling can be accomplished either by humans or by computer programs. When humans are employed, the whole process becomes time consuming and expensive. So, in order to produce a large set of labeled data we prefer the automatic annotation strategy. We apply five different automatic annotation techniques to produce labeled data using ROUGE similarity measure, Basic Element (BE) overlap, syntactic similarity measure, semantic similarity measure, and Extended String Subsequence Kernel (ESSK). The representative supervised methods we use are Support Vector Machines (SVM), Conditional Random Fields (CRF), Hidden Markov Models (HMM), and Maximum Entropy (Max-Ent). Evaluation results are presented to show the impact.
机译:在本文中,我们分析了不同的自动注释方法对监督方法对复杂问题回答问题(在DUC-2007主要任务中有所说明)的性能的影响。大量带注释或标签的数据是监督训练的前提。标记的任务可以由人类或计算机程序来完成。当雇用人员时,整个过程变得既费时又昂贵。因此,为了产生大量的标记数据,我们更喜欢自动注释策略。我们使用ROUGE相似性度量,基本元素(BE)重叠,句法相似性度量,语义相似性度量和扩展字符串子序列内核(ESSK)应用五种不同的自动注释技术来生成标记数据。我们使用的代表性监督方法是支持向量机(SVM),条件随机字段(CRF),隐马尔可夫模型(HMM)和最大熵(Max-Ent)。提出评估结果以显示影响。

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号