首页> 外文会议>Australasian joint conference on artificial intelligence >To Extend or Not to Extend? Context-Specific Corpus Enrichment
【24h】

To Extend or Not to Extend? Context-Specific Corpus Enrichment

机译:延伸还是不延伸?特定语境的语料库充实

获取原文

摘要

An agent in pursuit of a task may work with a corpus of documents with linked subjective content descriptions. Faced with a new document, an agent has to decide whether to include that document in its corpus or not. Basing the decision on only words, topics, or entities, has shown to not lead to a balanced performance for varying documents. Therefore, this paper presents an approach for an agent to decide if a new document adds value to its existing corpus by combining texts and content descriptions. Furthermore, an agent can use the approach as a starting point for high quality content descriptions for new documents. A case study shows the effectiveness of our approach given varying types of new documents.
机译:从事任务的代理可以处理带有链接的主观内容描述的文档集。面对新文档,代理必须决定是否将该文档包含在其语料库中。结果表明,仅基于单词,主题或实体的决定不会导致不同文档的性能达到平衡。因此,本文提出了一种代理人通过组合文本和内容描述来确定新文档是否为其现有语料库增加价值的方法。此外,代理可以将该方法用作新文档的高质量内容描述的起点。案例研究表明,在不同类型的新文档中,我们的方法是有效的。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号