首页> 外国专利> SYSTEM AND METHOD OF ANNOTATING UTTERANCES BASED ON TAGS ASSIGNED BY UNMANAGED CROWDS

SYSTEM AND METHOD OF ANNOTATING UTTERANCES BASED ON TAGS ASSIGNED BY UNMANAGED CROWDS

机译:基于未经管理的人群分配的标签对话语进行注释的系统和方法

摘要

A system and method of tagging utterances with Named Entity Recognition ("NER") labels using unmanaged crowds is provided. The system may generate various annotation jobs in which a user, among a crowd, is asked to tag which parts of an utterance, if any, relate to various entities associated with a domain. For a given domain that is associated with a number of entities that exceeds a threshold N value, multiple batches of jobs (each batch having jobs that have a limited number of entities for tagging) may be used to tag a given utterance from that domain. This reduces the cognitive load imposed on a user, and prevents the user from having to tag more than N entities. As such, a domain with a large number of entities may be tagged efficiently by crowd participants without overloading each crowd participant with too many entities to tag.
机译:提供了一种使用非管理人群用命名实体识别(“ NER”)标签标记话语的系统和方法。该系统可以生成各种注释作业,在该注释作业中,要求人群中的用户标记话语的哪些部分(如果有的话)与与域相关联的各种实体有关。对于与超过阈值N值的实体数量相关联的给定域,可以使用多批作业(每批作业中具有用于标记的实体数量有限的作业)来标记来自该域的给定话语。这减少了施加给用户的认知负担,并防止了用户必须标记多于N个实体。这样,人群参与者可以有效地标记具有大量实体的域,而不会给每个人群参与者过载太多的实体以进行标记。

著录项

相似文献

  • 专利
  • 外文文献
  • 中文文献
获取专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号