首页> 外文会议>IEEE Asia-Pacific Conference on Computer Science and Data Engineering >Reducing Wrong Labels using Conflict Score in Distant Supervision for Relation Extraction in Bangla Language
【24h】

Reducing Wrong Labels using Conflict Score in Distant Supervision for Relation Extraction in Bangla Language

机译:在邦加拉语言中遥远监督中使用冲突评分减少错误标签

获取原文

摘要

The research area of information extraction (IE) aims to extract structured information such as types of entities and relations between them, from unstructured textual data like newswires, blogs, governmental documents etc. Relation extraction (RE) deals with the automatic detection of relationships between concepts mentioned in free texts. Knowledge-based distant supervision (DS) uses structured data to heuristically label a training corpus. However, this heuristic can generate some noisy labeled data. In this paper, we propose a method using conflict score in DS to reduce the number of wrong labels for Bangla sentences.
机译:信息提取的研究领域(即)旨在提取结构化信息,例如他们之间的实体类型和关系,从非结构化的文本数据,如Newswires,博客,政府文件等关系提取(重新)处理自动检测之间的关系 在自由文本中提到的概念。 基于知识的远程监督(DS)使用结构化数据来启发式标记培训语料库。 但是,这种启发式可以生成一些嘈杂的标记数据。 在本文中,我们提出了一种在DS中使用冲突得分的方法,以减少孟加拉句的错误标签数量。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号