首页> 外文会议>IEEE International Congress on Big Data >Reducing Feature Embedding Data for Discovering Relations in Big Text Data
【24h】

Reducing Feature Embedding Data for Discovering Relations in Big Text Data

机译:减少特征嵌入数据以发现大文本数据中的关系

获取原文

摘要

Relation extraction is a critical task in building a knowledge base from unstructured text documents. Most works in automatic relation extraction have applied deep learning techniques such as Convolutional Neural Network (CNN) and Long Short-Term Memory (LSTM) in large text corpora. However, they require a large amount of human labelling data, which is labour intensive and is hardly applied in a new domain of document without human supervision. This paper proposes a novel framework to extract relations in multi-domain texts effectively. In particular, we construct the framework in three phases including preprocessing, feature embedding and relation extraction. We show that a small proportion of training data is sufficient to train our relation extraction framework and achieve a good accuracy in relation extraction works.
机译:关系提取是从非结构化文本文件构建知识库的关键任务。大多数在自动关系提取中的作品应用了大型文本语料库中的深度学习技术,如卷积神经网络(CNN)和长短期内存(LSTM)。然而,它们需要大量的人类标签数据,这是劳动密集型的,并且几乎没有应用于没有人类监督的新文件领域。本文提出了一种新颖的框架,有效地提取多域文本中的关系。特别是,我们在三个阶段构建框架,包括预处理,特征嵌入和关系提取。我们表明,小比例的培训数据足以培训我们的关系提取框架,并在关系提取工程中实现良好的准确性。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号