首页> 外文会议>International Conference on intelligent science and big data engineering >A Convolutional Deep Neural Network for Coreference Resolution via Modeling Hierarchical Features
【24h】

A Convolutional Deep Neural Network for Coreference Resolution via Modeling Hierarchical Features

机译:通过建模分层特征实现共指解析的卷积深度神经网络

获取原文

摘要

Coreference resolution is a major task of natural language processing (NLP) identifying which noun phrases (or mentions) refer to the same real-world entity or concept. The state-of-the-art methods applied to coreference resolution are mainly based on statistical machine learning, and their performance strongly depends on the quality of the extracted features. The extracted features are usually shallow features by artificial selection, which leads to the loss of unknown useful deep semantic information and becomes an obstacle for improving system performance. We explored a convolutional deep neural network (CDNN) to extract discourse level features automatically. Our method utilized all of the word tokens as input without complicated pre-processing. To begin with, the word tokens were transformed to vectors by looking up word embeddings. Secondly, mention-pair level features were extracted according to the given mentions. In the meanwhile, distance features were computed easily. Moreover, discourse level features were learned using a convolutional approach. Finally, these features were fed into a softmax classifier to predict the equivalence between two marked mentions. The experimental results demonstrate that our approach obtains a competitive score of average Fl over MUC, B3, and CEAF, which places it above the mean score of other systems on the dataset of CoNLL-2012 Shared Task.
机译:共指解析是自然语言处理(NLP)的一项主要任务,它确定哪些名词短语(或提及)指代相同的现实世界实体或概念。应用于共指分解的最新方法主要基于统计机器学习,其性能在很大程度上取决于提取特征的质量。提取的特征通常是通过人工选择而形成的浅层特征,这导致丢失了未知的有用的深层语义信息,并成为提高系统性能的障碍。我们探索了卷积深度神经网络(CDNN)以自动提取话语级别特征。我们的方法利用所有单词标记作为输入,而无需进行复杂的预处理。首先,通过查找单词嵌入将单词标记转换为向量。其次,根据给定的提要提取提要对级别特征。同时,距离特征易于计算。此外,使用卷积方法学习了话语级别的功能。最后,将这些功能输入softmax分类器,以预测两个标记提及之间的等效性。实验结果表明,我们的方法获得了超过MUC,B3和CEAF的平均Fl竞争评分,这使其高于CoNLL-2012共享任务数据集上其他系统的平均评分。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号