首页> 外文会议>Annual meeting of the Association for Computational Linguistics >Predicting the relevance of distributional semantic similarity with contextual information
【24h】

Predicting the relevance of distributional semantic similarity with contextual information

机译:预测分布语义相似性与上下文信息的相关性

获取原文

摘要

Using distributional analysis methods to compute semantic proximity links between words has become commonplace in NLP. The resulting relations are often noisy or difficult to interpret in general. This paper focuses on the issues of evaluating a distributional resource and filtering the relations it contains, but instead of considering it in abstracto, we focus on pairs of words in context. In a discourse, we are interested in knowing if the semantic link between two items is a byproduct of textual coherence or is irrelevant. We first set up a human annotation of semantic links with or without contextual information to show the importance of the textual context in evaluating the relevance of semantic similarity, and to assess the prevalence of actual semantic relations between word tokens. We then built an experiment to automatically predict this relevance, evaluated on the reliable reference data set which was the outcome of the first annotation. We show that in-document information greatly improve the prediction made by the similarity level alone.
机译:在NLP中,使用分布分析方法来计算单词之间的语义接近度链接已变得司空见惯。所产生的关系通常是嘈杂的,或者通常难以解释。本文着重于评估分布资源并过滤包含的关系的问题,但是,我们不着眼于抽象性,而是关注上下文中的单词对。在话语中,我们有兴趣知道两个项目之间的语义联系是文本连贯性的副产品还是无关紧要的。我们首先建立一个带有或不带有上下文信息的语义链接的人工注释,以显示文本上下文在评估语义相似性的相关性中的重要性,并评估单词标记之间实际语义关系的普遍性。然后,我们建立了一个实验来自动预测这种相关性,并根据可靠的参考数据集进行了评估,这是第一个注释的结果。我们表明,文档内信息极大地提高了仅由相似度水平得出的预测。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号