Encoding Distributional Semantics into Triple-Based Knowledge Ranking for Document Enrichment

机译：将分布语义编码为基于三重知识的知识排名，以丰富文档

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Document enrichment focuses on retrieving relevant knowledge from external resources, which is essential because text is generally replete with gaps. Since conventional work primarily relies on special resources, we instead use triples of Subject, Predicate, Object as knowledge and incorporate distributional semantics to rank them. Our model first extracts these triples automatically from raw text and converts them into real-valued vectors based on the word semantics captured by Latent Dirich-let Allocation. We then represent these triples, together with the source document that is to be enriched, as a graph of triples, and adopt a global iterative algorithm to propagate relevance weight from source document to these triples so as to select the most relevant ones. Evaluated as a ranking problem, our model significantly outperforms multiple strong baselines. Moreover, we conduct a task-based evaluation by incorporating these triples as additional features into document classification and enhances the performance by 3.02%.

机译：丰富的文档集中于从外部资源中获取相关知识，这是必不可少的，因为文本通常充满空白。由于常规工作主要依赖于特殊资源，因此我们改用主语，谓语，宾语三元组作为知识，并结合分布语义对它们进行排名。我们的模型首先从原始文本中自动提取这些三元组，然后根据Latent Dirich-let Allocation捕获的单词语义将它们转换为实值向量。然后，我们将这些三元组与要丰富的源文档一起表示为三元组图，并采用全局迭代算法将相关权重从源文档传播到这些三元组，以便选择最相关的三元组。作为排名问题进行评估，我们的模型明显优于多个强大的基准。此外，我们通过将这些三元组作为附加功能纳入文档分类来进行基于任务的评估，并将性能提高3.02％。

著录项

来源
《Annual meeting of the Association for Computational Linguistics;International joint conference on natural language processing of the Asian Federation of Natural Languages processing》|2015年|524-533|共10页
会议地点
作者
Muyu Zhang; Bing Qin; Mao Zheng; Graeme Hirst; Ting Liu;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词

相似文献

外文文献
中文文献
专利

1. Versioned linking of semantic enrichment of legal documents: Emerald:an implementation of knowledge-based services in a semantic web approach [J] . Akos Szoke, Andras Foerhecz, Gabor Korosi, Artificial Intelligence and Law . 2013,第4期

机译：法律文档的语义丰富化的版本链接：Emerald：在语义Web方法中基于知识的服务的实现
2. Semantic enrichment in knowledge repositories: Annotating semantic relationships between discussion documents [J] . Wei CP, Cheng TH, Pai YC Journal of database management . 2006,第1期

机译：知识库中的语义丰富：注释讨论文档之间的语义关系
3. Ranking Documents Based on the Semantic Relations Using Analytical Hierarchy Process: Query Expansion and Ranking Process [J] . Ali I. El-Dsouky, Hesham A. Ali, Rabab Samy Rashed International journal of information retrieval research . 2017,第3期

机译：使用层次分析法基于语义关系对文档进行排名：查询扩展和排名过程
4. Encoding Distributional Semantics into Triple-Based Knowledge Ranking for Document Enrichment [C] . Muyu Zhang, Bing Qin, Mao Zheng, Annual meeting of the Association for Computational Linguistics . 2015

机译：将分布语义编码为文档富集的三重知识排名
5. Methods of Enriching Domain Knowledge with Universal Semantics for Higher Text Mining Performance [D] . Qazanfari, Kazem . 2020

机译：以普通语义丰富域知识的方法，以获得更高的文本挖掘性能
6. Easing semantically enriched information retrieval—An interactive semi-automatic annotation system for medical documents [O] . Theresia Gschwandtner, Katharina Kaiser, Patrick Martini, -1

机译：在语义上富集的信息检索 - 用于医疗文档的交互式半自动注释系统
7. Semantic retrieval and ranking of Semantic Web documents using free-form queries [O] . Vassilis Spiliopoulos, Konstantinos Kotis, George A. Vouros 2012

机译：使用自由格式查询的语义检索和语义Web文档的排名

Encoding Distributional Semantics into Triple-Based Knowledge Ranking for Document Enrichment

摘要

著录项

相似文献

相关主题

期刊订阅