Large scale similarity-based relation expansion

机译：基于大规模的相似关系扩展

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Recent advances in automatic knowledge acquisition methods make it possible to construct massive knowledge bases of semantic relations, containing information potentially unknown to their users. However for certain data mining tasks like finding potential causes of a disease or side-effects of a drug, where missing a small piece of information can have grave consequences, the coverage of automatically acquired knowledge bases is often insufficient. This paper explores the use of automatic hypothesis generation for expanding a knowledge base of semantic relations, using distributional word similarities obtained from a large Web corpus. If successful, such a method can drastically improve the coverage of automatically acquired semantic relations, at the expense of a slight reduction in accuracy. We show that large scale similarity-based relation expansion works quite well for this purpose. Using a 100 million Japanese Web page corpus as input, we could generate a substantial amount of new semantic relations that were not found in the input corpus but whose validity was confirmed in a much larger Web corpus, i.e., by using a commercial Web search engine.

机译：最近的自动知识获取方法的进步使得可以构建大规模知识库的语义关系，其中包含其用户可能未知的信息。然而，对于像发现疾病或药物，其中缺一小块的信息可以有严重后果的副作用的潜在原因，某些数据挖掘任务，自动获得的知识基础的覆盖面往往是不够的。本文探讨了使用自动假设生成来扩展语义关系的知识库，使用从大型Web语料库获得的分布词相似之处。如果成功，这种方法可以大大提高自动获得的语义关系的覆盖范围，以少于准确性降低。我们表明，基于大规模的相似性的关系扩展非常适用于此目的。使用100万日语网页语料库作为输入，我们可以生成大量的新语义关系，这些语义未在输入语料库中找到，但其有效性在更大的Web语料库中确认，即，通过使用商业网络搜索引擎。

著录项

来源
《International University Communication Symposium》|2010年||共8页
会议地点
作者
{missing};
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类 TN91-53;
关键词

相似文献

外文文献
中文文献
专利

1. A shape similarity-based ranking method of hesitant fuzzy linguistic preference relations using discrete fuzzy number for group decision making [J] . Zhao Meng, Liu Meng-Ying, Su Jia, Soft computing: A fusion of foundations, methodologies and applications . 2019,第24期

机译：基于若干模糊语言偏好关系的基于形状相似性的排名方法，采用分立模糊数进行组决策
2. SIMILARITY-BASED RELATIONS IN DATALOG PROGRAMS [J] . MELITA HAJDINJAK, ANDREJ BAUER International Journal of Uncertainty, Fuzziness, and Knowledge-based Systems . 2012,第5期

机译：数据记录程序中基于相似关系
3. A Similarity-Based Approach for Audiovisual Document Classification Using Temporal Relation Analysis [J] . Zein Al Abidin Ibrahim, Isabelle Ferrane, Philippe Joly EURASIP journal on image and video processing . 2011,第1期

机译：基于时间关系分析的基于相似度的视听文档分类方法
4. Large scale similarity-based relation expansion [C] . Proceedings of 2010 4th International Universal Communication Symposium . 2010

机译：基于大规模相似度的关系扩展
5. Russia's relations with the CIS states in the context of NATO expansion, 1991--1998: A complex relations approach [D] . Yeremian, T. Rosemary. 1999

机译：1991--1998年北约扩张中俄罗斯与独联体国家的关系：复杂的关系方法
6. Similarity-based modeling in large-scale prediction of drug-drug interactions [O] . Santiago Vilar, Eugenio Uriarte, Lourdes Santana, -1

机译：大规模预测药物相互作用的基于相似性的建模
7. Term Similarity-Based Query Expansion for Cross-Language Information Retrieval [O] . Mirna Adriani, C. J. Van Rijsbergen 1999

机译：基于术语相似度的跨语言信息检索查询扩展

Large scale similarity-based relation expansion

摘要

著录项

相似文献

相关主题

期刊订阅