Large scale similarity-based relation expansion

机译：基于大规模相似度的关系扩展

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Recent advances in automatic knowledge acquisition methods make it possible to construct massive knowledge bases of semantic relations, containing information potentially unknown to their users. However for certain data mining tasks like finding potential causes of a disease or side-effects of a drug, where missing a small piece of information can have grave consequences, the coverage of automatically acquired knowledge bases is often insufficient. This paper explores the use of automatic hypothesis generation for expanding a knowledge base of semantic relations, using distributional word similarities obtained from a large Web corpus. If successful, such a method can drastically improve the coverage of automatically acquired semantic relations, at the expense of a slight reduction in accuracy. We show that large scale similarity-based relation expansion works quite well for this purpose. Using a 100 million Japanese Web page corpus as input, we could generate a substantial amount of new semantic relations that were not found in the input corpus but whose validity was confirmed in a much larger Web corpus, i.e., by using a commercial Web search engine.

机译：自动知识获取方法的最新进展使构建大量语义关系知识库成为可能，其中包含用户可能不知道的信息。但是，对于某些数据挖掘任务（例如查找疾病的潜在原因或药物的副作用），如果丢失一小部分信息可能会造成严重后果，则自动获取的知识库的覆盖范围通常不足。本文探讨了使用自动假设生成来扩展语义关系的知识库的方法，该方法利用了从大型Web语料库获得的分布词相似性。如果成功的话，这种方法可以大大提高自动获取的语义关系的覆盖范围，但会以略微降低准确性为代价。我们证明，基于大规模相似度的关系扩展可以很好地实现此目的。使用1亿个日语网页语料库作为输入，我们可以生成大量新的语义关系，这些语义关系在输入语料库中找不到，但其有效性已在更大的Web语料库中得到确认，即通过使用商业Web搜索引擎。

著录项

来源
《Proceedings of 2010 4th International Universal Communication Symposium》|2010年|p.141-148|共8页
会议地点
作者

展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类通信系统（传输系统）;
关键词

相似文献

外文文献
中文文献
专利

1. A shape similarity-based ranking method of hesitant fuzzy linguistic preference relations using discrete fuzzy number for group decision making [J] . Zhao Meng, Liu Meng-Ying, Su Jia, Soft computing: A fusion of foundations, methodologies and applications . 2019,第24期

机译：基于若干模糊语言偏好关系的基于形状相似性的排名方法，采用分立模糊数进行组决策
2. SIMILARITY-BASED RELATIONS IN DATALOG PROGRAMS [J] . MELITA HAJDINJAK, ANDREJ BAUER International Journal of Uncertainty, Fuzziness, and Knowledge-based Systems . 2012,第5期

机译：数据记录程序中基于相似关系
3. A Similarity-Based Approach for Audiovisual Document Classification Using Temporal Relation Analysis [J] . Zein Al Abidin Ibrahim, Isabelle Ferrane, Philippe Joly EURASIP journal on image and video processing . 2011,第1期

机译：基于时间关系分析的基于相似度的视听文档分类方法
4. Large scale similarity-based relation expansion [C] . {missing} International University Communication Symposium . 2010

机译：基于大规模的相似关系扩展
5. Russia's relations with the CIS states in the context of NATO expansion, 1991--1998: A complex relations approach [D] . Yeremian, T. Rosemary. 1999

机译：1991--1998年北约扩张中俄罗斯与独联体国家的关系：复杂的关系方法
6. Similarity-based modeling in large-scale prediction of drug-drug interactions [O] . Santiago Vilar, Eugenio Uriarte, Lourdes Santana, -1

机译：大规模预测药物相互作用的基于相似性的建模
7. Term Similarity-Based Query Expansion for Cross-Language Information Retrieval [O] . Mirna Adriani, C. J. Van Rijsbergen 1999

机译：基于术语相似度的跨语言信息检索查询扩展

Large scale similarity-based relation expansion

摘要

著录项

相似文献

相关主题

期刊订阅