首页> 外文期刊>Journal of software >High-speed Detection of Ontological Knowledge and Bi-directional Lexico-Syntactic Patterns from the Web
【24h】

High-speed Detection of Ontological Knowledge and Bi-directional Lexico-Syntactic Patterns from the Web

机译:从网络高速检测本体论知识和双向词汇句法模式

获取原文
           

摘要

We propose a high-speed method of detectingontological knowledge from the Web. Ontological knowledgein this paper means a term related to a given term. Forexample, hypernyms and hyponyms are basic related termsthat are treated in dictionaries. Synonyms and coordinateterms are also well-defined related terms. Topic terms anddescription terms represent topics of the given term and theyare vaguely defined. There are other related terms such asabbreviations and nicknames. The proposed method can beused for detecting many kinds of related terms. It extractsrelated terms from text resources only from Web searchresults, which consist of the titles, snippets, and URLs ofWeb pages. We use two different kinds of lexico-syntacticpatterns to extract related terms from the search results,and these are called bi-directional lexico-syntactic patterns.The proposed method can be applied to both languageswhere words are separated by a space such as English andKorean and ones where words are not separated by a spacesuch as Japanese and Chinese. The proposed method doesnot need any advanced natural language processing suchas morphological analysis or syntactic parsing. It worksrelatively fast and has excellent precision. We also propose amethod of automatically discovering superior bi-directionallexico-syntactic patterns using Web search engines becauseit is sometimes difficult to find appropriate patterns to detectrelated terms in a certain relationship.
机译:我们提出了一种从Web检测本体知识的高速方法。本文中的本体知识是指与给定术语相关的术语。例如,上位词和下位词是在字典中处理的基本相关术语。同义词和坐标项也是定义明确的相关术语。主题词和描述词代表给定术语的主题,并且定义模糊。还有其他相关术语,例如缩写和昵称。所提出的方法可用于检测多种相关项。它仅从Web搜索结果中的文本资源中提取相关术语,该结果由Web页面的标题,摘要和URL组成。我们使用两种不同的词汇-句法模式从搜索结果中提取相关术语,这些术语称为双向词汇-句法模式。该方法可以应用于单词之间用英语,韩语和单词之间用空格隔开的单词,例如日语和中文。所提出的方法不需要任何高级自然语言处理,例如形态分析或句法解析。它工作相对较快,并且精度极高。我们还提出了一种使用Web搜索引擎自动发现高级双向词汇语法模式的方法,因为有时很难找到合适的模式来检测某种关系中的相关术语。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号