首页> 外国专利> METHODS AND SYSTEMS FOR EXTRACTING KEYPHRASES FROM NATURAL TEXT FOR SEARCH ENGINE INDEXING

METHODS AND SYSTEMS FOR EXTRACTING KEYPHRASES FROM NATURAL TEXT FOR SEARCH ENGINE INDEXING

机译:从自然文本中提取关键词进行搜索引擎索引的方法和系统

摘要

The present invention is a method and system for the extraction of keyphrases from natural text. For the purpose of this document, keyphrases are text segments that represent the main topic of a text. The method of the present invention may facilitate keyphrase extraction from any length of text. The text may be of several varieties, such as, for example a sentence, paragraph, document or collection of documents. Phrase separator methods may be applied to the text to extract phrases from the text. From these phrases the present invention may identify the one or more phrases that are integral to the meaning of the text and these may be identified as the keyphrases of the text. The text may be indexed using the keyphrases so that a search based upon any of the keyphrases will cause search engines and/or text retrieval means to retrieve the text.
机译:本发明是一种用于从自然文本中提取关键词的方法和系统。就本文档而言,关键短语是代表文本主要主题的文本段。本发明的方法可以促进从任何长度的文本中提取关键词。文本可以具有多种变体,例如句子,段落,文档或文档集合。短语分隔符方法可以应用于文本以从文本中提取短语。从这些短语中,本发明可以识别一个或多个与文本的含义必不可少的短语,并且这些可以被识别为文本的关键词。可以使用关键词来索引文本,从而基于任何关键词的搜索将使搜索引擎和/或文本检索装置检索文本。

著录项

相似文献

  • 专利
  • 外文文献
  • 中文文献
获取专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号