首页> 外国专利> METHOD AND SYSTEM FOR EXTRACTING ASSOCIATED WORDS BY USING BIG DATA PROCESSING TECHNOLOGY

METHOD AND SYSTEM FOR EXTRACTING ASSOCIATED WORDS BY USING BIG DATA PROCESSING TECHNOLOGY

机译:大数据处理技术提取联想词的方法和系统

摘要

Disclosed are a method and system for extracting associated words by using a big data processing technology, which can extract associated words having high semantic associations with a search word by using a big data processing technology. A method for extracting associated words by using a big data processing technology according to an embodiment of the present invention comprises the steps of: receiving a search word; collecting data retrieved by using the received search word; extracting candidate associated words associated with the search word by analyzing morphemes of the collected data; calculating frequencies for the respective extracted candidate associated words; performing the calculation step to a storage step in a recursive manner by using each of the candidate associated words as an additional search word; and calculating associations between the search word and the candidate associated words based on the calculated frequencies, and extracting associated words for the search word based on the calculated associations.;COPYRIGHT KIPO 2016
机译:公开了一种利用大数据处理技术提取关联词的方法和系统,其可以利用大数据处理技术提取与搜索词具有较高语义关联的关联词。根据本发明实施例的利用大数据处理技术提取关联词的方法,包括以下步骤:接收搜索词;收集使用接收到的搜索词检索到的数据;通过分析收集到的数据的词素,提取与搜索词关联的候选关联词;计算各个提取的候选关联词的频率;通过使用每个候选关联词作为附加搜索词,以递归方式执行计算步骤到存储步骤;并基于计算出的频率计算搜索词与候选关联词之间的关联,并基于计算出的关联来提取搜索词的关联词。; COPYRIGHT KIPO 2016

著录项

相似文献

  • 专利
  • 外文文献
  • 中文文献
获取专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号