首页> 外国专利> SELECTION OF RELIABLE KEY WORDS FROM UNRELIABLE SOURCES IN A SYSTEM AND METHOD FOR CONDUCTING A SEARCH

SELECTION OF RELIABLE KEY WORDS FROM UNRELIABLE SOURCES IN A SYSTEM AND METHOD FOR CONDUCTING A SEARCH

机译:在进行搜索的系统和方法中从不可靠的来源中选择可靠的关键词

摘要

The invention provides for a system to select data including a reception component that receives at least one data entry from at least one data source, a processor component to determine the entropy of a word extracted from the at least one data entry, a filtering component to select reliable words, wherein reliable words are words with low entropy values, the filtering component further excluding words with high entropy values, and a transmission component to output a set of reliable words, wherein the set of reliable words is associated with the at least one data entry from which the reliable words were extracted.
机译:本发明提供了一种用于选择数据的系统,该系统包括:接收组件,其从至少一个数据源接收至少一个数据条目;处理器组件,用于确定从至少一个数据条目中提取的单词的熵;过滤组件,用于从所述至少一个数据条目中提取单词。选择可靠的词,其中可靠的词是具有低熵值的词,过滤组件进一步排除具有高的熵值的词,以及传输组件以输出一组可靠的词,其中该组可靠的词与至少一个相关联从中提取可靠单词的数据条目。

著录项

相似文献

  • 专利
  • 外文文献
  • 中文文献
获取专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号