首页> 外国专利> Category-based data analysis system for processing stored data-units and calculating their relevance to a subject domain with exemplary precision, and a computer-implemented method for identifying from a broad range of data sources, social entities that perform the function of Social Influencers

Category-based data analysis system for processing stored data-units and calculating their relevance to a subject domain with exemplary precision, and a computer-implemented method for identifying from a broad range of data sources, social entities that perform the function of Social Influencers

机译:基于类别的数据分析系统,用于处理存储的数据单元并以示例性的精度计算其与主题领域的相关性,以及一种计算机实现的方法,用于从广泛的数据源中识别执行社交影响者功能的社交实体

摘要

A category-based data analysis system for processing stored data-units and calculating their relevance to a subject domain with exemplary precision over Web and electronic document searches. A central processor parses a search definition comprising queries against target data sources, and a Boolean expression before launching a plurality of search engines. The Boolean expression and subexpressions comprise individual key-phrases and categories of key-phrases. Fine control of natural language matching behavior is controlled by parameters at the category and key-phrase level. The search engine reads data-units from a plurality of data sources, evaluates relevance, and stores metadata with the data-unit comprising relevance data by key-phrase and category. These results can be further analyzed by SQL query engines, spreadsheets, and Business Intelligence tools.;A computer-implemented method for identifying from a broad range of data sources, social entities that perform the function of Social Influencers. The method aggregates relevant results to provide a more comprehensive analysis of a subject domain than can be achieved with a manual search. Search results are presented in the form of web-presences that are logically related webpages, disaggregated and categorized from websites. Web-presences can be clustered by association with a social entity and are ranked to determine their function as Social Influencers. These results can be further analyzed by SQL query engines, spreadsheets, and Business Intelligence tools.
机译:基于类别的数据分析系统,用于处理存储的数据单元并通过Web和电子文档搜索以示例性的精度计算它们与主题领域的相关性。中央处理器在启动多个搜索引擎之前解析包括对目标数据源的查询和布尔表达式在内的搜索定义。布尔表达式和子表达式包含各个关键短语和关键短语类别。对自然语言匹配行为的精细控制是由类别和关键字短语级别的参数控制的。搜索引擎从多个数据源读取数据单元,评估相关性,并通过关键短语和类别将元数据与包含相关性数据的数据单元一起存储。这些结果可以通过SQL查询引擎,电子表格和商业智能工具进一步分析。一种计算机实现的方法,用于从广泛的数据源中识别执行社交影响者功能的社交实体。与手动搜索相比,该方法汇总了相关结果以提供对主题域的更全面的分析。搜索结果以存在于网络中的形式呈现,它们是逻辑上相关的网页,可以从网站中进行分类和分类。可以通过与社会实体的关联来对网络存在进行聚类,并对网络存在进行排名以确定其作为社会影响者的功能。这些结果可以通过SQL查询引擎,电子表格和商业智能工具进一步分析。

著录项

  • 公开/公告号US2018089193A1

    专利类型

  • 公开/公告日2018-03-29

    原文格式PDF

  • 申请/专利权人 SIMON BRUCE KNIGHT;

    申请/专利号US201615276694

  • 发明设计人 SIMON BRUCE KNIGHT;

    申请日2016-09-26

  • 分类号G06F17/30;

  • 国家 US

  • 入库时间 2022-08-21 13:01:32

相似文献

  • 专利
  • 外文文献
获取专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号