首页> 外国专利> AUTOMATED METHOD AND SYSTEM FOR CLUSTERING ENRICHED COMPANY SEEDS INTO A CLUSTER AND SELECTING BEST VALUES FOR EACH ATTRIBUTE WITHIN THE CLUSTER TO GENERATE A COMPANY PROFILE

AUTOMATED METHOD AND SYSTEM FOR CLUSTERING ENRICHED COMPANY SEEDS INTO A CLUSTER AND SELECTING BEST VALUES FOR EACH ATTRIBUTE WITHIN THE CLUSTER TO GENERATE A COMPANY PROFILE

机译:将丰富的公司种子进行自动化方法和系统将群集中的群集,并为群集中的每个属性选择最佳值以生成公司配置文件

摘要

Methods and systems are provided for automatically generating company profiles. Independent seed source services each crawl web pages to collect seeds from different web-based sources. A seed enricher module can then fetch additional information for each of the collected seeds from other different web-based sources and generate an enriched company seed for each collected seed. The enriched company seeds can then be automatically clustered into different clusters that each represent a particular company. A particular value for each attribute of each cluster that is determined to have the highest score can then be selected for inclusion in a corresponding company profile for that cluster.
机译:提供了用于自动生成公司配置文件的方法和系统。独立的种子源服务每个爬网网页收集来自基于Web的源的种子。然后,种子鼻子模块可以从其他不同的网状源中获取每个收集的种子的其他信息,并为每个收集的种子产生富集的公司种子。然后,富集的公司种子可以自动聚集在不同的群集中,每个集群都代表特定公司。然后,可以选择确定具有最高分数的每个群集的每个属性的特定值,以便包含在该群集的相应公司简档中。

著录项

相似文献

  • 专利
  • 外文文献
  • 中文文献
获取专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号