首页> 外国专利> Identifying information related to a particular entity from electronic sources, using dimensional reduction and quantum clustering

Identifying information related to a particular entity from electronic sources, using dimensional reduction and quantum clustering

机译:使用降维和量子聚类从电子资源中识别与特定实体有关的信息

摘要

Presented are systems and methods for identifying information about a particular entity including acquiring electronic documents having unstructured text, that are selected based on one or more search terms from a plurality of terms related to the particular entity. Tokenizing the acquired documents to form a data matrix and then calculating a plurality of eigenvectors, using the data matrix and the transpose of the data matrix. The variance is then acquired for determining the amount of intra-clustering between the documents and then the acquired documents are clustered using some of the eigenvectors and the variance.
机译:提出了用于识别关于特定实体的信息的系统和方法,包括获取具有非结构化文本的电子文档,该电子文档是基于从与该特定实体有关的多个术语中的一个或多个搜索术语来选择的。标记获取的文档以形成数据矩阵,然后使用数据矩阵和数据矩阵的转置来计算多个特征向量。然后获取方差,以确定文档之间的集群内数量,然后使用某些特征向量和方差对获取的文档进行聚类。

著录项

相似文献

  • 专利
  • 外文文献
  • 中文文献
获取专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号