首页> 外国专利> SYSTEMS AND METHODS FOR IDENTIFYING USER TYPES USING MULTI-MODAL CLUSTERING AND INFORMATION SCENT

SYSTEMS AND METHODS FOR IDENTIFYING USER TYPES USING MULTI-MODAL CLUSTERING AND INFORMATION SCENT

机译:使用多模态聚类和信息气味识别用户类型的系统和方法

摘要

Techniques for determining user types based on multi-modal clustering are provided. The topology, content and usage of a document collection or web site is determined. The user paths are identified using longest repeating subsequence techniques and a multi-modal information need vector is determined for each significant user path. Multi-modal vectors for each document in the significant path, content, uniform resource locators, inlink and outlink multi-modal vectors are determined and combined based on path position and access frequency. Multi- modal clustering is performed based on a multi-modal similarity function and a specified measure of similarity using a type of multi-modal clustering such as K-means or wavefront clustering. The identified clusters may be further analyzed based on changes to the weighting of the corresponding content, url, inlinks and outlinks multi-modal feature vectors.
机译:提供了用于基于多模式聚类确定用户类型的技术。确定文档集合或网站的拓扑,内容和用途。使用最长重复子序列技术来标识用户路径,并为每个重要的用户路径确定多模式信息需求向量。根据路径位置和访问频率,确定并组合有效路径中每个文档的多模式向量,内容,统一资源定位符,内联和外联多模式向量。基于多模态相似度函数和指定的相似性度量,使用一种多模态聚类(例如K均值或波前聚类)执行多模态聚类。可以基于对相应内容,URL,内联和外联多模态特征向量的权重的改变来进一步分析所识别的集群。

著录项

  • 公开/公告号CA2378765C

    专利类型

  • 公开/公告日2010-07-06

    原文格式PDF

  • 申请/专利权人 XEROX CORPORATION;

    申请/专利号CA20022378765

  • 发明设计人 HEER JEFFERY;CHI ED H.;PIROLLI PETER L.;

    申请日2002-03-25

  • 分类号H04L12/16;G06F17/30;

  • 国家 CA

  • 入库时间 2022-08-21 18:42:53

相似文献

  • 专利
  • 外文文献
  • 中文文献
获取专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号