首页> 外国专利> METHOD AND APPARATUS FOR UNSUPERVISED LEARNING OF MULTI-RESOLUTION USER PROFILE FROM TEXT ANALYSIS

METHOD AND APPARATUS FOR UNSUPERVISED LEARNING OF MULTI-RESOLUTION USER PROFILE FROM TEXT ANALYSIS

机译:文本分析中无监督学习多分辨率用户配置文件的方法和装置

摘要

A method and apparatus for retrieving information from a massive amount of user-written businesses reviews are described. From the bag of words of a given review set, a graph based on mutual information between the words is built. Spectral analysis on this graph enables creation of a Euclidean space specific to those reviews where the distance corresponds to semantic proximity. Applying a cover-tree based divisive hierarchical clustering in this space yields therefore a semantic tag tree. Such a taxonomy is specific of the review set used, which could be all the reviews about a product or written by a user, and can be used for profiling. These taxonomies are used to build profiles. Also described is a tool to summarize and browse the review set based on the obtained trees.
机译:描述了一种用于从大量用户撰写的企业评论中检索信息的方法和设备。从给定评论集的单词袋中,构建基于单词之间相互信息的图表。通过对该图进行频谱分析,可以创建特定于那些距离对应于语义接近度的评论的欧几里得空间。因此,在此空间中应用基于覆盖树的区分层次聚类可产生语义标记树。这种分类法特定于所使用的评论集,该评论集可以是有关产品的所有评论,也可以是用户撰写的,并且可以用于概要分析。这些分类法用于构建配置文件。还描述了一种工具,用于基于获得的树来汇总和浏览审阅集。

著录项

相似文献

  • 专利
  • 外文文献
  • 中文文献
获取专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号