首页> 外国专利> Automated Document Cluster Merging for Topic-Based Digital Assistant Interpretation

Automated Document Cluster Merging for Topic-Based Digital Assistant Interpretation

机译:自动文档集群合并,用于基于主题的数字助理解释

摘要

Disclosed are techniques for automatically extracting discovered topics and/or from determined discourse clusters for the generation of a language model that is applicable to interpreting commands received from a digital assistant device. An electronic document corpus can be generated having a plurality of documents that are clustered based on entropy, among other things. The clusters can be associated with a corresponding plurality of cluster attractors that are generally representative of a context of the documents included therein. The documents within the cluster for each of the document clusters can be analyzed, so that clusters determined representative of a hierarchical discourse community can be determined and logically merged. The merged clusters can be analyzed, such that topics and/or sub-topics can be determined and extracted therefrom, for indexing and storage, among other things. In this way, a more efficient searching of the electronic document corpus to interpret received inputs, such as commands received via a digital assistant device, can be facilitated.
机译:公开了用于自动提取发现的主题和/或从确定的话语簇中提取用于生成语言模型的技术,该语言模型适用于解释从数字助理设备接收的命令。可以生成具有多个文档的电子文档语料库,其中多个文档基于熵而聚类。这些簇可以与通常代表其中包含的文档的上下文的对应的多个簇吸引器相关联。可以对每个文档集群的集群中的文档进行分析,以便可以确定代表逻辑分层的社区的集群并进行逻辑合并。可以分析合并的群集,以便可以确定主题和/或子主题并从中提取主题,以便进行索引和存储等。以此方式,可以促进对电子文档语料库的更有效搜索以解释所接收的输入,诸如经由数字助理设备所接收的命令。

著录项

  • 公开/公告号US2019205391A1

    专利类型

  • 公开/公告日2019-07-04

    原文格式PDF

  • 申请/专利权人 AIQUDO INC.;

    申请/专利号US201816234219

  • 申请日2018-12-27

  • 分类号G06F17/27;G06F16/93;G06F16/22;G06F16/245;G06F16/28;

  • 国家 US

  • 入库时间 2022-08-21 12:06:22

相似文献

  • 专利
  • 外文文献
  • 中文文献
获取专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号