首页> 外国专利> AUTOMATED DOCUMENT CLUSTER MERGING FOR TOPIC-BASED DIGITAL ASSISTANT INTERPRETATION

AUTOMATED DOCUMENT CLUSTER MERGING FOR TOPIC-BASED DIGITAL ASSISTANT INTERPRETATION

机译:基于文档的数字辅助解释的自动文档聚类合并

摘要

Disclosed are systems and techniques for extracting discovered topics from determined discourse clusters for the generation of a language model for interpreting commands received from a digital assistant device. An electronic document corpus can be generated having documents clustered based on entropy, among other things. The clusters can be associated with a corresponding plurality of cluster attractors that are representative of a context of the documents included therein. The documents within the clusters can be analyzed, so that clusters determined representative of a hierarchical discourse community can be determined and merged. The merged clusters can be analyzed, such that topics and/or sub-topics can be determined and extracted therefrom, indexed, and stored, among other things. In this way, a more efficient searching of the electronic document corpus to interpret received inputs, such as commands received via a digital assistant device, can be facilitated.
机译:公开了用于从确定的话语簇中提取发现的主题以生成用于解释从数字助理设备接收的命令的语言模型的系统和技术。可以生成具有基于熵聚类的文档的电子文档语料库。聚类可以与代表其中包括的文档的上下文的对应的多个聚类吸引子相关联。可以分析群集中的文档,以便可以确定并合并代表分层话语社区的确定群集。可以分析合并的群集,从而可以确定主题和/或子主题,并从中提取,索引和存储主题。以这种方式,可以促进对电子文档语料库的更有效搜索以解释所接收的输入,诸如经由数字助理设备所接收的命令。

著录项

  • 公开/公告号WO2019133895A2

    专利类型

  • 公开/公告日2019-07-04

    原文格式PDF

  • 申请/专利权人 AIQUDO INC.;

    申请/专利号WO2018US67992

  • 申请日2018-12-28

  • 分类号G06F17/28;

  • 国家 WO

  • 入库时间 2022-08-21 11:54:01

相似文献

  • 专利
  • 外文文献
  • 中文文献
获取专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号