首页> 外国专利> DOCUMENT ANALYSIS SYSTEM THAT USES PROCESS MINING TECHNIQUES TO CLASSIFY CONVERSATIONS

DOCUMENT ANALYSIS SYSTEM THAT USES PROCESS MINING TECHNIQUES TO CLASSIFY CONVERSATIONS

机译:使用过程挖掘技术对会话进行分类的文档分析系统

摘要

A method includes performing, by a processor: receiving a first document, the first document comprising a first plurality of sub-documents that are related to one another in a first time sequence; converting the first plurality of sub-documents to a vector format to generate a vectorized document that encodes a probability distribution of words in the document and transition probabilities between words; detecting a plurality of topics within the vectorized document, the plurality of topics being related to one another in the first time sequence; applying a process discovery algorithm to the plurality of topics to generate a model that is representative of relationships between the plurality of topics; receiving a second document containing subject matter related to a course of action, the second document comprising a second plurality of sub-documents that are related to one another in a second time sequence; using the model to generate a classification for the second document; and adjusting the course of action based on the classification for the second document.
机译:一种方法,包括:由处理器执行:接收第一文档,所述第一文档包括在第一时间序列中彼此相关的第一多个子文档;以及将第一批多个子文档转换为矢量格式,以生成矢量化文档,该文档对文档中单词的概率分布和单词之间的转换概率进行编码;在所述矢量化文档中检测多个主题,所述多个主题在第一时间序列中彼此相关;将过程发现算法应用于多个主题以生成表示多个主题之间的关系的模型;接收包含与动作过程相关的主题的第二文档,该第二文档包括在第二时间序列中彼此相关的第二多个子文档;使用该模型为第二个文档生成分类;并根据第二份文件的分类调整行动方案。

著录项

相似文献

  • 专利
  • 外文文献
  • 中文文献
获取专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号