首页> 外国专利> System and method for automatically mining corpus of communications and identifying actions

System and method for automatically mining corpus of communications and identifying actions

机译:自动挖掘通信语料并识别动作的系统和方法

摘要

Exemplary embodiments of the present disclosure are directed towards a system and method for automatically mining a corpus of communications and identifying actions. The method comprises collecting corpus of communications from the communication modalities, determining the importance of the senders of those communications, filtering the corpus of communications remove promotional or marketing content, header and footer contents including signatures. The method further includes segmenting the filtered content to define phrases, creating a canonical form for each of those phrases, and extracting a feature vector for the canonical form. The feature vectors are processed by a classification algorithm to determine whether the corresponding phrase is one which requires the user’s attention or not.
机译:本公开的示例性实施例针对一种用于自动挖掘通信语料库并识别动作的系统和方法。该方法包括从通信模态收集通信语料库,确定那些通信的发送者的重要性,过滤通信语料库以除去促销或营销内容,包括签名的页眉和页脚内容。该方法还包括:对过滤后的内容进行分段以定义短语;为这些短语中的每一个创建规范形式;以及为该规范形式提取特征向量。特征向量由分类算法处理,以确定相应的短语是否是需要用户注意的短语。

著录项

  • 公开/公告号IN2014CH00401A

    专利类型

  • 公开/公告日2016-08-31

    原文格式PDF

  • 申请/专利权人

    申请/专利号IN401/CHE/2014

  • 申请日2014-01-29

  • 分类号

  • 国家 IN

  • 入库时间 2022-08-21 14:25:33

相似文献

  • 专利
  • 外文文献
  • 中文文献
获取专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号