首页> 外国专利> Automated message attachment labeling using feature selection in message content

Automated message attachment labeling using feature selection in message content

机译:使用邮件内容中的功能选择自动标记邮件附件

摘要

Embodiments are directed towards an automated machine learning framework to extract keywords within a message that are relevant to an attachment to the message. The machine learning model finds a set of relevant sentences within the message determined to be relevant to the one or more attachments based on identification of one or more sentence level features within a given sentence. The sentence level features include, for example, anchor features, noisy sentence features, short message features, threading features, anaphora detections, and lexicon features. From the set of relevant sentences, useful keywords may be extracted using a sequence of heuristics to convert the sentence set into the set of useful keywords. The set of useful keywords may then be associated to at least one attachment such that the keywords may subsequently be used to perform various indexing, searching, sorting, and to provide further context to the attachment.
机译:实施例针对自动机器学习框架,以提取消息内与消息附件相关的关键字。机器学习模型基于给定句子中一个或多个句子级别特征的标识,在确定为与一个或多个附件相关的消息中找到一组相关句子。句子级别的功能包括,例如,锚定功能,嘈杂的句子功能,短消息功能,线程功能,回指检测和词典功能。可以使用一系列启发式方法从相关句子集中提取有用的关键词,以将句子集合转换为有用的关键词集合。然后,可以将一组有用的关键字与至少一个附件相关联,以使得这些关键字随后可以用于执行各种索引,搜索,排序以及向附件提供进一步的上下文。

著录项

  • 公开/公告号US8825472B2

    专利类型

  • 公开/公告日2014-09-02

    原文格式PDF

  • 申请/专利权人 ARAVINDAM RAGHUVEER;

    申请/专利号US20100790536

  • 发明设计人 ARAVINDAM RAGHUVEER;

    申请日2010-05-28

  • 分类号G06F17/27;

  • 国家 US

  • 入库时间 2022-08-21 16:02:28

相似文献

  • 专利
  • 外文文献
  • 中文文献
获取专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号