首页> 外文会议>European conference on IR research >Bringing Back Structure to Free Text Email Conversations with Recurrent Neural Networks
【24h】

Bringing Back Structure to Free Text Email Conversations with Recurrent Neural Networks

机译:通过递归神经网络将结构带回免费文本电子邮件会话

获取原文

摘要

Email communication plays an integral part of everybody's life nowadays. Especially for business emails, extracting and analysing these communication networks can reveal interesting patterns of processes and decision making within a company. Fraud detection is another application area where precise detection of communication networks is essential. In this paper we present an approach based on recurrent neural networks to untangle email threads originating from forward and reply behaviour. We further classify parts of emails into 2 or 5 zones to capture not only header and body information but also greetings and signatures. We show that our deep learning approach outperforms state-of-the-art systems based on traditional machine learning and hand-crafted rules. Besides using the well-known Enron email corpus for our experiments, we additionally created a new annotated email benchmark corpus from Apache mailing lists.
机译:电子邮件通信已成为当今每个人生活中不可或缺的一部分。特别是对于商务电子邮件,提取和分析这些通信网络可以揭示公司内部有趣的流程和决策模式。欺诈检测是另一个需要精确检测通信网络的应用领域。在本文中,我们提出了一种基于递归神经网络的方法,以解开源自转发和回复行为的电子邮件线程。我们进一步将电子邮件的各个部分分为2个或5个区域,以不仅捕获标题和正文信息,还捕获问候和签名。我们证明了我们的深度学习方法优于基于传统机器学习和手工制定的规则的最新系统。除了将著名的Enron电子邮件语料库用于我们的实验之外,我们还从Apache邮件列表中创建了一个新的带注释的电子邮件基准语料库。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号