首页> 外文期刊>Systems and Computers in Japan >An Automatic Email Distribution by Using Text Mining and Reinforcement Learning
【24h】

An Automatic Email Distribution by Using Text Mining and Reinforcement Learning

机译:使用文本挖掘和强化学习的自动电子邮件分发

获取原文
获取原文并翻译 | 示例
           

摘要

The authors created a system to automatically distribute inquiry email from electronic mail systems or the Web to the appropriate supervisor. The proposed method gathers document data created by the supervisors, calculates the tf · idf value and the idf/conf value for the words that appear in the documents, and then creates two types of dictionaries for each supervisor. Moreover, the method features the use of Profit Sharing instead of conventional inductive learning for two weights of terms. Profit Sharing is one method of reinforcement learning. The system compares the inquiry email and the dictionaries, then calculates a score for each supervisor based on the word weights and match rates, and identifies a supervisor with a high score as the respondent. The authors performed evaluation experiments using real inquiry emails in order to evaluate the effectiveness of their method, and found the following. (1) Based on the distribution accuracy of specialists who distribute inquiry email, the accuracy necessary for practical use was obtained. (2) In distribution using only the tf · idf value and the idf/conf value, distribution accuracy sufficient for practical purposes was not obtained. (3) A practical level of distribution accuracy, roughly equivalent to that of the distribution specialists, was obtained through reinforcement learning of the word weights in (2). Finally, the authors evaluated the number of document files and noise necessary to obtain the practical level of accuracy in (3) and compared the accuracy in their method with that of a conventional text categorization method.
机译:作者创建了一个系统,用于自动将来自电子邮件系统或Web的查询电子邮件分发给适当的主管。所提出的方法收集由主管创建的文档数据,计算文档中出现的单词的tf·idf值和idf / conf值,然后为每个主管创建两种字典。此外,该方法的特征在于,对于两个权重,使用了利润共享,而不是传统的归纳学习。利益共享是强化学习的一种方法。系统将查询电子邮件和词典进行比较,然后根据词的权重和匹配率为每个主管计算分数,然后将得分较高的主管识别为受访者。作者使用真实的查询电子邮件进行了评估实验,以评估其方法的有效性,并发现了以下内容。 (1)根据分发查询电子邮件的专家的分发准确性,获得了实际使用所需的准确性。 (2)仅使用tf·idf值和idf / conf值进行分配时,未获得足够实用的分配精度。 (3)通过加强学习(2)中的单词权重,可以获得与发行专家相当的实际发行水平。最后,作者评估了在(3)中获得实际准确性水平所需的文档文件数量和噪声,并将他们的方法与常规文本分类方法的准确性进行了比较。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号