首页> 外文期刊>International Journal of Computer Processing of Oriental Languages >Evaluating Rule-Based and Statistical Filters to Detecting Arabic E-Mail Alert Messages
【24h】

Evaluating Rule-Based and Statistical Filters to Detecting Arabic E-Mail Alert Messages

机译:评估基于规则的统计过滤器以检测阿拉伯语电子邮件警报消息

获取原文
获取原文并翻译 | 示例
           

摘要

Detecting and filtering e-mail alerts that are related to criminal or terrorist activities is of great interest for both security agencies and people. This paper evaluates and compares the performance of both the rule-based filter and Paul Graham statistical filter for detecting alerts in Arabic e-mail messages. To evaluate the two filters, a set of 1500 Arabic messages related to criminal activities were collected manually from some news websites such as Al-Jazeera Net and BBC Arabic news. The e-mails have been preprocessed, normalized, and then the relevant features were extracted from the collected e-mails by involving categorical proportional difference (CPD) and term frequency variance (TFV) as features weighting methods for the rule-based filter. To test the performance of the two filters, several experiments have been conducted and the result show that the Paul Graham statistical filter was more accurate. It was able to detect about 85% of the e-mail alerts used in the experiments. The rule-based filter has achieved 80% accuracy using the CPD method and 70% accuracy using the TFV method.
机译:对于安全机构和人员来说,检测和过滤与犯罪或恐怖活动有关的电子邮件警报非常重要。本文评估并比较了基于规则的过滤器和Paul Graham统计过滤器在检测阿拉伯电子邮件中的警报方面的性能。为了评估这两个过滤器,从Al-Jazeera Net和BBC阿拉伯新闻等一些新闻网站手动收集了1500条与犯罪活动有关的阿拉伯语消息。电子邮件已经过预处理,标准化,然后通过将分类比例差异(CPD)和词频差异(TFV)作为基于规则的过滤器的特征加权方法,从收集的电子邮件中提取相关特征。为了测试这两个过滤器的性能,已进行了几次实验,结果表明,Paul Graham统计过滤器更为准确。它能够检测到实验中使用的大约85%的电子邮件警报。基于规则的过滤器使用CPD方法的精度达到80%,使用TFV方法的精度达到70%。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号