首页> 外文会议>International Conference on Computer Science Education >A Vector Space Model based spam SMS filter
【24h】

A Vector Space Model based spam SMS filter

机译:基于向量空间模型的垃圾短信过滤器

获取原文

摘要

Along with the popularity of telecommunication and mobile phone, short message (SMS) enters almost every human life. Meanwhile, each mobile phone client suffers from the harass of spam SMS. As the SMS service provider who is in charge of all industry SMS in east China, Dahan Tricom Corporation always invest much in anti spam SMS research. Recent years, we upgrade our anti-spam filter to semantic level. The core technology is described in this paper. Unlike other anti spam filter, such as anti spam emails, the anti spam SMS filter must face many difficulties oriented by SMS itself. Since SMS contains only 70 Chinese characters or 140 English letters at the most, it is always lack of semantic information. Also, vocal expressions often appear in SMS. In addition, the industry SMS often contains proper terms related to specific industry field. The anti spam SMS filter in this paper first leverages Vector Space Model (VSM) as its foundation technologies. Then, many modifications are made in the process in VSM method to address the difficulties of spam SMS filtering issue. Finally, the experiment result turns out to be acceptable in our commercial production environment.
机译:随着电信和移动电话的普及,短消息(SMS)进入了几乎每个人的生活。同时,每个手机客户都遭受垃圾短信的骚扰。作为负责华东地区所有行业SMS的SMS服务提供商,大韩Tricom Corporation始终在反垃圾邮件SMS研究方面投入大量资金。近年来,我们将反垃圾邮件过滤器升级到语义级别。本文介绍了核心技术。与其他反垃圾邮件过滤器(例如反垃圾邮件)不同,反垃圾短信SMS过滤器必须面对许多SMS自身面临的困难。由于SMS最多仅包含70个汉字或140个英文字母,因此始终缺少语义信息。此外,语音表达通常出现在SMS中。此外,行业SMS通常包含与特定行业领域相关的专有术语。本文中的反垃圾邮件SMS过滤器首先利用向量空间模型(VSM)作为基础技术。然后,在VSM方法的过程中进行了许多修改,以解决垃圾邮件SMS筛选问题的难题。最后,实验结果证明在我们的商业生产环境中是可以接受的。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号