首页> 外文期刊>Journal of machine learning research >Detecting Click Fraud in Online Advertising: A Data Mining Approach
【24h】

Detecting Click Fraud in Online Advertising: A Data Mining Approach

机译:检测在线广告中的点击欺诈:一种数据挖掘方法

获取原文
       

摘要

Click fraud--the deliberate clicking on advertisements with noreal interest on the product or service offered--is one of themost daunting problems in online advertising. Building aneffective fraud detection method is thus pivotal for onlineadvertising businesses. We organized a Fraud Detection inMobile Advertising (FDMA) 2012 Competition, opening theopportunity for participants to work on real-world fraud datafrom BuzzCity Pte. Ltd., a global mobile advertising companybased in Singapore. In particular, the task is to identifyfraudulent publishers who generate illegitimate clicks, anddistinguish them from normal publishers. The competition washeld from September 1 to September 30, 2012, attracting 127teams from more than 15 countries. The mobile advertising dataare unique and complex, involving heterogeneous information,noisy patterns with missing values, and highly imbalanced classdistribution. The competition results provide a comprehensivestudy on the usability of data mining-based fraud detectionapproaches in practical setting. Our principal findings are thatfeatures derived from fine-grained time-series analysis arecrucial for accurate fraud detection, and that ensemble methodsoffer promising solutions to highly-imbalanced nonlinearclassification tasks with mixed variable types and noisy/missingpatterns. The competition data remain available for furtherstudies at href="http://palanteer.sis.smu.edu.sg/fdma2012/">palanteer.sis.smu.edu.sg/fdma2012. color="gray">
机译:点击欺诈-故意点击对所提供产品或服务没有兴趣的广告-是在线广告中最艰巨的问题之一。因此,构建有效的欺诈检测方法对于在线广告业务至关重要。我们举办了 2012年移动广告欺诈检测(iDMA)(FDMA)竞赛,为参与者提供了使用BuzzCity Pte进行真实欺诈数据的机会。 Ltd.是一家总部位于新加坡的全球移动广告公司。特别是,任务是识别产生非法点击的欺诈性发布者,并将其与普通发布者区分开。比赛于2012年9月1日至9月30日举行,吸引了来自15个以上国家的127个团队。移动广告数据是独特且复杂的,涉及异构信息,带有缺失值的嘈杂模式以及高度不平衡的类别分布。竞赛结果为基于数据挖掘的欺诈检测方法在实际环境中的可用性提供了全面的研究。我们的主要发现是,从细粒度的时间序列分析得出的功能对于准确的欺诈检测至关重要,而集成方法为混合变量类型和噪声/丢失模式的高度不平衡的非线性分类任务提供了有希望的解决方案。有关比赛数据,请访问href="http://palanteer.sis.smu.edu.sg/fdma2012/"> palanteer.sis.smu.edu.sg/fdma2012 。 color =“ gray”>

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号