首页> 外文会议>2017 Intelligent Systems Conference >Classification of imbalanced data in E-commerce
【24h】

Classification of imbalanced data in E-commerce

机译:电子商务中不平衡数据的分类

获取原文
获取原文并翻译 | 示例

摘要

Applications for machine learning algorithms are beginning to dominate the world of online commerce with their seemingly endless potential for supplementing fully customizable shopping experiences. From socially impactful event predictions to smarter ways of shopping online, big fast data is streaming in and being utilized constantly. Unfortunately, unusual instances of data, called imbalanced data, are still being ignored at large because of the inadequacies of analytical methods that are designed to handle homogenized data sets and to “smooth out” outliers. Consequently, rare use cases of significant importance remain neglected and lead to high-cost loses or even tragedies. In the past decade, a myriad of approaches handling this problem that range from data modifications to alterations of existing algorithms have appeared with varying success. Yet, the majority of them have major drawbacks when applied to different application domains because of the non-uniform nature of the applicable data. Within the vast domain of e-Commerce, we are proposing a new approach for handling imbalanced data, which is a hybrid classification method that will consist of a mixed solution of multi-modal data formats and algorithmic adaptations for an optimal balance between prediction accuracy, precision and specificity. Our solution improves data usability, classification accuracy and resulting costs of analyzing massive data sets used in personalizing customer experiences in e-Commerce.
机译:机器学习算法的应用程序似乎以无穷无尽的潜力来补充完全可定制的购物体验,从而开始统治在线商务领域。从具有社会影响力的事件预测到更智能的在线购物方式,大数据快速流入并被不断利用。不幸的是,由于设计用于处理均质化数据集和“消除”异常值的分析方法的不足,仍被忽略了称为不平衡数据的异常数据实例。因此,非常重要的稀有用例仍然被忽略,并导致高成本损失甚至悲剧。在过去的十年中,出现了无数种处理此问题的方法,从数据修改到现有算法的更改,都取得了不同的成功。但是,由于适用数据的不统一特性,当将它们应用于不同的应用程序域时,它们中的大多数都有主要缺点。在电子商务的广阔领域内,我们正在提出一种新的方法来处理不平衡数据,该方法是一种混合分类方法,它将由多模式数据格式和算法调整的混合解决方案组成,以实现预测精度之间的最佳平衡,精确度和特异性。我们的解决方案提高了数据可用性,分类准确性以及分析用于个性化电子商务客户体验的海量数据集的成本。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号