首页> 外文会议>International conference on very large data bases >Aggregate Profile Clustering for Telco Analytics
【24h】

Aggregate Profile Clustering for Telco Analytics

机译:Telco Analytics的聚合简介群集

获取原文
获取外文期刊封面目录资料

摘要

Many telco analytics require maintaining call profiles based on recent customer call patterns. Such call profiles are typically organized as aggregations computed at different time scales over the recent customer interactions. Customer call profiles are key inputs for analytics targeted at improving operations, marketing, and sales of telco providers. Many of these analytics require clustering customer call profiles, so that customers with similar calling patterns can be modeled as a group. Example applications include optimizing tariffs, customer segmentation, and usage forecasting. In this demo, we present our system for scalable aggregate profile clustering in a streaming setting. We focus on managing anonymized segments of customers for tariff optimization. Due to the large number of customers, maintaining profile clusters have high processing and memory resource requirements. In order to tackle this problem, we apply distributed stream processing. However, in the presence of distributed state, it is a major challenge to partition the profiles over machines (nodes) such that memory and computation balance is maintained, while keeping the clustering accuracy high. Furthermore, to adapt to potentially changing customer calling patterns, the partitioning of profiles to machines should be continuously revised, yet one should minimize the migration of profiles so as not to disturb the online processing of updates. We provide a re-partitioning technique that achieves all these goals. We keep micro-cluster summaries at each node, collect these summaries at a centralize node, and use a greedy algorithm with novel affinity heuristics to revise the partitioning. We present a demo that showcases our Storm and Hbase based implementation of the proposed solution in the context of a customer segmentation application.
机译:许多电信分析需要基于最近的客户呼叫模式维护呼叫配置文件。这种呼叫配置文件通常被组织为在近期客户交互的不同时间尺度上计算的聚合。客户呼叫档案是用于改善电信提供商的运营,营销和销售的分析的主要输入。这些分析中的许多都需要群集客户呼叫配置文件,以便具有类似调用模式的客户可以作为组建模。示例应用包括优化关税,客户分割和使用预测。在此演示中,我们在流设置中展示了我们的系统中可伸缩的聚合配置文件群集。我们专注于管理客户的匿名细分,以获得关税优化。由于客户数量大,维护型材集群具有高处理和内存资源要求。为了解决这个问题,我们应用分布式流处理。然而,在分布式状态存在下,将配置文件分区(节点)分区,使得维护存储器和计算余额是一项重大挑战,同时保持聚类精度高。此外,为了适应潜在的客户调用模式,应该连续修改对机器的简档的分区,但是一个应该最小化轮廓的迁移,以免干扰更新的在线处理。我们提供了一种重新分区技术,实现了所有这些目标。我们在每个节点上保留微集群摘要,在集中节点上收集这些摘要,并使用具有新颖性亲和力启发式的贪婪算法来修改分区。我们展示了一个演示,展示了我们在客户细分申请的上下文中展示了我们的风暴和基于HBase的实施情况。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号