首页> 外文学位 >Essays in Machine Learning, Social Networks and Marketing
【24h】

Essays in Machine Learning, Social Networks and Marketing

机译:机器学习,社交网络和营销方面的论文

获取原文
获取原文并翻译 | 示例

摘要

This thesis is organized into two distinct parts; however, both explore different facets of large-scale machine learning. The first chapter is concerned with methodological aspects of machine learning, namely the design of new distributed optimization algorithms, whereas the last two chapters consider applications of machine learning that lie at the interface of social network analysis and marketing.;The motivation for the first chapter is that in recent years, the size of data sets has exceeded the disk and memory capacities of a single computer. This has created the need to employ parallel and distributed computing in order to perform critical machine learning tasks. Given that optimization is one of the pillars of machine learning and predictive modeling, distributed optimization methods have recently garnered ample attention in the literature. Although previous research has mostly focused on settings in which either the observations, or features of the problem at hand are stored in distributed fashion, the situation where both are partitioned across the nodes of a computer cluster (doubly distributed) has not been studied extensively. We propose two doubly distributed optimization algorithms. The first falls under the umbrella of distributed dual coordinate ascent methods, while the second belongs to the class of stochastic gradient/coordinate descent hybrid methods. We conduct numerical experiments in Spark using real-world and simulated data sets and study the scaling properties of our methods. Our empirical evaluation of the proposed algorithms demonstrates the outperformance of a block distributed ADMM method, which, to the best of our knowledge is the only other existing doubly distributed optimization algorithm.;The second and third chapters address two fundamental problems in marketing: customer retention, and product adoption, respectively. In the second chapter, we explore the power of social networks in predicting non-contractual customer behavior. The last decade has seen a rapid emergence of non-subscription-based services. Predicting spending patterns in such settings is particularly challenging due to the capricious purchasing behavior that customers often exhibit. We study the extent to which knowledge of a customer's social network can enhance the accuracy of forecasting their behavior in terms of future: (1) activity, (2) transaction levels and (3) membership to the group of best customers. We conduct a dynamic analysis on a sample of approximately one million users from the most popular peer-to-peer (P2P) payment application, Venmo. Our models produce high quality forecasts and demonstrate that social networks lead to a significant boost in predictive performance primarily during the first month of a customer's lifetime, thus providing a remedy to the "cold-start" problem. Finally, we characterize significant structural differences with regard to network centrality, density and connectivity between the top 10% and bottom 90% of users immediately after joining the service. We discuss how these structural dissimilarities provide a path towards proactive marketing and improved customer acquisition efforts.;In the third chapter, using the same data set as in the second chapter, we investigate the structural aspects behind the adoption of Venmo. The most distinct quality of our data set is that Venmo transactions reflect offline activities combined with the scale of the online world. We use the framework of structural virality, which has primarily been utilized in the spread of online content, as a vehicle to investigate the diffusion of an offline product. Structural virality serves as a tool for differentiating between the two main forms of diffusion, namely a single broadcast that reaches directly a large number of people, and a viral mechanism that follows a more organic, P2P growth. We explore the nature of adopting Venmo from two perspectives: the traditional approach that considers adoption to be a binary event, and a holistic approach that takes into account the behavior of users in the post-adoption period. Our key finding indicates that not all adoptions are equal. More specifically, our results show that broadcast and viral mechanisms are positively correlated with the speed of diffusion and user engagement, respectively. In the face of this trade-off, we build a model to predict broadcast and viral structures, which can help stakeholders proactively devise network interventions that best suit their objectives.
机译:本论文分为两个不同的部分:但是,两者都探索了大规模机器学习的不同方面。第一章涉及机器学习的方法学方面,即新的分布式优化算法的设计,而后两章则探讨了机器学习在社交网络分析和营销界面上的应用。就是说,近年来,数据集的大小已经超过了单台计算机的磁盘和内存容量。这就需要采用并行和分布式计算来执行关键的机器学习任务。鉴于优化是机器学习和预测建模的支柱之一,因此分布式优化方法最近在文献中引起了足够的关注。尽管以前的研究主要集中在将观察结果或问题的特征以分布式方式存储的环境中,但尚未对跨计算机集群的节点(双重分布)进行分区的情况进行广泛研究。我们提出了两种双重分布的优化算法。第一种属于分布式双坐标上升方法,而第二种属于随机梯度/坐标下降混合方法。我们使用真实世界和模拟数据集在Spark中进行数值实验,并研究我们方法的缩放性质。我们对所提出算法的实证评估证明了块分布式ADMM方法的出色表现,据我们所知,该方法是仅有的其他现有的双分布优化算法。;第二章和第三章解决了营销中的两个基本问题:客户保留和产品采用率。在第二章中,我们探讨了社交网络在预测非合同客户行为方面的功能。在过去的十年中,非订阅式服务迅速兴起。由于客户经常表现出反复无常的购买行为,因此在这种情况下预测支出模式尤其具有挑战性。我们研究客户社交网络的知识可以在多大程度上提高未来预测其行为的准确性:(1)活动,(2)交易级别和(3)最佳客户组的成员资格。我们对来自最流行的对等(P2P)付款应用程序Venmo的大约一百万用户的样本进行动态分析。我们的模型产生高质量的预测,并证明社交网络主要在客户生命周期的头一个月内大大提高了预测性能,从而为“冷启动”问题提供了解决方法。最后,我们在加入服务后立即就最高级的10%和最底层的90%用户之间的网络集中度,密度和连接性,描述了重大的结构差异。我们讨论了这些结构上的差异如何为主动营销和改善客户获取工作提供途径。在第三章中,使用与第二章相同的数据集,我们研究了采用Venmo背后的结构方面。我们数据集最独特的质量是Venmo交易反映了线下活动和在线世界的规模。我们使用主要在在线内容传播中使用的结构病毒式传播框架作为研究离线产品传播的工具。结构病毒传播是区分两种主要传播形式的工具,即直接传播到大量人群的单一广播,以及随着有机传播的P2P增长而传播的病毒机制。我们从两个角度探讨采用Venmo的性质:将采用视为二元事件的传统方法,以及考虑采用后阶段用户行为的整体方法。我们的主要发现表明,并非所有采用者都是平等的。更具体地说,我们的结果表明,广播和病毒机制分别与传播速度和用户参与度呈正相关。面对这种权衡,我们建立了一个模型来预测广播和病毒结构,这可以帮助利益相关者主动设计最适合其目标的网络干预措施。

著录项

  • 作者

    Nathan, Alexandros.;

  • 作者单位

    Northwestern University.;

  • 授予单位 Northwestern University.;
  • 学科 Industrial engineering.;Artificial intelligence.;Marketing.
  • 学位 Ph.D.
  • 年度 2018
  • 页码 125 p.
  • 总页数 125
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类
  • 关键词

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号