首页> 外文期刊>Statistical science >Moving from Data-Constrained to Data-Enabled Research: Experiences and Challenges in Collecting, Validating and Analyzing Large-Scale e-Commerce Data
【24h】

Moving from Data-Constrained to Data-Enabled Research: Experiences and Challenges in Collecting, Validating and Analyzing Large-Scale e-Commerce Data

机译:从数据受限研究转向数据启用研究:收集,验证和分析大规模电子商务数据的经验和挑战

获取原文
获取原文并翻译 | 示例
           

摘要

Widespread e-commerce activity on the Internet has led to new opportunities to collect vast amounts of micro-level market and nonmarket data. In this paper we share our experiences in collecting, validating, storing and analyzing large Internet-based data sets in the area of online auctions, music file sharing and online retailer pricing. We demonstrate how such data can advance knowledge by facilitating sharper and more extensive tests of existing theories and by offering observational underpinnings for the development of new theories. Just as experimental economics pushed the frontiers of economic thought by enabling the testing of numerous theories of economic behavior in the environment of a controlled laboratory, we believe that observing, often over extended periods of time, real-world agents participating in market and nonmarket activity on the Internet can lead us to develop and test a variety of new theories. Internet data gathering is not controlled experimentation. We cannot randomly assign participants to treatments or determine event orderings. Internet data gathering does offer potentially large data sets with repeated observation of individual choices and action. In addition, the automated data collection holds promise for greatly reduced cost per observation. Our methods rely on technological advances in automated data collection agents. Significant challenges remain in developing appropriate sampling techniques integrating data from heterogeneous sources in a variety of formats, constructing generalizable processes and understanding legal constraints. Despite these challenges, the early evidence from those who have harvested and analyzed large amounts of e-commerce data points toward a significant leap in our ability to understand the functioning of electronic commerce.
机译:互联网上广泛的电子商务活动为收集大量的微观市场和非市场数据提供了新的机会。在本文中,我们分享了我们在在线拍卖,音乐文件共享和在线零售商定价方面收集,验证,存储和分析基于Internet的大型数据集的经验。我们通过促进对现有理论的更尖锐和更广泛的测试,并为新理论的发展提供观察基础,证明了这些数据如何提高知识。正如实验经济学通过在受控实验室环境中对众多经济行为理论进行测试来推动经济学思想前沿一样,我们认为,通常会在很长一段时间内观察参与市场和非市场活动的现实世界代理商。在互联网上可以引导我们开发和测试各种新理论。 Internet数据收集不是受控实验。我们无法将参与者随机分配给治疗或确定事件的顺序。互联网数据收集确实提供了潜在的大型数据集,并且可以反复观察各个选择和行为。此外,自动数据收集有望大大降低每次观察的成本。我们的方法依靠自动化数据收集代理中的技术进步。开发适当的采样技术以各种格式集成来自异构源的数据,构建可概括的过程并理解法律约束条件方面仍然存在重大挑战。尽管存在这些挑战,但是那些收集和分析了大量电子商务数据的人的早期证据表明,我们在理解电子商务功能方面的能力有了重大飞跃。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号