首页> 外文期刊>Marketing Science >Efficient Methods for Sampling Responses from Large-Scale Qualitative Data
【24h】

Efficient Methods for Sampling Responses from Large-Scale Qualitative Data

机译:从大规模定性数据采样响应的有效方法

获取原文
获取原文并翻译 | 示例
       

摘要

The World Wide Web contains a vast corpus of consumer-generated content that holds invaluable insights for improving the product and service offerings of firms. Yet the typical method for extracting diagnostic information from online content-text mining-has limitations. As a starting point, we propose analyzing a sample of comments before initiating text mining. Using a combination of real data and simulations, we demonstrate that a sampling procedure that selects respondents whose comments contain a large amount of information is superior to the two most popular sampling methods-simple random sampling and stratified random sampling-in gaining insights from the data. In addition, we derive a method that determines the probability of observing diagnostic information repeated a specific number of times in the population, which will enable managers to base sample size decisions on the trade-off between obtaining additional diagnostic information and the added expense of a larger sample. We provide an illustration of one of the methods using a real data set from a website containing qualitative comments about staying at a hotel and demonstrate how sampling qualitative comments can be a useful first step in text mining.
机译:万维网包含大量由用户生成的内容,这些内容对改善公司的产品和服务提供了宝贵的见解。然而,从在线内容文本挖掘中提取诊断信息的典型方法有局限性。作为起点,我们建议在启动文本挖掘之前分析评论样本。结合实际数据和模拟结果,我们证明,选择评论中包含大量信息的受访者的抽样程序要优于两种最流行的抽样方法-简单随机抽样和分层随机抽样-从数据中获取见解。此外,我们推导了一种方法,该方法确定了在总体中重复观察特定次数的诊断信息的可能性,这将使管理人员能够根据获得额外诊断信息和增加医疗费用之间的权衡取舍来确定样本量。大样本。我们提供了其中一种方法的示例,该方法使用了来自网站的真实数据集,其中包含有关住宿的定性注释,并演示了如何对定性注释进行采样是文本挖掘中有用的第一步。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号