首页> 外文会议>Workshop on Algorithm Engineering and Experiments >Budgeted Maximum Coverage with Overlapping Costs: Monitoring the Emerging Infections Network
【24h】

Budgeted Maximum Coverage with Overlapping Costs: Monitoring the Emerging Infections Network

机译:预算的最大覆盖率具有重叠成本:监控新兴感染网络

获取原文

摘要

The Emerging Infections Network (EIN) (http://ein.idsociety.org/) is a CDC supported "sentinel" network of over 1400 members (currently), designed to connect clinical infectious disease specialists and public health officials. Members primarily communicate through an EIN managed listserv and discuss disease outbreaks, treatment protocols, effectiveness of vaccinations and other disease-control and prevention mechanisms, etc. Recently, researchers at Google and Yahoo! Research have used search engine query logs to tap into the online "wisdom of crowds" and produce disease outbreak trends for flu. Following this work, there is now interest in trying to monitor EIN discussions more carefully to disseminate timely and accurate information on clinical events of possible interest to health officials. We model the problem of monitoring a listserv, such as the EIN, as a type of budgeted maximum coverage problem that we call Budgeted Maximization with Overlapping Costs (BMOC). Even though BMOC seems superficially similar to the budgeted maximum coverage problem considered by Khuller et al. (Inf. Process. Lett., 1999), our problem is fundamentally different from an algorithmic point of view, due to its cost structure. We observe that the greedy algorithm that provides a constant-factor approximation to the budgeted maximum coverage problem can be arbitrarily bad for BMOC. We also present a reduction to BMOC from the k-densest subgraph problem that provides evidence indicating that obtaining a constant-factor approximation for our problem might be quite challenging. Nevertheless, experimental runs of the greedy algorithm on the EIN data show that greedy performs remarkably well relative to OPT. We identify a feature of our EIN data, that we call the overlap condition, and show that the greedy algorithm does indeed yield a constant-factor approximation guarantee if the overlap condition is satisfied. Using an implementation of the greedy algorithm for BMOC on the EIN data, we identify small sets of "bellwether" users who are good predictors of important discussions. We provide evidence to show that tracking just these users reduces the cost of monitoring the EIN significantly without causing any important discussions to be missed.
机译:新兴传染病网络(EIN)(http://ein.idsociety.org/)是CDC支持超过1400个成员(目前)的“哨兵”的网络,用于连接临床传染病专家和公共卫生官员。成员主要通过EIN管理的Listserv和讨论疾病爆发,治疗方案,疫苗养的有效性以及其他疾病控制和预防机制等。最近,谷歌和雅虎的研究人员研究使用了搜索引擎查询日志,从线“人群智慧”,产生疾病爆发流感趋势。在这项工作之后,现在有兴趣努力更加仔细地监测EIN讨论,以便在卫生官员可能兴趣的临床活动中传播及时和准确的信息。我们模拟监控ListServ的问题,例如EIN,作为我们通过重叠成本(BMOC)进行预算最大化的预算最大覆盖问题的类型。尽管BMOC似乎与Khuller等人所考虑的预算最大覆盖问题一样。 (inf。过程。Lett。,1999),由于其成本结构,我们的问题与算法的角度根本不同。我们观察到为预算的最大覆盖问题提供恒因子近似的贪婪算法可以是任意的BMOC不良的。我们还从K-DESEST子画面问题上提出了BMOC的减少,这提供了证据表明获得了我们问题的恒因因素近似可能是非常具有挑战性的。尽管如此,贪婪算法在EIN数据上的实验运行表明,贪婪相对于opt来表现出显着良好。我们识别我们的EIN数据的特征,我们称之为重叠条件,并且如果满足重叠条件,则贪婪算法确实产生了恒因子近似保证。使用对EIN数据的BMOC贪婪算法的实现,我们识别了少量的“Bellwether”用户,这些用户是重要讨论的良好预测因素。我们提供证据表明,跟踪只是这些用户降低了显着监测EIN的成本,而不会导致任何重要讨论。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号