首页> 美国卫生研究院文献>other >A novel web informatics approach for automated surveillance of cancer mortality trends
【2h】

A novel web informatics approach for automated surveillance of cancer mortality trends

机译:自动监测癌症死亡率趋势的新型网络信息学方法

代理获取
本网站仅为用户提供外文OA文献查询和代理获取服务,本网站没有原文。下单后我们将采用程序或人工为您竭诚获取高质量的原文,但由于OA文献来源多样且变更频繁,仍可能出现获取不到、文献不完整或与标题不符等情况,如果获取不到我们将提供退款服务。请知悉。

摘要

Cancer surveillance data are collected every year in the United States via the National Program of Cancer Registries (NPCR) and the Surveillance, Epidemiology and End Results (SEER) Program of the National Cancer Institute (NCI). General trends are closely monitored to measure the nation's progress against cancer. The objective of this study was to apply a novel web informatics approach for enabling fully automated monitoring of cancer mortality trends. The approach involves automated collection and text mining of online obituaries to derive the age distribution, geospatial, and temporal trends of cancer deaths in the US. Using breast and lung cancer as examples, we mined 23,850 cancer-related and 413,024 general online obituaries spanning the timeframe 2008–2012. There was high correlation between the web-derived mortality trends and the official surveillance statistics reported by NCI with respect to the age distribution (ρ = 0.981 for breast; ρ = 0.994 for lung), the geospatial distribution (ρ = 0.939 for breast; ρ = 0.881 for lung), and the annual rates of cancer deaths (ρ = 0.661 for breast; ρ = 0.839 for lung). Additional experiments investigated the effect of sample size on the consistency of the web-based findings. Overall, our study findings support web informatics as a promising, cost-effective way to dynamically monitor spatiotemporal cancer mortality trends.
机译:在美国,每年都会通过国家癌症注册计划(NPCR)和国家癌症研究所(NCI)的监测,流行病学和最终结果(SEER)计划来收集癌症监测数据。严密监控总体趋势,以衡量该国抗癌的进展。这项研究的目的是应用一种新颖的网络信息学方法来实现对癌症死亡率趋势的全自动监测。该方法涉及在线收集aries书的自动收集和文本挖掘,以得出美国癌症死亡的年龄分布,地理空间和时间趋势。以乳腺癌和肺癌为例,我们在2008-2012年的时间范围内,开采了23,850例与癌症有关的信息和413,024项一般在线online告。网络得出的死亡率趋势与NCI报告的官方监测统计数据之间存在高度相关性,其中包括年龄分布(乳腺癌=ρ= 0.981;肺部ρ= 0.994),地理空间分布(乳腺癌=ρ= 0.939;ρ =肺脏= 0.881)和每年的癌症死亡率(乳腺癌=ρ= 0.661;肺部=ρ0.839)。其他实验研究了样本量对基于Web的发现的一致性的影响。总的来说,我们的研究发现支持网络信息学,是一种有前景的,具有成本效益的方法,可以动态监测时空癌症死亡率趋势。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
代理获取

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号