首页> 外文期刊>International Journal of Computational Medicine and Healthcare. >Extraction of breast cancer biomarker status using natural language processing
【24h】

Extraction of breast cancer biomarker status using natural language processing

机译:提取乳腺癌生物标记状态使用自然语言处理

获取原文
获取原文并翻译 | 示例
       

摘要

We employed natural language processing (NLP) algorithms to extract estrogen receptor (ER), progesterone receptor (PR), and human epidermal growth factor 2 (HER2) receptor status for females with breast cancer using unstructured (free text) EMR data, and to determine the prevalence of triple negative breast cancer in the Indiana network for patient care (INPC) population. We identified female patients in INPC with a history of breast cancer over a ten year period who had at least five oncology notes or one related pathology document. Based on manual chart review, our NLP algorithms for extracting ER, PR, and HER2 receptor status performed well with sensitivity 87.5% to 92.6%, specificity 88.6% to 95.8%, positive predictive values (PPV) 82.4% to 99.0%, and negative predictive values (NPV) 85.2% to 97.7%. This study confirmed our primary hypothesis that NLP algorithms are effective in identifying important breast cancer biomarkers in patients with breast cancer using unstructured data.
机译:我们使用自然语言处理(NLP)算法来提取雌激素受体(ER),孕激素受体(PR),和人类的表皮生长因子受体2 (HER2)状态患有乳腺癌的女性使用非结构化(自由文本)EMR数据,来确定三阴性乳腺癌的患病率印第安纳州的病人护理网络(INPC)人口。与乳腺癌的历史超过一百一十年至少有五肿瘤学笔记或时期一个相关的病理学文档。图检查,我们的NLP的提取算法ER、PR和HER2受体状态表现良好敏感性为87.5%至92.6%,特异性88.6%至95.8%,阳性预测值(PPV)82.4%到99.0%,阴性预测值(NPV) 85.2%到97.7%。主要假设NLP算法有效地识别重要的乳腺癌生物标志物在乳腺癌患者使用非组织性数据

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号