On the diminishing return of labeling clinical reports

机译：论标签临床报告的递减回报

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Ample evidence suggests that better machine learning models may be steadily obtained by training on increasingly larger datasets on natural language processing (NLP) problems from non-medical domains. Whether the same holds true for medical NLP has by far not been thoroughly investigated. This work shows that this is indeed not always the case. We reveal the somehow counter-intuitive observation that performant medical NLP models may be obtained with small amount of labeled data, quite the opposite to the common belief, most likely due to the domain specificity of the problem. We show quantitatively the effect of training data size on a fixed test set composed of two of the largest public chest x-ray radiology report datasets on the task of abnormality classification. The trained models not only make use of the training data efficiently, but also outperform the current state-of-the-art rule-based systems by a significant margin.

机译：充足的证据表明，通过培训来自非医学领域的自然语言处理（NLP）问题的越来越大的数据集，可以稳定地获得更好的机器学习模型。对于医疗NLP的同样的持有情况，迄今为止没有得到彻底调查。这项工作表明，这种情况确实并非总是如此。我们揭示了某种反向直观观察，即，可以用少量标记数据获得表演医疗NLP模型，与常见信念相反，很可能是由于问题的域特异性。我们在定量上显示了培训数据大小对由两个最大的公共胸部X射线放射学报告数据集组成的固定测试集的效果，这是异常分类任务的。训练有素的型号不仅有效地利用培训数据，而且还优于当前基于规则的基于规则的系统的显着边际。

著录项

来源
《Clinical natural language processing workshop》|2020年|280-290|共11页
会议地点
作者
Jean-Baptiste Lamare; Tobi Olatunji; Li Yao;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词

相似文献

外文文献
中文文献
专利

1. Diminishing Clinical Returns of Multilevel Minimally Invasive Lumbar Interbody Fusion [J] . Passias Peter G., Bortz Cole, Horn Samantha R., Spine . 2019,第20期

机译：多级微创腰椎椎体融合的临床回报递减
2. Diminishing clinical returns of multilevel minimally invasive lumbar interbody fusion [J] . Peter G. Passias, Cole Bortz, Samantha R. Horn, The spine journal: official journal of the North American Spine Society . 2018,第8期

机译：多级微创腰椎间融合递减临床回报
3. Diminishing clinical returns of multilevel minimally invasive lumbar interbody fusion [J] . Peter G. Passias, Cole Bortz, Samantha R. Horn, The spine journal: official journal of the North American Spine Society . 2018,第8期

机译：多级微创腰椎间融合递减临床回报
4. Context-based Bidirectional-LSTM Model for Sequence Labeling in Clinical Reports [C] . Henghui Zhu, Ioannis Ch. Paschalidis, Amir M. Tahmasebi Society of Photo-Optical Instrumentation Engineers;SPIE Medical Imaging Conference . 2019

机译：临床报告中基于上下文的双向LSTM序列标记模型
5. Published reports describing clinical trials in off-label uses of drugs: Are they a valid source of knowledge? [D] . Vedula, Satyanarayana Swaroop. 2012

机译：已发表的报告描述了毒品的临床试验：他们是一个有效的知识来源吗？
6. A unified multi-level model approach to assessing patient responsiveness including; return to normal minimally important differences and minimal clinically important improvement for patient reported outcome measures [O] . Adrian Sayers, Vikki Wylde, Erik Lenguerrand, 2017

机译：评估患者反应能力的统一的多层次模型方法包括：对于患者报告的结果指标恢复到正常最低限度的差异和临床最低限度的改善
7. Evolutionary agglomeration theory: increasing returns, diminishing returns, and the industry life cycle [O] . A. Potter, H. D. Watts 2010

机译：进化集聚理论：增加回报，回报递减，以及行业生命周期

On the diminishing return of labeling clinical reports

摘要

著录项

相似文献

相关主题

期刊订阅