...
首页> 外文期刊>Journal of healthcare engineering. >An Interpretable Classification Framework for Information Extraction from Online Healthcare Forums
【24h】

An Interpretable Classification Framework for Information Extraction from Online Healthcare Forums

机译:来自在线医疗论坛的信息提取的可解释分类框架

获取原文
           

摘要

Online healthcare forums (OHFs) have become increasingly popular for patients to share their health-related experiences. The healthcare-related texts posted in OHFs could help doctors and patients better understand specific diseases and the situations of other patients. To extract the meaning of a post, a commonly used way is to classify the sentences into several predefined categories of different semantics. However, the unstructured form of online posts brings challenges to existing classification algorithms. In addition, though many sophisticated classification models such as deep neural networks may have good predictive power, it is hard to interpret the models and the prediction results, which is, however, critical in healthcare applications. To tackle the challenges above, we propose an effective and interpretable OHF post classification framework. Specifically, we classify sentences into three classes: medication, symptom, and background. Each sentence is projected into an interpretable feature space consisting of labeled sequential patterns, UMLS semantic types, and other heuristic features. A forest-based model is developed for categorizing OHF posts. An interpretation method is also developed, where the decision rules can be explicitly extracted to gain an insight of useful information in texts. Experimental results on real-world OHF data demonstrate the effectiveness of our proposed computational framework.
机译:在线医疗保健论坛(OHFS)对患者越来越受欢迎,以分享其与健康有关的经验。在OHFS发布的医疗保健相关文本可以帮助医生和患者更好地了解特定疾病和其他患者的情况。要提取帖子的含义,通常使用的方式是将句子分为几个预定义的不同语义类别。但是,非结构化形式的在线帖子为现有分类算法带来了挑战。此外,尽管许多复杂的分类模型如深神经网络可能具有良好的预测功率,但很难解释模型和预测结果,然而,这在医疗保健应用中是至关重要的。为了解决上述挑战,我们提出了一个有效和可译文的OHF邮政分类框架。具体来说,我们将句子分为三类:药物,症状和背景。每个句子都被投影到一个由标记的顺序模式,UMLS语义类型和其他启发式特征组成的可解释的特征空间。开发了一种基于森林的模型,用于对OHF帖子进行分类。还开发了一种解释方法,其中可以明确提取决策规则,以在文本中获得有用信息的识别。实验结果对现实世界的OHF数据展示了我们所提出的计算框架的有效性。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号