首页> 外文会议>International Conference on Machine Learning and Applications >Event Causality Identification Using Conditional Random Field in Geriatric Care Domain
【24h】

Event Causality Identification Using Conditional Random Field in Geriatric Care Domain

机译:在老年护理领域中使用条件随机场进行事件因果识别

获取原文

摘要

Event extraction is a key step in many text-mining applications such as question-answering, information extraction and summarization systems. In this study we used conditional random field (CRF) to extract causal events from PubMed articles related to Geriatric care. Abstracts of geriatric care domain were manually reviewed and categorized into 42 different sub domains. There are a total of 19, 677 sentences in the collected abstracts from PubMed, out of which 2, 856 sentences were selected and manually annotated with cause and effect events. The data set was then divided into training (2, 520), validation (252) and test (84) sentence sets. Features such as tokens, token categories, affixes, part of speech and shallow parser were used as inputs to the CRF model. A window of features before and after each token was used to determine its causal event label using CRF. A window of four features had the best performance with 84.6% precision, 87% recall, 85% and F-measure.
机译:事件提取是许多文本挖掘应用程序(例如问题解答,信息提取和摘要系统)中的关键步骤。在这项研究中,我们使用条件随机场(CRF)从与老年病护理有关的PubMed文章中提取因果事件。手动审查了老年护理领域的摘要,并将其分类为42个不同的子领域。从PubMed收集的摘要中总共有19,677个句子,从中选择了2,856个句子,并人工注释因果事件。然后将数据集分为训练(2,520),验证(252)和测试(84)句子集。令牌,令牌类别,词缀,词性和浅层解析器等功能被用作CRF模型的输入。使用CRF使用每个标记之前和之后的特征窗口来确定其因果事件标签。具有四个功能的窗口以84.6%的精度,87%的召回率,85%的F-measure表现最佳。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号