首页> 外文会议>The Semantic Web - ASWC 2008 >Extracting Semantic Frames from Thai Medical-Symptom Phrases with Unknown Boundaries
【24h】

Extracting Semantic Frames from Thai Medical-Symptom Phrases with Unknown Boundaries

机译:从具有未知边界的泰国医学症状短语中提取语义框架

获取原文
获取原文并翻译 | 示例

摘要

Due to the limitations of language-processing tools for the Thai language, pattern-based information extraction from Thai documents requires supplementary techniques. Based on sliding-window rule application and extraction filtering, we present a framework for extracting semantic information from medical-symptom phrases with unknown boundaries in Thai free-text information entries. A supervised rule learning algorithm is employed for automatic construction of information extraction rules from hand-tagged training symptom phrases. Two filtering components are introduced: one uses a classification model for predicting rule application across a symptom-phrase boundary, the other uses extraction distances observed during rule learning for resolving conflicts arising from overlapping-frame extractions. In our experimental study, we focus our attention on two basic types of symptom phrasal descriptions: one is concerned with abnormal characteristics of some observable entities and the other with human-body locations at which symptoms appear. The experimental results show that the filtering components improve precision while preserving recall satisfactorily.
机译:由于泰语的语言处理工具的局限性,从泰语文档中提取基于模式的信息需要补充技术。基于滑动窗口规则应用和提取过滤,我们提供了一个框架,用于从泰国自由文本信息条目中具有未知边界的医学症状短语中提取语义信息。监督规则学习算法用于从手标记的训练症状短语中自动构造信息提取规则。引入了两种过滤组件:一种使用分类模型来预测症状短语边界上的规则应用,另一种使用规则学习期间观察到的提取距离来解决由重叠帧提取引起的冲突。在我们的实验研究中,我们将注意力集中在症状短语描述的两种基本类型上:一种关注某些可观察到的实体的异常特征,另一种关注出现​​症状的人体位置。实验结果表明,滤波组件在提高精度的同时,还可以保持令人满意的召回率。

著录项

  • 来源
    《The Semantic Web - ASWC 2008》|2008年|390-404|共15页
  • 会议地点 Bangkok(TH);Bangkok(TH)
  • 作者单位

    School of Information and Computer Technology Sirindhorn International Institute of Technology, Thammasat University Pathumthani, Thailand;

    School of Information and Computer Technology Sirindhorn International Institute of Technology, Thammasat University Pathumthani, Thailand;

    School of Information and Computer Technology Sirindhorn International Institute of Technology, Thammasat University Pathumthani, Thailand;

  • 会议组织
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类 计算机网络;
  • 关键词

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号