首页> 外文会议>IEEE International Conference on Healthcare Informatics >Finding Difficult-to-Disambiguate Words: Towards an Efficient Workflow to Implement Word Sense Disambiguation
【24h】

Finding Difficult-to-Disambiguate Words: Towards an Efficient Workflow to Implement Word Sense Disambiguation

机译:查找难于消除歧义的单词:朝着实现单词义消除歧义的高效工作流发展

获取原文

摘要

In the biomedical and clinical domain, valuable information is frequently represented in free-text documents. Natural language processing (NLP) is a powerful tool that can extract structured information from theses documents. Word sense disambiguation (WSD) is a critical component in an NLP pipeline that increases the accuracy of the extracted information. However, WSD is expensive to apply for all known ambiguous words. Given limited time and resources, one practical strategy is to prioritize easy-to-disambiguate words and efficiently maximize the coverage of disambiguation. To aid prioritization efforts, we studied two quantitative indicators that are associated with how easy/difficult it is to disambiguate any given word.
机译:在生物医学和临床领域,有价值的信息经常以自由文本文档形式表示。自然语言处理(NLP)是一种功能强大的工具,可以从这些文档中提取结构化信息。词义消歧(WSD)是NLP管道中的关键组成部分,可提高提取信息的准确性。但是,WSD申请所有已知的歧义词的成本很高。在有限的时间和资源下,一种实用的策略是优先处理易于区分的单词,并有效地扩大歧义的覆盖范围。为了帮助进行优先排序,我们研究了两个量化指标,这些指标与消除任何给定单词的歧义的难易程度相关。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号