Finding Difficult-to-Disambiguate Words: Towards an Efficient Workflow to Implement Word Sense Disambiguation

机译：查找难于消除歧义的单词：朝着实现单词义消除歧义的高效工作流发展

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

In the biomedical and clinical domain, valuable information is frequently represented in free-text documents. Natural language processing (NLP) is a powerful tool that can extract structured information from theses documents. Word sense disambiguation (WSD) is a critical component in an NLP pipeline that increases the accuracy of the extracted information. However, WSD is expensive to apply for all known ambiguous words. Given limited time and resources, one practical strategy is to prioritize easy-to-disambiguate words and efficiently maximize the coverage of disambiguation. To aid prioritization efforts, we studied two quantitative indicators that are associated with how easy/difficult it is to disambiguate any given word.

机译：在生物医学和临床领域，有价值的信息经常以自由文本文档形式表示。自然语言处理（NLP）是一种功能强大的工具，可以从这些文档中提取结构化信息。词义消歧（WSD）是NLP管道中的关键组成部分，可提高提取信息的准确性。但是，WSD申请所有已知的歧义词的成本很高。在有限的时间和资源下，一种实用的策略是优先处理易于区分的单词，并有效地扩大歧义的覆盖范围。为了帮助进行优先排序，我们研究了两个量化指标，这些指标与消除任何给定单词的歧义的难易程度相关。

著录项

来源
《IEEE International Conference on Healthcare Informatics》|2015年|448-448|共1页
会议地点
作者
Torii Manabu; Fan Jung-Wei; Zisook Daniel S.;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
Benchmark testing; Bioinformatics; Data mining; Informatics; Natural language processing; Training; Vocabulary; Medical Informatics; Natural Language Processing; Word Sense Disambiguation;

机译：基准测试;生物信息学数据挖掘;信息学自然语言处理;训练;词汇;医学信息学;自然语言处理;词义消歧;

相似文献

外文文献
中文文献
专利

1. Words Polysemy Analysis: Implementation of the Word Sense Disambiguation Algorithm Based on Magnini Domains [J] . Francis C. Fernández-Reyes, Exiquio C. Leyva Pérez, Rog International Journal of Information Science . 2012,第3期

机译：词多义分析：基于Magnini域的词义消歧算法的实现
2. A Sense Annotated Corpus for All-Words Urdu Word Sense Disambiguation [J] . Saeed Ali, Nawab Rao Muhammad Adeel, Stevenson Mark, ACM transactions on Asian language information processing . 2019,第4期

机译：用于全词乌尔都语的词义注释语料库
3. Bioinformatic Workflow Extraction from Scientific Texts based on Word Sense Disambiguation [J] . Ahmed Halioui, Petko Valtchev, Abdoulaye Baniré Diallo IEEE/ACM transactions on computational biology and bioinformatics . 2018,第6期

机译：基于词义消歧的科学文本生物信息工作流提取
4. Finding Difficult-to-Disambiguate Words: Towards an Efficient Workflow to Implement Word Sense Disambiguation [C] . Torii Manabu, Fan Jung-Wei, Zisook Daniel S. IEEE International Conference on Healthcare Informatics . 2015

机译：寻找难以消除的歧视词：迈向有效的工作流程来实现词语感歧义
5. Finding meaning: Sense inventories for improved word sense disambiguation. [D] . Brown, Susan Windisch. 2010

机译：查找含义：感官清单可改善词义歧义。
6. Word sense disambiguation for event trigger word detection in biomedicine [O] . David Martinez, Timothy Baldwin 2011

机译：用于生物医学中事件触发词检测的词义消歧
7. Foundation, Implementation and Evaluation of the MorphoSaurus System : Subword Indexing, Lexical Learning and Word Sense Disambiguation for Medical Cross-Language Information Retrieval [O] . Markó Kornél Géza 2009

机译：MorphoSaurus系统的建立，实施和评估：医学跨语言信息检索的子词索引，词汇学习和词义消歧
8. Word Domain Disambiguation via Word Sense Disambiguation [R] . Sanfilippo, A. 2006

机译：Word Word消歧通过Word sense消歧

Finding Difficult-to-Disambiguate Words: Towards an Efficient Workflow to Implement Word Sense Disambiguation

摘要

著录项

相似文献

相关主题

期刊订阅