【24h】

On Some Optimization Heuristics for Lesk-Like WSD Algorithms

机译:Lesk-like WSD算法的一些优化启发式算法

获取原文
获取原文并翻译 | 示例

摘要

For most English words, dictionaries give various senses: e.g., "bank" can stand for a financial institution, shore, set, etc. Automatic selection of the sense intended in a given text has crucial importance in many applications of text processing, such as information retrieval or machine translation: e.g., "(my account in the) bank" is to be translated into Spanish as "(mi cuenta en el) banco" whereas "(on the) bank (of the lake)" as "(en la) orilla (del lago)." To choose the optimal combination of the intended senses of all words, Lesk suggested to consider the global coherence of the text, i.e., which we mean the average relatedness between the chosen senses for all words in the text. Due to high dimensionality of the search space, heuristics are to be used to find a near-optimal configuration. In this paper, we discuss several such heuristics that differ in terms of complexity and quality of the results. In particular, we introduce a dimensionality reduction algorithm that reduces the complexity of computationally expensive approaches such as genetic algorithms.
机译:对于大多数英语单词,词典提供各种含义:例如,“银行”可以代表金融机构,海岸,地点等。自动选择给定文本中所使用的含义在诸如文本处理的许多应用中至关重要。信息检索或机器翻译:例如,“(在我的帐户中)银行”应翻译为西班牙语,“(mi cuenta en el)banco”,而“(在(银行)中的银行)”应翻译为“(en la)orilla(del lago)。”为了选择所有单词的预期含义的最佳组合,Lesk建议考虑文本的全局连贯性,即,我们的意思是指文本中所有单词的所选含义之间的平均相关性。由于搜索空间的维数较高,因此将使用启发式方法来找到接近最佳的配置。在本文中,我们讨论了几种这样的启发式方法,它们在结果的复杂性和质量方面有所不同。特别是,我们引入了降维算法,该算法降低了计算成本高昂的方法(如遗传算法)的复杂性。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号