首页> 外文会议>Workshop on multiword expressions: from parsing and generation to the real world 2011 >An N-gram frequency database reference to handle MWE extraction in NLP applications
【24h】

An N-gram frequency database reference to handle MWE extraction in NLP applications

机译:N克频率数据库参考,用于处理NLP应用中的MWE提取

获取原文
获取原文并翻译 | 示例

摘要

The identification and extraction of Multiword Expressions (MWEs) currently deliver satisfactory results. However, the integration of these results into a wider application remains an issue. This is mainly due to the fact that the association measures (AMs) used to detect MWEs require a critical amount of data and that the MWE dictionaries cannot account for all the lexical and syntactic variations inherent in MWEs. In this study, we use an alternative technique to overcome these limitations. It consists in defining an n-gram frequency database that can be used to compute AMs on-the-fly, allowing the extraction procedure to efficiently process all the MWEs in a text, even if they have not been previously observed.
机译:目前,多字表达式(MWE)的识别和提取可提供令人满意的结果。但是,将这些结果集成到更广泛的应用程序中仍然是一个问题。这主要是由于以下事实:用于检测MWE的关联度量(AM)需要大量数据,并且MWE词典无法说明MWE固有的所有词汇和句法变体。在这项研究中,我们使用替代技术来克服这些限制。它包含定义一个n克频率数据库,该数据库可用于即时计算AM,即使以前未曾观察到,提取过程也可有效处理文本中的所有MWE。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号