首页> 外文会议>Natural Language Processing and Information Systems >Abbreviation Disambiguation: Experiments with Various Variants of the One Sense per Discourse Hypothesis
【24h】

Abbreviation Disambiguation: Experiments with Various Variants of the One Sense per Discourse Hypothesis

机译:缩写歧义消除:每个语篇假说对一种意义的各种变体进行实验

获取原文
获取原文并翻译 | 示例

摘要

Abbreviations are very common and are widely used in both written and spoken language. However, they are not always explicitly defined and in many cases they are ambiguous. In this research, we present a process that attempts to solve the problem of abbreviation ambiguity. Various features have been explored, including context-related methods and statistical methods. The application domain is Jewish Law documents written in Hebrew, which are known to be rich in ambiguous abbreviations. Various variants of the one sense per discourse hypothesis (by varying the scope of discourse) have been implemented. Several common machine learning methods have been tested to find a successful integration of these variants. The best results have been achieved by SVM, with 96.09% accuracy.
机译:缩写非常普遍,并且广泛用于书面和口头语言中。但是,它们并不总是明确定义的,在许多情况下它们是模棱两可的。在这项研究中,我们提出了一个尝试解决缩写歧义问题的过程。已经探索了各种功能,包括上下文相关方法和统计方法。应用领域是用希伯来语编写的犹太法律文档,众所周知,这些文档含糊不清的缩写词。每个话语假设(通过改变话语范围)已经实现了一种意义的各种变体。已经测试了几种常见的机器学习方法,以找到这些变量的成功集成。 SVM以96.09%的精度实现了最佳结果。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号