首页> 外文会议>IJCNLP 2011 >Word Sense Disambiguation Corpora Acquisition via Confirmation Code
【24h】

Word Sense Disambiguation Corpora Acquisition via Confirmation Code

机译:通过确认码,词感歧义歧义基层收购

获取原文

摘要

Word Sense Disambiguation (WSD) is one of the fundamental natural language processing tasks. However, lack of training corpora is a bottleneck to construct a high accurate all-words WSD system. Annotating a large-scale corpus by experts costs enormous time and financial resources. Human Computation is a novel idea for integrating human resources behind the Web, which has been wasted, to solve practical problems that are difficult for computers. Based on human computation, we design a confirmation code system, which can not only distinguish between human beings and computers (the function of normal confirmation code system), but also annotate WSD corpora. The preliminary experimental result shows that the proposed method can annotate large-scale and high-quality WSD corpora within a short time. To the best of our knowledge, this is the first attempt to use confirmation code in natural language processing for corpora acquisition.
机译:字感歧义(WSD)是基本的自然语言处理任务之一。然而,缺乏培训语料库是构建高准确的全文WSD系统的瓶颈。专家注释大规模的语料库成本巨大的时间和财务资源。人类计算是将人类资源整合在浪费的网络背后的新想法,以解决对计算机难以解决的实际问题。基于人类计算,我们设计了一个确认码系统,它不仅可以区分人类和计算机(正常确认代码系统的功能),而且还可以注释WSD Corpora。初步实验结果表明,该方法可以在短时间内注释大规模和高质量的WSD Corpora。据我们所知,这是第一次尝试在基层采集中使用自然语言处理中的确认码。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号