...
首页> 外文期刊>Computer speech and language >Simple methods to overcome the limitations of general word representations in natural language processing tasks
【24h】

Simple methods to overcome the limitations of general word representations in natural language processing tasks

机译:克服自然语言处理任务中通用单词表示形式的局限性的简单方法

获取原文
获取原文并翻译 | 示例
           

摘要

Although general word representations (GWRs) by skip-gram or GloVe have been widely used in many natural language processing (NLP) tasks with considerable success, they require further improvement. First, a GWR only represents general information of a word, even though task-oriented information can be more useful in specific tasks. Second, a GWR cannot avoid the out-of-vocabulary (OOV) problem. Thus, some recent studies have proposed methods based on an additional complex model or deep knowledge of resources for each specific task. Although such methods have the potential for improved performance, we believe that the baseline systems of each NLP task are already expensive; hence, making them more complex would be problematic for real-world applications. Therefore, the objective of this study is to overcome the limitations of GWRs by developing simple but effective methods for task-specific word representations (TSWRs) and OOV representations (OOVRs). The proposed methods achieved state-of-the-art performance in four Korean NLP tasks, namely part-of-speech tagging, named entity recognition, dependency parsing, and semantic role labeling. (C) 2019 Elsevier Ltd. All rights reserved.
机译:尽管通过跳跃语法或GloVe表示的通用单词表示(GWR)已在许多自然语言处理(NLP)任务中得到了广泛的成功,但它们仍需要进一步的改进。首先,尽管面向任务的信息在特定任务中可能更有用,但是GWR仅代表单词的常规信息。其次,GWR无法避免语音不足(OOV)问题。因此,最近的一些研究提出了基于附加复杂模型或针对每个特定任务的资源深厚知识的方法。尽管这种方法有可能提高性能,但我们认为每个NLP任务的基准系统已经很昂贵;因此,将它们变得更复杂对于现实世界的应用将是一个问题。因此,本研究的目的是通过为任务特定的单词表示(TSWR)和OOV表示(OOVR)开发简单而有效的方法来克服GWR的局限性。所提出的方法在四个韩国NLP任务中达到了最先进的性能,即词性标记,命名实体识别,依赖项解析和语义角色标记。 (C)2019 Elsevier Ltd.保留所有权利。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号