【24h】

Efficient part-of-speech tagging with a min-max modular neural-network model

机译:最小-最大模块化神经网络模型的高效词性标记

获取原文
获取原文并翻译 | 示例
获取外文期刊封面目录资料

摘要

This paper presents a part-of-speech tagging method based on a min-max modular neural-network model. The method has three main steps. First, a large-scale tagging problem is decomposed into a number of relatively smaller and simpler subproblems according to the class relations among a given training corpus. Secondly, all of the subproblems are learned by smaller network modules in parallel. Finally, following two simple module combination laws, all of the trained network modules are integrated into a modular parallel tagging system that produces solutions to the original tagging problem. The proposed method has several advantages over existing tagging systems based on multilayer perceptrons. (1) Training times can be drastically reduced and desired learning accuracy can be easily achieved; (2) the method can scale up to larger tagging problems; (3) the tagging system has quick response and facilitates hardware implementation. In order to demonstrate the effectiveness of the proposed method, we perform simulations on two different language corpora: a Thai corpus and a Chinese corpus, which have 29,028 and 45,595 ambiguous words, respectively. We also compare our method with several existing tagging models including hidden Markov models, multilayer perceptrons and neuro-taggers. The results show that both the learning accuracy and generalization performance of the proposed tagging model are better than statistical models and multilayer perceptrons, and they are comparable to the most successful tagging models. [References: 29]
机译:本文提出了一种基于最小-最大模块化神经网络模型的词性标注方法。该方法具有三个主要步骤。首先,根据给定训练语料库之间的类关系,将大规模标记问题分解为多个相对较小和较简单的子问题。其次,所有子问题都是由较小的网络模块并行学习的。最后,遵循两个简单的模块组合法则,所有经过训练的网络模块都集成到模块化并行标记系统中,该系统可为原始标记问题提供解决方案。与基于多层感知器的现有标记系统相比,该方法具有多个优势。 (1)可以大大减少培训时间,并可以轻松实现所需的学习准确性; (2)该方法可以扩展到更大的标记问题; (3)标签系统响应速度快,便于硬件实现。为了证明该方法的有效性,我们对两种不同的语言语料库进行了仿真:泰国语料库和中文语料库,它们分别具有29,028和45,595个不明确的单词。我们还将我们的方法与几种现有的标记模型(包括隐藏的马尔可夫模型,多层感知器和神经标记器)进行比较。结果表明,所提出的标签模型的学习准确性和泛化性能均优于统计模型和多层感知器,并且可以与最成功的标签模型相媲美。 [参考:29]

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号