...
首页> 外文期刊>Journal of Machine Learnig Research >Shallow Parsing with PoS Taggers and Linguistic Features
【24h】

Shallow Parsing with PoS Taggers and Linguistic Features

机译:使用PoS Taggers和语言功能进行浅层解析

获取原文
获取原文并翻译 | 示例
   

获取外文期刊封面封底 >>

       

摘要

Three data-driven publicly available part-of-speech taggers are applied to shallow parsing of Swedish texts. The phrase structure is represented by nine types of phrases in a hierarchical structure containing labels for every constituent type the token belongs to in the parse tree. The encoding is based on the concatenation of the phrase tags on the path from lowest to higher nodes. Various linguistic features are used in learning; the taggers are trained on the basis of lexical information only, part-of-speech only, and a combination of both, to predict the phrase structure of the tokens with or without part-of-speech. Special attention is directed to the taggers' sensitivity to different types of linguistic information included in learning, as well as the taggers' sensitivity to the size and the various types of training data sets. The method can be easily transferred to other languages.
机译:将三种数据驱动的公开语音词性标注器应用于瑞典文本的浅层分析。短语结构由分层结构中的九种类型的短语表示,其中包含令牌在解析树中属于每种构成类型的标签。编码基于从最低节点到较高节点的路径上的短语标签的串联。学习中使用了各种语言功能;仅基于词法信息,仅词性以及二者的组合来训练标记器,以预测具有或不具有词性的标记的短语结构。特别注意的是标记者对学习中包括的不同类型语言信息的敏感性,以及标记者对大小和各种类型的训练数据集的敏感性。该方法可以轻松地转换为其他语言。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号