首页> 外文会议>International conference on computational linguistics >Exploring Syntactic Features for Native Language Identification: A Variationist Perspective on Feature Encoding and Ensemble Optimization
【24h】

Exploring Syntactic Features for Native Language Identification: A Variationist Perspective on Feature Encoding and Ensemble Optimization

机译:探索母语识别的语法特征:特征编码和组合优化的变奏法视角

获取原文

摘要

In this paper, we systematically explore lexicalized and non-lexicalized local syntactic features for the task of Native Language Identification (NLI). We investigate different types of feature representations in single- and cross-corpus settings, including two representations inspired by a variationist perspective on the choices made in the linguistic system. To combine the different models, we use a probabilities-based ensemble classifier and propose a technique to optimize and tune it. Combining the best performing syntactic features with four types of n-grams outperforms the best approach of the NLI Shared Task 2013.
机译:在本文中,我们系统地探索了本地化语言识别(NLI)任务的词汇化和非词汇化的本地句法功能。我们研究了单语料库和跨语料库设置中的不同类型的要素表示,包括两种表示形式,它们是由对语言系统所做选择的差异主义观点启发而来的。为了组合不同的模型,我们使用了基于概率的集成分类器,并提出了一种对其进行优化和调整的技术。将表现最佳的语法功能与四种类型的n-gram结合起来,性能优于NLI Shared Task 2013的最佳方法。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号