首页> 外文会议>Mexican international conference on artificial intelligence >Syntactic Dependency-Based N-grams as Classification Features
【24h】

Syntactic Dependency-Based N-grams as Classification Features

机译:基于句法依赖的N-gram作为分类特征

获取原文

摘要

In this paper we introduce a concept of syntactic n-grams (sn-grams). Sn-grams differ from traditional n-grams in the manner of what elements are considered neighbors.In case of sn-grams, the neighbors are taken by following syntactic relations in syntactic trees, and not by taking the words as they appear in the text. Dependency trees fit directly into this idea, while in case of constituency trees some simple additional steps should be made. Sn-grams can be applied in any NLP task where traditional n-grams are used. We describe how sn-grams were applied to authorship attribution. SVM classifier for several profile sizes was used. We used as baseline traditional n-grams of words, POS tags and characters. Obtained results are better when applying sn-grams.
机译:在本文中,我们介绍了语法n-gram(sn-gram)的概念。 Sn-gram与传统n-gram的不同之处在于哪些元素被视为邻居。在使用sn-gram的情况下,邻居是通过遵循语法树中的句法关系来获取的,而不是通过采用文本中出现的单词来获取。依赖树直接适合这个想法,而在选区树的情况下,应采取一些简单的附加步骤。 Sn-gram可以应用于使用传统n-gram的任何NLP任务中。我们描述了sn-grams如何应用于作者身份归属。使用了几种配置文件大小的SVM分类器。我们将传统的n-gram字词,POS标签和字符用作基准。应用sn-gram时获得的结果更好。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号