首页> 外文会议>International conference on recent advances in natural language processing >Linguistic Classification: Dealing Jointly with Irrelevance and Inconsistency
【24h】

Linguistic Classification: Dealing Jointly with Irrelevance and Inconsistency

机译:语言分类:共同处理无关紧要和不一致的地方

获取原文

摘要

In this paper we present new methods for language classification which put to good use both syntax and fuzzy tools, and are capable of dealing with irrelevant linguistic features (i.e. features which should not contribute to the classification) and even inconsistent features (which do not make sense for specific languages). We introduce a metric distance, based on the generalized Steinhaus transform, which allows one to deal jointly with irrelevance and inconsistency. To evaluate our methods, we test them on a syntactic data set. due to the linguist G. Longobardi and his school. We obtain phylogenetic trees which sometimes outperform the ones obtained by Atkinson and Gray (Gray and Atkinson, 2003; Bouckaert et al., 2012).
机译:在本文中,我们提出了一种新的语言分类方法,该方法充分利用了语法和模糊工具,并且能够处理不相关的语言特征(即不应有助于分类的特征),甚至是不一致的特征(不会使语言分类)。特定语言的意义)。我们基于广义的Steinhaus变换引入了一种度量距离,该距离允许人们共同处理无关紧要和不一致的情况。为了评估我们的方法,我们在语法数据集上对其进行测试。由于语言学家G. Longobardi和他的学校。我们获得的系统发育树有时会比Atkinson和Gray获得的系统树好(Gray和Atkinson,2003年; Bouckaert等人,2012年)。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号