首页> 外文会议>Joint workshop on linguistic annotation, multiword expressions and constructions >From Chinese Word Segmentation to Extraction of Constructions: Two Sides of the Same Algorithmic Coin
【24h】

From Chinese Word Segmentation to Extraction of Constructions: Two Sides of the Same Algorithmic Coin

机译:从中文字分割到建筑的提取:同一算法硬币的两侧

获取原文

摘要

This paper presents the results of two experiments carried out within the framework of computational construction grammar. Starting from the constructionist point of view that there are just constructions in language, including lexical ones, we tested the validity of a clustering algorithm that was primarily designed for MWE extraction, the cpr-score (Colson. 2017). on Chinese word segmentation. Our results indicate a striking recall rate of 75 percent without any special adaptation to Chinese or to the lexicon, which confirms that there is some similarity between extracting MWEs and CWS. Our second experiment also suggests that the same methodology might be used for extracting more schematic or abstract constructions, thereby providing evidence for the statistical foundation of construction grammar.
机译:本文介绍了在计算建设语法框架内进行的两个实验的结果。从建筑物的角度来看,只有语言的结构,包括词典,我们测试了主要为MWE提取而设计的聚类算法的有效性,CPR分数(Colson。2017)。论中文字分割。我们的结果表明,没有任何对中国人或莱克逊的特别适应的召回率为75%,这证实了提取MWE和CWS之间存在一些相似之处。我们的第二个实验还表明,相同的方法可以用于提取更多原理图或抽象结构,从而为建筑语法的统计基础提供证据。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号