首页> 外文会议>Workshop on multiword expressions: from theory to application. >Varro: An Algorithm and Toolkit for Regular Structure Discovery in Treebanks
【24h】

Varro: An Algorithm and Toolkit for Regular Structure Discovery in Treebanks

机译:Varro:树库中规则结构发现的算法和工具包

获取原文
获取原文并翻译 | 示例

摘要

The Varro toolkit is a system for identi-fying and counting a major class of reg-ularity in treebanks and annotated nat-ural language data in the form of tree-structures: frequently recurring unordered subtrees. This software has been designed for use in linguistics to be maximally applicable to actually existing treebanks and other stores of tree-structurable nat-ural language data. It minimizes mem-ory use so that moderately large treebanks are tractable on commonly available com-puter hardware. This article introduces condensed canonically ordered trees as a data structure for efficiently discovering frequently recurring unordered subtrees.
机译:Varro工具箱是一个用于识别和计算树库和带注释的自然语言数据(以树结构形式)中的主要正则性类别的系统:经常重复出现的无序子树。该软件已设计用于语言学,可最大程度地应用于实际存在的树库和其他可树状结构的自然语言数据存储。它最大程度地减少了内存使用,因此在通常可用的计算机硬件上,中等大小的树库很容易处理。本文介绍了压缩规范化有序树作为一种数据结构,可以有效地发现频繁重复出现的无序子树。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号