...
首页> 外文期刊>Language Resources and Evaluation >Semi-automatic construction of word-formation networks
【24h】

Semi-automatic construction of word-formation networks

机译:半自动构建字形网络

获取原文
获取原文并翻译 | 示例
           

摘要

The article presents a semi-automatic method for the construction of word-formation networks focusing particularly on derivation. The proposed approach applies a sequential pattern mining technique to construct useful morphological features in an unsupervised manner. The features take the form of regular expressions and later they are used to feed a machine-learned ranking model. The network is constructed by applying the learned model to sort the lists of possible base words and selecting the most probable ones. This approach, besides relatively small training set and a lexicon, does not require any additional language resources such as a list of vowel and consonant alternations, part-of-speech tags etc. The proposed approach is evaluated on lexeme sets of four languages, namely Polish, Spanish, Czech, and French. The conducted experiments demonstrate the ability of the proposed method to construct linguistically adequate word-formation networks from small training sets. Furthermore, the performed feasibility study shows that the method can further benefit from the interaction with a human language expert within the active learning framework.
机译:该物品提出了一种半自动方法,用于构建一个专注于衍生的字形成网络。该方法采用顺序模式采矿技术以无监督的方式构建有用的形态特征。这些功能采用正则表达式的形式,后来它们用于馈送机器学习的排名模型。通过应用学习模型来对网络构成网络来对可能的基本单词和选择最可能的模型来构建网络。除了相对较小的训练集和词典之外,这种方法不需要任何额外的语言资源,例如元音和辅音的列表,致辞标签等。所提出的方法是在lexeme组的四种语言中进行评估,即波兰语,西班牙语,捷克语和法语。所进行的实验表明了提出的方法从小型训练集构造语言上足够的单词形成网络的能力。此外,所进行的可行性研究表明,该方法可以进一步受益于与主动学习框架内的人类语言专家的互动。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号