首页> 外文会议>1st Workshop on computational approaches to compound analysis >A Comparative Study of Different Classification Methods for the Identification of Brazilian Portuguese Multiword Expressions
【24h】

A Comparative Study of Different Classification Methods for the Identification of Brazilian Portuguese Multiword Expressions

机译:识别巴西葡萄牙语多词表达的不同分类方法的比较研究

获取原文
获取原文并翻译 | 示例

摘要

This paper presents a comparative study of different methods for the identification of multiword expressions, applied to a Brazilian Portuguese corpus. First, we selected the candidates based on the frequency of bigrams. Second, we used the linguistic information based on the grammatical classes of the words forming the bigrams, together with the frequency information in order to compare the performance of different classification algorithms. The focus of this study is related to different classification techniques such as support-vector machines (SVM), multi-layer perceptron, naieve Bayesian nets, decision trees and random forest. Third, we evaluated three different multi-layer perceptron training functions in the task of classifying different patterns of multiword expressions. Finally, our study compared two different tools, MWEtoolkit and Text-NSP, for the extraction of multiword expression candidates using different association measures.
机译:本文介绍了一种适用于巴西葡萄牙语语料库的多词表达识别方法的比较研究。首先,我们根据二元组的出现频率选择候选人。其次,我们使用了基于构成双字词的语法分类的语言信息以及频率信息,以比较不同分类算法的性能。这项研究的重点与不同的分类技术有关,例如支持向量机(SVM),多层感知器,朴素贝叶斯网络,决策树和随机森林。第三,在对多词表达的不同模式进行分类的任务中,我们评估了三种不同的多层感知器训练功能。最后,我们的研究比较了两种不同的工具MWEtoolkit和Text-NSP,它们使用不同的关联度量来提取多词表达候选对象。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号