首页> 外文会议>Language processing and intelligent information systems >Unsupervised Induction of Persian Semantic Verb Classes Based on Syntactic Information
【24h】

Unsupervised Induction of Persian Semantic Verb Classes Based on Syntactic Information

机译:基于句法信息的波斯语语义动词类的无监督归纳

获取原文
获取原文并翻译 | 示例

摘要

Automatic induction of semantic verb classes is one of the most challenging tasks in computational lexical semantics with a wide variety of applications in natural language processing. The large number of Persian speakers and the lack of such semantic classes for Persian verbs have motivated us to use unsupervised algorithms for Persian verb clustering. In this paper, we have done experiments on inducing the semantic classes of Persian verbs based on Levin's theory for verb classes. Syntactic information extracted from dependency trees is used as base features for clustering the verbs. Since there has been no manual classification of Persian verbs prior to this paper, we have prepared a manual classification of 265 verbs into 43 semantic classes. We show that spectral clustering algorithm outperforms KMeans and improves on the baseline algorithm with about 17% in Fmeasure and 0.13 in Rand index.
机译:语义动词类的自动归纳是计算词汇语义中最具挑战性的任务之一,在自然语言处理中具有广泛的应用。大量的波斯语说话者以及波斯语动词缺乏此类语义类别促使我们使用非监督算法进行波斯语动词聚类。在本文中,我们进行了基于莱文动词分类理论的波斯动词语义分类归纳实验。从依赖关系树中提取的语法信息被用作对动词进行聚类的基本特征。由于在此之前还没有对波斯语动词进行手动分类,因此我们准备了将265个动词分类为43个语义类别的手动分类。我们表明,频谱聚类算法优于KMeans并在基线算法上有所改进,Fmeasure约为17%,Rand指数约为0.13。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号