首页> 外文会议>International Workshop on Computational Processing of the Portuguese Language >Proverb Variation: Experiments on Automatic Detection in Brazilian Portuguese Texts
【24h】

Proverb Variation: Experiments on Automatic Detection in Brazilian Portuguese Texts

机译:谚语变异:巴西葡萄牙文本自动检测实验

获取原文

摘要

This paper describes a methodology for automatically identifying proverbs and their variants in running texts. This methodology is based on existing compilations of proverbs, by exploring the regular syntactic structures that most proverbs present and intersecting syntactic structure with the lexical units of the proverbs. From the syntactic regularities we divided the data into 13 different classes. Finite-state automata is used to represent the regular patterns found in the classes. The results showed a precision rate of 74.68% tested in Brazilian Portuguese journalistic corpus.
机译:本文介绍了在运行文本中自动识别谚语及其变体的方法。该方法是基于谚语的现有汇编,通过探讨大多数谚语以及与谚语的词汇单位存在和与句法结构相交的谚语。从句法规律地,我们将数据划分为13个不同的类。有限状态自动机用于表示类中的常规模式。在巴西葡萄牙语语料库中测试了74.68%的精确率为74.68%。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号