首页> 外文会议>Brazilian Symposium in Information and Human Language Technology >Formacao de gentilicos a partir de topônimos: descricao linguistica e aprendizado automatico
【24h】

Formacao de gentilicos a partir de topônimos: descricao linguistica e aprendizado automatico

机译:由地名形成外邦人:语言描述和机器学习

获取原文

摘要

O presente artigo tem como objetivo descrever as regras envolvidas na transformando de toponimos em gentilicos, de modo a identificar regularidades. A partir dessas regularidades, desenvolve-se um algoritmo capaz de gerar gentilicos de forma automatica. Como base teorica, sao considerados conceitos da Morfologia Derivacional e, do ponto de vista metodologico, tomase como fonte toponimos e gentilicos do Instituto Brasileiro de Geografia e Estatistica (IBGE), bem como se criam procedimentos para tornarem os dados manipulaveis. Realizase tambem um processo complementar de aprendizado automatico. Como resultado, obtemse boa acuracia na predicao de gentilicos, revelando regras e atributos novos e relevantes para a tarefa. This paper aims to describe the rules required in the transformation of toponyms into demonyms in order to identify regularities. From these regularities, we developed an algorithm that automatically generates demonyms for toponyms of interest. As a theoretical basis, the concepts of Derivational Morphology are considered, and, concerning the methodology, we used data about cities and demonyms provided by the Brazilian Institute of Geography and Statistics (IBGE) website, for which we produced procedures to make this data tractable. A complementary process of automatic learning was also carried out. As a result, a good accuracy was obtained in the prediction of demonyms, revealing new and relevant rules and features for the task.
机译:本文旨在描述将toponimos转换为gentilicos所涉及的规则,以识别规律性。根据这些规律,开发了一种能够自动生成外差的算法。作为理论基础,考虑了派生形态的概念,并且从方法论的角度出发,我们采用了巴西地理与统计研究所(IBGE)的toponimos和gentilicos,并创建了可操作数据的程序。还有一个补充的自动学习过程。结果,在外邦人的预测中可以预测出良好的准确性,从而揭示了任务的新的和相关的规则和属性。本文旨在描述将地名转换为假名以识别规律性所需的规则。根据这些规律,我们开发了一种算法,该算法可自动为感兴趣的地名生成假名。作为理论基础,考虑了派生形态的概念,关于方法,我们使用了巴西地理与统计研究所(IBGE)网站提供的有关城市和恶魔的数据,我们为此制定了程序以使这些数据易于处理。还进行了自动学习的补充过程。结果,在音韵预测中获得了良好的准确性,揭示了该任务的新的和相关的规则和特征。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号