【24h】

Automatic Extraction of Keywords for the Portuguese Language

机译:自动提取葡萄牙语的关键词

获取原文

摘要

This paper outlines the adaptation of an algorithm for automatic extraction of keywords for the Portuguese Language. Keywords make possible to summarize the contents of documents in a compact form, and may also be used as an efficient measure of similarity between texts. This work is focused on the extraction of keywords for theses on several fields of knowledge. To identify the keywords the KEA algorithm was used, together with a stemming technique specific to Portuguese and a manually created list of stopwords. It is shown that the results obtained are good enough for practical use and similarly match what have been done for the English Language.
机译:本文概述了对葡萄牙语自动提取关键词的算法。关键字可以以紧凑的形式总结文件的内容,并且也可以用作文本之间的有效衡量标准。这项工作专注于提取关于几个知识领域的关键字。为了识别关键字,使用KEA算法,以及特定于葡萄牙语的杆状技术和手动创建的停止列表。结果表明,获得的结果足以进行实际使用,并且同样匹配对英语所做的事情。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号