首页> 外文会议>International Conference on Computer Vision and Graphics >Pattern Recognition Method for Classification of Agricultural Scientific Papers in Polish
【24h】

Pattern Recognition Method for Classification of Agricultural Scientific Papers in Polish

机译:抛光中农业科学论文分类的模式识别方法

获取原文

摘要

Calculation of text similarity is an essential task for the text analysis and classification. It be can based, e.g., on Jaccard, cosine or other similar measures. Such measures consider the text as a bag-of-words and, therefore, lose some syntactic and semantic features of its sentences. This article presents a different measure based on the so-called artificial sentence pattern (ASP) method. This method has been developed to analyze texts in the Polish language which has very rich inflection. Therefore, ASP has utilized syntactic and semantic rules of the Polish language. Nevertheless, we argue that it admits extensions to other languages. As a result of the analysis, we have obtained several hypernodes which contain the most important words. Each hypernode corresponds to one of the examined documents, the latter being published papers from agriculture domain written in Polish. Experimental results obtained from that set of papers have been described and discussed. Those results have been visually illustrated using graphs of hypernodes and compared with Jaccard and cosine measures.
机译:文本相似性的计算是文本分析和分类的重要任务。它可以基于,例如,在Jaccard,余弦或其他类似措施上。这些措施将文本视为一个单词的文本,因此失去了句子的一些句法和语义特征。本文呈现了基于所谓的人工句子模式(ASP)方法的不同措施。已经开发了这种方法来分析波兰语中的文本,这具有非常丰富的拐点。因此,ASP利用了波兰语的句法和语义规则。尽管如此,我们争辩说它承认扩展到其他语言。由于分析,我们已经获得了几种包含最重要词语的超节点。每个超数对应于其中一个审查的文件,后者正在从抛光中写的农业领域发表论文。已经描述并讨论了从该组论文获得的实验结果。使用超节点的图表和jaccard和余弦测量相比,这些结果已经在视觉上说明。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号