【24h】

String Vector based KNN for text categorization

机译:基于字符串向量的KNN用于文本分类

获取原文
获取原文并翻译 | 示例

摘要

In this research, we propose the graph based KNN where a graph is given as input, instead of a numerical vector, as the approach to the text categorization tasks. The ontology which is given as a graph has been used as the popular and standard knowledge representation which is understandable by computers, so it is regarded as more natural scheme to encode texts into graphs, than numerical vectors. In this research, we encode texts into graphs, define the similarity measure between graphs, and modify the K Nearest Neighbor into its graph based version as the text categorization tool. As the benefit from this research, we expect the more compact, graphical, and symbolic representation of texts, than numerical vectors. Therefore, the goal of this research is to implement the text categorization system with the better performance and more user-friendly representations of texts.
机译:在这项研究中,我们提出了一种基于图的KNN,其中将图作为输入而不是数值向量,作为文本分类任务的方法。作为图形给出的本体已被用作计算机可以理解的流行和标准的知识表示,因此,将文本编码成图形比将其编码为图形向量,被认为是更自然的方案。在这项研究中,我们将文本编码为图形,定义图形之间的相似性度量,并将“ K最近邻”修改为基于图形的版本,作为文本分类工具。作为这项研究的收益,我们期望文本比数字矢量更紧凑,图形和符号表示。因此,本研究的目的是实现具有更好性能和更友好的文本表示形式的文本分类系统。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号