首页> 外文会议>International conference on artificial intelligence >Encoding Words into String Vectors for Word Categorization
【24h】

Encoding Words into String Vectors for Word Categorization

机译:将单词编码为字符串向量以进行单词分类

获取原文

摘要

In this research, we propose the string vector based K Nearest Neighbor as the approach to the -word : categorization. In the previous works on the text categoriza-tion, it was successful to encode texts into string vectors, by preventing the demerits from encoding them into numerical vectors; it provides the motivation for doing this research. In this research, we encode words into string vectors instead of texts, define the semantic operation between the string vectors, and modify the K Nearest Neighbor into the string 1 vector based version as the approach to the word categorization. As the benefits from this research, we expect the improved performance by avoiding problems in encoding texts or words into numerical vectors and more compact representations than numerical vectors. Hence, the goal of this research is to implement the word categorization system with its better performance and more compact representation of words.
机译:在这项研究中,我们提出了基于字符串向量的K最近邻作为-word:分类的方法。在先前关于文本分类的工作中,通过防止将缺点编码为数字矢量,成功地将文本编码为字符串矢量。它为进行这项研究提供了动力。在这项研究中,我们将单词编码为字符串向量而不是文本,定义字符串向量之间的语义运算,并将K最近邻居修改为基于字符串1向量的版本,作为单词分类的方法。作为这项研究的收益,我们期望通过避免在将文本或单词编码为数值向量以及比数值向量更紧凑的表示形式方面避免问题来提高性能。因此,本研究的目的是实现具有更好的性能和更紧凑的单词表示的单词分类系统。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号