首页> 外文期刊>系统工程与电子技术(英文版) >A Comparative Study on Two Techniques of Reducing the Dimension of Text Feature Space
【24h】

A Comparative Study on Two Techniques of Reducing the Dimension of Text Feature Space

机译:减少文本特征空间维度的两种技术的比较研究

获取原文
获取原文并翻译 | 示例
       

摘要

With the development of large-scale text processing, the dimension of text feature space has become larger and larger, which has added a lot of difficulties to natural language processing. How to reduce the dimension has become a practical problem in the field. Here we present two clustering methods, i.e. concept association and concept abstract, to achieve the goal. The first refers to the keyword clustering based on the co-occurrence of keywords in the same text, and the second refers to that in the same category. Then we compare the difference between them. Our experiment results show that they are efficient to reduce the dimension of text feature space.
机译:随着大规模文本处理的发展,文本特征空间的维度变得越来越大,而且对自然语言处理增加了很大的困难。如何降低维度已成为该领域的实际问题。在这里,我们提出了两个聚类方法,即概念协会和概念摘要,实现目标。首先是指基于同一文本中的关键字的共同发生的关键字群集,第二个是指在同一类别中的关键字。然后我们比较它们之间的差异。我们的实验结果表明,减少文本特征空间的维度有效。

著录项

  • 来源
    《系统工程与电子技术(英文版)》 |2002年第1期|87-92|共6页
  • 作者

  • 作者单位

    School of Electronic & Information Technology Shanghai Jiaotong University Shanghai 200030 P.R.China;

    School of Electronic & Information Technology Shanghai Jiaotong University Shanghai 200030 P.R.China;

    School of Electronic & Information Technology Shanghai Jiaotong University Shanghai 200030 P.R.China;

    School of Electronic & Information Technology Shanghai Jiaotong University Shanghai 200030 P.R.China;

  • 收录信息 中国科学引文数据库(CSCD);
  • 原文格式 PDF
  • 正文语种 chi
  • 中图分类 无线电电子学、电信技术;
  • 关键词

    Text data mining; Natural language processing; Keyword clustering;

    机译:文本数据挖掘;自然语言处理;关键字聚类;
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号