首页> 外文会议>ICONIP 2008;International conference on advances in neuro-information processing >Sprinkled Latent Semantic Indexing for Text Classification with Background Knowledge
【24h】

Sprinkled Latent Semantic Indexing for Text Classification with Background Knowledge

机译:具有背景知识的文本分类的潜在隐式语义索引

获取原文
获取外文期刊封面目录资料

摘要

In text classification, one key problem is its inherent dichotomy of polysemy and synonym; the other problem is the insufficient usage of abundant useful, but unlabeled text documents. Targeting on solving these problems, we incorporate a sprinkling Latent Semantic Indexing (LSI) with background knowledge for text classification. The motivation comes from: 1) LSI is a popular technique for information retrieval and it also succeeds in text classification solving the problem of polysemy and synonym; 2) By fusing the sprinkling terms and unlabeled terms, our method not only considers the class relationship, but also explores the unlabeled information. Finally, experimental results on text documents demonstrate our proposed method benefits for improving the classification performance.
机译:在文本分类中,一个关键问题是其固有的一词多义和同义词二分法。另一个问题是大量有用但未加标签的文本文档的使用不足。针对解决这些问题,我们结合了具有背景知识的撒播潜在语义索引(LSI),以进行文本分类。动机来自:1)LSI是一种流行的信息检索技术,它在解决文本多义和同义词问题的文本分类中也取得了成功; 2)通过融合散乱词和未标记词,我们的方法不仅考虑了类关系,而且还探索了未标记信息。最后,文本文件上的实验结果证明了我们提出的方法对改进分类性能的好处。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号