Sparse Topical Coding with Sparse Groups

机译：稀疏组的稀疏主题编码

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Learning a latent semantic representing from a large number of short text corpora makes a profound practical significance in research and engineering. However, it is difficult to use standard topic models in microblogging environments since microblogs have short length, large amount, snarled noise and irregular modality characters, which prevent topic models from using full information of microblogs. In this paper, we propose a novel non-probabilistic topic model called sparse topical coding with sparse groups (STCSG), which is capable of discovering sparse latent semantic representations of large short text corpora. STCSG relaxes the normalization constraint of the inferred representations with sparse group lasso, a sparsity-inducing regularizer, which is convenient to directly control the sparsity of document, topic and word codes. Furthermore, the relaxed non-probabilistic STCSG can be effectively learned with alternating direction method of multipliers (ADMM). Our experimental results on Twitter dataset demonstrate that STCSG performs well in finding meaningful latent representations of short documents. Therefore, it can substantially improve the accuracy and efficiency of document classification.

机译：从大量的短文本语料库中学习潜在的语义表示在研究和工程中具有深远的现实意义。但是，由于微博的长度短，数量大，噪音大，模态特征不规则，因此很难在微博环境中使用标准主题模型，这会阻止主题模型使用微博的全部信息。在本文中，我们提出了一种新的非概率主题模型，称为带有稀疏组的稀疏主题编码（STCSG），它能够发现大型短文本语料库的稀疏潜在语义表示。 STCSG通过稀疏组套索（sparse group lasso）放宽了推断表示的归一化约束，稀疏组套索导致规则化，方便直接控制文档，主题和单词代码的稀疏性。此外，可以通过乘数的交替方向方法（ADMM）有效地学习松弛的非概率STCSG。我们在Twitter数据集上的实验结果表明，STCSG在寻找有意义的短文档潜在表示方面表现良好。因此，可以大大提高文档分类的准确性和效率。

著录项

来源
《International conference on web-age information management》|2016年|415-426|共12页
会议地点
作者
Min Peng; Qianqian Xie; Jiajia Huang; Jiahui Zhu; Shuang Ouyang; Jimin Huang; Gang Tian;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
Document representation; Topic model; Sparse coding; Sparse group lasso;

机译：文件代表;主题模型;稀疏编码;稀疏组套索;

相似文献

外文文献
中文文献
专利

1. Sparse Coding Models Can Exhibit Decreasing Sparseness while Learning Sparse Codes for Natural Images [J] . Joel Zylberberg, Michael Robert DeWeese PLoS Computational Biology . 2013,第8期

机译：稀疏编码模型可以在学习自然图像的稀疏代码时表现出稀疏性
2. When sparse coding meets ranking: a joint framework for learning sparse codes and ranking scores [J] . Wang Jim Jing-Yan, Cui Xuefeng, Yu Ge, Neural computing & applications . 2019,第3期

机译：当稀疏编码符合排名时：学习稀疏代码和排名分数的联合框架
3. Image classification based on sparse-coded features using sparse coding technique for aerial imagery: a hybrid dictionary approach [J] . Qayyum Abdul, Malik Aamir Saeed, Saad Naufal M., Neural computing & applications . 2019,第8期

机译：基于使用空中图像的稀疏编码技术的稀疏编码特征的图像分类：混合词典方法
4. Sparse Topical Coding with Sparse Groups [C] . Min Peng, Qianqian Xie, Jiajia Huang, International Conference on Web-Age Information Management . 2016

机译：稀疏局部编码与稀疏组
5. Codes on Graphs and Analysis of Iterative Algorithms for Reconstructing Sparse Signals and Decoding of Check-Hybrid GLDPC Codes [D] . Ravanmehr, Vida 2015

机译：图上的代码以及用于重构稀疏信号和校验混合GLDPC代码的迭代算法的分析
6. Sparse Coding Models Can Exhibit Decreasing Sparseness while Learning Sparse Codes for Natural Images [O] . Joel Zylberberg, Michael Robert DeWeese 2013

机译：稀疏编码模型可以在学习自然图像的稀疏代码时表现出稀疏性
7. Sparse coding models can exhibit decreasing sparseness while learning sparse codes for natural images. [O] . Joel Zylberberg, Michael Robert DeWeese 2013

机译：稀疏编码模型可以在学习自然图像的稀疏代码时表现出减少的稀疏性。

Sparse Topical Coding with Sparse Groups

摘要

著录项

相似文献

相关主题

期刊订阅