Compressive Feature Learning

机译：压缩特征学习

获取原文

页面导航

摘要
著录项
引文网络
相似文献
相关主题

摘要

This paper addresses the problem of unsupervised feature learning for text data. Our method is grounded in the principle of minimum description length and uses a dictionary-based compression scheme to extract a succinct feature set. Specifically, our method finds a set of word k-grams that minimizes the cost of reconstructing the text losslessly. We formulate document compression as a binary optimization task and show how to solve it approximately via a sequence of reweighted linear programs that are efficient to solve and parallelizable. As our method is unsupervised, features may be extracted once and subsequently used in a variety of tasks. We demonstrate the performance of these features over a range of scenarios including unsupervised exploratory analysis and supervised text categorization. Our compressed feature space is two orders of magnitude smaller than the full k-gram space and matches the text categorization accuracy achieved in the full feature space. This dimensionality reduction not only results in faster training times, but it can also help elucidate structure in unsupervised learning tasks and reduce the amount of training data necessary for supervised learning.

机译：本文讨论了文本数据的无监督功能的问题。我们的方法以最小描述长度的原则接地，并使用基于字典的压缩方案来提取简洁的功能集。具体来说，我们的方法发现了一组Word K-GRAM，最小化了无损重建文本的成本。我们将文档压缩作为二进制优化任务制定，并展示如何通过一系列重复的线性程序来解决，这些程序是有效的解决和并行化的。随着我们的方法是无监督的，可以提取一次并随后用于各种任务中的特征。我们展示了这些特征在一系列场景中的性能，包括无监督的探索性分析和监督文本分类。我们的压缩特征空间比全K-GRAM空间小的两个数量级，并匹配完整特征空间中所实现的文本分类精度。这种维数减少不仅导致更快的培训时间，而且还可以帮助阐明在无监督的学习任务中的结构，并减少监督学习所需的培训数据量。

著录项

来源
《Annual conference on Neural Information Processing Systems》|2013年||共9页
会议地点
作者
Hristo S. Paskov; Robert West; John C. Mitchell; Trevor J. Hastie;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类信息处理（信息加工）;
关键词

相似文献

外文文献
中文文献
专利

1. Predicting clinically significant prostate cancer from quantitative image features including compressed sensing radial MRI of prostate perfusion using machine learning: comparison with PI-RADS v2 assessment scores [J] . David Jean Winkel, Hanns-Christian Breit, Bibo Shi, Quantitative Imaging in Medicine and Surgery . 2020,第4期

机译：通过机器学习预测从定量图像特征预测临床显着的前列腺癌，包括压缩感测前列腺灌注的径向MRI：与PI-RADS V2评估分数的比较
2. Rolling bearing fault feature learning using improved convolutional deep belief network with compressed sensing [J] . Haidong Shao, Hongkai Jiang, Haizhou Zhang, Mechanical systems and signal processing . 2018,第FEBa1期

机译：改进的带压缩感知的卷积深度置信网络在滚动轴承故障特征学习中的应用
3. Fusion of Deep Learning and Compressed Domain Features for Content-Based Image Retrieval [J] . Peizhong Liu, Jing-Ming Guo, Chi-Yi Wu, IEEE Transactions on Image Processing . 2017,第12期

机译：深度学习和压缩域功能的融合，用于基于内容的图像检索
4. Feature Extraction and Selection in Hidden Layer of Deep Learning Based on Graph Compressive Sensing [C] . Yifei Yuan, Lei Xu, Yiman Ma, International Conference on Artificial Intelligence in China . 2020

机译：基于图表压缩感测的深度学习隐藏层的特征提取与选择
5. Automatic Modulation Classification Using Cyclic Features via Compressed Sensing [D] . Ramsey, Andrew J. 2018

机译：自动调制分类使用循环特征通过压缩感测
6. Predicting clinically significant prostate cancer from quantitative image features including compressed sensing radial MRI of prostate perfusion using machine learning: comparison with PI-RADS v2 assessment scores [O] . David Jean Winkel, Hanns-Christian Breit, Bibo Shi, 2020

机译：使用机器学习从定量图像特征（包括前列腺灌注的压缩传感放射线MRI）定量图像特征预测具有临床意义的前列腺癌：与PI-RADS v2评估得分的比较
7. Object Tracking Based on Compressive Features and Extreme Learning Machine [O] . Enhan Liu, Yan Chu, Liying Zheng 2019

机译：基于压缩特征和极端学习机的对象跟踪

Compressive Feature Learning

摘要

著录项

引文网络

相似文献

相关主题

期刊订阅