首页> 外文会议>SIAM International Conference on Data Mining >Large-Scale Many-Class Learning

【24h】

Large-Scale Many-Class Learning

机译：大规模的多级学习

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

A number of tasks, such as large-scale text categorization and word prediction, can benefit from efficient learning and classification when the number of classes (categories), in addition to instances and features, is large, that is, in the thousands and beyond. We investigate learning of sparse category indices to address this challenge. An index is a weighted bipartite graph mapping features to categories. On presentation of an instance, the index retrieves and scores a small set of candidate categories. The candidates can then be ranked and the ranking or the scores can be used for category assignment. We present novel online index learning algorithms. When compared to other approaches, including one-versusrest and top-down learning and classification using support vector machines, we find that indexing is highly advantageous in terms of space and time efficiency, at both training and classification times, while yielding similar and often better accuracies. On problems with hundreds of thousands of instances and thousands of categories, the index is learned in minutes, while other methods can take orders of magnitude longer. As we explain, the design of the algorithm makes it convenient to maintain a constraint on the number of prediction connections a feature is allowed to make. This constraint is crucial in yielding efficient learning and classification.

机译：许多任务，例如大规模的文本分类和字预测，可以从高效的学习和分类中受益，当类（类别）的数量之外，除了实例和特征之外，很大，即，在数千和超越中。我们调查稀疏类别指数的学习来解决这一挑战。索引是类别的加权二分钟图映射功能。在演示文稿上，索引检索并分数一小组候选类别。然后可以将候选者排列，排名或分数可用于类别分配。我们提出了小说在线指数学习算法。与其他方法相比，包括使用支持向量机的一个Versyustrest和自上而下的学习和分类，我们发现索引在训练和分类时间方面，在空间和时间效率方面是非常有利的，同时产生相似且经常更好精度。关于数十万个实例和数千类的问题，索引在几分钟内学到，而其他方法可以比较长时间的数量级。如我们解释，算法的设计使得维持对允许的预测连接数量的约束方便。这种约束对于产生有效的学习和分类至关重要。

著录项

来源
《SIAM International Conference on Data Mining 》|2008年|869 p.|共12页
会议地点
作者
Omid Madani; Michael Connor;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类 TP274.2-53;
关键词

相似文献

外文文献
中文文献
专利

1. Classifying many-class high-dimensional fingerprint datasets using random forest of oblique decision trees [J] . Thanh-Nghi Do, Philippe Lenca, Stéphane Lallich Vietnam Journal of Computer Science . 2015 ,第1期

机译：使用倾斜决策树的随机森林对多类高维指纹数据集进行分类
2. Using supervised machine learning on large-scale online forums to classify course-related Facebook messages in predicting learning achievement within the personal learning environment [J] . Jiun-Yu Wu, Yi-Cheng Hsiao, Mei-Wen Nian Interactive Learning Environments . 2020 ,第1a4期

机译：在大型在线论坛上使用受监管机器学习来对课程相关的Facebook消息进行分类，以预测个人学习环境中的学习成就
3. Validating the validation: reanalyzing a large-scale comparison of deep learning and machine learning models for bioactivity prediction [J] . Journal of Computer-Aided Molecular Design . 2020 ,第7期

机译：验证验证：重新分析深度学习和机器学习模型的大规模比较生物活性预测
4. Large-Scale Many-Class Learning [C] . Omid Madani, Michael Connor SIAM International Conference on Data Mining . 2008

机译：大规模的多级学习
5. Open Set Classification for Deep Learning in Large-Scale and Continual Learning Models [D] . Roady, Ryne. 2020

机译：在大规模和持续学习模型中开放集分类
6. Academic Emotion Classification and Recognition Method for Large-scale Online Learning Environment—Based on A-CNN and LSTM-ATT Deep Learning Pipeline Method [O] . Xiang Feng, Yaojia Wei, Xianglin Pan, 2020

机译：大规模在线学习环境的学术情感分类与识别方法-基于A-CNN和LSTM-ATT深度学习流水线方法
7. Large-scale many-class learning [O] . Omid Madani, Michael Connor 2009

机译：大规模多班学习

Large-Scale Many-Class Learning

摘要

著录项

相似文献

相关主题

期刊订阅