Graph Based Feature Augmentation for Short and Sparse Text Classification

机译：基于图的特征增强用于短文本和稀疏文本分类

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Short text classification, such as snippets, search queries, micro-blogs and product reviews, is a challenging task mainly because short texts have insufficient co-occurrence information between words and have a very spare document-term representation. To address this problem, we propose a novel multi-view classification method by combining both the original document-term representation and a new graph based feature representation. Our proposed method uses all documents to construct a neighbour graph by using the shared co-occurrence words. Multi-Dimensional Scaling (MDS) is further applied to extract a low-dimensional feature representation from the graph, which is augmented with the original text features for learning. Experiments on several benchmark datasets show that the proposed multi-view classifier, trained from augmented feature representation, obtains significant performance gain compared to the baseline methods.

机译：短文本分类（例如代码片段，搜索查询，微博和产品评论）是一项具有挑战性的任务，主要是因为短文本在单词之间的共现信息不足，并且具有非常多余的文档术语表示形式。为了解决这个问题，我们提出了一种新颖的多视图分类方法，该方法将原始文档项表示和基于新图的特征表示结合在一起。我们提出的方法使用所有文档通过共享共现单词来构造邻居图。多维比例缩放（MDS）进一步应用于从图中提取低维特征表示，并用原始文本特征进行了扩充以供学习。在几个基准数据集上进行的实验表明，从增强特征表示中训练出来的多视图分类器与基线方法相比，获得了明显的性能提升。

著录项

来源
《International conference on advanced data mining and applications》|2013年|456-467|共12页
会议地点
作者
Guodong Long; Jing Jiang;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
Short Text; Text Classification; Graph Based Method; Multi-view Learning; Multi-Dimensional Scaling;

机译：短文字;文字分类;基于图的方法;多视图学习;多维缩放;

相似文献

外文文献
中文文献
专利

1. Kernel Sparse Feature Selection Based on Semantics in Text Classification [J] . Zhantao Deng, Guyu Hu, Zhisong Pan, Information Technology Journal . 2012,第3期

机译：基于语义的文本分类中的核稀疏特征选择
2. Kernel Sparse Feature Selection Based on Semantics in Text Classification [J] . Zhantao Deng, Guyu Hu, Zhisong Pan, Information Technology Journal . 2012,第3期

机译：基于语义的文本分类中的核稀疏特征选择
3. Orthographic features for emotion classification in Chinese in informal short texts [J] . Chen I-Hsuan, Long Yunfei, Lu Qin, Language Resources and Evaluation . 2021,第2期

机译：非正式短文中的情感分类的正交特征
4. Graph Based Feature Augmentation for Short and Sparse Text Classification [C] . Guodong Long, Jing Jiang International conference on advanced data mining and applications . 2013

机译：基于图表的短语和稀疏文本分类的功能增强
5. A Data Augmentation Approach to Short Text Classification. [D] . Rosario, Ryan Robert. 2017

机译：短文本分类的数据增强方法。
6. Depression Disorder Classification of fMRI Data Using Sparse Low-Rank Functional Brain Network and Graph-Based Features [O] . Xin Wang, Yanshuang Ren, Wensheng Zhang 2017

机译：使用稀疏低秩功能脑网络和基于图的功能对fMRI数据进行抑郁障碍分类
7. Boosting Text Classification Performance on Sexist Tweets by Text Augmentation and Text Generation Using a Combination of Knowledge Graphs [O] . Sima Sharifirad, Borna Jafarpour, Stan Matwin 2018

机译：通过使用知识图形的组合，通过文本增强和文本生成提升文本分类性能。

Graph Based Feature Augmentation for Short and Sparse Text Classification

摘要

著录项

相似文献

相关主题

期刊订阅