Rule-based Word Clustering for Text Classification

机译：基于规则的词聚类用于文本分类

获取原文

获取原文并翻译 | 示例

页面导航

摘要
著录项
相似文献
相关主题

摘要

This paper introduces a rule-based, context-dependent word clustering method, with the rules derived from various domain databases and the word text orthographic properties. Besides significant dimensionality reduction, our experiments show that such rule-based word clustering improves by 8% the overall accuracy of extracting bibliographic fields from references, and by 18.32% on average the class-specific performance on the line classification of document headers.

机译：本文介绍了一种基于规则的，上下文相关的词聚类方法，该方法具有从各种域数据库和词文本正字法属性派生的规则。除了大幅减少维度外，我们的实验还表明，这种基于规则的词聚类可将参考书目字段的总体准确度提高8％，平均可提高18.32％的文档标题行分类性能。

著录项

来源
《The Twenty-Sixth Annual International ACM SIGIR Conference on Research and Development in Information Retrieval Jul 28-Aug 1, 2003 Toronto, Canada》|2003年|p.445-446|共2页
会议地点 Toronto(CA);Toronto(CA);Toronto(CA)
作者
Hui Han; Eren Manavoglu; C. Lee Giles; Hongyuan Zha;
展开▼
作者单位

Department of Computer Science and Engineering The Pennsylvania State University University Park, PA, 16802;

展开▼
会议组织
原文格式 PDF
正文语种 eng
中图分类科学、科学研究;
关键词
word clustering; feature dimensionality reduction;

机译：词聚类；特征降维;

相似文献

外文文献
中文文献
专利

1. Rule-based Word Clustering for Text Classification [J] . Hui Han, Eren Manavoglu, C. Lee Giles, ACM SIGIR FORUM . 2003,第Special期

机译：基于规则的词聚类用于文本分类
2. From Image to Text Classification: A Novel Approach based on Clustering Word Embeddings [J] . Andrei M. Butnaru, Radu Tudor Ionescu Procedia Computer Science . 2017,第1期

机译：从图像到文本分类：一种基于聚类词嵌入的新方法
3. Text Sentiment Classification Based on a Genetic Algorithm and Word and Document Co-clustering [J] . E. V. Kotelnikov, M. V. Pletneva Journal of Computer and Systems Sciences International . 2016,第1期

机译：基于遗传算法和词与文档共聚的文本情感分类
4. Rule-based word clustering for text classification [C] . Hui Han, Eren Manavoglu, C. Lee Giles, Proceedings of the 26th annual international ACM SIGIR conference on Research and development in informaion retrieval . 2003

机译：基于规则的词聚类用于文本分类
5. Influence of word sense disambiguation on text classification. [D] . Widlak, Magdalena. 2004

机译：词义歧义化对文本分类的影响。
6. Clinical text classification with rule-based features and knowledge-guided convolutional neural networks [O] . Liang Yao, Chengsheng Mao, Yuan Luo 2019

机译：具有基于规则的功能和知识导向的卷积神经网络的临床文本分类
7. Rule-based Word Clustering for Text Classification [O] . Hui Han, Eren Manavoglu, C. Lee Giles, 2003

机译：基于规则的文本分类词汇聚类

Rule-based Word Clustering for Text Classification

摘要

著录项

相似文献

相关主题

期刊订阅