Classification Based on Specific Vocabulary

机译：基于特定词汇的分类

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Assuming a binomial distribution for word occurrence, we propose computing a standardized Z score to define the specific vocabulary of a subset compared to that of the entire corpus. This approach is applied to weight terms characterizing a document (or a sample of texts). We then show how these Z score values can be used to derive an efficient categorization scheme. To evaluate this proposition we categorize speeches given by B. Obama as either electoral or presidential. The results tend to show that the suggested classification scheme performs better than a Support Vector Machine scheme, and a Naive Bayes classifier (10-fold cross validation).

机译：假设出现单词的二项式分布，我们建议计算标准化的Z分数，以定义子集与整个语料库相比的特定词汇。此方法适用于表征文档（或文本样本）的权重术语。然后，我们说明如何使用这些Z得分值来得出有效的分类方案。为了评估这一主张，我们将B. Obama的演讲归类为选举或总统演讲。结果倾向于表明，建议的分类方案比支持向量机方案和朴素贝叶斯分类器（10倍交叉验证）更好。

著录项

来源
《2011 IEEE/WIC/ACM International Conference on Web Intelligence and Intelligent Agent Technology - Workshops》|2011年|p.120-123|共4页
会议地点
作者
Savoy Jacques; Zubaryeva Olena;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类人工智能理论;
关键词
Lexical Analysis; Machine Learning; Natural Language Processing; Political Discourse; Text Categorization;

机译：词法分析;机器学习;自然语言处理;政治话语;文本分类;

相似文献

外文文献
中文文献
专利

1. Simple and efficient classification scheme based on specific vocabulary [J] . Jacques Savoy, Olena Zubaryeva Computational management science . 2012,第3期

机译：基于特定词汇的简单高效分类方案
2. Spatiotemporal variation of rainfall based on random sample classification and English vocabulary translation of imported products [J] . L. Wang Oceanographic Literature Review . 2021,第9期

机译：基于随机样品分类和进口产品的英语词汇平衡的降雨量的时空变化
3. Integrated visual vocabulary in latent Dirichlet allocation-based scene classification for IKONOS image [J] . Kusumaningrum Retno, Wei Hong, Manurung Ruli, Journal of Applied Remote Sensing . 2014,第Null期

机译：基于潜在狄利克雷分配的IKONOS图像场景分类中的集成视觉词汇
4. Classification Based on Specific Vocabulary [C] . Savoy Jacques, Zubaryeva Olena IEEE/WIC/ACM International Conferences on Web Intelligent and Intelligent Agent Technology . 2011

机译：基于特定词汇的分类
5. Validating a theory-based model of L2 reading comprehension: Relative contributions of content -specific schematic knowledge and L2 vocabulary knowledge to comprehending a science text [D] . Oh, Eunjou 2010

机译：验证基于理论的L2阅读理解模型：特定内容的示意图知识和L2词汇知识对理解科学课本的相对贡献
6. An E-Liquid Flavor Wheel: A Shared Vocabulary Based on Systematically Reviewing E-Liquid Flavor Classifications in Literature [O] . Erna J Z Krüsemann, Sanne Boesveldt, Kees de Graaf, -1

机译：电子液体风味轮子：基于系统地复习文献中电子液体风味分类的共享词汇
7. Simple and efficient classification scheme based on specific vocabulary [O] . Savoy, Jacques, Zubaryeva, Olena 2013

机译：基于特定词汇的简单高效分类方案

Classification Based on Specific Vocabulary

摘要

著录项

相似文献

相关主题

期刊订阅