Using Maximum Entropy Model for Chinese Text Categorization

机译：使用最大熵模型进行中文文本分类

获取原文

获取原文并翻译 | 示例

页面导航

摘要
著录项
相似文献
相关主题

摘要

Maximum Entropy Model is a probability estimation technique widely used for a variety of natural language tasks. It offers a clean and accommodable frame to combine diverse pieces of contextual information to estimate the probability of a certain linguistics phenomena. This approach for many tasks of NLP perform near state-of-the-art level, or outperform other competing probability methods when trained and tested under similar conditions. In this paper, we use maximum entropy model for text categorization. We compare and analyze its categorization performance using different approaches for text feature generation, different number of features and smoothing technique. Moreover, in experiments we compare it to Bayes, KNN and SVM, and show that its performance is higher than Bayes and comparable with KNN and SVM. We think it is a promising technique for text categorization.

机译：最大熵模型是一种广泛用于各种自然语言任务的概率估计技术。它提供了一个干净且可容纳的框架，可以结合各种上下文信息来估计某种语言现象的可能性。当在类似条件下训练和测试时，这种用于NLP的许多任务的方法可以达到最先进的水平，或优于其他竞争概率方法。在本文中，我们使用最大熵模型进行文本分类。我们使用不同的文本特征生成方法，不同数量的特征和平滑技术来比较和分析其分类性能。此外，在实验中，我们将其与贝叶斯，KNN和SVM进行了比较，并表明其性能高于贝叶斯，并且与KNN和SVM相当。我们认为这是一种有前景的文本分类技术。

著录项

来源
《Asia-Pacific Web Conference(APWeb 2004); 20040414-20040417; Hangzhou; CN》|2004年|P.578-587|共10页
会议地点 Hangzhou(CN);Hangzhou(CN)
作者
Ronglu Li; Xiaopeng Tao; Lei Tang; Yunfa Hu;
展开▼
作者单位

Computer Technology and Information Department, Pudan University, 200433 Shanghai, China;

展开▼
会议组织
原文格式 PDF
正文语种 eng
中图分类计算机网络;
关键词

相似文献

外文文献
中文文献
专利

1. Maximum Entropy Models with Inequality Constraints: A Case Study on Text Categorization [J] . JUNICHI KAZAMA, JUNICHI TSUJII Machine Learning . 2005,第1a3期

机译：具有不等式约束的最大熵模型：文本分类的案例研究
2. Optimized deep belief network and entropy-based hybrid bounding model for incremental text categorization [J] . V. Srilakshmi, K. Anuradha, C. Shoba Bindu International journal of web information systems . 2020,第3期

机译：用于增量文本分类的优化深度信仰网络和基于熵的混合限制
3. Text prediction method based on multi-label attributes and improved maximum entropy model [J] . Yin Yi, Feng Dan, Li Yue, Journal of intelligent & fuzzy systems: Applications in Engineering and Technology . 2018,第2期

机译：基于多标签属性的文本预测方法和改进的最大熵模型
4. Using Maximum Entropy Model for Chinese Text Categorization [C] . Ronglu Li, Xiaopeng Tao, Lei Tang, Asia-Pacific Web Conference . 2004

机译：使用最大熵模型进行中文文本分类
5. Apply syntactic features in a maximum entropy framework for English and Chinese reading comprehension. [D] . Xu, Kui. 2008

机译：在英语和汉语阅读理解的最大熵框架中应用语法功能。
6. Maximum entropy modeling for mining patient medication status from free text. [O] . Serguei V. Pakhomov, Alexander Ruggieri, Christopher G. Chute 2002

机译：最大熵建模用于从自由文本中挖掘患者的用药状态。
7. Research on Chinese Web Text Categorization Based on Maximum Common Subgraph [O] . 赖兴瑞 2011

机译：基于最大公共子图的中文Web文本分类研究

Using Maximum Entropy Model for Chinese Text Categorization

摘要

著录项

相似文献

相关主题

期刊订阅