首页> 外文会议>Asia-Pacific Web Conference >Using Maximum Entropy Model for Chinese Text Categorization

【24h】

Using Maximum Entropy Model for Chinese Text Categorization

机译：使用最大熵模型进行中文文本分类

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Maximum Entropy Model is a probability estimation technique widely used for a variety of natural language tasks. It offers a clean and accommodable frame to combine diverse pieces of contextual information to estimate the probability of a certain linguistics phenomena. This approach for many tasks of NLP perform near state-of-the-art level, or outperform other competing probability methods when trained and tested under similar conditions. In this paper, we use maximum entropy model for text categorization. We compare and analyze its categorization performance using different approaches for text feature generation, different number of features and smoothing technique. Moreover, in experiments we compare it to Bayes, KNN and SVM, and show that its performance is higher than Bayes and comparable with KNN and SVM. We think it is a promising technique for text categorization.

机译：最大熵模型是一种广泛用于各种自然语言任务的概率估计技术。它提供了一个干净且可容纳的框架，以组合不同的上下文信息，以估计某种语言学现象的概率。这种方法对于NLP的许多任务执行近最先进的水平，或者在相似条件下训练和测试时优于其他竞争概率方法。在本文中，我们使用最大熵模型进行文本分类。我们使用不同方法进行比较和分析其分类性能，用于文本特征生成，不同的特征数量和平滑技术。此外，在实验中，我们将其与Bayes，Knn和SVM进行比较，并表明其性能高于贝叶斯，与KNN和SVM相当。我们认为这是文本分类的有希望的技术。

著录项

来源
《Asia-Pacific Web Conference》|2004年||共10页
会议地点
作者
Ronglu Li; Xiaopeng Tao; Lei Tang; Yunfa Hu;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类 TP3-53;
关键词

相似文献

外文文献
中文文献
专利

1. Maximum Entropy Models with Inequality Constraints: A Case Study on Text Categorization [J] . JUNICHI KAZAMA, JUNICHI TSUJII Machine Learning . 2005,第1a3期

机译：具有不等式约束的最大熵模型：文本分类的案例研究
2. Optimized deep belief network and entropy-based hybrid bounding model for incremental text categorization [J] . V. Srilakshmi, K. Anuradha, C. Shoba Bindu International journal of web information systems . 2020,第3期

机译：用于增量文本分类的优化深度信仰网络和基于熵的混合限制
3. Text prediction method based on multi-label attributes and improved maximum entropy model [J] . Yin Yi, Feng Dan, Li Yue, Journal of intelligent & fuzzy systems: Applications in Engineering and Technology . 2018,第2期

机译：基于多标签属性的文本预测方法和改进的最大熵模型
4. Using Maximum Entropy Model for Chinese Text Categorization [C] . Ronglu Li, Xiaopeng Tao, Lei Tang, Asia-Pacific Web Conference(APWeb 2004); 20040414-20040417; Hangzhou; CN . 2004

机译：使用最大熵模型进行中文文本分类
5. Apply syntactic features in a maximum entropy framework for English and Chinese reading comprehension. [D] . Xu, Kui. 2008

机译：在英语和汉语阅读理解的最大熵框架中应用语法功能。
6. Maximum entropy modeling for mining patient medication status from free text. [O] . Serguei V. Pakhomov, Alexander Ruggieri, Christopher G. Chute 2002

机译：最大熵建模用于从自由文本中挖掘患者的用药状态。
7. Research on Chinese Web Text Categorization Based on Maximum Common Subgraph [O] . 赖兴瑞 2011

机译：基于最大公共子图的中文Web文本分类研究

Using Maximum Entropy Model for Chinese Text Categorization

摘要

著录项

相似文献

相关主题

期刊订阅