首页> 外文会议>Language and Technology Conference >Itemsets-Based Amharic Document Categorization Using an Extended A Priori Algorithm

【24h】

Itemsets-Based Amharic Document Categorization Using an Extended A Priori Algorithm

机译：基于项目的AMHaric文档分类使用扩展的先验算法

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Document categorization is gaining importance due to the large volume of electronic information which requires automatic organization and pattern identification. Due to the morphological complexity of the language, automatic categorization of Amharic documents has become a difficult talk to carry out. This paper presents a system that categorizes Amharic documents based on the frequency of itemsets obtained after analyzing the morphology of the language. We selected seven categories into which a given document is to be classified. The task of categorization is achieved by employing an extended version of a priori algorithm which had been traditionally used for the purpose of knowledge mining in the form of association rules. The system is tested with a corpus containing Amharic news documents and experimental results are reported.

机译：由于需要自动组织和模式识别的大量电子信息，文档分类是增益的。由于语言的形态复杂性，Amharic文件的自动分类已成为一个难以执行的谈话。本文介绍了一个系统，该系统根据分析语言形态后获得的项目集的频率进行分类。我们选择了七个类别，给定文件将被分类。通过使用传统上用于以关联规则形式的知识挖掘目的的优先算法来实现分类的任务。该系统用含有Amharic新闻文件的语料库进行测试，并报告了实验结果。

著录项

来源
《Language and Technology Conference》|2016年|422p|共10页
会议地点
作者
Abraham Hailu; Yaregal Assabie;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类 TP18-53;
关键词
Amharic language processing; Text categorization; Document classification; A priori algorithm; Itemsets;

机译：Amharic语言处理;文本分类;文档分类;先验算法;项目集;

相似文献

外文文献
中文文献
专利

1. Text Document Categorization using Machine Learning Algorithm in Agricultural Domain [J] . Sreekumar Biswas, Rajni Jain Journal of the Indian Society of Agricultural Statistics . 2018,第1期

机译：用农业域中机器学习算法进行文本文档分类
2. Semi-supervised fuzzy co-clustering algorithm for document categorization [J] . Yang Yan, Lihui Chen, William-Chandra Tjhi Knowledge and information systems . 2013,第1期

机译：用于文档分类的半监督模糊联合聚类算法
3. AN ALGORITHM FOR IDENTIFICATION OF EXTENDED OBJECTS WITH A PRIORI INDETERMINACY OF SIGNAL AND NOISE STATISTICAL CHARACTERISTICS [J] . Ye. A. Lavrentiev, A. A. Shatalov Radioelectronics and Communications Systems . 2002,第11期

机译：一种先验不确定信号和噪声统计特性的扩展对象识别算法
4. Itemsets-Based Amharic Document Categorization Using an Extended A Priori Algorithm [C] . Abraham Hailu, Yaregal Assabie Language and technology conference . 2016

机译：使用扩展的先验算法的基于项目集的阿姆哈拉语文档分类
5. Adaptive algorithms for channel estimation: Using a priori information for optimal design. [D] . Sohail, Muhammad Saqib. 2008

机译：信道估计的自适应算法：使用先验信息进行最佳设计。
6. Automated Amharic News Categorization Using Deep Learning Models [O] . Demeke Endalie, Getamesay Haile 2021

机译：自动化Amharic新闻分类使用深层学习模型
7. Influence of a priori Knowledge on Medical Document Categorization [O] . Senior Member, John Pestian 2012

机译：先验知识对医学文献分类的影响
8. Concept Indexing: A Fast Dimensionality Reduction Algorithm With Applications to Document Retrieval and Categorization. [R] . Karypis, G., Han, E. 2000

机译：概念索引：一种快速降维算法及其在文档检索和分类中的应用。

Itemsets-Based Amharic Document Categorization Using an Extended A Priori Algorithm

摘要

著录项

相似文献

相关主题

期刊订阅