Classifying crop pest data using C4.5 algorithm

机译：使用C4.5算法对农作物有害生物数据进行分类

获取原文

获取原文并翻译 | 示例

页面导航

摘要
著录项
相似文献
相关主题

摘要

Data mining is a way of exploring large preexisting databases in order to generate new information. It is used to find a relationship between the bulky data set which is very helpful in decision making. In agriculture sector, data mining plays an emerging role. Various data mining techniques can be used to protect crops from vertebrate pests, diseases so as to enhance risk on crop cultivation. This paper comprises data pre-processing to remove noisy data in crop pest data that offers better accuracy. Feature selection takes an essential pre-processing step is to reduce the cost of learning by reducing the number of attributes. In this paper Relief and Random Forest Filters are applied for filtering crop pest data set attributes instead of using full attribute set. Relief carries out a selection of instances randomly for calculating the attribute weights. Random forest retains random selection, but provides two straightforward methods such as mean decrease impurity and mean decrease accuracy. Depending upon weights, splitting attributes have been chosen for generating decision tree. This paper proposed C4.5 algorithm that handles crop pest training data with missing values and eliminates overfitting while construction of the tree, that improves the accuracy of the algorithm.

机译：数据挖掘是一种探索大型现有数据库以生成新信息的方法。它用于查找庞大数据集之间的关系，这对决策非常有帮助。在农业领域，数据挖掘起着新兴的作用。可以使用各种数据挖掘技术来保护农作物免受脊椎动物害虫，疾病的侵害，从而增加农作物种植的风险。本文包括数据预处理，以去除农作物有害生物数据中的噪声数据，从而提供更高的准确性。特征选择需要一个基本的预处理步骤，即通过减少属性数量来减少学习成本。在本文中，救济和随机森林过滤器用于过滤农作物有害生物数据集属性，而不是使用完整属性集。救济会随机选择一个实例来计算属性权重。随机林保留了随机选择，但提供了两种直接方法，例如均值降低杂质和均值降低精度。根据权重，已选择拆分属性以生成决策树。本文提出了一种C4.5算法，该算法可以处理缺少值的农作物病虫害训练数据，并消除了树木构建过程中的过拟合现象，提高了算法的准确性。

著录项

来源
《2017 IEEE International Conference on Intelligent Techniques in Control, Optimization and Signal Processing》|2017年|1-6|共6页
会议地点 Srivilliputhur(IN)
作者
R. Revathy; R. Lawrance;
展开▼
作者单位

Department of Computer Science, Ayya Nadar Janaki Ammal College, Sivakasi, Tamil Nadu, India;

Department of Computer Applications, Ayya Nadar Janaki Ammal College, Sivakasi, Tamil Nadu, India;

展开▼
会议组织
原文格式 PDF
正文语种 eng
中图分类
关键词
Agriculture; Data mining; Signal processing algorithms; Impurities; Classification algorithms; Filtering algorithms; Feature extraction;

机译：农业;数据挖掘;信号处理算法;杂质;分类算法;过滤算法;特征提取;;

相似文献

外文文献
中文文献
专利

1. Effect of Missing Data Treatment on the Predictive Accuracy of C4.5 Classifier [J] . Saeed A. Shurrab, Rehab M. Duwairi International Journal on Communications Antenna and Propagation . 2021,第3期

机译：缺少数据处理对C4.5分类器预测准确度的影响
2. A Hybrid Predictive Model Integrating C4.5 and Decision Table Classifiers for Medical Data Sets [J] . Bikash Kanti Sarkar, Amit Kumar Journal of information technology research . 2018,第2期

机译：集成C4.5和决策表分类器的医疗数据集混合预测模型
3. Classification of complete blood count and haemoglobin typing data by a C4.5 decision tree, a naieve Bayes classifier and a multilayer perceptron for thalassaemia screening [J] . Damrongrit Setsirichok, Theera Piroonratana, Waranyu Wongseree, Biomedical signal processing and control . 2012,第2期

机译：通过C4.5决策树，朴素的贝叶斯分类器和多层感知器对地中海贫血进行筛查，对全血细胞计数和血红蛋白分型数据进行分类
4. Classifying crop pest data using C4.5 algorithm [C] . R. Revathy, R. Lawrance IEEE International Conference on Intelligent Techniques in Control, Optimization and Signal Processing . 2017

机译：使用C4.5算法对作物有害生物数据进行分类
5. Algorithms for non-parametric classifiers in multi-relational data mining. [D] . Encarnacion Rivera, Trilce Marie. 2007

机译：多重关系数据挖掘中非参数分类器的算法。
6. Interpretation of Clinical Data Based on C4.5 Algorithm for the Diagnosis of Coronary Heart Disease [O] . Wiharto Wiharto, Hari Kusnanto, Herianto Herianto 2016

机译：基于C4.5算法的临床数据解释对冠心病的诊断
7. COMPARATIVE ANALYSIS OF NAIVE BAYS CLASSIFIER AND DECISION TREE C4.5 ON CREDIT PAYMENT DATA SET [O] . S.B.Siledar . 2017

机译：Naive Bayes分类器和决策树C4.5对信用支付数据集的比较分析

Classifying crop pest data using C4.5 algorithm

摘要

著录项

相似文献

相关主题

期刊订阅