IMPLICATIONS OF DISCRETIZATION TOWARDS IMPROVING CLASSIFICATION ACCURACY FOR SOFTWARE DEFECT DATA

POOJA KAPOOR; DEEPAK ARORA; ASHWANI KUMAR

首页> 外文期刊>Journal of Theoretical and Applied Information Technology >IMPLICATIONS OF DISCRETIZATION TOWARDS IMPROVING CLASSIFICATION ACCURACY FOR SOFTWARE DEFECT DATA

【24h】

IMPLICATIONS OF DISCRETIZATION TOWARDS IMPROVING CLASSIFICATION ACCURACY FOR SOFTWARE DEFECT DATA

机译：离散化对提高软件缺陷数据的分类准确性的意义

获取原文

获取外文期刊封面封底 >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

Since the advent of new software architectures, paradigms and technologies the software design and development has developed a cutting edge requirements of being on the right track in terms of software quality and reliability. This leads the prediction of defects in software at its early stages of its development. Implications of machine learning algorithms are now playing a very crucial role in classification and prediction of the possible bugs during the systems design phase. In this research work a discretization method is proposed based on the Object Oriented metrics threshold values in order to gain better classification accuracy on a given data set. For the experimentation purpose, Jedit, Lucene, tomcat, velocity, xalan and xerces software systems from NASA repositories have been considered and classification accuracies have been compared with the existing approaches with the help of open source WEKA tool. For this study, the Object Oriented CK metrics suite has been considered due to its wide applicability in software industry for software quality prediction. After experimentation it is found that Naive Bayes and Voted Perceptron, classifiers are performing well and provide highest accuracy level with the discretized dataset values. The performance of these classifiers are checked and analyzed on different performance measures like ROC, RMSE, Precision, Recall values in this research work. Result shows significant performance improvements towards classification accuracy if used with discrete features of the individual software systems.

机译：自从新的软件体系结构，范例和技术问世以来，软件设计和开发就在软件质量和可靠性方面处于正确的轨道提出了最前沿的要求。这导致了在软件开发早期阶段对软件缺陷的预测。在系统设计阶段，机器学习算法的含义在分类和预测可能的错误中起着至关重要的作用。在这项研究工作中，提出了一种基于面向对象的度量阈值的离散化方法，以便在给定的数据集上获得更好的分类精度。为了进行实验，已经考虑了来自NASA储存库的Jedit，Lucene，tomcat，speed，xalan和xerces软件系统，并借助开源WEKA工具将分类精度与现有方法进行了比较。在本研究中，已考虑了面向对象的CK度量套件，因为它在软件行业中对软件质量预测的广泛适用性。经过实验发现，朴素贝叶斯（Naive Bayes）和投票的感知器（Voted Perceptron）分类器表现良好，并且使用离散化的数据集值可提供最高的准确性。在这项研究工作中，这些分类器的性能是通过不同的性能指标（例如ROC，RMSE，Precision和Recall值）进行检查和分析的。结果表明，如果与单个软件系统的离散功能一起使用，则可以显着提高分类精度。

著录项

来源
《Journal of Theoretical and Applied Information Technology》 |2017年第24期|共1页
作者
POOJA KAPOOR; DEEPAK ARORA; ASHWANI KUMAR;
展开▼
作者单位

展开▼
收录信息
原文格式 PDF
正文语种
中图分类计算技术、计算机技术;
关键词

相似文献

外文文献
中文文献
专利

1. A Novel Approach for Converting N-Dimensional Dataset into Two Dimensions to Improve Accuracy in Software Defect Prediction [J] . Rayhanul Islam, Abdus Satter, Atish Kumar Dipongkor, Journal of software . 2020,第6期

机译：一种新的方法，用于将n维数据集转换为两个维度，提高软件缺陷预测精度
2. Improving the Accuracy of Land Use and Land Cover Classification of Landsat Data Using Post-Classification Enhancement [J] . Inakwu O. A. Odeh, Ramita Manandhar, Tiho Ancev Remote Sensing . 2009,第3期

机译：使用后分类增强功能提高Landsat数据的土地利用和土地覆盖分类的准确性
3. A multi-objective evolutionary method for learning granularities based on fuzzy discretization to improve the accuracy-complexity trade-off of fuzzy rule-based classification systems: D-MOFARC algorithm [J] . Michela Fazzolari, Rafael Alcala, Francisco Herrera Applied Soft Computing . 2014,第Null期

机译：一种基于模糊离散化的粒度学习多目标进化方法，以提高基于模糊规则的分类系统的精度-复杂度折衷：D-MOFARC算法
4. An Approach for Improving Classification Accuracy Using Discretized Software Defect Data [C] . Pooja Kapoor, Deepak Arora, Ashwani Kumar International Conference on Advanced Computing, Networking, and Informatics . 2018

机译：使用离散软件缺陷数据提高分类精度的方法
5. Improve Software Defect Estimation with Six Sigma Defect Measures: Empirical Studies with Imputation Techniques on ISBSG Data Repository with a High Ratio of Missing Data [D] . Almakadmeh, Mhammed. 2017

机译：提高六种Sigma缺陷措施的软件缺陷估算：具有高比例的ISBSG数据储存中缺货技术的实证研究
6. Use of Diabetes Data Management Software Reports by Health Care Providers Patients With Diabetes and Caregivers Improves Accuracy and Efficiency of Data Analysis and Interpretation Compared With Traditional Logbook Data [O] . Deborah A. Hinnen, Ann Buskirk, Maureen Lyden, 2015

机译：与传统的日志数据相比医疗保健提供者糖尿病患者和护理人员使用糖尿病数据管理软件报告可提高数据分析和解释的准确性和效率
7. Improve software defect estimation with six sigma defect measures : empirical studies imputation techniques on ISBSG data repository with a high ratio of missing data [O] . Almakadmeh Mhammed 2017

机译：使用六个sigma缺陷度量改进软件缺陷估计：IsBsG数据存储库上的缺陷数据比率高的实证研究插补技术
8. Accuracy Assessment of the Discrete Classification of Remotely-Sensed DigitalData for Landcover Mapping [R] . Senseman, G. M., Bagley, C. F., Tweddale, S. A. 1995

机译：用于土地覆盖的遥感数字数据离散分类的精度评估

IMPLICATIONS OF DISCRETIZATION TOWARDS IMPROVING CLASSIFICATION ACCURACY FOR SOFTWARE DEFECT DATA

摘要

著录项

相似文献

相关主题

期刊订阅