A cross training corrective approach for web page classification

Abdelbadie B.; Mohammed B.

首页> 外文期刊>international journal of computer science and applications >A cross training corrective approach for web page classification

【24h】

A cross training corrective approach for web page classification

机译：

获取原文

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

© Technomathematics Research Foundation.Textual document classification is one challenging area of data mining. Web page classification is a type of textual document classification. However, the text contained in web pages is not homogenous since a web page can discuss related but different subjects. Thus, results obtained by a textual classifier on web pages are not as better as those obtained on textual documents. Therefore, we need to use a method to enhance results of those classifiers or more precisely a technique to correct their results. One category of techniques that address this problem is to use the test set hidden underlying information to correct results assigned by a textual classifier. In this paper, we propose a method that belongs to this category. Our method is a Cross Training based Corrective approach (CTC) for web page classification that learns information from the test set in order to fix classes initially assigned by a text classifier on that test set. This adjustment leads to a significant improvement on classification results. We tested our approach using three traditional classification algorithms: Support Vector Machine (SVM), Naïve Bayes (NB) and K Nearest Neighbors (KNN), on four subsets of the Open Directory Project (ODP). Results show that our collective and corrective approach, when applied after SVM, NB or KNN, enhances their classification results by up to 12.39.

著录项

来源
《international journal of computer science and applications》 |2015年第1期|40-47|共页
作者
Abdelbadie B.; Mohammed B.;
展开▼
作者单位

Computer Science Laboratory (LRI), Computer science department, Faculty of science, Mohammed V-Agdal University;

展开▼
收录信息
原文格式 PDF
正文语种英语
中图分类
关键词
Corrective approach; Knn; Naïve bayes; Svm; Web page classification;

相似文献

外文文献
中文文献
专利

1. Researchers from University of Miami (UM) Describe Findings in Machine Learning (Machine-Learning Classification of SAR Remotely-Sensed Sea-Surface Petroleum Signatures-Part 1: Training and Testing Cross Validation) [J] . Network Daily News . 2022,第27期

机译：Researchers from University of Miami (UM) Describe Findings in Machine Learning (Machine-Learning Classification of SAR Remotely-Sensed Sea-Surface Petroleum Signatures-Part 1: Training and Testing Cross Validation)
2. Multi-label legal document classification: A deep learning-based approach with label-attention and domain-specific pre-training [J] . Song Dezhao, Vold Andrew, Madan KanikaSchilder Frank Information systems . 2022,第5期

机译：Multi-label legal document classification: A deep learning-based approach with label-attention and domain-specific pre-training
3. An approach to constructing effective training data for a classification model to evaluate the reliability of a passive safety system [J] . Jin Kyungho, Kim Hyeonmin, Ryu SeunghyoungKim SeunggeunPark Jinkyun Reliability engineering & system safety . 2022,第6期

机译：An approach to constructing effective training data for a classification model to evaluate the reliability of a passive safety system
4. Training-ValueNet: Data Driven Label Noise Cleaning on Weakly-Supervised Web Images [C] . Luka Smyth, Dmitry Kangin, Nicolas Pugeault International Conference on Development and Learning and Epigenetic Robotics . 2019

机译：Training-ValueNet：在弱监督的Web图像上清除数据驱动的标签噪声
5. EFL大學生寫作動機及寫作回饋的認知對寫作成績之影響 =The Effects of Self-Determination and Perceptions of Teacher Corrective Feedback on L2 Writing Performance [D] . Yang, Chu-Ting. 2019

机译：EFL大学生写作动机及写作回馈的认知对写作成绩之影响 =The Effects of Self-Determination and Perceptions of Teacher Corrective Feedback on L2 Writing Performance
6. Annotation of long non-coding RNAs expressed in Collaborative Cross founder mice in response to respiratory virus infection reveals a new class of interferon-stimulated transcripts [O] . Laurence Josset, Nicolas Tchitchek, Lisa E Gralinski, 2014

机译：响应呼吸道病毒感染在Cross Cross创始人小鼠中表达的长非编码RNA的注释揭示了一类新的干扰素刺激的转录本
7. A new approach of classification of time series database. [O] . 2011

机译：a new approach of classification of time series database.
8. Phosphorus Control Action Plan and Total Maximum Daily (Annual Phosphorus) Load Report: Cross Lake, Cross Lake and Square Lake Twp., Aroostook County, Maine. Cross Lake PCAP-TMDL Report, Maine DEPLW 0790. [R] . 2006

机译：磷控制行动计划和每日总磷（年磷）负荷报告：缅因州aroostook县的Cross Lake，Cross Lake和square Lake Twp。 Cross Lake pCap-TmDL报告，缅因州DEpLW 0790。

A cross training corrective approach for web page classification

摘要

著录项

相似文献

相关主题

期刊订阅