相同应用领域,不同时间、地点或设备检测到的数据域不一定完整.文中针对如何进行数据域间知识传递问题,提出相同领域的概率分布差异可用两域最小包含球中心点表示且其上限与半径无关的定理.基于上述定理,在原有支持向量域描述算法基础上,提出一种数据域中心校正的领域自适应算法,并利用人造数据集和KDD CUP 99入侵检测数据集验证该算法.实验表明,这种领域自适应算法具有较好的性能.%The data fields detected from different times,places or devices are not always complete even if they come from the same data resource.To solve the problem of effectively transferring the knowledge between the two fields,the theorem is proposed that the difference between two probability distributions from two domains can be expressed by the center of each domain's minimum enclosing ball and its up limit has nothing to do with the radius.Based on the theorem,a fast center calibration domain adaptive algorithm,center calibration-core sets support vector data description (CC-CSVDD),is proposed for large domain adaptation by modifying the original support vector domain description (SVDD) algorithm.The validity of the proposed algorithm is experimentally verified on the artificial datasets and the real KDD CUP-99 datasets.Experimental results show that the proposed algorithm has good performance.
展开▼