A Generative Model with Network Regularization for Semi-Supervised Collective Classification

机译：具有半监控集体分类的网络正规的生成模型

获取原文

页面导航

摘要
著录项
引文网络
相似文献
相关主题

摘要

In recent years much effort has been devoted to Collective Classification (CC) techniques for predicting labels of linked instances. Given a large number of labeled data, conventional CC algorithms make use of local labeled neighbours to increase accuracy. However, in many real-world applications, labeled data are limited and very expensive to obtain. In this situation, most of the data have no connection to labeled data, and supervision knowledge cannot be obtained from the local connections. Recently, Semi-Supervised Collective Classification (SSCC) has been examined to leverage unlabeled data for enhancing the classification performance of CC. In this paper we propose a probabilistic generative model with network regularization (GMNR) for SSCC. Our main idea is to compute label probability distributions for unlabeled instances by maximizing both the log-likelihood in the generative model and the label smoothness on the network topology of data. The proposed generative model is based on the Probabilistic Latent Semantic Analysis (PLSA) method using attribute features of all instances. A network regularizer is employed to smooth the label probability distributions on the network topology of data. Finally, we develop an effective EM algorithm to compute the label probability distributions for label prediction. Experimental results on three real sparsely-labeled network datasets show that the proposed model GMNR outperforms state-of-the-art CC algorithms and other SSCC algorithms.

机译：近年来，努力致力于用于预测链接实例标签的集体分类（CC）技术。鉴于大量标记数据，传统的CC算法利用本地标记的邻居来提高精度。然而，在许多现实世界应用中，标记的数据有限且非常昂贵。在这种情况下，大多数数据都没有与标记数据的连接，并且无法从本地连接获得监督知识。最近，已经研究了半监督集体分类（SSCC）以利用未标记的数据来提高CC的分类性能。在本文中，我们提出了一种具有网络正规化（GMNR）的概率性生成模型，用于SSCC。我们的主要思想是通过最大化生成模型中的日志似然和数据的网络拓扑上的标签平滑度来计算未标记的实例的标签概率分布。所提出的生成模型基于使用所有实例的属性特征的概率潜在语义分析（PLSA）方法。采用网络规范器来平滑标签概率分布对数据的网络拓扑。最后，我们开发了一种有效的EM算法来计算标签预测的标签概率分布。三个真正稀疏标记的网络数据集的实验结果表明，建议的模型GMNR优于最先进的CC算法和其他SSCC算法。

著录项

来源
《SIAM International Conference on Data Mining》|2014年||共9页
会议地点
作者
Ruichao Shi; Qingyao Wu; Yunming Ye; Shen-Shyang Ho;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类 TP274-53;
关键词

相似文献

外文文献
中文文献
专利

1. Multi-Label Regularized Generative Model for Semi-Supervised Collective Classification in Large-Scale Networks [J] . QingyaoWu, Jian Chen, Shen-Shyang Ho, Big Data Research . 2015,第4期

机译：大型网络中半监督集体分类的多标签正则化生成模型
2. Semi-Supervised Encrypted Traffic Classification With Deep Convolutional Generative Adversarial Networks [J] . Iliyasu Auwal Sani, Deng Huifang Quality Control, Transactions . 2020,第期

机译：具有深度卷积生成对抗网络的半监督加密流量分类
3. Semi-supervised learning with deep convolutional generative adversarial networks for canine red blood cells morphology classification [J] . Kitsuchart Pasupa, Suchat Tungjitnob, Supawit Vatathanavaro Multimedia Tools and Applications . 2020,第45a46期

机译：半监督学习与犬红细胞的深度卷积生成对抗网络形态分类
4. A Generative Model with Network Regularization for Semi-Supervised Collective Classification [C] . Ruichao Shi, Qingyao Wu, Yunming Ye, SIAM International Conference on Data Mining . 2014

机译：具有半监控集体分类的网络正规的生成模型
5. Semi-supervised Regression with Generative Adversarial Networks Using Minimal Labeled Data [D] . Olmschenk, Greg. 2019

机译：使用最小标记数据与生成对抗网络的半监督回归
6. Generative Adversarial Networks-Based Semi-Supervised Automatic Modulation Recognition for Cognitive Radio Networks [O] . Mingxuan Li, Ou Li, Guangyi Liu, 2018

机译：基于生成对抗网络的认知无线电网络半监督自动调制识别
7. A Generative Model with Network Regularization for Semi-Supervised Collective Classification [O] . Ruichao Shi, Qingyao Wu, Yunming Ye, 2014

机译：具有半监控集体分类的网络正规的生成模型

A Generative Model with Network Regularization for Semi-Supervised Collective Classification

摘要

著录项

引文网络

相似文献

相关主题

期刊订阅