Improving Learning in Networked Data by Combining Explicit and Mined Links

机译：通过组合显式和挖掘链接来改善网络数据学习

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

This paper is about using multiple types of information for classification of networked data in a semi-supervised setting: given a fully described network (nodes and edges) with known labels for some of the nodes, predict the labels of the remaining nodes. One method recently developed for doing such inference is a guilt-by-association model. This method has been independently developed in two different settings-relational learning and semi-supervised learning. In relational learning, the setting assumes that the networked data has explicit links such as hyperlinks between web-pages or citations between research papers. The semi-supervised setting assumes a corpus of non-relational data and creates links based on similarity measures between the instances. Both use only the known labels in the network to predict the remaining labels but use very different information sources. The thesis of this paper is that if we combine these two types of links, the resulting network will carry more information than either type of link by itself. We test this thesis on six benchmark data sets, using a within-network learning algorithm, where we show that we gain significant improvements in predictive performance by combining the links. We describe a principled way of combining multiple types of edges with different edge-weights and semantics using an objective graph measure called node-based assortativity. We investigate the use of this measure to combine text-mined links with explicit links and show that using our approach significantly improves performance of our classifier over naively combining these two types of links.

机译：本文是关于在半监控设置中使用多种类型的网络数据分类：给定具有用于一些节点的已知标签的完全描述的网络（节点和边），预测剩余节点的标签。最近开发用于这样的推断的一种方法是逐个关联模型。这种方法已在两个不同的设置关系学习和半监督学习中独立开发。在关系学习中，该设置假设网络数据具有明确的链接，例如网页之间的超链接或研究论文之间的引文。半监控设置假定非关系数据的语料库，并根据实例之间的相似度测量创建链接。两者都仅使用网络中的已知标签来预测剩余的标签，但使用非常不同的信息源。本文的论文是，如果我们组合这两种类型的链接，所得到的网络将携带比自身的链路类型更多的信息。我们在六个基准数据集上测试本论文，使用网络内学习算法，我们认为我们通过组合链接来获得预测性能的显着改进。我们使用称为基于节点的assortivity的客观图测量来描述用不同边缘权重和语义结合多种类型的边缘的原则方式。我们调查使用这一措施与明确的链接相结合的文本开采链接，并显示使用我们的方法显着提高了我们的分类器的性能，而不是天然地结合这两种类型的链接。

著录项

来源
《AAAI Conference on Artificial Intelligence》|2007年||共6页
会议地点
作者
Sofus A. Macskassy;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类人工智能理论;
关键词

相似文献

外文文献
中文文献
专利

1. Enriching regulatory networks by bootstrap learning using optimised GO-based gene similarity and gene links mined from PubMed abstracts. [J] . Taylor RC, Sanfilippo A, McDermott JE, International journal of computational biology and drug design . 2011,第1期

机译：通过使用基于GO的优化基因相似性和从PubMed摘要中提取的基因链接的自举学习来丰富监管网络。
2. Improving TCP performance for wireless cellular networks by adaptive FEC combined with explicit loss notification [J] . Masahiro Miyoshi, Masashi Sugano, Masayuki Murata 電子情報通信学会技術研究報告. コミュニケ-ションクオリティ. Communication Quality . 2002,第24期

机译：通过自适应FEC与显式丢失通知相结合来提高无线蜂窝网络的TCP性能
3. Improving TCP performance for wireless cellular networks by adaptive FEC combined with explicit loss notification [J] . Masahiro Miyoshi, Masashi Sugano, Masayuki Murata 電子情報通信学会技術研究報告. 無線通信システム. Radio Communication Systems . 2002,第22期

机译：通过自适应FEC与显式丢失通知相结合来提高无线蜂窝网络的TCP性能
4. Improving Learning in Networked Data by Combining Explicit and Mined Links [C] . Sofus A. Macskassy AAAI Conference on Artificial Intelligence(AAAI-07); Innovative Applications of Artificial Intelligence Conference(IAAI-07); 20070722-26; 20070722-26; Vancouver(CA); Vancouver(CA) . 2007

机译：通过结合显式链接和挖掘链接来改善网络数据的学习
5. Transport, network, and data link layer protocol designs to improve geo-stationary earth orbit satellite data set transmission performance. [D] . Wiedemeier, Paul Douglas. 2005

机译：传输，网络和数据链路层协议设计可提高地球静止地球轨道卫星数据集的传输性能。
6. Branching principles of animal and plant networks identified by combining extensive data machine learning and modelling [O] . Alexander B. Brummer, Panagiotis Lymperopoulos, Jocelyn Shen, 2021

机译：通过组合广泛的数据机器学习和建模来确定的动物和植物网络的分支原则
7. Cross-Layer Explicit Link Status Notification to Improve TCP Performance in Wireless Networks [O] . Ji-Hoon Yun 2009

机译：跨层显式链接状态通知可提高无线网络中的TCP性能

Improving Learning in Networked Data by Combining Explicit and Mined Links

摘要

著录项

相似文献

相关主题

期刊订阅