Improving Learning in Networked Data by Combining Explicit and Mined Links

机译：通过结合显式链接和挖掘链接来改善网络数据的学习

获取原文

获取原文并翻译 | 示例

页面导航

摘要
著录项
相似文献
相关主题

摘要

This paper is about using multiple types of information for classification of networked data in a semi-supervised setting: given a fully described network (nodes and edges) with known labels for some of the nodes, predict the labels of the remaining nodes. One method recently developed for doing such inference is a guilt-by-association model. This method has been independently developed in two different settings-relational learning and semi-supervised learning. In relational learning, the setting assumes that the networked data has explicit links such as hyperlinks between web-pages or citations between research papers. The semi-supervised setting assumes a corpus of non-relational data and creates links based on similarity measures between the instances. Both use only the known labels in the network to predict the remaining labels but use very different information sources. The thesis of this paper is that if we combine these two types of links, the resulting network will carry more information than either type of link by itself. We test this thesis on six benchmark data sets, using a within-network learning algorithm, where we show that we gain significant improvements in predictive performance by combining the links. We describe a principled way of combining multiple types of edges with different edge-weights and semantics using an objective graph measure called node-based assortativity. We investigate the use of this measure to combine text-mined links with explicit links and show that using our approach significantly improves performance of our classifier over naively combining these two types of links.

机译：本文是关于在半监督的环境中使用多种类型的信息对网络数据进行分类的：给定一个描述完整的网络（节点和边缘），其中某些节点具有已知标签，则预测其余节点的标签。最近开发的用于进行这种推断的一种方法是内关联模型。该方法已在两种不同的设置中独立开发：关系学习和半监督学习。在关系学习中，该设置假定网络数据具有明确的链接，例如网页之间的超链接或研究论文之间的引用。半监督设置假定非关系数据的语料，并基于实例之间的相似性度量创建链接。两者都仅使用网络中的已知标签来预测剩余标签，但使用非常不同的信息源。本文的论点是，如果我们将这两种类型的链接组合在一起，那么所产生的网络将比任何一种链接本身携带更多的信息。我们使用网络内学习算法在六个基准数据集上测试了本文，结果表明，通过组合链接，可以大大提高预测性能。我们描述了一种使用称为基于节点的分类的客观图度量将具有不同边缘权重和语义的多种类型的边缘组合在一起的原则方法。我们调查了使用此方法将文本挖掘的链接与显式链接相结合的过程，并表明与单纯组合这两种类型的链接相比，使用我们的方法可显着提高分类器的性能。

著录项

来源
《AAAI Conference on Artificial Intelligence(AAAI-07); Innovative Applications of Artificial Intelligence Conference(IAAI-07); 20070722-26; 20070722-26; Vancouver(CA); Vancouver(CA)》|2007年|P.590-595|共6页
会议地点 Vancouver(CA);Vancouver(CA)
作者
Sofus A. Macskassy;
展开▼
作者单位

Fetch Technologies, 2041 Rosecrans Ave, El Segundo, CA 90245;

展开▼
会议组织
原文格式 PDF
正文语种 eng
中图分类人工智能理论;
关键词

相似文献

外文文献
中文文献
专利

1. Enriching regulatory networks by bootstrap learning using optimised GO-based gene similarity and gene links mined from PubMed abstracts. [J] . Taylor RC, Sanfilippo A, McDermott JE, International journal of computational biology and drug design . 2011,第1期

机译：通过使用基于GO的优化基因相似性和从PubMed摘要中提取的基因链接的自举学习来丰富监管网络。
2. Improving TCP performance for wireless cellular networks by adaptive FEC combined with explicit loss notification [J] . Masahiro Miyoshi, Masashi Sugano, Masayuki Murata 電子情報通信学会技術研究報告. コミュニケ-ションクオリティ. Communication Quality . 2002,第24期

机译：通过自适应FEC与显式丢失通知相结合来提高无线蜂窝网络的TCP性能
3. Improving TCP performance for wireless cellular networks by adaptive FEC combined with explicit loss notification [J] . Masahiro Miyoshi, Masashi Sugano, Masayuki Murata 電子情報通信学会技術研究報告. 無線通信システム. Radio Communication Systems . 2002,第22期

机译：通过自适应FEC与显式丢失通知相结合来提高无线蜂窝网络的TCP性能
4. Improving Learning in Networked Data by Combining Explicit and Mined Links [C] . Sofus A. Macskassy AAAI Conference on Artificial Intelligence . 2007

机译：通过组合显式和挖掘链接来改善网络数据学习
5. Transport, network, and data link layer protocol designs to improve geo-stationary earth orbit satellite data set transmission performance. [D] . Wiedemeier, Paul Douglas. 2005

机译：传输，网络和数据链路层协议设计可提高地球静止地球轨道卫星数据集的传输性能。
6. Branching principles of animal and plant networks identified by combining extensive data machine learning and modelling [O] . Alexander B. Brummer, Panagiotis Lymperopoulos, Jocelyn Shen, 2021

机译：通过组合广泛的数据机器学习和建模来确定的动物和植物网络的分支原则
7. Cross-Layer Explicit Link Status Notification to Improve TCP Performance in Wireless Networks [O] . Ji-Hoon Yun 2009

机译：跨层显式链接状态通知可提高无线网络中的TCP性能

Improving Learning in Networked Data by Combining Explicit and Mined Links

摘要

著录项

相似文献

相关主题

期刊订阅