首页> 外文期刊>Chaos >Network-based stochastic competitive learning approach to disambiguation in collaborative networks
【24h】

Network-based stochastic competitive learning approach to disambiguation in collaborative networks

机译:基于网络的随机竞争学习方法可消除协作网络中的歧义

获取原文
获取原文并翻译 | 示例
       

摘要

Many patterns have been uncovered in complex systems through the application of concepts and methodologies of complex networks. Unfortunately, the validity and accuracy of the unveiled patterns are strongly dependent on the amount of unavoidable noise pervading the data, such as the presence of homonymous individuals in social networks. In the current paper, we investigate the problem of name disambiguation in collaborative networks, a task that plays a fundamental role on a myriad of scientific contexts. In special, we use an unsupervised technique which relies on a particle competition mechanism in a networked environment to detect the clusters. It has been shown that, in this kind of environment, the learning process can be improved because the network representation of data can capture topological features of the input data set. Specifically, in the proposed disambiguating model, a set of particles is randomly spawned into the nodes constituting the network. As time progresses, the particles employ a movement strategy composed of a probabilistic convex mixture of random and preferential walking policies. In the former, the walking rule exclusively depends on the topology of the network and is responsible for the exploratory behavior of the particles. In the latter, the walking rule depends both on the topology and the domination levels that the particles impose on the neighboring nodes. This type of behavior compels the particles to perform a defensive strategy, because it will force them to revisit nodes that are already dominated by them, rather than exploring rival territories. Computer simulations conducted on the networks extracted from the arXiv repository of preprint papers and also from other databases reveal the effectiveness of the model, which turned out to be more accurate than traditional clustering methods.
机译:通过应用复杂网络的概念和方法,在复杂系统中发现了许多模式。不幸的是,公开模式的有效性和准确性在很大程度上取决于数据中不可避免的噪声数量,例如社交网络中同名个人的存在。在当前的论文中,我们研究了协作网络中名称歧义消除的问题,该任务在众多科学环境中都起着基本作用。特别地,我们使用一种无​​监督技术,该技术依赖于网络环境中的粒子竞争机制来检测集群。已经表明,在这种环境中,由于数据的网络表示可以捕获输入数据集的拓扑特征,因此可以改善学习过程。具体来说,在提出的消歧模型中,一组粒子被随机生成到构成网络的节点中。随着时间的流逝,粒子采用由随机和优先行走策略的概率凸混合组成的运动策略。在前者中,行走规则仅取决于网络的拓扑结构,并且负责粒子的探索行为。在后者中,步行规则既取决于粒子施加于相邻节点的拓扑结构,也取决于粒子的控制级别。这种行为迫使粒子执行防御策略,因为它将迫使它们重新访问已经由其控制的节点,而不是探索敌对领土。在从预印本的arXiv存储库以及其他数据库中提取的网络上进行的计算机仿真显示了该模型的有效性,事实证明,该模型比传统的聚类方法更为准确。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号