首页> 外文OA文献 >Prestructuring Multilayer Perceptrons based on Information-Theoretic Modeling of a Partido-Alto-based Grammar for Afro-Brazilian Music: Enhanced Generalization and Principles of Parsimony, including an Investigation of Statistical Paradigms
【2h】

Prestructuring Multilayer Perceptrons based on Information-Theoretic Modeling of a Partido-Alto-based Grammar for Afro-Brazilian Music: Enhanced Generalization and Principles of Parsimony, including an Investigation of Statistical Paradigms

机译:基于非洲 - 巴西音乐的基于partido-alto的语法的信息理论建模的多层感知器的预构建:增强的泛化和简约原则,包括统计范式的调查

摘要

The present study shows that prestructuring based on domain knowledge leads to statistically significant generalization-performance improvement in artificial neural networks (NNs) of the multilayer perceptron (MLP) type, specifically in the case of a noisy real-world problem with numerous interacting variables. The prestructuring of MLPs based on knowledge of the structure of a problem domain has previously been shown to improve generalization performance. However, the problem domains for those demonstrations suffered from significant shortcomings: 1) They were purely logical problems, and 2) they contained small numbers of variables in comparison to most data-mining applications today. Two implications of the former were a) the underlying structure of the problem was completely known to the network designer by virtue of having been conceived for the problem at hand, and b) noise was not a significant concern in contrast with real-world conditions. As for the size of the problem, neither computational resources nor mathematical modeling techniques were advanced enough to handle complex relationships among more than a few variables until recently, so such problems were left out of the mainstream of prestructuring investigations. In the present work, domain knowledge is built into the solution through Reconstructability Analysis, a form of information-theoretic modeling, which is used to identify mathematical models that can be transformed into a graphic representation of the problem domainu27s underlying structure. Employing the latter as a pattern allows the researcher to prestructure the MLP, for instance, by disallowing certain connections in the network. Prestructuring reduces the set of all possible maps (SAPM) that are realizable by the NN. The reduced SAPM--according to the Lendaris-Stanley conjecture, conditional probability, and Occamu27s razor--enables better generalization performance than with a fully connected MLP that has learned the same I/O mapping to the same extent. In addition to showing statistically significant improvement over the generalization performance of fully connected networks, the prestructured networks in the present study also compared favorably to both the performance of qualified human agents and the generalization rates in classification through Reconstructability Analysis alone, which serves as the alternative algorithm for comparison.
机译:本研究表明,基于领域知识的预构造在多层感知器(MLP)类型的人工神经网络(NN)中具有统计上显着的泛化性能改进,特别是在嘈杂的现实世界问题中,其中存在许多交互变量。先前已经显示了基于对问题域的结构的了解而对MLP进行预构造可以提高泛化性能。但是,这些演示的问题领域存在重大缺陷:1)它们纯粹是逻辑问题,2)与当今大多数数据挖掘应用程序相比,它们包含少量变量。前者的两个含义是:a)问题的潜在结构由于已经针对当前问题而完全为网络设计人员所了解,并且b)与实际情况相比,噪声并不是一个重要的问题。至于问题的规模,直到最近,计算资源和数学建模技术都还不足以处理多个变量之间的复杂关系,因此此类问题被排除在结构研究的主流之外。在当前的工作中,通过可重构性分析(一种信息理论模型的形式)将领域知识构建到解决方案中,该知识用于识别可转换为问题域底层结构的图形表示形式的数学模型。采用后者作为一种模式,例如,通过禁止网络中的某些连接,研究人员可以预先构造MLP。预先构造减少了NN可实现的所有可能映射(SAPM)的集合。根据Lendaris-Stanley猜想,条件概率和Occam剃刀的要求,减少的SAPM可以提供比完全连接的MLP更好的泛化性能,而完全连接的MLP在相同程度上学习了相同的I / O映射。除了在统计上显示完全连接的网络的泛化性能有显着改善外,本研究中的预构建网络还仅通过可重构性分析即可与合格人员的性能和泛化率进行比较(仅通过可重构性分析)。比较算法。

著录项

  • 作者

    Vurkaç Mehmet;

  • 作者单位
  • 年度 2011
  • 总页数
  • 原文格式 PDF
  • 正文语种
  • 中图分类

相似文献

  • 外文文献

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号