首页> 外文学位 >Analysis of algorithms in learning theory and network analysis of knowledge bases.
【24h】

Analysis of algorithms in learning theory and network analysis of knowledge bases.

机译:学习理论中的算法分析和知识库的网络分析。

获取原文
获取原文并翻译 | 示例

摘要

This thesis is concerned with problems that arise in learning theory as well as with an investigation, using the tools of network analysis, of a popular commonsense knowledge base that is publicly available online and reasoning with the knowledge that is available in the database. We study the evolvability of monotone conjunctions through an intuitive neighborhood under the uniform distribution, give a structure theorem of best approximations, and improve the previous known results. We show the existence of local optima under nondegenerate product distributions for the same algorithm. Using covariance as the fitness metric we prove a similar structure theorem as well as the evolvability of short monotone conjunctions under these distributions. Using points from the moment curve we show that the VC dimension of halfspaces is under the multiple-instance learning framework, matching the previous known upper bound. This result is also shown to hold over any large point set in general position. Further, it is shown that the hypothesis finding problem is NP-complete. The disagreement coefficient of a concept class for a fixed distribution can be used to give bounds for the label complexity of active learning. We study the disagreement coefficient of monotone conjunctions under the uniform distribution and compute tight, or almost tight, asymptotic lower and upper bounds for every target. This is the first time that the disagreement coefficient is studied for Boolean concept classes. We are investigating the commonsense knowledge base ConceptNet 4 using the tools of network analysis. We identify missing links, spurious links, as well as provide additional knowledge to be added in ConceptNet. Using variants of spreading activation techniques we use the database for question answering and explain, as well as improve, candidate answers for the same questions from a previous study that relied in a low-rank approximation of the adjacency matrix. Aiming to add general rules for further reasoning we mine frequent triples from ConceptNet. This mining approach also identifies incorrect assertions already in ConceptNet. Finally, we also identify rules that may make sense as factual statements about the world but not in terms of natural language usage.
机译:本论文涉及学习理论中出现的问题以及使用网络分析工具对流行的常识性知识库进行的调查,该知识库可在线公开获得,并利用数据库中的可用知识进行推理。我们通过均匀分布下的直观邻域研究单调连词的可演化性,给出最佳逼近的结构定理,并改善先前的已知结果。对于相同的算法,我们展示了在非简并产品分布下局部最优的存在。使用协方差作为适应性度量,我们证明了在这些分布下相似的结构定理以及短单调连词的可演化性。使用力矩曲线中的点,我们表明半空间的VC维在多实例学习框架下,与先前已知的上限匹配。该结果还显示可以保留一般位置上的任何大点。此外,证明了假设发现问题是NP完全的。固定分布的概念类的分歧系数可用于为主动学习的标签复杂度划定界限。我们研究均匀分布下单调连词的不一致系数,并为每个目标计算紧密或几乎紧密的渐近下界和上限。这是首次针对布尔概念类研究分歧系数。我们正在使用网络分析工具来调查常识知识库ConceptNet 4。我们确定丢失的链接,虚假链接,并提供要在ConceptNet中添加的其他知识。使用扩展激活技术的变体,我们使用数据库来回答问题,并解释和改进先前研究中依赖于邻接矩阵的低秩近似的相同问题的候选答案。为了添加进一步的推理规则,我们从ConceptNet挖掘了频繁的三元组。这种挖掘方法还可以识别ConceptNet中已经存在的错误断言。最后,我们还确定了一些规则,这些规则可能是关于世界的事实陈述,而不是自然语言用法。

著录项

  • 作者单位

    University of Illinois at Chicago.;

  • 授予单位 University of Illinois at Chicago.;
  • 学科 Computer Science.
  • 学位 Ph.D.
  • 年度 2013
  • 页码 191 p.
  • 总页数 191
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类 遥感技术;
  • 关键词

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号