Optimize First, Buy Later: Analyzing Metrics to Ramp-Up Very Large Knowledge Bases

机译：首先优化，稍后购买：分析度量标准以增加非常大的知识库

获取原文

页面导航

摘要
著录项
引文网络
相似文献
相关主题

摘要

As knowledge bases move into the landscape of larger ontologies and have terabytes of related data, we must work on optimizing the performance of our tools. We are easily tempted to buy bigger machines or to fill rooms with armies of little ones to address the scalability problem. Yet, careful analysis and evaluation of the characteristics of our data - using metrics - often leads to dramatic improvements in performance. Firstly, are current scalable systems scalable enough? We found that for large or deep ontologies (some as large as 500,000 classes) it is hard to say because benchmarks obscure the load-time costs for materialization. Therefore, to expose those costs, we have synthesized a set of more representative ontologies. Secondly, in designing for scalability, how do we manage knowledge over time? By optimizing for data distribution and ontology evolution, we have reduced the population time, including materialization, for the NCBO Resource Index, a knowledge base of 16.4 billion annotations linking 2.4 million terms from 200 ontologies to 3.5 million data elements, from one week to less than one hour for one of the large datasets on the same machine.

机译：由于知识库进入较大的本体的景观并具有与相关数据的TB，我们必须致力于优化工具的性能。我们很容易诱惑购买更大的机器或填充有小家伙的机器，以解决可扩展性问题。然而，仔细分析和评估我们的数据的特征 - 使用指标 - 通常会导致性能的戏剧性改善。首先，目前可扩展系统是否足够可扩展？我们发现，对于大型或深层本体（有些大约500,000级），很难说，因为基准掩盖了实现的负载时间成本。因此，为了公开这些成本，我们已经合成了一套更多代表性的本体。其次，在设计可扩展性时，我们如何随时间管理知识？通过优化数据分布和本体论演变，我们减少了人口时间，包括资源指数，包括资源指数的资源指数，将有164亿元的知识库，从200个本体到350万个数据元素，从一周到更少同一台机器上的一个大型数据集中的一个小时。

著录项

来源
《International Semantic Web Conference》|2010年||共16页
会议地点
作者
Paea LePendu; Natalya F. Noy; Clement Jonquet; Paul R. Alexander; Nigam H. Shah; Mark A. Musen;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类 TP3-53;
关键词

相似文献

外文文献
中文文献
专利

1. A new grey decision dynamic model based on cybernetics knowledge for complex system optimization analyzing and its practical application [J] . Xiaoping Bai, Hongming Wang Kybernetes: The International Journal of Systems & Cybernetics . 2008,第9a10期

机译：基于控制论知识的复杂系统优化分析的灰色决策动态模型及其实际应用
2. Application of LocaL Metrics for the Formation and Optimization of the Knowledge Base [J] . A. E. Yankovskaya, E. A. Muratova Pattern recognition and image analysis: advances in mathematical theory and applications in the USSR . 2001,第2期

机译：LocaL度量标准在知识库的形成和优化中的应用
3. An MDP Model-Based Reinforcement Learning Approach for Production Station Ramp-Up Optimization: Q-Learning Analysis [J] . Doltsinis S., Ferreira P., Lohse N. IEEE Transactions on Systems, Man, and Cybernetics . 2014,第9期

机译：基于MDP模型的强化学习平台用于生产站升级优化：Q学习分析
4. Optimize First, Buy Later: Analyzing Metrics to Ramp-Up Very Large Knowledge Bases [C] . Paea LePendu, Natalya F. Noy, Clement Jonquet, ISWC 2010;International semantic web conference . 2011

机译：首先进行优化，然后再购买：分析指标以增加非常大的知识库
5. A metrics-based sustainability assessment of cryogenic machining using modeling and optimization of process performance. [D] . Lu, Tao. 2014

机译：使用建模和过程性能优化对低温加工进行基于度量的可持续性评估。
6. Analyzing Knowledge Base Content Development and Review: Recommendations for a Robust Knowledge Management Infrastructure [O] . Steven G. Wilkinson, Roberto A. Rocha, Julie Rhodes 2002

机译：分析知识库内容的开发和审阅：有关稳健的知识管理基础结构的建议
7. Optimize First, Buy Later: Analyzing Metrics to Ramp-up Very Large Knowledge Bases [O] . LePendu, Paea, Noy, Natalya F., Jonquet, Clement, 2010

机译：首先进行优化，然后再购买：分析度量标准以建立大量知识库

Optimize First, Buy Later: Analyzing Metrics to Ramp-Up Very Large Knowledge Bases

摘要

著录项

引文网络

相似文献

相关主题

期刊订阅