首页> 外文会议>International Joint Conference on Artificial Intelligence >An Achievement Test for Knowledge-Based Systems: QUEM
【24h】

An Achievement Test for Knowledge-Based Systems: QUEM

机译:基于知识的系统的成就测试:QUEM

获取原文

摘要

This paper describes QUEM, a method for assessing the skill level of a knowledge-based system based on the quality of the solutions it produces. QUEM is demonstrated by using it to assess the performance of a particular knowledge-based system, P~3. QUEM can be viewed as an achievement or job placement test given to know ledge-based systems to help system designers determine how the system should be used, and in what capacity by what level of users. In general, it is difficult to find useful metrics for assessing a system's overall performance. Most literature on evaluation deals with validation, verification and testing in which the primary concern is the correctness and consistency in the databases and rule-bases. However, these properties alone may not be sufficient to determine how well a system performs its task. QUEM allows software developers to assess their system's performance by constructing a skill function based on human performance data that relates experience and solution quality. QUEM can be used to gauge the experience level of an individual system, compare two systems, or compare a system to its intended users. This represents an important advance in quantitative measures of over-all system performance that can be applied to a broad range of systems.
机译:本文介绍了基于其产生的解决方案的质量评估基于知识的系统的技能水平的方法。通过使用它来评估特定知识的系统,P〜3的性能来证明标准。可以被视为索引的成就或作业位置测试,以了解基于地段的系统,以帮助系统设计人员确定如何使用系统,以及通过什么用户的能力。通常,很难找到评估系统整体性能的有用度量。大多数关于评估的文献涉及验证,验证和测试,其中主要关注的是数据库和规则基础的正确性和一致性。然而,单独的这些属性可能不足以确定系统如何执行任务。 QUEM允许软件开发人员通过基于与人类性能数据构建技能功能来评估其系统的性能,这些功能和解决方法和解决方案质量。可用于衡量单个系统的体验级别,比较两个系统,或将系统与其预期用户进行比较。这代表了可以应用于广泛系统的全部系统性能的定量测量中的重要提前。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号