首页> 外文期刊>IBM Journal of Research and Development >Performance of natural language classifiers in a question-answering system
【24h】

Performance of natural language classifiers in a question-answering system

机译:自然语言分类器在问答系统中的表现

获取原文
获取原文并翻译 | 示例
       

摘要

Deep-learning algorithms are being used extensively in question–answering systems based on natural language classifiers to classify an incoming user question into a set of classes with the same answer. We treat a natural language classifier as a black box and study its performance with respect to the ground truth that is used to train and test the system. We have observed that maintaining ground truth is challenging; for example, 1) the number of answer classes can be large (in the several hundreds), 2) manual mapping of questions to answers can result in inconsistent mappings, leading to overlap and confusion among them, and 3) users ask questions within a context that is not apparent by examining the question standalone, leading to erroneous mappings. We propose a methodology for guided evolution of the ground truth, from its initial creation to its ongoing maintenance in the deployed production environment. We measure performance using two metrics: accuracy and confidence. Accuracy measures how many classifications are correct, based on an assessment, while confidence is a raw metric, output by the classifier, which correlates with accuracy. Confidence can further be used to effectively manage the perceived accuracy of the system from a user's perspective, appropriately trading off accuracy versus coverage.
机译:在基于自然语言分类器的问答系统中,广泛使用了深度学习算法,以将传入的用户问题分类为具有相同答案的一组类。我们将自然语言分类器视为黑匣子,并针对用于训练和测试系统的地面事实研究其性能。我们已经观察到保持基本真理是具有挑战性的。例如,1)答案类别的数量可能很大(数百个),2)手动将问题映射到答案可能会导致映射不一致,从而导致它们之间的重叠和混乱,以及3)用户在通过独立地检查问题而看不到的上下文,从而导致错误的映射。我们提出了一种指导性方法,以指导地面事实的发展,从最初的创建到在部署的生产环境中对其进行持续维护。我们使用两个指标来衡量绩效:准确性和置信度。准确度基于评估来衡量多少个正确分类,而置信度是分类器输出的原始指标,与准确度相关。从用户的角度来看,可以进一步使用置信度来有效地管理系统的感知准确性,适当地权衡准确性与覆盖范围。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号