首页> 外国专利> BOOTSTRAPPING MULTIPLE VARIETIES OF GROUND TRUTH FOR A COGNITIVE SYSTEM

BOOTSTRAPPING MULTIPLE VARIETIES OF GROUND TRUTH FOR A COGNITIVE SYSTEM

机译:自举一个认知系统的地面真相的多个变量

摘要

Curating high-quality ground truth is an important but difficult part of training a cognitive system. The invention greatly simplifies this process by determining the value that particular training data has in improving existing ground truth. Candidate training data of different types (text, audio, images) is extracted from an interaction log, and each entry is analyzed to arrive at a training value score. The analysis generates multiple component scores which are combined for the final score. The component scores may include a per-feature variability score, a cross-feature variability score, and an accuracy score. A set of the unverified entries may be presented to a user based on the training value scores, and the user can select which of the entries in the set should be included as new ground truths. The ground truths can then be updated by adding the selected entries.
机译:制定高质量的地面真理是训练认知系统的重要但困难的部分。本发明通过确定特定训练数据在改善现有地面真实性方面具有的价值而大大简化了该过程。从交互日志中提取不同类型(文本,音频,图像)的候选训练数据,并对每个条目进行分析以得出训练值得分。该分析生成多个成分分数,将其合并为最终分数。组件分数可以包括按特征的可变性分数,跨特征的可变性分数和准确性分数。可以基于训练值得分将一组未经验证的条目呈现给用户,并且用户可以选择该组中的哪些条目应被包括为新的地面事实。然后可以通过添加所选条目来更新基本事实。

著录项

相似文献

  • 专利
  • 外文文献
  • 中文文献
获取专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号