首页> 外国专利> METHOD AND APPARATUS FOR QUESTION-AND-ANSWER DATA ENHANCEMENT, COMPUTER DEVICE, AND STORAGE MEDIUM

METHOD AND APPARATUS FOR QUESTION-AND-ANSWER DATA ENHANCEMENT, COMPUTER DEVICE, AND STORAGE MEDIUM

机译:用于问答数据增强,计算机设备和存储介质的方法和装置

摘要

A method and an apparatus for question-and-answer data enhancement, a computer device, and a storage medium, relating to artificial intelligence technology, and specifically for use in deep learning. The method comprises acquiring a question and answer dataset, the question and answer dataset comprising a plurality of data points and real tags corresponding thereto (S1); on the basis of a pre-trained prediction model and the real tags, performing first soft tag prediction on each data point to obtain first soft tags corresponding to each data point (S2); constructing a soft tag dataset from each data point and the first soft tags corresponding thereto, and use knowledge distillation to generate a labeling model from the soft tag dataset and the prediction model (S3) acquiring a dataset to be tagged, inputting the dataset to be tagged into the labeling model for pre-labeling, and on the basis of labeling results, screening the dataset to be tagged to obtain a labeled sample set (S4). The described method further relates to blockchain technology, and the data in the labeled sample set and the dataset to be tagged are stored in a blockchain. The described method is able to improve efficiency and quality in labeling and tagging.
机译:用于问答数据增强,计算机设备和存储介质的方法和装置,与人工智能技术有关,专门用于深入学习。该方法包括获取问题和应答数据集,该问题和应答数据集包括多个数据点和与其对应的真实标签(S1);基于预先训练的预测模型和真实标签,对每个数据点执行第一软标签预测,以获得与每个数据点对应的第一软标签(S2);从每个数据点和对应的第一软标签构造软标签数据集,并使用知识蒸馏从软标签数据集和预测模型生成标签模型(S3)获取要标记的数据集,输入数据集标记为预标记的标签模型,并在标记结果的基础上,筛选要标记的数据集以获得标记的样本集(S4)。所描述的方法还涉及区块链技术,并且标记的样本集中的数据和要标记的数据集存储在区块链中。所描述的方法能够提高标记和标记中的效率和质量。

著录项

相似文献

  • 专利
  • 外文文献
  • 中文文献
获取专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号