首页> 外国专利> TRAINING METHOD AND APPARATUS FOR TEXT SIMILARITY RECOGNITION MODEL, AND RELATED DEVICE

TRAINING METHOD AND APPARATUS FOR TEXT SIMILARITY RECOGNITION MODEL, AND RELATED DEVICE

机译:文本相似度识别模型的培训方法和装置,以及相关设备

摘要

The present application relates to the technical field of text recognition in artificial intelligence. Provided are a training method and apparatus for a text similarity recognition model, and a related device. The method comprises: obtaining a plurality of first sample groups comprising first text samples and second text samples; using as third text samples elements having a literal similarity to the first text samples that reaches a preset threshold; labelling the third text samples to obtain negative text samples, and forming a plurality of second sample groups; representing each sample in each second sample group with a representation vector; calculating a first similarity and a second similarity; and adjusting parameters according to the first similarity and the second similarity, and repeatedly obtaining representation vectors up to the present step so as to obtain a trained text similarity recognition model. By implementing the present application, the problem in the prior art that text similarity recognition methods are low in recognition accuracy can be solved; moreover, the present application further relates to blockchain technology, and the first sample groups and the second sample groups can be stored in a blockchain node.
机译:本申请涉及人工智能中文本识别技术领域。提供了文本相似度识别模型和相关设备的训练方法和装置。该方法包括:获得包括第一文本样本和第二文本样本的多个第一样本组;使用作为第三文本样本元素,其具有与达到预设阈值的第一文本样本的文字相似性;标记第三文本样本以获得阴性文本样本,并形成多个第二样品组;表示每个第二样本组中的每个样本,具有表示向量;计算第一个相似性和第二个相似性;根据第一相似性和第二相似度调整参数,并重复地获得对当前步骤的表示向量,以便获得训练的文本相似度识别模型。通过实现本申请,可以解决现有技术中的问题,即可以解决文本相似性识别方法的识别精度低;此外,本申请还涉及区块技术,并且第一样本组和第二样本组可以存储在块链节节节点中。

著录项

相似文献

  • 专利
  • 外文文献
  • 中文文献
获取专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号