首页> 外国专利> METHOD TO SELECT LEARNING TEXT FOR LANGUAGE MODEL, METHOD TO LEARN LANGUAGE MODEL BY USING THE SAME LEARNING TEXT, AND COMPUTER AND COMPUTER PROGRAM FOR EXECUTING THE METHODS

METHOD TO SELECT LEARNING TEXT FOR LANGUAGE MODEL, METHOD TO LEARN LANGUAGE MODEL BY USING THE SAME LEARNING TEXT, AND COMPUTER AND COMPUTER PROGRAM FOR EXECUTING THE METHODS

机译:选择用于语言模型的学习文本的方法,使用相同学习文本的用于学习语言模型的方法以及用于执行该方法的计算机和计算机程序

摘要

PROBLEM TO BE SOLVED: To provide a technique to efficiently collect sentences resembling sentences contained in the corpus of an object area from a corpus outside the corpus of the object area.SOLUTION: A technique to select a learning text for a language model comprises a generating technique to replace one or more words in the corpus of a first domain with a special symbol or symbols representing any random word or word string and to use the replacing word string as a template for selecting the learning text; or selection as the learning text in accordance with at least one generating technique to use a word string from the corpus of the first domain and selection of a text covered by the template as the learning text from the corpus of a second domain differing from the first domain.SELECTED DRAWING: Figure 2A
机译:解决的问题:提供一种从对象区域的语料库之外的语料库有效地收集类似于对象区域的语料库中包含的句子的句子的技术方案:为语言模型选择学习文本的技术包括生成一种技术,用一个特殊的符号或多个代表任意随机单词或单词串的符号替换第一域的语料库中的一个或多个单词,并使用替换的单词串作为选择学习文本的模板;根据至少一种生成技术来选择或选择作为学习文本,以使用来自第一领域的语料库的单词串,并选择模板所覆盖的文本作为来自不同于第一领域的第二领域的语料库的学习文本domain.SELECTED DRAWING:图2A

著录项

相似文献

  • 专利
  • 外文文献
  • 中文文献
获取专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号