首页> 外国专利> METHOD AND SYSTEM OF ADDING PUNCTUATION AND ESTABLISHING LANGUAGE MODEL

METHOD AND SYSTEM OF ADDING PUNCTUATION AND ESTABLISHING LANGUAGE MODEL

机译:建立和建立语言模型的方法和系统

摘要

A method of processing information content based on a language model is performed at a computer. The method includes the following steps: identifying a plurality of expressions in the information content that is queued to be processed; dividing the plurality of expressions into a plurality of characteristic units according to semantic features and predetermined characteristics associated with each of the plurality of characteristic units, each characteristic unit including a subset of the plurality of expressions and the predetermined characteristics at least including a respective integer number of expressions that are included in the characteristic unit; extracting, from the language model, a plurality of probabilities for a plurality of punctuation marks associated with each of the plurality of characteristic units; and in accordance with the extracted probabilities, associating a respective punctuation mark with each of the plurality of characteristic units included in the information content.
机译:在计算机上执行基于语言模型的处理信息内容的方法。该方法包括以下步骤:在排队等待处理的信息内容中识别多个表达;根据语义特征和与多个特征单元中的每个特征单元相关联的预定特征,将多个表达式划分为多个特征单元,每个特征单元包括多个表达式的子集,并且预定特征至少包括各自的整数特征单元中包含的表达式;从语言模型中,提取与多个特征单元中的每一个相关的多个标点符号的多个概率;根据提取的概率,将各个标点符号与信息内容中包括的多个特征单元中的每一个相关联。

著录项

相似文献

  • 专利
  • 外文文献
  • 中文文献
获取专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号