首页> 外国专利> METHOD, APPARATUS AND DEVICE FOR IDENTIFYING TEXT TYPE

METHOD, APPARATUS AND DEVICE FOR IDENTIFYING TEXT TYPE

机译:文本类型识别的方法,装置和设备

摘要

Disclosed are a method, apparatus and device for identifying a text type. One particular embodiment of the method comprises: implementing a pre-process on a pre-acquired text to obtain a key word set of the text (201); calculating an appearance probability value of each key word in the key word set in the text (202); for each key word in the key word set, introducing the key word and the appearance probability value corresponding to the key word into a pre-established document theme generation model and determining an appearance probability value of each pre-arranged theme in the text in the document theme generation model (203); and according to the appearance probability value of each pre-arranged theme in the text, identifying the type to which the text belongs (204). The method improves the accuracy rate for identifying a text type.
机译:公开了一种用于识别文本类型的方法,装置和设备。该方法的一个特定实施例包括:对预先获取的文本执行预处理以获得文本的关键词集(201);计算文本中设置的关键词中每个关键词的出现概率值(202);对于关键词集中的每个关键词,将关键词和该关键词对应的出现概率值引入到预先建立的文档主题生成模型中,并确定文本中每个预先安排的主题在文本中的出现概率值。文档主题生成模型(203);根据所述文本中每个预先安排的主题的出现概率值,确定所述文本所属的类型(204)。该方法提高了识别文本类型的准确率。

著录项

相似文献

  • 专利
  • 外文文献
  • 中文文献
获取专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号