首页> 外国专利> CHINESE CODING METHOD AND APPARATUS BASED ON BIDIRECTIONAL LONG SHORT-TERM MEMORY NETWORK MODEL

CHINESE CODING METHOD AND APPARATUS BASED ON BIDIRECTIONAL LONG SHORT-TERM MEMORY NETWORK MODEL

机译:基于双向长期短时记忆网络模型的中文编码方法和装置

摘要

Disclosed are a Chinese coding method and apparatus based on a bidirectional long short-term memory network model, relating to the technical field of artificial intelligence. The method comprises: converting training linguistic data into character-level data; partitioning the character-level data according to a preset symbol to obtain multiple pieces of first character-level data, and grouping the multiple pieces of first character-level data according to the lengths of the pieces of first character-level data to obtain K data sets; obtaining, according to the K data sets, K trained bidirectional long short-term memory network models; and processing target linguistic data and then inputting same into at least one trained bidirectional long short-term memory network model of the K trained bidirectional long short-term memory network models to obtain a coding result of the target linguistic data. Therefore, the problem of low accuracy of Chinese coding can be solved.
机译:本发明公开了一种基于双向长短期记忆网络模型的中文编码方法及装置,涉及人工智能技术领域。该方法包括:将训练语言数据转换为字符级数据;根据预设符号对所述字符级数据进行划分,得到多个第一字符级数据,根据所述第一字符级数据的长度对所述多个第一字符级数据进行分组,得到K个数据。套根据K个数据集,获得K个训练有素的双向长短期记忆网络模型;处理目标语言数据,然后将其输入到K个训练的双向长短期存储网络模型中的至少一个训练的双向长短期存储网络模型中,以获得目标语言数据的编码结果。因此,可以解决中文编码精度低的问题。

著录项

相似文献

  • 专利
  • 外文文献
  • 中文文献
获取专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号