首页> 外国专利> SYSTEMS AND METHODS FOR WORD SEGMENTATION BASED ON A COMPETING NEURAL CHARACTER LANGUAGE MODEL

SYSTEMS AND METHODS FOR WORD SEGMENTATION BASED ON A COMPETING NEURAL CHARACTER LANGUAGE MODEL

机译:基于竞争神经字符语言模型的文字分割系统和方法

摘要

A computer-implemented method and system for word segmentation, comprising: a computer-implemented method for word segmentation, comprising: receiving a plurality of characters for segmentation; converting the characters of the plurality of characters into embedding vectors by using the embedding model; feed the embedding vector to the forward language model to retrieve the first output vector; feed the embedding vector to the reverse language model to retrieve the second output vector; compare the embedding vector with the first output vector and the second output vector; and dividing the plurality of characters based on the comparison, wherein comparing the embedding vector with the first output vector and the second output vector is an inverse of the Euclidean distance between the embedding vector and each of the first output vector and the second output vector. Including determining the exponent.
机译:一种用于Word分割的计算机实现的方法和系统,包括:用于字分割的计算机实现的方法,包括:接收多个字符以进行分割; 通过使用嵌入模型将多个字符的字符转换为嵌入向量; 将嵌入向量馈送到前向语言模型以检索第一输出矢量; 将嵌入矢量馈送到逆向语言模型以检索第二输出矢量; 将嵌入向量与第一输出矢量和第二输出矢量进行比较; 基于比较划分多个字符,其中将嵌入矢量与第一输出矢量进行比较,并且第二输出矢量是嵌入矢量与第一输出矢量的每个欧洲距离和第二输出矢量的逆。 包括确定指数。

著录项

  • 公开/公告号KR20210145700A

    专利类型

  • 公开/公告日2021-12-02

    原文格式PDF

  • 申请/专利权人 쿠팡 주식회사;

    申请/专利号KR20210160488

  • 发明设计人 위 슈시;리 징;

    申请日2021-11-19

  • 分类号G06F40/279;G06F16/31;G06F16/33;G06F16/35;G06F40/268;G06N3/08;

  • 国家 KR

  • 入库时间 2022-08-24 22:36:18

相似文献

  • 专利
  • 外文文献
  • 中文文献
获取专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号