首页> 外国专利> NAMED ENTITY RECOGNITION ON CHAT DATA

NAMED ENTITY RECOGNITION ON CHAT DATA

机译:聊天数据上的命名实体识别

摘要

Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for receiving a plurality of word strings in a first language, each received word string comprising a plurality of words, identifying one or more named entities in each received word string using a statistical classifier that was trained using training data comprising a plurality of features, wherein one of the features is a word shape feature that comprises a respective token for each letter of a respective word wherein each token signifies a case of the letter or whether the letter is a digit, and translating the received word strings from the first language to a second language including preserving the respective identified named entities in each received word string during translation.
机译:方法,系统和装置,包括在计算机存储介质上编码的计算机程序,用于接收第一语言的多个单词串,每个接收到的单词串包括多个单词,标识每个接收到的单词串中的一个或多个命名实体使用统计分类器,该统计分类器是使用包含多个特征的训练数据进行训练的,其中,特征之一是单词形状特征,其包括针对相应单词的每个字母的相应标记,其中每个标记表示字母的大小写或字母是一个数字,并将接收到的单词字符串从第一语言翻译成第二语言,包括在翻译过程中在每个接收到的单词字符串中保留各自标识的命名实体。

著录项

  • 公开/公告号EP3400536A1

    专利类型

  • 公开/公告日2018-11-14

    原文格式PDF

  • 申请/专利权人 MZ IP HOLDINGS LLC;

    申请/专利号EP20170701607

  • 申请日2017-01-04

  • 分类号G06F17/27;G06F17/28;

  • 国家 EP

  • 入库时间 2022-08-21 12:26:24

相似文献

  • 专利
  • 外文文献
  • 中文文献
获取专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号