首页> 外国专利> CROSS-LINGUAL TEXT CLASSIFICATION USING CHARACTER EMBEDDED DATA STRUCTURES

CROSS-LINGUAL TEXT CLASSIFICATION USING CHARACTER EMBEDDED DATA STRUCTURES

机译:使用字符嵌入数据结构的跨语言文本分类

摘要

A device may be configured to obtain text from a document. The device may perform embedding to obtain a data structure indicating probabilities associated with characters included in the text and apply a first convolution to the data structure to obtain different representations of the characters included in the text. In addition, the device may apply parallel convolution to the different representations to obtain multiple sets of character representations, subsample the multiple sets of character representations, and pool the subsampled multiple sets of character representations into a merged data structure. The device may provide the merged data structure to a fully connected layer, of a convolutional neural network, to produce data representing features of the text; and provide the data representing features of the text to an inference layer, of the convolutional neural network, that provides data indicating a classification for the text.
机译:设备可以被配置为从文档获得文本。设备可以执行嵌入以获得指示与文本中包括的字符相关联的概率的数据结构,并将第一卷积应用于数据结构以获得文本中包括的字符的不同表示。另外,该设备可以将并行卷积应用于不同的表示,以获得多组字符表示,对多组字符表示进行子采样,并将子采样的多组字符表示集合为合并的数据结构。该设备可以将合并的数据结构提供给卷积神经网络的完全连接的层,以产生表示文本特征的数据。并将表示文本特征的数据提供给卷积神经网络的推理层,该推理层提供指示文本分类的数据。

著录项

相似文献

  • 专利
  • 外文文献
  • 中文文献
获取专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号