首页> 外国专利> A TEXT PROCESSING METHOD AND DEVICE BASED ON AMBIGUOUS ENTITY WORDS

A TEXT PROCESSING METHOD AND DEVICE BASED ON AMBIGUOUS ENTITY WORDS

机译:基于歧义词的文本处理方法及装置

摘要

The present invention proposes a method and apparatus for processing text based on ambiguous entity words, wherein the text processing method based on ambiguous entity words is a context of text to remove ambiguity, and at least two candidate entities represented by text to remove ambiguity To obtain a semantic vector of the context through a trained word vector model, a first entity vector of at least two candidate entities, through a trained unsupervised neural network model, and a sync between the context and each candidate entity By calculating the rate, you determine the target entity that the text that you want to remove ambiguity expresses in context. The first entity vector of the generated candidate entity further includes the text meaning of the candidate entity and the relationship between each entity, through an unsupervised neural network model obtained by already learning the meaning of each entity text and the relationship between each entity, The entity information of the text to remove the ambiguity is completely shaped, and the semantic vector and sync rate of the context are calculated to determine the target entity, and the accuracy of removing the ambiguity is improved by determining the target entity.
机译:本发明提出了一种基于歧义实体词的文本处理方法和装置,其中,基于歧义实体词的文本处理方法是文本上下文消除歧义,至少两个候选文本实体以歧义消除以获得通过训练的单词向量模型,上下文的语义向量,通过训练的无监督神经网络模型的至少两个候选实体的第一实体向量以及上下文与每个候选实体之间的同步。通过计算速率,您可以确定您要消除歧义的文本在上下文中表示的目标实体。通过已经通过学习每个实体文本的含义和每个实体之间的关系而获得的无监督神经网络模型,所生成的候选实体的第一实体向量还包括候选实体的文本含义和每个实体之间的关系。去除歧义的文本信息被完全成形,并计算上下文的语义矢量和同步速率以确定目标实体,并且通过确定目标实体来提高歧义的准确性。

著录项

相似文献

  • 专利
  • 外文文献
  • 中文文献
获取专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号