首页> 外国专利> Thematic segmentation of long content using deep learning and contextual cues

Thematic segmentation of long content using deep learning and contextual cues

机译:使用深度学习和上下文线索对长内容进行主题细分

摘要

A recurrent neural network (RNN) is trained to identify split positions in long content, wherein each split position is a position at which the theme of the long content changes. Each sentence in the long content is converted to a vector that corresponds to the meaning of the sentence. The sentence vectors are used as inputs to the RNN. The high-probability split points determined by the RNN may be combined with contextual cues to determine the actual split point to use. The split points are used to generate thematic segments of the long content. The multiple thematic segments may be presented to a user along with a topic label for each thematic segment. Each topic label may be generated based on the words contained in the corresponding thematic segment.
机译:训练循环神经网络(RNN)以识别长内容中的拆分位置,其中每个拆分位置是长内容主题发生变化的位置。长内容中的每个句子都转换为与句子含义相对应的向量。句子向量用作RNN的输入。由RNN确定的高概率分割点可以与上下文提示结合起来,以确定要使用的实际分割点。分割点用于生成较长内容的主题片段。多个主题片段可以与每个主题片段的主题标签一起呈现给用户。每个主题标签可以基于包含在相应主题片段中的单词来生成。

著录项

相似文献

  • 专利
  • 外文文献
  • 中文文献
获取专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号