首页> 外文期刊>Journal of information and computational science >Similarity Measures for Chinese Short Text Based on Representation Learning
【24h】

Similarity Measures for Chinese Short Text Based on Representation Learning

机译:基于表征学习的中文短文本相似性度量

获取原文
获取原文并翻译 | 示例
           

摘要

Similarity measure in Chinese short text is an important prerequisite for many content-based texts or documents retrieval tasks. In this paper, we propose a fast method for representing Chinese short texts to calculate the similarity between texts. The method is based on the representation of Chinese words. First, Chinese word representation is learned by a deep neural network with local context embedding and global context. Then, the words in short text are replaced by the learned representations of Chinese words and the short text is represented by dynamic average-weighted function depending on target text. Next, the cosine similarity method is used for the similarity measurement between texts. Last, experiment shows the semantic by visualizing the result of Chinese word representation learning and the experiment on similarity measure demonstrates the effectiveness of our short text representation method.
机译:中文短文本的相似性度量是许多基于内容的文本或文档检索任务的重要前提。在本文中,我们提出了一种用于表示中文短文本的快速方法,以计算文本之间的相似度。该方法基于中文单词的表示。首先,通过具有局部上下文嵌入和全局上下文的深度神经网络来学习中文单词表示。然后,将短文本中的单词替换为学习过的中文单词表示形式,并根据目标文本,通过动态平均加权函数来表示短文本。接下来,将余弦相似度方法用于文本之间的相似度测量。最后,实验通过可视化汉字表示学习的结果来显示语义,而相似性度量的实验证明了我们的短文本表示方法的有效性。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号