首页> 外文期刊>Journal of information and computational science >A Link Strength Based Language Model for Chinese Short-text Categorization
【24h】

A Link Strength Based Language Model for Chinese Short-text Categorization

机译:基于链接强度的中文短文本分类语言模型

获取原文
获取原文并翻译 | 示例

摘要

For the sake of further mining information of high discrimination in Chinese short-text, this paper presents a link strength based language model, which could epurate the intension of the short-text fully by considering term relations and order deeply and remedy the deficiency that the short-text can't express information adequately. The proposed model changes the link structure in traditional language mode that could reduce information loss brought about by conventional categorization model and suits for short text classification. This is implemented by introducing a new scale-cumulative link distribution of depicting the link strength between terms, and it solves the problem that traditional language mode assumes all adjacent terms are related, but in practice it doesn't accord to the real fact. Compared with traditional language model, experimental results show the superiority of the proposed model in Chinese short-text categorization.
机译:为了进一步挖掘中文短文本中的高歧视信息,本文提出了一种基于链接强度的语言模型,该模型可以通过充分考虑术语关系和顺序来充分消除短文本的内涵,并弥补这种缺陷。短文本不能充分表达信息。所提出的模型改变了传统语言模式的链接结构,可以减少传统分类模型带来的信息丢失,适合于短文本分类。这是通过引入一种新的比例累积链接分布来实现的,该链接分布描述了词语之间的链接强度,解决了传统语言模式假定所有相邻词语都相关的问题,但实际上这并不符合实际情况。与传统语言模型相比,实验结果表明了该模型在中文短文本分类中的优越性。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号