首页> 外文会议>Database systems for advanced applications >Wikipedia Based Short Text Classification Method
【24h】

Wikipedia Based Short Text Classification Method

机译:基于维基百科的短文本分类方法

获取原文
获取原文并翻译 | 示例

摘要

Short text is usually expressed in refined slightly, insufficient information, which makes text classification difficult. But we can try to introduce some information from the existing knowledge base to strengthen the performance of short text classification. Wikipedia [2,13,15] is now the largest human-edited knowledge base of high quality. It would benefit to short text classification if we can make full use of Wikipedia information in short text classification. This paper presents a new concept based [22] on Wikipedia short text representation method, by identifying the concept of Wikipedia mentioned in short text, and then expand the concept of wiki correlation and short text messages to the feature vector representation.
机译:短文本通常用略微修饰的信息来表示,这使文本分类变得困难。但是我们可以尝试从现有知识库中引入一些信息,以增强短文本分类的性能。维基百科[2,13,15]现在是最大的人工编辑的高质量知识库。如果我们可以在短文本分类中充分利用Wikipedia信息,将对短文本分类有利。本文提出了一种基于维基百科短文本表示方法的新概念[22],通过识别短文本中提到的维基百科的概念,然后将维基关联和短文本消息的概念扩展为特征向量表示。

著录项

  • 来源
  • 会议地点 Suzhou(CN)
  • 作者单位

    School of Software Engineering, South China University of Technology, Guangzhou, China;

    School of Software Engineering, South China University of Technology, Guangzhou, China;

    School of Software Engineering, South China University of Technology, Guangzhou, China;

    Department of Computer Science and Engineering, The Chinese University of Hong Kong, Sha Tin, China;

    School of Software Engineering, South China University of Technology, Guangzhou, China;

  • 会议组织
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类
  • 关键词

    Short text classification; Concept; Wikipedia;

    机译:短文本分类;概念;维基百科;

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号