首页> 外文会议>International conference on artificial intelligence;ICAI 2011 >Contrasting Machine Learning Approaches for Microtext Classification
【24h】

Contrasting Machine Learning Approaches for Microtext Classification

机译:对比机器学习方法进行微文本分类

获取原文

摘要

The goal is classification of microtext: classifying lines of military chat, or posts, which contain items of interest. This paper evaluates non-linear statistical data modeling techniques, and compares with our previous results using several text categorization and feature selection methodologies. The chat posts are examples of 'microtext', or text that is generally very short in length, semi-structured, and characterized by unstructured or informal grammar and language. These three distinct attributes cause different results than traditional long-form free text. In this paper, we further characterize microtext. Highly accurate classification of microtext entries is crucial to facilitate more complex information extraction. Although this study focused specifically on tactical updates via chat, we believe the findings are applicable to content of a similar linguistic structure regardless of domain. This includes other microtext sources such as IM/XMPP, SMS, voice transcriptions, and micro-blogging such as Twitter(tm).
机译:目标是Microtext的分类:分类军事聊天行,或包含感兴趣的项目。本文评估了非线性统计数据建模技术,并使用多个文本分类和特征选择方法与我们以前的结果进行比较。聊天帖子是“MicroText”的示例,或通常非常短,半结构化的文本,并由非结构化或非正式语法和语言表征。这三个不同的属性导致不同的结果而不是传统的长形式自由文本。在本文中,我们进一步表征了Microtext。 Microtext条目的高度准确分类对于促进更复杂的信息提取至关重要。虽然这项研究专门专注于通过聊天的战术更新,但我们相信该发现适用于类似语言结构的内容,无论域如何。这包括其他MicroText来源,例如IM / XMPP,SMS,语音转录和微博(如Twitter)(TM)。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号