首页> 外文会议>Data Mining Workshops, 2009. ICDMW '09 >Parameterized Contrast in Second Order Soft Co-occurrences: A Novel Text Representation Technique in Text Mining and Knowledge Extraction
【24h】

Parameterized Contrast in Second Order Soft Co-occurrences: A Novel Text Representation Technique in Text Mining and Knowledge Extraction

机译:二阶软共现中的参数化对比度:文本挖掘和知识提取中的一种新型文本表示技术

获取原文

摘要

In this article, we present a novel statistical representation method for knowledge extraction from a corpus containing short texts. Then we introduce the contrast parameter which could be adjusted for targeting different conceptual levels in text mining and knowledge extraction. The method is based on second order co-occurrence vectors whose efficiency for representing meaning has been established in many applications, especially for representing word senses in different contexts and for disambiguation purposes. We evaluate our method on two tasks: classification of textual description of dreams, and classification of medical abstracts for systematic reviews.
机译:在本文中,我们提出了一种新颖的统计表示方法,用于从包含短文本的语料库中提取知识。然后,我们介绍了对比度参数,可以针对文本挖掘和知识提取中的不同概念级别进行调整。该方法基于二阶共现向量,其在许多应用中已经确立了表示含义的效率,尤其是在不同上下文中表示词义并为消歧目的。我们在两个任务上评估我们的方法:对梦的文字描述进行分类,以及对系统综述的医学摘要进行分类。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号