首页> 外文会议>Fuzzy logic and applications >A Fuzzy Declarative Approach for Classifying Unlabeled Short Texts Using Thesauri
【24h】

A Fuzzy Declarative Approach for Classifying Unlabeled Short Texts Using Thesauri

机译:使用叙词表对未标记短文本进行分类的模糊声明方法

获取原文
获取原文并翻译 | 示例

摘要

The classic approach to text categorisation is based on a learning process that requires a large number of labelled training texts to achieve an accurate performance. The most notable problem is that labelled texts are difficult to generate because categorising shorts texts as snippets or messages must be done by human developers, although unlabelled short texts could be easily collected. In this paper, we present an approach to categorising unlabelled short texts which only require, as user input, the category names defined by means of an ontology of terms modelled by a set of proximity equations. The proposed classifica-tion process is based on the ability of a fuzzy extension of the standard Prolog language named Bousi~Prolog for flexible matching and knowledge representation. This declarative approach provides a text classifier which is fast and easy to build, as well as a classification process that is easy for the user to understand. The results of the experiment showed that the proposed method achieved a reasonably good performance.
机译:文本分类的经典方法基于学习过程,该过程需要大量带标签的训练文本才能实现准确的性能。最显着的问题是,标记的文本难以生成,因为将短裤文本分类为摘要或消息必须由人类开发人员完成,尽管可以轻松地收集未标记的文本。在本文中,我们提出了一种对未标记的短文本进行分类的方法,这些文本仅需要作为用户输入的类别名称,该类别名称是通过一组近似方程建模的术语本体定义的。提出的分类过程基于对标准Prolog语言Bousi〜Prolog的模糊扩展的能力,以实现灵活的匹配和知识表示。这种声明性方法提供了一种快速且易于构建的文本分类器,以及一个易于用户理解的分类过程。实验结果表明,该方法取得了较好的性能。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号