首页> 外文会议>International Symposium on Methodologies for Intelligent Systems >The Impact of Supercategory Inclusion on Semantic Classifier Performance
【24h】

The Impact of Supercategory Inclusion on Semantic Classifier Performance

机译:超类别包含对语义分类器性能的影响

获取原文

摘要

It is a known phenomenon that text document classifiers may benefit from inclusion of hypernyms of the terms in the document. However, this inclusion may be a mixed blessing because it may fuzzify the boundaries between document classes [5,6,10]. We have elaborated a new type of document classifiers, so called semantic classifiers, trained not on the original data but rather on the categories assigned to the document by our semantic categorizer [1,4], that require significantly smaller corpus of training data and outperforms traditional classifiers used in the domain. With this research we want to clarify what is the advantage/disadvantage of using supercategories of the assigned categories (an analogon of hypernyms) on the quality of classification. In particular we concluded that supercategories should be added with restricted weight, for otherwise they may deteriorate the classification performance. We found also that our technique of aggregating the categories counteracts the fuzzifying of class boundaries.
机译:它是一种已知的现象,文本文本分类器可以从文档中包含术语的复杂性。然而,这种包含可能是混合祝福,因为它可以模糊文档类之间的边界[5,6,10]。我们已经详细阐述了一种新型文档分类器,所以称为语义分类器,不受原始数据的培训,而是通过我们的语义分类程序[1,4]分配给文档的类别,这需要显着较小的培训数据语料库和优于胜过域中使用的传统分类器。通过这项研究,我们希望澄清使用分配类别的超类别(一种超肾上腺素)的优势/缺点在分类的质量上。特别是我们得出结论,超类别应加入限制权重,否则它们可能会恶化分类性能。我们还发现,我们的聚合该类别的技术抵消了类边界的模糊。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号