首页> 外文会议>IEEE International Conference on Systems, Man and Cybernetics >Genre identification of Chinese finance text using machine learning method
【24h】

Genre identification of Chinese finance text using machine learning method

机译:使用机器学习方法的中国金融文本的类型鉴定

获取原文
获取外文期刊封面目录资料

摘要

Document genre information is one of the most distinguishing features in information retrieval, which brings order to the search results. What the genre classification concerned is not the topic but the genre of document. In this paper, we examine the effectiveness of using machine learning techniques to solve genre classification of Chinese text with the same topic, viz. finance. Based on the likelihood ratio test, we present a new method for selecting feature terms, which can improve the performance clearly and perform better than others with up to 80% terms removal. In empirical results with SVMs classifier on the real world corpora, we find that this method can gain a better selecting effect and likelihood ratio is a reliable measure for selecting informative features.
机译:文档类型信息是信息检索中最具区别的功能之一,它为搜索结果带来了命令。有关类型的分类是什么不是主题,而是文件类型。在本文中,我们研究了使用机器学习技术解决中文文本类型的有效性,具有同一主题,viz。金融。基于似然比测试,我们提出了一种选择特征术语的新方法,可以清楚地提高性能,比其他术语更好地表现更好。在现实世界上的SVMS分类器的经验结果中,我们发现该方法可以获得更好的选择效果,并且似然比是选择信息特征的可靠度量。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号