首页> 外文会议>International Conference on Machine Learning, Optimization, and Data Science >Generating Term Weighting Schemes Through Genetic Programming
【24h】

Generating Term Weighting Schemes Through Genetic Programming

机译:通过遗传编程生成术语加权方案

获取原文
获取外文期刊封面目录资料

摘要

Term-Weighting Scheme (TWS) is an important step in text classification. It determines how documents are represented in Vector Space Model (VSM). Even though state-of-the-art TWSs exhibit good behaviors, a large number of new works propose new approaches and new TWSs that improve performances. Furthermore, it is still difficult to tell which TWS is well suited for a specific problem. In this paper, we are interested in automatically generating new TWSs with the help of evolutionary algorithms and especially genetic programming (GP). GP evolves and combines different statistical information and generates a new TWS based on the performance of the learning method. We experience the generated TWSs on three well-known benchmarks. Our study shows that even early generated formulas are quite competitive with the state-of-the-art TWSs and even in some cases outperform them.
机译:术语加权方案(TWS)是文本分类的重要步骤。它确定如何在向量空间模型(VSM)中表示文件。尽管最先进的TWSS表现出良好的行为,但大量的新作品提出了新的方法和新的TWS,可以改善表演。此外,仍然很难判断哪个TWS非常适合特定问题。在本文中,我们有兴趣在进化算法和尤其是遗传编程(GP)的帮助下自动生成新的TWSS。 GP演变并结合不同的统计信息,并根据学习方法的性能生成新的TW。我们在三个着名的基准上体验了所生命的TWSS。我们的研究表明,即使是早期生成的公式也与最先进的TWSS相当竞争,甚至在某些情况下才能表达它们。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号