首页> 外文会议>International Conference on Information and Communication Technology Convergence >Feature word selection by iterative top-K aggregation for classifying recommended shops
【24h】

Feature word selection by iterative top-K aggregation for classifying recommended shops

机译:通过迭代top-K聚合选择特征词以对推荐店铺进行分类

获取原文

摘要

We propose a feature word selection method for classifying recommended shops using Yelp customer reviews. TextRank keywords are extracted from the customer reviews to construct the sorted positive and negative keyword lists based on each keyword's summed TextRank scores. The top-K keywords are then aggregated iteratively by multiples of K to construct the positive and negative keyword frequency lists. The negative keyword frequency list is then subtracted from the positive keyword frequency list, and the resulting list is standardized to generate the final positive and negative keyword lists. The performance of our feature selection method is evaluated using Naïve Bayes classifiers, and the binary classification accuracy of the selected feature words is 77.94%, which is better than the baseline χ2 feature word selection.
机译:我们提出了一种功能词选择方法,用于使用Yelp客户评论对推荐商店进行分类。从客户评论中提取TextRank关键字,以根据每个关键字的总TextRank得分构建排序的肯定和否定关键字列表。然后,将前K个关键字按K的倍数进行迭代聚合,以构建肯定和否定关键字频率列表。然后从肯定关键字频率列表中减去否定关键字频率列表,并对结果列表进行标准化以生成最终的肯定关键字列表和否定关键字列表。我们使用朴素贝叶斯分类器对我们的特征选择方法的性能进行了评估,所选特征词的二进制分类精度为77.94%,优于基线χ2特征词选择。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号