首页> 外文会议>2017 2nd International Conference on Knowledge Engineering and Applications >Yelp business rating classification using hybrid ensemble
【24h】

Yelp business rating classification using hybrid ensemble

机译:使用混合集成的Yelp业务评级分类

获取原文
获取原文并翻译 | 示例
获取外文期刊封面目录资料

摘要

An algorithm that can predict the review rating of a potential business with only existing information about the location and business categories would be an invaluable tool in making investment decisions. Utilizing the Yelp business dataset, we built a model, that can do as such, by classifying whether a potential business belongs to a positively-reviewed class (star ratings greater than or equal to 4) or a negatively-reviewed class (star ratings less than 4) given its location in latitude and longitude and the categories the potential business belongs to. More specifically, we applied a feature engineering technique using Extremely Randomized Trees, and constructed a hybrid ensemble classifier using neural network, decision tree, and logistic regression. We compared our model with other popular ensemble algorithms such as random forest and neural network ensemble, and our hybrid ensemble model generates the best result with an accuracy rate of 67.37% and AUC of 0.7322.
机译:仅使用有关位置和业务类别的现有信息就可以预测潜在企业的评论等级的算法将是做出投资决策的宝贵工具。通过使用Yelp业务数据集,我们建立了一个模型,可以通过将潜在业务归为正面评价类别(星级评分大于或等于4)还是负面评价类别(星级评分较少)来进行分类比4)给出的经纬度位置以及潜在业务的类别。更具体地说,我们应用了使用极端随机树的特征工程技术,并使用神经网络,决策树和逻辑回归构建了一个混合集成分类器。我们将模型与其他流行的集成算法(例如随机森林和神经网络集成)进行了比较,并且我们的混合集成模型以67.37%的准确率和0.7322的AUC产生了最佳结果。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号