首页> 外文期刊>Fermentation >Wineinformatics: Regression on the Grade and Price of Wines through Their Sensory Attributes
【24h】

Wineinformatics: Regression on the Grade and Price of Wines through Their Sensory Attributes

机译:Wineinformatics:通过其感官属性对葡萄酒的等级和价格进行回归

获取原文
       

摘要

Wineinformatics is a field that uses machine-learning and data-mining techniques to glean useful information from wine. In this work, attributes extracted from a large dataset of over 100,000 wine reviews are used to make predictions on two variables: quality based on a “100-point scale”, and price per 750 mL bottle. These predictions were built using support vector regression. Several evaluation metrics were used for model evaluation. In addition, these regression models were compared to classification accuracies achieved in a prior work. When regression was used for classification, the results were somewhat poor; however, this was expected since the main purpose of the regression was not to classify the wines. Therefore, this paper also compares the advantages and disadvantages of both classification and regression. Regression models can successfully predict within a few points of the correct grade of a wine. On average, the model was only 1.6 points away from the actual grade and off by about $13 per bottle of wine. To the best of our knowledge, this is the first work to use a large-scale dataset of wine reviews to perform regression predictions on grade and price.
机译:Wineinformatics是一个使用机器学习和数据挖掘技术从葡萄酒中收集有用信息的领域。在这项工作中,从超过100,000个葡萄酒评论的大型数据集中提取的属性用于对两个变量进行预测:基于“ 100分制”的质量和每750毫升瓶的价格。这些预测是使用支持向量回归建立的。一些评估指标用于模型评估。此外,将这些回归模型与先前工作中获得的分类精度进行了比较。当使用回归进行分类时,结果有些差。但是,这是可以预期的,因为回归的主要目的不是对葡萄酒进行分类。因此,本文还比较了分类和回归的优缺点。回归模型可以成功预测葡萄酒正确等级的几个点。平均而言,该模型与实际等级仅相差1.6分,每瓶葡萄酒约降低13美元。据我们所知,这是使用大规模葡萄酒评论数据集对等级和价格进行回归预测的第一项工作。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号