首页> 外文会议>Industrial conference on data mining >The Computational Wine Wheel 2.0 and the TriMax Triclustering in Wineinformatics
【24h】

The Computational Wine Wheel 2.0 and the TriMax Triclustering in Wineinformatics

机译:Wineinformatics中的计算式Wine Wheel 2.0和TriMax Triclustering

获取原文

摘要

Even with the current state of technology, data growth is increasing so fast that without proper storage and analytical techniques, it is challenging to process and analyze large datasets. This applies to knowledge bases from all fields and all kinds of data. In Wineinformatics, various kind of data related to wine, including physicochemical laboratory data and wine reviews, are analyzed by data science related researches. In the previous work, we proposed the Computational Wine Wheel, derived from 2011's top 100 wine, to automatically process and extract key attributes from human-language-format wine expert reviews. In this work, past 10 year's top 100 wines are collected and formed a 1000 excellent wines dataset to further improve the Computational Wine Wheel. The extraction process led to the creation of what we call a Computational Wine Wheel 2.0, which is a wine attribute dictionary consisting of 985 categorized and normalized wine attributes. After the Computational Wine Wheel 2.0 is formed, we experiment it on a region- and grape type- specific dataset to seek new types of information in Wineinformatics. A novel TriMax Triclustering algorithm specifically used for the dataset processed by the Computational Wine Wheel is proposed and applied to discover three dimensional clusters (Wine × Attributes × Vintage) in wine. We found that the TriMax Triclustering algorithm produced promising and cohesive results that can be used in various aspects of the wine industry, such as defined palate grouping and wine searching.
机译:即使采用当前的技术水平,数据的增长速度仍然如此之快,以至于没有适当的存储和分析技术,处理和分析大型数据集就具有挑战性。这适用于来自所有领域和各种数据的知识库。在Wineinformatics中,通过与数据科学相关的研究来分析与葡萄酒有关的各种数据,包括物理化学实验室数据和葡萄酒评论。在之前的工作中,我们提出了源自2011年百强葡萄酒的计算酒轮,以自动处理和提取人类语言格式的葡萄酒专家评论的关键属性。在这项工作中,收集了过去10年的前100种葡萄酒,并形成了1000种出色的葡萄酒数据集,以进一步改进计算葡萄酒杯。提取过程导致创建了我们所说的“计算酒轮2.0”,它是由985个归类和归一化的酒属性组成的酒属性字典。在计算葡萄轮2.0形成之后,我们将在特定于区域和葡萄类型的数据集上进行实验,以在Wineinformatics中寻找新类型的信息。提出了一种新颖的TriMax Triclustering算法,该算法专门用于计算酒轮处理的数据集,并用于发现酒中的三维簇(酒×属性×年份)。我们发现TriMax Triclustering算法产生了有前途且具有凝聚力的结果,可用于葡萄酒行业的各个方面,例如定义的味觉分组和葡萄酒搜索。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号