首页> 外文会议>IEEE International Conference on Machine Learning and Applications >A Crowdsourcing Semi-Supervised LSTM Training Approach to Identify Novel Items in Emerging Artificial Intelligent Environments
【24h】

A Crowdsourcing Semi-Supervised LSTM Training Approach to Identify Novel Items in Emerging Artificial Intelligent Environments

机译:一种众包半监督的LSTM培训方法,以识别新兴人工智能环境中的新项目

获取原文

摘要

Nowadays always new kinds of cuisines appear on the market. Even though main cuisines such as French, Italian, Japanese, Chinese and Indian are always appreciated, they are not anymore the most popular. The new trend the fusion cuisine. A fusion cuisine is a combination of different main cuisines, this combination makes this cuisine new. The opening of a new restaurant proposing a new kind of cuisine produces a lot of excitement and people feel the need to try it and be part of this new culture. Yelp is a platform which publishes crowd-sourced reviews about different businesses, in particular, restaurants. Yelp allows the possibility to declare for each restaurant the kind of cuisine. Unfortunately, since the restaurants in the Yelp database are not often generated by the owners but by the users creating the reviews, there is no much information about the kind of cuisine, especially for restaurants providing fusion ones. In this paper, we address the problem of identifying restaurants proposing new kinds of cuisines by using their Yelp reviews. These new cuisines can be completely new or fusion cuisines. Discriminating between main cuisines and fusion cuisines is very difficult because fusion cuisines are similar to the main ones even if they are conceptually different. We propose 4Phase, a semi-supervised procedure that trains Long Short-Term Memory with only the text reviews of the restaurants providing main cuisines. The trained LSTM is ultimately used as a feature generator in combination with a standard novelty detection model (e.g., Gaussian Mixture Models). We perform experiments on Yelp to separate restaurants providing main cuisines from the ones providing completely new cuisines or fusion ones. In this experiments, our 4Phase procedure outperforms all the baselines (term frequency, Doc2Vec, autoencoder LSTM, etc.) and reaches 0.91 of both AUROC and MAP.
机译:如今,市场上出现了新的美食。尽管诸如法国,意大利语,日语,中餐和印度等主要美食,但总是欣赏,它们也不会是最受欢迎的。新趋势融合料理。这款组合可以融合美食是不同的主要美食,使这家美食新鲜。打开一家新餐厅,提出一种新型美食,产生了很多兴奋,人们觉得需要尝试它并成为这一新文化的一部分。 Yelp是一个公布关于不同企业,特别是餐馆的人群审查的平台。 Yelp允许申报每家餐厅的那种美食。不幸的是,由于yelp数据库的餐馆通常由业主产生但由用户创建评论而产生的,因此没有有关这些美食的众多信息,特别是餐厅提供融合器。在本文中,我们通过使用yelp评论来解决识别餐厅的问题,提出询问新鲜的美食。这些新美食可以全新或融合料理。歧视主要美食和融合疗法料理闻酒很难,因为即使它们在概念上不同,融合式内食也与主要的美食相似。我们提出4平方面,一个半监督程序,可以长期列出短期内存,只有在提供主要美食的餐厅的文本审查。训练的LSTM最终用作特征发生器,与标准新颖性检测模型组合(例如,高斯混合模型)。我们对Yelp进行实验到独立的餐厅,供应提供的主要美食,提供全新的美食或多国料理。在该实验中,我们的4个平静过程优于所有基线(术语频率,DOC2VEC,AutoEncoder LSTM等),并达到0.91的AUROC和地图。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号