【24h】

Evaluation of Cooking Recipes Using Their Texts and Images

机译:使用他们的文本和图像评估烹饪食谱

获取原文

摘要

Many people are using smartphone applications and websites with respect to cooking recipes in almost everyday life. By using cooking recipe sites, we easily search the cooking recipe for dishes that we want to eat and cook the dish based on the recipe. When we search cooking recipes, we refer to the two information that are pictures of cooked food and texts of cooking recipe. In this study, to investigate good cooking recipes, we built a multimodal coupled network to use these two kinds of information of cooking recipe sites. In this multimodal coupled network, as pre-learning, we learned images and texts separately. To learn texts, we first used word2vec to vectorize for each text, and then we use the deep averaging network to identify the objective function. To learn images, we performed transfer learning of images using VGG16 which is a famously trained model. We combined these two kinds of 300-dimensional feature vectors extracted from the final layer of the two models obtained by pre-learning, and then we learned a total of 600-dimensional feature vectors. As a result, the multimodal coupled network that combines image and text data has the best accuracy.
机译:在几乎日常生活中,许多人正在使用智能手机应用程序和网站了解烹饪食谱。通过使用烹饪配方站点,我们可以轻松搜索我们想要吃饭的菜肴和基于食谱的菜肴。当我们搜索烹饪配方时,我们指的是熟食食品和烹饪食谱的文本的两个信息。在这项研究中,要调查良好的烹饪配方,我们建立了一个多模式耦合网络,用于使用烹饪配方网站的这两种信息。在这种多模式耦合网络中,作为预学习,我们分别学习了图像和文本。要学习文本,我们首先将Word2VEC用于每个文本的向量化,然后我们使用深度平均网络来识别目标函数。要学习图像,我们使用VGG16执行了图像的转移学习,这是一个着名的训练有素的模型。我们将这两种300维特征向量组合从通过预学习获得的两种模型的最终层提取的这两种300维特征向量,然后我们学习了总共600维特征向量。结果,组合图像和文本数据的多模耦合网络具有最佳精度。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号