首页> 外文期刊>Semantic web >A machine learning approach for product matching and categorization
【24h】

A machine learning approach for product matching and categorization

机译:产品匹配和分类的机器学习方法

获取原文
获取原文并翻译 | 示例
           

摘要

Consumers today have the option to purchase products from thousands of e-shops. However, the completeness of the product specifications and the taxonomies used for organizing the products differ across different e-shops. To improve the consumer experience, e.g., by allowing for easily comparing offers by different vendors, approaches for product integration on the Web are needed. In this paper, we present an approach that leverages neural language models and deep learning techniques in combination with standard classification approaches for product matching and categorization. In our approach we use structured product data as supervision for training feature extraction models able to extract attribute-value pairs from textual product descriptions. To minimize the need for lots of data for supervision, we use neural language models to produce word embeddings from large quantities of publicly available product data marked up with Microdata, which boost the performance of the feature extraction model, thus leading to better product matching and categorization performances. Furthermore, we use a deep Convolutional Neural Network to produce image embeddings from product images, which further improve the results on both tasks.
机译:今天消费者可以选择从成千上万的电子商店购买产品。但是,产品规格的完整性和用于组织产品的分类学各种不同电子商店的不同。为了提高消费者体验,例如,通过允许轻松比较不同供应商的优惠,需要在网上的产品集成方法。在本文中,我们提出了一种利用神经语言模型和深度学习技术与产品匹配和分类的标准分类方法结合使用的方法。在我们的方法中,我们将结构化产品数据用作能够从文本产品描述中提取属性值对的训练功能提取模型的监督。为了最大限度地减少对监督许多数据的需求,我们使用神经语言模型从带有Microdata的大量公开可用产品数据制作Word Embedings,这提高了特征提取模型的性能,从而导致更好的产品匹配和产品匹配分类表演。此外,我们使用深度卷积神经网络从产品图像中产生图像嵌入,这进一步改善了两个任务的结果。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号