首页> 外文会议>International conference on computational linguistics >Large-scale Multi-class and Hierarchical Product Categorization for an E-commerce Giant
【24h】

Large-scale Multi-class and Hierarchical Product Categorization for an E-commerce Giant

机译:电子商务巨头的大规模多类和分层产品分类

获取原文

摘要

In order to organize the large number of products listed in e-commerce sites, each product is usually assigned to one of the multi-level categories in the taxonomy tree. It is a time-consuming and difficult task for merchants to select proper categories within thousands of options for the products they sell. In this work, we propose an automatic classification tool to predict the matching category for a given product title and description. We used a combination of two different neural models, i.e., deep belief nets and deep autoencoders, for both titles and descriptions. We implemented a selective reconstruction approach for the input layer during the training of the deep neural networks, in order to scale-out for large-sized sparse feature vectors. GPUs are utilized in order to train neural networks in a reasonable time. We have trained our models for around 150 million products with a taxonomy tree with at most 5 levels that contains 28,338 leaf categories. Tests with millions of products show that our first predictions matches 81% of merchants' assignments, when "others" categories are excluded.
机译:为了组织电子商务站点中列出的大量产品,通常将每种产品分配给分类树中的多级类别之一。对于商人来说,在成千上万的选择中为他们出售的产品选择合适的类别是一项既费时又困难的任务。在这项工作中,我们提出了一种自动分类工具来预测给定产品标题和描述的匹配类别。对于标题和描述,我们使用了两种不同的神经模型的组合,即深度置信网和深度自动编码器。在深度神经网络训练期间,我们对输入层实施了选择性重构方法,以横向扩展大型稀疏特征向量。利用GPU是为了在合理的时间内训练神经网络。我们已使用分类树(最多包含5个级别,包含28338个叶子类别)对大约1.5亿个产品的模型进行了训练。对数百万种产品进行的测试表明,如果排除“其他”类别,我们的第一个预测与商户分配的81%相匹配。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号