Large-scale Multi-class and Hierarchical Product Categorization for an E-commerce Giant

机译：电子商务巨头的大规模多类和分层产品分类

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

In order to organize the large number of products listed in e-commerce sites, each product is usually assigned to one of the multi-level categories in the taxonomy tree. It is a time-consuming and difficult task for merchants to select proper categories within thousands of options for the products they sell. In this work, we propose an automatic classification tool to predict the matching category for a given product title and description. We used a combination of two different neural models, i.e., deep belief nets and deep autoencoders, for both titles and descriptions. We implemented a selective reconstruction approach for the input layer during the training of the deep neural networks, in order to scale-out for large-sized sparse feature vectors. GPUs are utilized in order to train neural networks in a reasonable time. We have trained our models for around 150 million products with a taxonomy tree with at most 5 levels that contains 28,338 leaf categories. Tests with millions of products show that our first predictions matches 81% of merchants' assignments, when "others" categories are excluded.

机译：为了组织电子商务站点中列出的大量产品，通常将每种产品分配给分类树中的多级类别之一。对于商人来说，在成千上万的选择中为他们出售的产品选择合适的类别是一项既费时又困难的任务。在这项工作中，我们提出了一种自动分类工具来预测给定产品标题和描述的匹配类别。对于标题和描述，我们使用了两种不同的神经模型的组合，即深度置信网和深度自动编码器。在深度神经网络训练期间，我们对输入层实施了选择性重构方法，以横向扩展大型稀疏特征向量。利用GPU是为了在合理的时间内训练神经网络。我们已使用分类树（最多包含5个级别，包含28338个叶子类别）对大约1.5亿个产品的模型进行了训练。对数百万种产品进行的测试表明，如果排除“其他”类别，我们的第一个预测与商户分配的81％相匹配。

著录项

来源
《International conference on computational linguistics》|2016年|525-535|共11页
会议地点
作者
Ali Cevahir; Koji Murakami;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词

相似文献

外文文献
中文文献
专利

1. E-Commerce Product Categorization via Machine Translation [J] . LILING TAN, MAGGIE YUNDI LI, STANLEY KOK ACM Transactions on Management Information Systems . 2020,第3期

机译：通过机器翻译电子商务产品分类
2. Lightweight Methods for Large-Scale Product Categorization [J] . Eli Cortez, Mauro Rojas Herrera, Altigran S. da Silva, Journal of the American Society for Information Science and Technology . 2011,第9期

机译：轻量级方法用于大规模产品分类
3. FLOPPIES: A Framework for Large-Scale Ontology Population of Product Information from Tabular Data in E-commerce Stores [J] . Lennart J. Nederstigt, Steven S. Aanen, Damir Vandic, Decision support systems . 2014,第mara期

机译：FLOPPIES：一个用于电子商务商店中表格数据的产品信息的大规模本体填充的框架
4. Large-scale Multi-class and Hierarchical Product Categorization for an E-commerce Giant [C] . Ali Cevahir, Koji Murakami International conference on computational linguistics . 2016

机译：大型多级和电子商务巨头的分层产品分类
5. Hierarchical learning for large multi-class classification in network data [D] . Liu, Lei 2015

机译：网络数据中大型多类分类的分层学习
6. Stimulus Type Level of Categorization and Spatial-Frequencies Utilization: Implications for Perceptual Categorization Hierarchies [O] . Assaf Harel, Shlomo Bentin -1

机译：刺激类型分类水平和空间频率利用率：对感知分类层次结构的影响
7. Hierarchical Multi-Class Text Categorization with Global Margin Maximization [O] . Xipeng Qiu, Wenjun Gao, Xuanjing Huang 2010

机译：具有全局边距最大化的分层多类文本分类

Large-scale Multi-class and Hierarchical Product Categorization for an E-commerce Giant

摘要

著录项

相似文献

相关主题

期刊订阅