【24h】

Data Mining Learning Bootstrap Through Semantic Thumbnail Analysis

机译:通过语义缩略图分析进行数据挖掘学习的引导

获取原文
获取原文并翻译 | 示例

摘要

The rapid increase of technological innovations in the mobile phone industry induces the research community to develop new and advanced systems to optimize services offered by mobile phones operators (telcos) to maximize their effectiveness and improve their business. Data mining algorithms can run over data produced by mobile phones usage (e.g. image, video, text and logs files) to discover user's preferences and predict the most likely (to be purchased) offer for each individual customer. One of the main challenges is the reduction of the learning time and cost of these automatic tasks. In this paper we discuss an experiment where a commercial offer is composed by a small picture augmented with a short text describing the offer itself. Each customer's purchase is properly logged with all relevant information. Upon arrival of new items we need to learn who the best customers (prospects) for each item are, that is, the ones most likely to be interested in purchasing that specific item. Such learning activity is time consuming and, in our specific case, is not applicable given the large number of new items arriving every day. Basically, given the current customer base we are not able to learn on all new items. Thus, we need somehow to select among those new items to identify the best candidates. We do so by using a joint analysis between visual features and text to estimate how good each new item could be, that is, whether or not is worth to learn on it. Preliminary results show the effectiveness of the proposed approach to improve classical data mining techniques.
机译:移动电话行业中技术创新的迅速增长促使研究界开发新的先进系统,以优化移动电话运营商(telcos)提供的服务,以最大程度地发挥其效用并改善其业务。数据挖掘算法可以处理手机使用情况产生的数据(例如图像,视频,文本和日志文件),以发现用户的喜好并预测每个客户的最有可能(购买)的报价。主要挑战之一是减少这些自动任务的学习时间和成本。在本文中,我们讨论了一个实验,在该实验中,商业报价是由一幅小图片组成的,上面加上描述该报价本身的短文字。每个客户的购买都将正确记录所有相关信息。新商品到货后,我们需要了解谁是每个商品的最佳客户(潜在客户),即最有可能购买该特定商品的客户。这种学习活动非常耗时,在我们特定的情况下,鉴于每天有大量新物品到来,因此这种学习活动不适用。基本上,鉴于当前的客户群,我们无法学习所有新产品。因此,我们需要以某种方式在这些新项目中进行选择,以找出最佳候选人。我们通过在视觉特征和文本之间进行联合分析来估算每个新项目的质量,即是否值得学习。初步结果表明,该方法有效改善了经典数据挖掘技术。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号