首页> 外文会议>International Conference on Software Technologies >Machine Learning Models for Automatic Labeling: A Systematic Literature Review
【24h】

Machine Learning Models for Automatic Labeling: A Systematic Literature Review

机译:自动标签机器学习模型:系统文献综述

获取原文

摘要

Automatic labeling is a type of classification problem. Classification has been studied with the help of statistical methods for a long time. With the explosion of new better computer processing units (CPUs) and graphical processing units (GPUs) the interest in machine learning has grown exponentially and we can use both statistical learning algorithms as well as deep neural networks (DNNs) to solve the classification tasks. Classification is a supervised machine learning problem and there exists a large amount of methodology for performing such task. However, it is very rare in industrial applications that data is fully labeled which is why we need good methodology to obtain error-free labels. The purpose of this paper is to examine the current literature on how to perform labeling using ML, we will compare these models in terms of popularity and on what datatypes they are used on. We performed a systematic literature review of empirical studies for machine learning for labeling. We identified 43 primary studies relevant to our search. From this we were able to determine the most common machine learning models for labeling. Lack of unlabeled instances is a major problem for industry as supervised learning is the most widely used. Obtaining labels is costly in terms of labor and financial costs. Based on our findings in this review we present alternate ways for labeling data for use in supervised learning tasks.
机译:自动标签是一种分类问题。已经在统计方法的帮助下进行了分类了很长时间。随着新的更好的计算机处理单元(CPU)和图形处理单元(GPU),机器学习的兴趣是指数增长的,我们可以使用统计学习算法以及深神经网络(DNN)来解决分类任务。分类是一个监督机器学习问题,并且存在大量用于执行此类任务的方法。但是,在工业应用中非常罕见,数据完全标记为什么我们需要良好方法来获取无错误标签。本文的目的是检查当前关于如何使用mL执行标签的当前文献,我们将在流行方面和它们用于哪些数据类型上进行比较这些模型。我们对机器学习的实证研究进行了系统的文献综述。我们确定了43项与我们的搜索相关的初级研究。由此,我们能够确定标签最常见的机器学习模型。缺乏未标记的实例是行业的主要问题,因为监督学习是最广泛的使用。在劳动和财务成本方面获得标签昂贵。根据我们的调查结果,我们提出了备用方式来标记用于监督学习任务的数据。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号