Fast Data Acquisition in Cost-Sensitive Learning

机译：成本敏感型学习中的快速数据采集

获取原文

获取原文并翻译 | 示例

页面导航

摘要
著录项
相似文献
相关主题

摘要

Data acquisition is the first and one of the most important steps in many data mining applications. It is a time consuming and costly task. Acquiring an insufficient number of examples makes the learned model and future prediction inaccurate, while acquiring more examples than necessary wastes time and money. Thus it is very important to estimate the number examples needed for learning algorithms in machine learning. However, most previous learning algorithms learn from a given and fixed set of examples. To our knowledge, little previous work in machine learning can dynamically acquire examples as it learns, and decide the ideal number of examples needed. In this paper, we propose a simple on-line framework for fast data acquisition (FDA). FDA is an extrapolation method that estimates the number of examples needed in each acquisition and acquire them simultaneously. Comparing to the naive step-by-step data acquisition strategy, FDA reduces significantly the number of times of data acquisition and model building. This would significantly reduce the total cost of misclassification, data acquisition arrangement, computation, and examples acquired costs.

机译：在许多数据挖掘应用程序中，数据采集是第一步，也是最重要的步骤之一。这是一项耗时且昂贵的任务。获取足够数量的示例会使学习的模型和未来的预测变得不准确，而获取更多示例所需的时间和金钱却不多。因此，估计机器学习中学习算法所需的数量示例非常重要。但是，大多数以前的学习算法都是从一组给定的固定示例中学习。据我们所知，机器学习的先前工作很少可以在学习过程中动态获取示例，并确定所需的理想示例数量。在本文中，我们提出了一个简单的在线框架以进行快速数据采集（FDA）。 FDA是一种外推方法，它估计每次收购所需的示例数量并同时进行获取。与幼稚的逐步数据获取策略相比，FDA显着减少了数据获取和模型构建的次数。这将大大减少分类错误，数据获取安排，计算和示例获取成本的总成本。

著录项

来源
《Advances in data mining : Applications and theoretical aspects》|2011年|p.66-77|共12页
会议地点 New York NY(US);New York NY(US)
作者
Victor S. Sheng;
展开▼
作者单位

Computer Science Department,University of Central Arkansas,Conway, AR, 72034, USA;

展开▼
会议组织
原文格式 PDF
正文语种 eng
中图分类 TP311.13;
关键词
data acquisition; cost-sensitive learning; machine learning; data mining; fast data acquisition;

机译：数据采集;成本敏感型学习；机器学习数据挖掘;快速数据采集;

相似文献

外文文献
中文文献
专利

1. Cost-Sensitive Topical Data Acquisition from the Web [J] . Mahdi Naghibi, Reza Anvari, Ali Forghani, International Journal of Data Mining & Knowledge Management Process . 2019,第2a3期

机译：从网上获取成本敏感的主题数据
2. Fast Analog Transmission for High-Mobility Wireless Data Acquisition in Edge Learning [J] . Du Yuqing, Huang Kaibin Wireless Communications Letters, IEEE . 2019,第2期

机译：快速模拟传输，用于边缘学习中的高移动性无线数据采集
3. Fast Analog Transmission for High-Mobility Wireless Data Acquisition in Edge Learning [J] . Du Yuqing, Huang Kaibin Wireless Communications Letters, IEEE . 2019,第2期

机译：高级移动无线数据采集的快速模拟传输
4. Fast Data Acquisition in Cost-Sensitive Learning [C] . Victor S. Sheng Industrial Conference on Data Mining . 2011

机译：成本敏感学习中的快速数据采集
5. Knowledge Discovery from Databases: Cost-sensitive and imbalance learning. [D] . Yang, Zhuo. 2010

机译：从数据库中发现知识：成本敏感和不平衡的学习。
6. Applying Cost-Sensitive Extreme Learning Machine and Dissimilarity Integration to Gene Expression Data Classification [O] . Yanqiu Liu, Huijuan Lu, Ke Yan, 2016

机译：将成本敏感的极限学习机和异种整合应用于基因表达数据分类
7. Partial Example Acquisition in Cost-Sensitive Learning [O] . Victor S. Sheng, Charles X. Ling 2007

机译：成本敏感学习中的部分例子习得

Fast Data Acquisition in Cost-Sensitive Learning

摘要

著录项

相似文献

相关主题

期刊订阅