首页> 外文学位 >Industrial Applications of Data Mining Engineering Effort Forecasting based on Mining and Analysis of Patterns in Historical Project Execution Data.
【24h】

Industrial Applications of Data Mining Engineering Effort Forecasting based on Mining and Analysis of Patterns in Historical Project Execution Data.

机译:基于历史项目执行数据的挖掘和模式分析的数据挖掘工程工作量预测的工业应用。

获取原文
获取原文并翻译 | 示例

摘要

Data mining is increasing in importance in solving a variety of industry problems. Our initiative involves the estimation of resource requirements by skill set for future projects by mining and analyzing actual resource consumption data from past projects in the semiconductor industry. To achieve this goal we face difficulties like data with relevant consumption information but stored in different format and insufficient data about project attributes to interpret consumption data. Our first goal is to clean the historical data and organize it into meaningful structures for analysis. Once the preprocessing on data is completed, different data mining techniques like clustering is applied to find projects which involve resources of similar skillsets and which involve similar complexities and size. This results in "resource utilization templates" for groups of related projects from a resource consumption perspective. Then project characteristics are identified which generate this diversity in headcounts and skillsets. These characteristics are not currently contained in the data base and are elicited from the managers of historical projects. This represents an opportunity to improve the usefulness of the data collection system for the future. The ultimate goal is to match the product technical features with the resource requirement for projects in the past as a model to forecast resource requirements by skill set for future projects. The forecasting model is developed using linear regression with cross validation of the training data as the past project execution are relatively few in number. Acceptable levels of forecast accuracy are achieved relative to human experts' results and the tool is applied to forecast some future projects' resource demand.
机译:在解决各种行业问题中,数据挖掘的重要性日益提高。我们的计划包括通过挖掘和分析半导体行业过去项目的实际资源消耗数据,通过技能来估算未来项目的资源需求。为了实现这一目标,我们面临着困难,例如具有相关消费信息但以不同格式存储的数据,以及有关项目属性的数据不足以解释消费数据。我们的首要目标是清除历史数据并将其组织为有意义的结构以进行分析。一旦完成对数据的预处理,就可以应用诸如聚类之类的不同数据挖掘技术来查找涉及相似技能资源,复杂性和规模相似的项目。从资源消耗的角度来看,这将为相关项目组提供“资源利用模板”。然后确定项目特征,从而在人员和技能上产生这种多样性。这些特征当前未包含在数据库中,而是从历史项目的管理者中得出的。这代表了一个机会,可以提高数据收集系统在未来的实用性。最终目标是使产品技术特征与过去项目的资源需求相匹配,以此作为模型来预测未来项目的技能需求。预测模型是使用线性回归开发的,并且对训练数据进行了交叉验证,因为过去的项目执行数量相对较少。相对于人类专家的结果,可以达到可接受的预测准确性水平,并且该工具可用于预测某些未来项目的资源需求。

著录项

  • 作者

    Bhattacharya, Indrani.;

  • 作者单位

    Arizona State University.;

  • 授予单位 Arizona State University.;
  • 学科 Engineering Industrial.;Computer Science.;Information Science.
  • 学位 M.S.
  • 年度 2013
  • 页码 81 p.
  • 总页数 81
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类
  • 关键词

  • 入库时间 2022-08-17 11:41:46

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号