【24h】

WEB-BASED MINING OF STATISTICAL INFORMATION

机译:基于网络的统计信息挖掘

获取原文

摘要

Data Mining can be defined as the task of extracting statistically meaningful information from data. In the past, statisticians have been doing this activity by performing "custom analysis" on an as needed basis. In the semiconductor field, we must generate numerous statistical process control (SPC) charts and associated data summaries on a regular basis to monitor process performance. Consequently, we must find more efficient methods of analyzing process data and making the information readily available to the engineering community. The information must be accessible in a timely and user-friendly manner. At the Merrimack, NH facility of Texas Instruments (previously Unitrode), we have done this by integrating the capabilities of SAS, the Unix operating system and the Promis material tracking system that operates in the OpenVMS environment. The resulting information is updated daily and placed at an "SPC Website" so that the users can view it using only the "point and click" techniques. This paper describes some of the steps in the long "chain of events" that are essential to make this happen without human intervention. The statistician can now devote his energies to other duties that require his personal attention, such as: design experiments, perform gauge studies, develop & deliver statistics / DoE courses for the engineering community, publish / upgrade SPC specification and support the Quality group's continuous improvement projects. This is in sharp contrast to the traditional methods for computing control limits that depended strongly on human interaction to: 1. Extract process data from Promis (engage repeatedly in almost identical dialogue) into a VMS file; 2. Access that file from the Windows operating system using FTP; 3. Convert it in the CSV format using Excel; 4. Determine the structure of that file to invoke just the right SAS 'macro' for making a dataset; 5. Invoke the right SAS macro to make data summaries, compute limits and make SPC charts; 6. Repeat all of the above for each new chart. Given that a semiconductor facility must track a large number of processes daily, the task of creating and maintaining SPC charts would have been a full time job for more than one statistician. There would be no time for other projects that more clearly required statistician's personal attention.
机译:数据挖掘可以定义为从数据中提取统计上有意义的信息的任务。在过去,统计学家通过根据需要执行“自定义分析”来完成此活动。在“半导体”字段中,我们必须定期生成许多统计过程控制(SPC)图表和相关数据摘要,以监视过程性能。因此,我们必须找到更有效的方法来分析过程数据并使信息易于使用工程界。信息必须及时和用户友好的方式访问。在德克萨斯乐器(以前Unitrode)的NH工厂的Merrimack,我们通过集成了SAS的功能,UNIX操作系统和在OpenVMS环境中运行的ProMIS材料跟踪系统的功能来完成此操作。结果信息每天更新并放置在“SPC网站”中,以便用户只能使用“点并单击”技术来查看。本文介绍了长期“事件链”中的一些步骤,这对于在没有人为干预的情况下使这种情况至关重要。统计名人现在可以将他的能量投入到其他需要他个人关注的其他职责,例如:设计实验,执行仪表研究,开发和交付统计/ DoE课程,为工程界,发布/升级SPC规范和支持质量集团的持续改进项目。这与传统的计算限制方法鲜明对比,依赖于人类交互的强烈计算限制:1。从PROMIS中提取过程数据(在几乎相同的对话中接触)到VM文件; 2.使用FTP从Windows操作系统访问该文件; 3.使用Excel以CSV格式转换它; 4.确定该文件的结构只能调用正确的SAS“宏”来制作数据集; 5.调用正确的SAS宏以使数据摘要,计算限制并制作SPC图表; 6.对每个新图表重复上述所有内容。鉴于半导体设施必须每天跟踪大量进程,创建和维护SPC图表的任务将是一个以上的统计学家的全职作用。其他项目将没有时间更明确要求统计学家的个人关注。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号