首页> 外文会议>IEEE International Conference on Big Data Computing Service and Applications >The Use of Domain Knowledge Models for Effective Data Mining of Unstructured Customer Service Data in Engineering Applications
【24h】

The Use of Domain Knowledge Models for Effective Data Mining of Unstructured Customer Service Data in Engineering Applications

机译:领域知识模型在工程应用中用于非结构化客户服务数据的有效数据挖掘的使用

获取原文

摘要

Despite the fact that enterprises are routinely collecting massive amounts of data from customers, only a relatively small body of knowledge engineering (KE) work has addressed methods and application of KE to the design, development, and maintenance of engineering systems and products. A major challenge when applying KE to such applications is that the data is often unstructured and in the form of text exchanges between the customer and the enterprise. While the importance of modelling domain knowledge in order to produce meaningful results from mining unstructured data has been recognized, most approaches are based primarily on the linguistic structure of the text and keyword taxonomies. These approaches share the common issue that the knowledge extraction results are often not properly structured for solving the engineering problem of interest and, therefore, require manual post-processing before they can be applied. Our hypothesis is that the a priori modelling of the engineering problem of interest is crucial for both (1) efficient (rapid) collection, representation, and structuring of domain knowledge, and (2) the proper integration of domain knowledge with analytical KE methods in order facilitate the extraction of useful knowledge. In order to validate our hypothesis, we apply this approach to the important real-world engineering problem of monitoring the occurrence of product failure modes, and thereby product quality, using customer support cases. In order to translate the free-form text provided by the customer into engineering failure modes we use two methods from engineering design, the Function Analysis System Technique (FAST) and Failure Modes and Effects Analysis (FMEA), to provide the necessary domain knowledge model. This model then drives the collection, representation, and structuring of the failure modes for the product of interest. These failure modes are used as the class labels when applying data mining classification techniques (e.g., Suppo- t Vector Machine) to the support case data. The labelled support case data then can be aggregated by failure mode in order to compute a number of failure mode metrics that can be used to monitor product quality. We have demonstrated our approach to monitor the quality of a network security product at a large computer networking company using a data set of 100,000 customer support cases.
机译:尽管企业经常从客户那里收集大量数据,但只有相对较小的知识工程(KE)工作解决了KE的方法和应用,以用于工程系统和产品的设计,开发和维护。将KE应用于此类应用程序时的主要挑战是,数据通常是非结构化的,并且以客户与企业之间的文本交换形式出现。虽然已经认识到建模领域知识以从挖掘非结构化数据中产生有意义的结果的重要性,但是大多数方法主要基于文本和关键字分类法的语言结构。这些方法存在一个共同的问题,即知识提取结果通常没有正确构造以解决感兴趣的工程问题,因此需要进行手动后处理才能应用。我们的假设是,感兴趣的工程问题的先验建模对于(1)领域知识的有效(快速)收集,表示和结构化,以及(2)领域知识与分析KE方法的正确整合至关重要。以便于提取有用的知识。为了验证我们的假设,我们使用客户支持案例将此方法应用于监视产品故障模式的发生并由此监视产品质量的重要的实际工程问题。为了将客户提供的自由格式文本转换为工程故障模式,我们使用了工程设计中的两种方法,即功能分析系统技术(FAST)和故障模式与效果分析(FMEA),以提供必要的领域知识模型。然后,该模型驱动感兴趣产品的故障模式的收集,表示和结构化。当将数据挖掘分类技术(例如,支持向量机)应用于支持案例数据时,这些故障模式将用作类别标签。然后,可以通过故障模式汇总已标记的支持案例数据,以便计算可用于监视产品质量的许多故障模式指标。我们已经展示了我们的方法,该方法使用100,000个客户支持案例的数据集监视大型计算机网络公司的网络安全产品的质量。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号