首页> 外文期刊>Fusion Engineering and Design >Data mining technique for fast retrieval of similar waveforms in Fusion massive databases
【24h】

Data mining technique for fast retrieval of similar waveforms in Fusion massive databases

机译:用于快速检索Fusion海量数据库中相似波形的数据挖掘技术

获取原文
获取原文并翻译 | 示例

摘要

Fusion measurement systems generate similar waveforms for reproducible behavior. A major difficulty related to data analysis is the identification, in a rapid and automated way, of a set of discharges with comparable behaviour, i.e. discharges with "similar" waveforms. Here we introduce a new technique for rapid searching and retrieval of "similar" signals. The approach consists of building a classification system that avoids traversing the whole database looking for similarities. The classification system diminishes the problem dimensionality (by means of waveform feature extraction) and reduces the searching space to just the most probable "similar" waveforms (clustering techniques). In the searching procedure, the input waveform is classified in any of the existing clusters. Then, a similarity measure is computed between the input signal and all cluster elements in order to identify the most similar waveforms. The inner product of normalized vectors is used as the similarity measure as it allows the searching process to be independent of signal gain and polarity. This development has been applied recently to TJ-II stellarator databases and has been integrated into its remote participation system.
机译:融合测量系统会生成相似的波形以实现可重现的行为。与数据分析有关的主要困难是以快速和自动化的方式识别具有可比较行为的一组放电,即具有“相似”波形的放电。在这里,我们介绍了一种快速搜索和检索“相似”信号的新技术。该方法包括建立一个分类系统,该系统避免遍历整个数据库以寻找相似之处。分类系统减小了问题的维数(通过波形特征提取),并将搜索空间缩小到最可能的“相似”波形(聚类技术)。在搜索过程中,输入波形被分类为任何现有簇。然后,在输入信号和所有群集元素之间计算相似度,以识别最相似的波形。归一化向量的内积被用作相似性度量,因为它允许搜索过程独立于信号增益和极性。该开发成果最近已应用于TJ-II恒星数据库,并已集成到其远程参与系统中。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号