Using n-grams for the Automated Clustering of Structural Models

机译：使用n-gram对结构模型进行自动聚类

获取原文

页面导航

摘要
著录项
引文网络
相似文献
相关主题

摘要

Model comparison and clustering are important for dealing with many models in data analysis and exploration, e.g. in domain model recovery or model repository management. Particularly in structural models, information is captured not only in model elements (e.g. in names and types) but also in the structural context, i.e. the relation of one element to the others. Some approaches involve a large number of models ignoring the structural context of model elements; others handle very few (typically two) models applying sophisticated structural techniques. In this paper we address both aspects and extend our previous work on model clustering based on vector space model, with a technique for incorporating structural context in the form of n-grams. We compare the n-gram accuracy on two datasets of Ecore metamodels in AtlanMod Zoo: small random samples using up to trigrams and a larger one (~100 models) up to bigrams.

机译：模型比较和聚类对于处理数据分析和探索中的许多模型非常重要，例如在域模型恢复或模型存储库管理中。特别是在结构模型中，信息不仅在模型元素（例如，名称和类型）中捕获，而且在结构上下文中，即一个元素与其他元素之间的关系捕获。一些方法涉及大量模型，而忽略了模型元素的结构上下文;其他人则很少使用复杂的结构技术处理模型（通常是两个）。在本文中，我们针对这两个方面进行了研究，并扩展了我们先前基于向量空间模型的模型聚类工作，并采用了一种以n-gram形式合并结构上下文的技术。我们在AtlanMod Zoo的两个Ecore元模型数据集中比较了n元语法的准确性：使用三元组的较小随机样本和使用二元组的较大一个（〜100个模型）随机样本。

著录项

来源
《International conference on current trends in theory and practice of computer science》|2017年|510-524|共15页
会议地点
作者
Oender Babur; Loek Cleophas;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
Model-driven engineering; Model comparison; Vector space model; Hierarchical clustering; n-grams;

机译：模型驱动的工程;模型比较;向量空间模型;层次聚类;克;

相似文献

外文文献
中文文献
专利

1. A context evaluation approach for structural comparison of proteins using cross entropy over n-gram modelling [J] . RazmaraJ., DerisS.B., ParvizpourS. Computers in Biology and Medicine . 2013,第10期

机译：在n-gram建模中使用交叉熵的蛋白质结构比较的上下文评估方法
2. Multi-class composite N-gram language model using multiple word clusters and word successions [J] . Hirofumi Yamamoto, Shuntarou Isogai, Yoshinori Sagisaka 電子情報通信学会技術研究報告. 音声. Speech . 2001,第156期

机译：使用多个单词簇和单词继承的多类复合N-gram语言模型
3. Multi-class composite N-gram language model using multiple word clusters and word successions [J] . Hirofumi Yamamoto, Shuntarou Isogai, Yoshinori Sagisaka 電子情報通信学会技術研究報告. 音声. Speech . 2001,第156期

机译：使用多个单词集群和Word Arucessions的多级复合N-GRAM语言模型
4. Using n-grams for the Automated Clustering of Structural Models [C] . Onder Babur, Loek Cleophas International Conference on Current Trends in Theory and Practice of Computer Science . 2017

机译：使用N-GRAM用于结构模型的自动聚类
5. Identifying malware using n-gram clustering metrics. [D] . Dowd, Christopher Ryan. 2014

机译：使用n-gram群集指标识别恶意软件。
6. Modeling Actions of PubMed Users with N-Gram Language Models [O] . Jimmy Lin, W. John Wilbur -1

机译：N-Gram语言模型对PubMed用户的建模动作
7. Business Process Models Clustering Based on Multimodal Search, K-means, and Cumulative and No-Continuous N-Grams [O] . Hugo Ordoñez, Luis Merchán, Armando Ordoñez, 2016

机译：基于多模式搜索，K均值和累积和无连续n-gram的业务流程模型集群
8. Investigation of Back-off Based Interpolation Between Recurrent Neural Network and N-gram Language Models (Author's Manuscript). [R] . Chen, X., Liu, X., Gales, M. J. F., 2016

机译：基于回退的递归神经网络与N-gram语言模型的插值研究（作者手稿）。

Using n-grams for the Automated Clustering of Structural Models

摘要

著录项

引文网络

相似文献

相关主题

期刊订阅