首页> 外文期刊>Science of Computer Programming >WSDL term tokenization methods for IR-style Web services discovery
【24h】

WSDL term tokenization methods for IR-style Web services discovery

机译:用于IR样式Web服务发现的WSDL术语标记化方法

获取原文
获取原文并翻译 | 示例

摘要

The IR-style Web services discovery represents an important approach that applies proven techniques developed in the field of Information Retrieval (1R). Many studies exploited the Web Services Description Language (WSDL) syntax to extract useful service metadata for building indexes. However, a fundamental issue associated with this approach is the WSDL term tokenization. This paper proposes the application of three statistical methods for WSDL term tokenization-MDL, TP, and PPM. With the increasing need for effective IR-style Web services discovery facilities, term tokenization is of fundamental importance for properly indexing WSDL documents. We compare our applied methods with two baseline methods. The experiment suggests the superiority of MDL and PPM methods based on IR evaluation metrics. To the best of our knowledge, our work is the first to systematically investigate the issue of WSDL term tokenization for Web services discovery. Our solution can benefit source coding mining, in which a key step is to tokenize names (i.e. terms) of variables, functions, classes, modules, etc. for semantic analysis. Our methods could also be used for solving Web-related string tokenization problems such as URL analysis and Web scripts comprehension.
机译:IR样式的Web服务发现代表了一种重要的方法,该方法应用了在信息检索(1R)领域中开发的成熟技术。许多研究利用Web服务描述语言(WSDL)语法来提取有用的服务元数据以建立索引。但是,与此方法相关的一个基本问题是WSDL术语标记化。本文提出了三种用于WSDL术语标记化的统计方法-MDL,TP和PPM的应用。随着对有效的IR样式Web服务发现工具的需求不断增长,术语标记化对于正确索引WSDL文档至关重要。我们将应用的方法与两种基准方法进行比较。实验表明基于IR评估指标的MDL和PPM方法的优越性。据我们所知,我们的工作是第一个系统地研究用于Web服务发现的WSDL术语标记化问题。我们的解决方案可以有益于源代码编码挖掘,其中关键步骤是将变量,函数,类,模块等的名称(即术语)标记化以进行语义分析。我们的方法还可用于解决与Web相关的字符串标记化问题,例如URL分析和Web脚本理解。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号