SeSaMe: A Data Set of Semantically Similar Java Methods

机译：芝麻：语义类似的Java方法的数据集

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

In the past, techniques for detecting similarly behaving code fragments were often only evaluated with small, artificial oracles or with code originating from programming competitions. Such code fragments differ largely from production codes. To enable more realistic evaluations, this paper presents SeSaMe, a data set of method pairs that are classified according to their semantic similarity. We applied text similarity measures on JavaDoc comments mined from 11 open source repositories and manually classified a selection of 857 pairs.

机译：过去，用于检测类似行为的代码片段的技术通常仅被小，人工oracles或源自编程竞争的代码进行评估。这些代码片段在很大程度上不同于生产代码。为了实现更现实的评估，本文提出了芝麻，根据其语义相似性对的方法对的数据集。我们应用于从11个开源存储库中挖掘的Javadoc评论的文本相似度措施，并手动分类为857对的选择。

著录项

来源
《IEEE/ACM International Conference on Mining Software Repositories》|2019年|xxxiv 606 p. :|共5页
会议地点
作者
Marius Kamp; Patrick Kreutzer; Michael Philippsen;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类安全保密;
关键词
data mining; Java; pattern classification; text analysis;

机译：数据挖掘;Java;模式分类;文本分析;
入库时间 2022-08-20 20:18:28

相似文献

外文文献
中文文献
专利

1. A set-oriented method definition language for object databases and its semantics [J] . Elisa Bertino, Giovanna Guerrini, Isabella Merlo Concurrency and Computation . 2003,第14期

机译：对象数据库的面向集合的方法定义语言及其语义
2. A Semantic Associative Search Method with Temporal Cause-and-effect Relationship for Event Data Sets [J] . Kosuke TAKANO, Taizo ZUSHI, Yasushi KIYOKl, 電子情報通信学会技術研究報告. デ-タ工学. Data Engineering . 2003,第191期

机译：事件数据集具有时间因果关系的语义联想搜索方法
3. A Semantic Associative Search Method with Temporal Cause-and-effect Relationship for Event Data Sets [J] . Kosuke TAKANO, Taizo ZUSHI, Yasushi KIYOKl, 電子情報通信学会技術研究報告. デ-タ工学. Data Engineering . 2003,第191期

机译：具有活动数据集的时间原因和效应关系的语义关联搜索方法
4. SeSaMe: A Data Set of Semantically Similar Java Methods [C] . Marius Kamp, Patrick Kreutzer, Michael Philippsen IEEE/ACM International Conference on Mining Software Repositories . 2019

机译：SeSaMe：语义相似的Java方法的数据集
5. CodeReco - A Semantic Java Method Recommender [D] . Singh, Shashank. 2017

机译：CodeReco-语义Java方法推荐器
6. Comparison of Different Classification Methods for Analyzing Electronic Nose Data to Characterize Sesame Oils and Blends [O] . Xiaolong Shao, Hui Li, Nan Wang, 2015

机译：分析电子鼻数据以表征芝麻油和混合物的不同分类方法的比较
7. Comparison of Different Classification Methods for Analyzing Electronic Nose Data to Characterize Sesame Oils and Blends [O] . Xiaolong Shao, Hui Li, Nan Wang, 2015

机译：用于分析电子鼻数据以表征芝麻油和混合物的不同分类方法的比较
8. Applications of the AVE-Sesame data sets to mesoscale studies [R] . Suomi, V. E., Mosher, F., Norton, C., 1984

机译：aVE-sesame数据集在中尺度研究中的应用

SeSaMe: A Data Set of Semantically Similar Java Methods

摘要

著录项

相似文献

相关主题

期刊订阅