首页> 外文会议>2019 IEEE/ACM 41st International Conference on Software Engineering: Software Engineering in Practice >Using Machine Learning to Recommend Correctness Checks for Geographic Map Data
【24h】

Using Machine Learning to Recommend Correctness Checks for Geographic Map Data

机译:使用机器学习推荐地理地图数据的正确性检查

获取原文
获取原文并翻译 | 示例
获取外文期刊封面目录资料

摘要

Developing an industry application that serves geographic map data to users across the world presents the significant challenge of checking the data using "data correctness checks." The size of data that needs to be checked-the entire world-and data churn rate-thousands per day-makes executing the full set of candidate checks cost prohibitive. Current techniques rely on hand-curated static subsets of checks to be run at different stages of the data production pipeline, These hard-coded subsets are uninformed of data changes, and cause bug detection to be delayed to downstream quality assurance activities. To address these problems, we have developed new representations of map data changes and checks, formally defined "check safety," and built a recommender system that dynamically and automatically selects and ranks a relevant subset of checks using signals from latest data changes. Empirical evaluation shows that it improves (1) efficiency by eliminating 65% of checks unrelated to changes, (2) coverage by recommending and ranking change-related checks from the full set of candidate checks, previously excluded by the manual process, and (3) overall visibility into the data editing process by quickly and automatically identifying latest fault prone parts of the data.
机译:开发一个向全球用户提供地理地图数据的行业应用程序带来了使用“数据正确性检查”来检查数据的重大挑战。需要检查的数据大小-整个世界以及数据流失率-每天要成千上万,这使得执行整套候选检查的成本高昂。当前的技术依赖手工编制的支票静态子集在数据生产管道的不同阶段运行。这些硬编码子集不了解数据更改,并导致错误检测延迟到下游质量保证活动中。为了解决这些问题,我们开发了地图数据更改和支票的新表示形式,正式定义了“支票安全性”,并建立了一个推荐系统,该系统使用来自最新数据更改的信号动态地自动选择支票的相关子集并对其进行排名。经验评估表明,它可以提高(1)通过消除65%与变更无关的支票来提高效率;(2)通过从以前由手动流程排除的整套候选支票中推荐和排列与变更相关的支票并对其进行排名,来提高覆盖率;以及(3 ),可以快速,自动地识别数据中最新的容易发生故障的部分,从而全面了解数据编辑过程。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号