首页> 外国专利> METHOD AND SYSTEM FOR MANAGING DATA QUALITY FOR SPANISH NAMES IN A DATABASE

METHOD AND SYSTEM FOR MANAGING DATA QUALITY FOR SPANISH NAMES IN A DATABASE

机译:数据库中西班牙语名称数据质量的管理方法和系统

摘要

A method and system to identify similar names and addresses from given data set comprising plurality of names and addresses. The invention more specifically addresses the challenge faced in Spanish data quality assurance. The name and data is parsed through parsing engine to parse the plurality of Spanish names and addresses. The parsed Spanish names and addresses are sent to a Probable identification engine to identify the probable matches. The combination of name and address matching process can be used for assuring data quality for Spanish names and addresses. The Spanish name matching process consists of identification of probable matches and finding similarity percentages between those probable. Similarly, the Spanish address matching process consists of identification of probable matches (criteria like same city) and finding similarity percentages between those probable. The system includes a parsing engine, a probable identification engine and a match percentage calculation engine.
机译:一种用于从包括多个名称和地址的给定数据集中识别相似的名称和地址的方法和系统。本发明更具体地解决了西班牙数据质量保证中面临的挑战。通过解析引擎解析名称和数据,以解析多个西班牙名称和地址。解析的西班牙语名称和地址被发送到可能的标识引擎,以标识可能的匹配项。名称和地址匹配过程的组合可用于确保西班牙语名称和地址的数据质量。西班牙语名称匹配过程包括识别可能的匹配以及查找可能的匹配之间的相似性百分比。同样,西班牙的地址匹配过程包括识别可能的匹配项(类似同一城市的标​​准),并找出可能的匹配项之间的相似性百分比。该系统包括解析引擎,可能的识别引擎和匹配百分比计算引擎。

著录项

相似文献

  • 专利
  • 外文文献
  • 中文文献
获取专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号