首页> 外文期刊>China Economic Journal >A new algorithm for matching Chinese NBS firm-level with customs data
【24h】

A new algorithm for matching Chinese NBS firm-level with customs data

机译:一种新的算法与海关数据匹配中国网通公司

获取原文
获取原文并翻译 | 示例
           

摘要

Combining accounting-type firm data and transactions-type customs data has become increasingly important for research in international and industrial economics. The statistical authorities in several countries such as the United States or France provide such linked data without details on sources, and researchers have to assume that the matching is correct and the firm identifiers are unique and flawless in the source data. For some other countries such as Switzerland or China, firm and customs data contain information which permits such linking ex post using string matching based on firm names and their meta-information like addresses. Due to spelling and typos, such matching is prone to some errors. Obtaining the largest-possible number of high-quality matches helps avoid potential biases while keeping crucial details. We report on a new algorithm which improves considerably the hitherto available linking efforts of the National Bureau of Statistics firm-level and the Customs trade data for China.
机译:结合会计型公司数据和交易类型的海关数据对于国际和工业经济学的研究变得越来越重要。美国或法国等统计局在若干国家提供此类链接数据,无需关于来源的详细信息,研究人员必须假设匹配是正确的,并且公司的标识符在源数据中是独一无二的,并且在源数据中是完美的。对于其他一些其他国家,如瑞士或中国,公司和海关数据包含信息,该信息允许使用基于公司名称及其元信息的字符串匹配如地址使用字符串匹配。由于拼写和拼写错误,这种匹配容易出现一些错误。获得最大数量的高质量匹配有助于避免潜在的偏见,同时保持关键的细节。我们报告了一种新的算法,可显着提高了国家统计局局局和中国海关贸易数据的迄今为止的联系努力。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号