首页> 外文期刊>Earth System Science Data >The National Eutrophication Survey: lake characteristics and historical nutrient concentrations
【24h】

The National Eutrophication Survey: lake characteristics and historical nutrient concentrations

机译:全国富营养化调查:湖泊特征和历史营养物浓度

获取原文
           

摘要

Historical ecological surveys serve as a?baseline and provide context for contemporary research, yet many of these records are not preserved in a?way that ensures their long-term usability. The National Eutrophication Survey (NES) database is currently only available as scans of the original reports (PDF files) with no embedded character information. This limits its searchability, machine readability, and the ability of current and future scientists to systematically evaluate its contents. The NES data were collected by the US Environmental Protection Agency between 1972 and 1975 as part of an effort to investigate eutrophication in freshwater lakes and reservoirs. Although several studies have manually transcribed small portions of the database in support of specific studies, there have been no systematic attempts to transcribe and preserve the database in its entirety. Here we use a?combination of automated optical character recognition and manual quality assurance procedures to make these data available for analysis. The performance of the optical character recognition protocol was found to be linked to variation in the quality (clarity) of the original documents. For each of the four archival scanned reports, our quality assurance protocol found an error rate between 5.9 and 17?%. The goal of our approach was to strike a?balance between efficiency and data quality by combining entry of data by hand with digital transcription technologies. The finished database contains information on the physical characteristics, hydrology, and water quality of about 800 lakes in the contiguous US (Stachelek et?al.(2017). Ultimately, this database could be combined with more recent studies to generate meta-analyses of water quality trends and spatial variation across the continental US.
机译:历史生态调查是当代研究的基线,并提供了背景资料,但许多记录并未以确保其长期可用性的方式保存。国家富营养化调查(NES)数据库当前仅可用于原始报告(PDF文件)的扫描,而没有嵌入式字符信息。这限制了它的可搜索性,机器可读性以及当前和将来的科学家系统地评估其内容的能力。 NES数据是美国环境保护局在1972年至1975年之间收集的,是研究淡水湖泊和水库富营养化的一部分。尽管一些研究已经手动转录了数据库的一小部分以支持特定研究,但还没有系统地尝试转录和完整保存数据库。在这里,我们结合使用了自动光学字符识别和手动质量保证程序,以使这些数据可用于分析。发现光学字符识别协议的性能与原始文档的质量(清晰度)的变化有关。对于这四个档案扫描报告中的每一个,我们的质量保证协议都发现错误率在5.9%到17%之间。我们方法的目标是通过将手工输入的数据与数字转录技术相结合,在效率和数据质量之间取得平衡。完整的数据库包含有关美国连续约800个湖泊的物理特征,水文和水质的信息(Stachelek et al。(2017)。最终,该数据库可以与更近期的研究相结合,以进行水的荟萃分析美国大陆各地的水质趋势和空间变化。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号