首页> 外文期刊>International Journal of Digital Curation >Migration to Intermediate XML for Electronic Data (MIXED): Repository of Durable File Format Conversions
【24h】

Migration to Intermediate XML for Electronic Data (MIXED): Repository of Durable File Format Conversions

机译:迁移到电子数据的中间XML(MIXED):持久文件格式转换的存储库

获取原文
           

摘要

Data Archiving and Networked Services (DANS), the Dutch scientific data archive for the social sciences and humanities, is engaged in the Migration to Intermediate XML for Electronic Data (MIXED) project to develop open source software that implements the smart migration strategy concerning the long-term archiving of file formats. Smart migration concerns the conversion upon ingest of specific kinds of data formats, such as spreadsheets and databases, to an intermediate XML formatted file. It is assumed that the long-term curation of the XML files is much less problematic than the migration of binary source files and that the intermediate XML file can be converted in an efficient way to file formats that are common in the future. The features of the intermediate XML files are stored in the so-called Standard Data Formats for Preservation (SDFP) specification. This XML schema can be considered an umbrella as it contains existing formal descriptions of file formats developed by others. SDFP also contain schemata developed by DANS, for example, a schema for file-oriented databases. It can be used, for example, for the binary DataPerfect format, that was used on a large scale about twenty years ago, and for which no existing XML schema could be found. The software developed in the MIXED project has been set up as a generic framework, together with a number of plug-ins. It can be considered as a repository of durable file format conversions. This paper contains an overview of the results of the MIXED project.
机译:荷兰社会科学和人文科学数据档案馆数据归档和网络服务(DANS)参与了电子数据中间XML的迁移(MIXED)项目,以开发开源软件,该软件可实现涉及长期的智能迁移策略文件格式的长期归档。智能迁移涉及在将特定类型的数据格式(例如电子表格和数据库)提取后转换为中间XML格式的文件。假定XML文件的长期管理比二进制源文件的迁移要容易得多,并且可以以有效的方式将中间XML文件转换为将来常见的文件格式。中间XML文件的功能存储在所谓的“标准数据保存格式”(SDFP)规范中。可以将这个XML模式视为一个伞,因为它包含其他人开发的文件格式的现有形式描述。 SDFP还包含DANS开发的模式,例如,面向文件数据库的模式。例如,它可以用于二进制DataPerfect格式,该格式已在20年前大规模使用,并且找不到现有的XML模式。 MIXED项目中开发的软件已与许多插件一起设置为通用框架。它可以被视为持久文件格式转换的存储库。本文概述了MIXED项目的结果。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利