首页> 外文OA文献 >Building a Dictionary using XML Technology
【2h】

Building a Dictionary using XML Technology

机译:使用XmL技术构建字典

摘要

In this article we describe the workflow implemented to convert a dictionary saved as a PDF file into an XML document and posterior importation into an XML aware database, and the process to edit, add and delete new entries. The conversion process was challenging given the format of the PDF file, and the fine grained detail of the XML schema that was used. For that, an iterative filtering approach was used. To store the dictionary we decided to use an XML aware database (eXist-DB), that stores each dictionary entry as a separate resource. It can be queried used a web interface developed using XQuery. The lexicographers can edit entries using the oXygen XML editor, reading and storing them directly in the database. In order to guarantee incremental backups, it was defined a mechanism to import the XML database into a GIT repository. Finally, a couple of programs were created in order to prepare regular reports on the dictionary revision process, as well as to backup it in a GIT repository.
机译:在本文中,我们描述了实现将保存为PDF文件的字典转换为XML文档并向后导入XML感知数据库的工作流,以及编辑,添加和删除新条目的过程。考虑到PDF文件的格式以及所使用的XML模式的细粒度细节,转换过程具有挑战性。为此,使用了迭代过滤方法。为了存储字典,我们决定使用XML感知数据库(eXist-DB),该数据库将每个字典条目存储为单独的资源。可以使用XQuery开发的Web界面查询。词典编辑者可以使用oXygen XML编辑器编辑条目,并将其直接读取并存储在数据库中。为了保证增量备份,定义了一种将XML数据库导入GIT存储库的机制。最后,创建了两个程序,以准备有关字典修订过程的常规报告,并将其备份到GIT存储库中。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号