Transforming the arχiv to XML

机译：将Arχiv转化为XML

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

We describe an experiment of transforming large collections of LATEX documents to more machine-understandable representations. Concretely, we are translating the collection of scientific publications of the Cornell e-Print Archive (ARXIV) using the LATEX to XML converter which is currently under development. The main technical task of our ARXMLIV project is to supply LaTeXML bindings for the (thousands of) LATEX classes and packages used in the ARXIV collection. For this we have developed a distributed build system that reiteratively runs LaTeXML over the ARXIV collection and collects statistics about e.g. the most sorely missing LaTeXML bindings and clusters common error events. This creates valuable feedback to both the developers of the LaTeXML package and to binding implementers. We have now processed the complete ARXIV collection of more than 400,000 documents from 1993 until 2006 (one run is a processor-year-size undertaking) and have continuously improved our success rate to more than 56% (i.e. over 56% of the documents that are LATEX have been converted by LaTeXML without noticing an error and are available as XHTML+MathML documents).

机译：我们描述了将大型乳胶文件转变为更多机可理解的陈述的实验。具体地，我们正在使用目前正在开发的乳胶到XML转换器转换康奈尔电子印刷存档（ARXIV）的科学出版物的集合。我们的ARXMLIV项目的主要技术任务是为ARXIV集合中使用的（数千个）乳胶类和软件包提供乳胶绑定。为此，我们开发了一个分布式构建系统，在Arxiv收集中重复运行LaTeXML，并收集大约一节的统计信息。最严重缺少的乳胶绑定和群集常见错误事件。这为乳胶包的开发人员和绑定实施者创造了有价值的反馈。我们现在已从1993年从1993年处理了超过40万个文件的完整Arxiv收集，直到2006年（一跑是一个处理器年级的承诺），并使我们的成功率不断提高到56％以上（即超过56％的文件乳胶是否已被乳胶转换，而不会注意到错误，并且可用作XHTML + MathML文档）。

著录项

来源
《International Conference on Intelligent Computer Mathematics》|2008年||共9页
会议地点
作者
Heinrich Stamerjohanns; Michael Kohlhase;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类 TP18-53;
关键词
入库时间 2022-08-20 21:22:45

相似文献

外文文献
中文文献
专利

1. The New CLOCITCLOCIT Irradiation Facility for ^{4040 Ar/ ^{3939 Ar Geochronology: Characterisation, Comparison with CLICITCLICIT and Implications for High‐Precision Geochronology}} [J] . Rutte Daniel, Becker Tim A., Deino Alan L., Geostandards and geoanalytical research . 2018,第3期

机译：new clocit clocit照射工具为^{40 40ar / ^{39 39 AR地理学：表征，与}}
2. Noble gas composition, cosmic‐ray exposure age, ^{3939 Ar‐ ^{4040 Ar, and I‐Xe analyses of ungrouped achondrite NWANWA 7325}} [J] . Hopp Jens, Schr?ter Natalie, Pravdivtseva Olga, Meteoritics & planetary science . 2018,第6期

机译：贵族气体成分，宇宙射线暴露年龄， ^{39 39 ar- ^{40 40 AR和I-XE分析未分组的achondrite nwa nwa 7325.}}
3. Observation of transitions involving core-excited states in Ar III and Ar IV and high-lying singly excited states in Ar I-Ar IV [J] . Nandi T., Mishra A.P., Jagatap B.N. Journal of Quantitative Spectroscopy & Radiative Transfer . 2011,第18期

机译：观察到涉及Ar III和Ar IV中的核心激发态和Ar I-Ar IV中的高位单激发态的跃迁
4. Transforming the arχiv to XML [C] . Heinrich Stamerjohanns, Michael Kohlhase International Conference on Intelligent Computer Mathematics . 2008

机译：将Arχiv转化为XML
5. A generalized way of transforming meteorological data into XML. [D] . Chen, Mingli. 2005

机译：将气象数据转换为XML的通用方法。
6. Rethinking Molecular Mimicry in Rheumatic Heart Disease and Autoimmune Myocarditis: Laminin Collagen IV CAR and B1AR as Initial Targets of Disease [O] . Robert Root-Bernstein 2014

机译：风湿性心脏病和自身免疫性心肌炎的分子模仿的反思：层粘连蛋白胶原IVCAR和B1AR作为疾病的最初目标
7. Arşiv Malzemesini Tahrib Eden Unsurlar, Bunlara Karşı Korunma Metodları ve Arşiv Malzemesinin Restorasyonu [O] . Binark İsmet 1988

机译：破坏档案材料的要素，保护它们的方法和恢复档案材料

Transforming the arχiv to XML

摘要

著录项

相似文献

相关主题

期刊订阅