首页> 外文期刊>SoftwareX >TabbyXL: Software platform for rule-based spreadsheet data extraction and transformation
【24h】

TabbyXL: Software platform for rule-based spreadsheet data extraction and transformation

机译:TabbyXL:用于基于规则的电子表格数据提取和转换的软件平台

获取原文
           

摘要

Spreadsheets are widely used in science, engineering, business, and other activities. Overall, they conceal a large volume of data in a form intended to be interpreted by humans. We present a novel software platform facilitated for liberating such data. It provides rule-based spreadsheet data extraction and transformation to a structured form. Its core consists of a flexible table object model and a domain-specific rule language for table analysis. They serve to represent knowledge of table layout and content features, as well as their interpretation depending on transformation goals. This enables processing arbitrary tables originating from various domains. Our empirical results demonstrate that one ruleset can be applied to process arbitrary tables having the same features of layout, style, or content. The paper also describes two applications using the software platform to develop programs for rule-based converting data from arbitrary spreadsheet tables.
机译:电子表格广泛用于科学,工程,商业和其他活动。总体而言,它们以旨在由人类解释的形式隐藏了大量数据。我们提出了一种新颖的软件平台,可方便地释放此类数据。它提供基于规则的电子表格数据提取和转换为结构化形式。它的核心包括灵活的表对象模型和用于表分析的特定于域的规则语言。它们用于表示表布局和内容功能的知识,以及根据转换目标进行的解释。这使得能够处理源自各个域的任意表。我们的经验结果表明,一个规则集可用于处理具有相同布局,样式或内容特征的任意表。本文还描述了两个使用该软件平台的应用程序,以开发用于从任意电子表格中进行基于规则的数据转换的程序。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号