首页> 外国专利> Method for extracting, interpreting and standardizing tabular data from unstructured documents

Method for extracting, interpreting and standardizing tabular data from unstructured documents

机译：从非结构化文档中提取，解释和标准化表格数据的方法

页面导航

摘要
著录项
相似文献

摘要

A system, method, and computer program for automatically identifying, parsing, and interpreting tabular data from unstructured documents stored in various formats such as ASCII text, Unicode text, HTML, PDF text, and PDF image format is provided. A set of table identification, parsing/tokenizing, and interpreting/mapping rules are developed with grammar descriptors. These rules are then applied to a set of documents to identify a table, parse the content of the table, and interpret the parsed content, if required, thereby standardizing the tabular data.

机译：提供了一种用于自动识别，解析和解释来自以各种格式存储的非结构化文档的表格数据的系统，方法和计算机程序，该非结构化文档以各种格式存储，例如ASCII文本，Unicode文本，HTML，PDF文本和PDF图像格式。使用语法描述符开发了一组表标识，解析/标记和解释/映射规则。然后，将这些规则应用于一组文档以标识表，解析表的内容并解释解析的内容（如果需要），从而标准化表格数据。

著录项

公开/公告号US7590647B2

专利类型
公开/公告日2009-09-15

原文格式PDF
申请/专利权人 VENKATESAN SRINIVASAN;MAHANTESH KOTHIWALE;RUMMANA ALAM;SRINIVASAN BHARADWAJ;
展开▼

申请/专利号US20050140340
发明设计人 VENKATESAN SRINIVASAN;MAHANTESH KOTHIWALE;RUMMANA ALAM;SRINIVASAN BHARADWAJ;
展开▼

申请日2005-05-27
分类号G06F7;G06F9/44;G06F3;
国家 US
入库时间 2022-08-21 19:32:57

相似文献

专利
外文文献
中文文献

1. Tolerance Study for Standardized iMacleaya cordata/iExtract Added to Chicken Layer Diet [J] . Ray A. Matulka ,Sophie von Alvensleben ,Mauro Morlacchini . 动物科学期刊（英文） . 2018,第1期

2. Residue Study for a Standardized iMacleaya cordata/iExtract in Growing-Finishing Swine [J] . Lu Zhao ,Ray A. Matulka ,Sophie von Alvensleben . 动物科学期刊（英文） . 2017,第2期

3. Safety Evaluation of a Standardized iMacleaya cordata/iExtract in a Ninety Day Feeding Study in Weaned Piglets [J] . Lu Zhao ,Sophie von Alvensleben ,Giorgio Fusconi . 动物科学期刊（英文） . 2017,第2期

4. An Improved Fine-Grained Encryption Method for Unstructured Big Data [J] . Changli Zhou1 ,Chunguang Ma1 ,Songtao Yang1 . 国际计算机前沿大会会议论文集 . 2015,第001期

5. 数据元素方法理论研究及在油田数据标准化中的应用 [C] . 袁满 . 2005石油数据管理与应用国际学术研讨会 . 2005

6. 非结构化文档的版面分析及表格提取 [A] . 张昊玥 . 2019