Learning Table Extraction from Examples

机译：从示例中学习表提取

获取原文

获取原文并翻译 | 示例

页面导航

摘要
著录项
相似文献
相关主题

摘要

Information extraction from tables in web pages is a challenging problem due to the diverse nature of table formats and the vocabulary variants in attribute names. This paper presents a new approach to automated table extraction that exploits formatting cues in semi-structured HTML tables, learns lexical variants from training examples and uses a vector space model to deal with non-exact matches among labels. We conducted experiments with this method on a set of tables collected from 157 university web sites, and obtained the information extraction performance of 91.4% in the F1-measure, showing the effectiveness of the combined use of structural table parsing and example-based label learning.

机译：由于表格格式的多样性和属性名称中的词汇变体，从网页表格中提取信息是一个具有挑战性的问题。本文提出了一种新的自动表提取方法，该方法利用了半结构化HTML表中的格式提示，从训练示例中学习了词法变体，并使用向量空间模型来处理标签之间的不完全匹配。我们对从157个大学网站收集的一组表进行了这种方法的实验，在F1测度中获得了91.4％的信息提取性能，显示了结合使用结构表解析和基于示例的标签学习的有效性。

著录项

来源
《20th International Conference on Computational Linguistics vol.2》|2004年|P.987-993|共7页
会议地点 Geneva(CH)
作者
Ashwin Tengli; Yiming Yang; Nian Li Ma;
展开▼
作者单位

School of Computer Science Carnegie Mellon University Pittsburgh, PA -15213;

展开▼
会议组织
原文格式 PDF
正文语种 eng
中图分类程序语言、算法语言;
关键词

相似文献

外文文献
中文文献
专利

1. Learning From Worked Examples, Erroneous Examples, and Problem Solving: Toward Adaptive Selection of Learning Activities [J] . Chen Xingliang, Mitrovic Antonija, Matthews Moffat Learning Technologies, IEEE Transactions on . 2020,第1期

机译：从工作的例子中学习，错误的例子和解决问题：朝着自适应选择学习活动
2. An Approach for Chinese-Japanese Named Entity Equivalents Extraction Using Inductive Learning and Hanzi-Kanji Mapping Table [J] . JinAn XU, Yufeng CHEN, Kuang RU, IEICE transactions on information and systems . 2017,第8期

机译：基于归纳学习和汉字汉字映射表的汉日命名实体对等提取方法
3. Automatic Detection of Mis-Spelled Japanese Expressions Using a New Method for Automatic Extraction of Negative Examples Based on Positive Examples [J] . Masaki MURATA, Hitoshi ISAHARA IEICE Transactions on Information and Systems . 2002,第9期

机译：使用基于正例的自动提取负例的新方法来自动检测拼写错误的日语表达
4. Learning Table Extraction from Examples [C] . Ashwin Tengli, Yiming Yang, Nian Li Ma International Conference on Computational Linguistics . 2004

机译：学习表从例子提取
5. TableSeer: Automatic table extraction, search, and understanding . [D] . Liu, Ying. 2009

机译：TableSeer：自动提取，搜索和理解表。
6. Assessment of metals bioavailability to vegetables under field conditions using DGT single extractions and multivariate statistics [O] . Marin Senila, Erika Andrea Levei, Lacrimioara Ramona Senila 2012

机译：使用DGT单次提取和多元统计评估田间条件下蔬菜中金属的生物利用度
7. Learning Information Extraction Patterns From Examples [O] . Scott Huffman 1996

机译：从示例中学习信息提取模式
8. Tables of the Internal Magnetic Source Functions Un for Thick Solenoids and Disk Coils: n-3 to n=17, to Cosine Arguments, 8 to 5 Decimals, with Second Differences. With a Short Table of the External Source Functions Wn and Numerical Examples of some Appli [R] . Milan Wayne Garrett 1953

机译：厚电磁铁和磁盘线圈的内部磁源功能表：n-3到n = 17，余弦参数，8到5个小数，第二个差异。使用外部源函数的短表Wn和一些appli的数值例子

Learning Table Extraction from Examples

摘要

著录项

相似文献

相关主题

期刊订阅