首页> 外文会议>Insternational Joint Conference on Natural Language Processing >Mining Table Information on the Internet

【24h】

Mining Table Information on the Internet

机译：互联网上的挖掘表信息

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Making HTML documents, the authors use various methods for clearly conveying their intension. In those various methods, this paper pays special attention to tables because tables are commonly used within many documents to make the meanings clear, which are well recognized because web documents use tags for additional information. On the Internet, tables are used for the purpose of the knowledge structuring as well as design of documents. Thus, we are firstly interested in classifying tables into two types: meaningful tables and decorative tables. However, this is not easy because HTML does not separate presentation and structure. This paper proposes a method of extracting meaningful tables using a modified k-means and compares it with other methods. The experiment results show that classifying on web documents is promising.

机译：制作HTML文件，作者使用各种方法来清楚地传达其内涵。在这些各种方法中，本文对表格表示特别关注表，因为表通常在许多文档中使用，以使含义清晰，这很清楚，因为Web文档使用标签以获取其他信息。在互联网上，表格用于知识结构的目的以及文档的设计。因此，我们首先对分类表分为两种类型：有意义的表和装饰表。但是，这并不容易，因为HTML不分隔演示和结构。本文提出了一种使用改进的k型方式提取有意义表的方法，并将其与其他方法进行比较。实验结果表明，在Web文件上进行分类是有前途的。

著录项

来源
《Insternational Joint Conference on Natural Language Processing 》|2004年||共6页
会议地点
作者
Sung-won Jung; Gi-deuk Han; Hyuk-chul Kwon; Association for Computational Linguistics(ACL); Association for Computational Linguistics and Chinese Language Processing(ACLCLP); Association of Natural Language Processing(ANLP);
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类程序语言、算法语言 ;
关键词

相似文献

外文文献
中文文献
专利

1. Malware Detection Through Mining Symbol Table of Linux Executables [J] . Jinrong Bai, Yanrong Yang, Shiguang Mu, Information Technology Journal . 2013 ,第2期

机译：通过挖掘Linux可执行文件的符号表进行恶意软件检测
2. Malware Detection Through Mining Symbol Table of Linux Executables [J] . Jinrong Bai, Yanrong Yang, Shiguang Mu, Information Technology Journal . 2013 ,第2期

机译：恶意软件检测通过Linux可执行文件的挖掘符号表
3. Effects of Groundwater Table Decline on Vegetation Transpiration in an Arid Mining Area: A Case Study of the Yushen Mining Area, Shaanxi Province, China [J] . Wang Qiangmin, Dong Shuning, Wang Hao, Mine water and the environment . 2020 ,第4期

机译：地下水位下降对干旱矿区植被蒸腾的影响 - 以陕西省玉仁矿区为例
4. Mining Table Information on the Internet [C] . Sung-won Jung, Gi-deuk Han, Hyuk-chul Kwon International Joint Conference on Natural Language Processing . 2005

机译：互联网上的挖掘表信息
5. Mining rules in single-table and multiple-table databases. [D] . Cristofor, Laurentiu Bogdan. 2002

机译：单表和多表数据库中的挖掘规则。
6. Health Risk Assessment of Dietary Heavy Metals Intake from Fruits and Vegetables Grown in Selected Old Mining Areas—A Case Study: The Banat Area of Southern Carpathians [O] . Dan Nicolae Manea, Anişoara Aurelia Ienciu, Ramona Ştef, 2020

机译：选定的旧采矿区生长的膳食重金属摄入的健康风险评估 - 以案例研究：南喀尔巴阡植物的巴纳特地区
7. An Improvised Frequent Pattern Tree Based Association Rule Mining Technique with Mining Frequent Item Sets Algorithm and a Modified Header Table [O] . Agarwal, Vandit, Kushal, Mandhani, Kumar, Dr. Preetham 2015

机译：基于简易频繁模式树的关联规则挖掘挖掘频繁项集算法和修改后的标题技术表

Mining Table Information on the Internet

摘要

著录项

相似文献

相关主题

期刊订阅