A New Vision-Based Method for Extracting Academic Information from Conference Web Pages

机译：一种新的基于视觉信息，用于从会议网页提取学术信息

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

This paper proposes a new vision-based method for extracting academic information from conference Web pages. The main contributions include: (1) An new vision-based page segmentation algorithm is proposed to improve the result of classical VIPS algorithm. This algorithm can divide pages into text blocks. (2) All text blocks are classified as 10 categories according to vision features, keyword features and text content features. The initial classification results have 75% precision and 67% recall. (3) The context information of text blocks are employed to repair and refine initial classification results, which are improved to 96% precision and 98% recall. Finally, academic information is extracted from classified text blocks. Our experimental results on real-world datasets show that the proposed method is effective and efficient for extracting academic information from conference Web pages.

机译：本文提出了一种新的基于视觉信息，用于从会议网页提取学术信息。主要贡献包括：（1）提出了一种新的基于视觉的页面分段算法来改善经典VIP算法的结果。此算法可以将页面划分为文本块。（2）根据Vision功能，关键字功能和文本内容功能，所有文本块都被分类为10类。初始分类结果具有75％的精确度和67％的召回。（3）采用文本块的上下文信息来修复和改进初始分类结果，其提高到96％的精度和98％的召回。最后，从分类的文本块中提取学术信息。我们对现实世界数据集的实验结果表明，该方法是从会议网页提取学术信息的有效和有效。

著录项

来源
《International Conference on Tools with Artificial Intelligence》|2012年||共6页
会议地点
作者
Wang Peng; Zhou Mingqi; You Yue; Zhang Xiang;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类 TP18-53;
关键词
Web information extraction; Web page segmentation; bayesian network classifier;

机译：Web信息提取;网页分段;贝叶斯网络分类器;
入库时间 2022-08-21 10:23:20

相似文献

外文文献
中文文献
专利

1. Book Review: Essential Mathematical Methods for Physicists. By Hans Weber and George Arfken, Academic Press, San Diego, California, U.S.A., 2003, xxii+932 pp., $89.95 (hardcover). ISBN 0-12-059877-9 [J] . Donald Spector Foundations of Physics . 2004,第8期

机译：书评：物理学家的基本数学方法。汉斯·韦伯（Hans Weber）和乔治·阿夫肯（George Arfken），美国加利福尼亚州圣地亚哥，学术出版社，2003年，xxii + 932页，89.95美元（精装）。书号0-12-059877-9
2. Evaluating webinar‐based training: a mixed methods study of trainee reactions toward digital web conferencing [J] . Gegenfurtner Andreas, Zitt Alexander, Ebner Christian International Journal of Training and Development . 2020,第1期

机译：评估基于网络研讨会的培训：一种混合方法研究实习生对数字网会议的影响
3. Box clustering segmentation: A new method for vision-based web page preprocessing [J] . Jan Zeleny, Radek Burget, Jaroslav Zendulka Information Processing & Management . 2017,第3期

机译：框群分割：基于视觉的网页预处理的新方法
4. A New Vision-Based Method for Extracting Academic Information from Conference Web Pages [C] . Wang Peng, Zhou Mingqi, You Yue, IEEE International Conference on Tools with Artificial Intelligence . 2012

机译：从会议网页中提取学术信息的基于视觉的新方法
5. Web Conference vs. Webcast: The Perceived Effectiveness of Training Sessions at a Southeastern Community College [D] . Jones, Jenny Bailey. 2017

机译：网络会议与网络广播：东南社区学院培训课程的感知效果
6. Proceedings of the 22nd Academic Conference of the Bauru School of Dentistry Dr. Waldyr Antonio Janson the 16th Academic Conference of Speech-Language Pathology and Audiology of the Bauru School of Dentistry Dr. Kátia de Freitas Alvarenga and the 3rd Meeting of Latin American Region of the IADR and 8th Meeting of the Venezuelan Division of the IADR [O] . 2009

机译：鲍鲁牙科学院第22届学术会议论文集。 Waldyr Antonio JansonBauru牙科学院第16届语言病理学和听力学学学术会议 Dr. Kátiade Freitas Alvarenga和IADR拉丁美洲地区第三次会议以及IADR委内瑞拉分部第八次会议
7. A Linked Data Generation Method for Academic Conference Websites [O] . Peng Wang, Mingqi Zhou, Xiang Zhang, 2014

机译：学术会议网站的关联数据生成方法

A New Vision-Based Method for Extracting Academic Information from Conference Web Pages

摘要

著录项

相似文献

相关主题

期刊订阅