Semantics-based Extraction of Webpage Main Text

机译：基于语义的网页正文提取

获取原文

获取原文并翻译 | 示例

页面导航

摘要
著录项
相似文献
相关主题

摘要

Extraction of webpage main text is one of the most efficient methods to improve search engine.In the traditional method,the extraction of the webpage main text use the similarity of DOM sub-tree as a end condition for the DOM tree traversing,while its speed is unsatisfactory on such a complex webpage structure.Thus,to raise the traverse speed and accuracy of DOM sub-tree effectively,we propose a method which is Semantics-based Extraction of Webpage Main text.

机译：网页主体文本的提取是改进搜索引擎的最有效方法之一。在传统方法中，网页主体文本的提取以DOM子树的相似性作为遍历DOM树的最终条件，而其速度却很快。因此，为有效提高DOM子树的遍历速度和准确性，提出了一种基于语义的网页正文提取方法。

著录项

来源
《2012 Eighth International Conference on Semantics, Knowledge and Grids.》|2012年|181-184|共4页
会议地点 Beijing(CN)
作者
Han Fengjiao; Zhou Zhurong;
展开▼
作者单位

College of Computer and Information Science Southwest University Chongqing, China;

College of Computer and Information Science Southwest University Chongqing, China;

展开▼
会议组织
原文格式 PDF
正文语种 eng
中图分类程序设计;
关键词
Semantics; Extraction; Webpage;

机译：语义;提取;网页;

相似文献

外文文献
中文文献
专利

1. Extracting Chemical Reactions from Thai Text for Semantics-Based Information Retrieval [J] . Peerasak INTARAPAIBOON, Ekawit NANTAJEEWARAWAT, Thanaruk THEERAMUNKONG IEICE Transactions on Information and Systems . 2011,第3期

机译：从泰语文本中提取化学反应以基于语义的信息检索
2. Extracting Chemical Reactions from Thai Text for Semantics-Based Information Retrieval [J] . Peerasak INTARAPAIBOON, Ekawit NANTAJEEWARAWAT, Thanaruk THEERAMUNKONG IEICE transactions on information and systems . 2011,第3期

机译：从泰语文本中提取化学反应以进行语义的信息检索
3. Webpage reading: Psychophysiological correlates of emotional arousal and regulation predict multiple-text comprehension [J] . Mason Lucia, Scrimin Sara, Zaccoletti Sonia, Computers in Human Behavior . 2018,第OCTa期

机译：网页阅读：情绪唤醒与调节的心理生理相关性预测多文本理解
4. Semantics-Based Extraction of Webpage Main Text [C] . Fengjiao Han, Zhurong Zhou 2012 Eighth International Conference on Semantics, Knowledge and Grids. . 2012

机译：基于语义的网页正文提取
5. Semantics-based language models for information retrieval and text mining. [D] . Zhou, Xiaohua. 2008

机译：基于语义的语言模型，用于信息检索和文本挖掘。
6. Layout-aware text extraction from full-text PDF of scientific articles [O] . Cartic Ramakrishnan, Abhishek Patnia, Eduard Hovy, 2012

机译：从科学文章的全文PDF中提取可识别布局的文本
7. Improving Webpage Content Extraction by extending a novel single page extraction approach: A case study with Thai websites [O] . Thanadechteemapat W., Fung C.C. 2012

机译：通过扩展新颖的单页提取方法来改善网页内容提取：以泰国网站为例
8. Semantics-Based Reference Resolution in Technical Text Processing: An Exploration of Using the WordNet Database in the Computerized Comprehensibility System. [R] . Kieras, D. E. 1992

机译：基于语义的技术文本处理参考分辨率：在计算机化可理解系统中使用WordNet数据库的探索。

Semantics-based Extraction of Webpage Main Text

摘要

著录项

相似文献

相关主题

期刊订阅