Semiautomatic Extraction of Topic Maps from Web Pages Using Clustering with Web Contents and Structure

机译：使用Web内容和结构群集的网页从网页中的半自动提取

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

In this paper, we describe a method to semi-automatically extract Topic Maps from a set of Web pages. We introduce the following two points to the existing clustering method: The first is merging only the linked Web pages, to extract the underlying relationship of the topics. The second is introducing the similarity by contents of Web pages and the types of links, and the distance between the directories in which the pages are located, to generate dense clusters. We generate the topic map by assuming the clusters as topics, the edges as associations, the Web pages related to the topic as occurrences from the result of clustering. We experimentally extracted the topic map and evaluated it.

机译：在本文中，我们描述了来自一组网页的半自动提取主题映射的方法。我们介绍了现有聚类方法的以下两点：第一个是仅合并链接的网页，以提取主题的基础关系。第二个是通过网页的内容和链路类型的相似性，以及页面所在的目录之间的距离，以产生密集的簇。我们通过假设群集作为主题，边缘作为关联，与群集结果发生的网页来生成主题映射。我们通过实验提取了主题地图并进行了评估。

著录项

来源
《Workshop on Collective Intelligence on Semantic Web》|2007年||共4页
会议地点
作者
Mase Motohiro; Yamada Seiji; Nitta Katsumi; WI-IAT Workshops;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类 TP18-53;
关键词
information extractionTopic Mapsclustering;

机译：信息提取主题映射群集;

相似文献

外文文献
中文文献
专利

1. Multi Level Web Data Extraction Based Topical Visual Structure Clustering for Efficient Web Search [J] . Sureshkumar T, Shanthi N Journal of computational and theoretical nanoscience . 2017,第9期

机译：基于多级Web数据提取的高效网络搜索的局部视觉结构聚类
2. Automatic sitemaps generation: Exploring website structures using block extraction and hyperlink analysis [J] . Shian-Hua Lin, Kuan-Pak Chu, Chun-Ming Chiu Expert Systems with Application . 2011,第4期

机译：自动生成站点地图：使用块提取和超链接分析来探索网站结构
3. Relation Extraction from Web Contents with Linguistic and Web Features（言語分析およびWeb上の情報を用いたコンテンツからの関係の抽出） [J] . 顔玉蘭人工知能学会志 . 2011,第1期

机译：使用语言和Web功能从Web内容中提取关系（使用Web上的信息进行语言分析和从内容中提取关系）
4. Semiautomatic Extraction of Topic Maps from Web Pages Using Clustering with Web Contents and Structure [C] . Mase Motohiro, Yamada Seiji, Nitta Katsumi, Workshop on Collective Intelligence on Semantic Web . 2007

机译：使用Web内容和结构群集的网页从网页中的半自动提取
5. Websites through genre lenses: Recognizing emergent regularities in websites' content structure. [D] . Symonenko, Svetlana. 2007

机译：通过体裁视角分析网站：识别网站内容结构中出现的规律性。
6. 2StrucCompare: a webserver for visualizing small but noteworthy differences between protein tertiary structures through interrogation of the secondary structure content [O] . Elliot D Drew, Robert W Janes 2019

机译：2 StrucCompare：一个网络服务器用于通过查询二级结构内容来可视化蛋白质三级结构之间的微小但值得注意的差异
7. Extracting Topic Maps from Web histories by clustering with Web structure and contents [O] . Motohiro Mase 2008

机译：通过使用Web结构和内容进行聚类，从Web历史中提取主题地图

Semiautomatic Extraction of Topic Maps from Web Pages Using Clustering with Web Contents and Structure

摘要

著录项

相似文献

相关主题

期刊订阅