Automatic generation of semantically enriched web pages by a text mining approach

Hsin-Chang Yang

首页> 外文期刊>Expert systems with applications >Automatic generation of semantically enriched web pages by a text mining approach

【24h】

Automatic generation of semantically enriched web pages by a text mining approach

机译：通过文本挖掘方法自动生成语义丰富的网页

获取原文

获取原文并翻译 | 示例

开具论文收录证明 >>

页面导航

摘要
著录项
引文网络
相似文献
相关主题

摘要

Nowadays most of the Web pages contain little amount of structure and supporting information that can reveal their semantics or meanings. To enable automated processing of the Web pages, semantic infor-mation such as metadata and tags regarding to each page should be added to it. Several authoring tools have been developed to help users tackling this task. However, manual or semi-automatic authoring is implausible when we intend to annotate large amount of Web pages. In this work, we proposed a method to automatically generate some descriptive metadata and tags for a Web page. The idea is to apply the self-organizing map algorithm to cluster the Web pages and discover the relationships between these clusters. In the mean time, the themes of each cluster are also identified. We then use such relationships and themes to tag the Web pages and generate metadata for the Web pages. The result of experiments shows that our method may generate semantically relevant metadata and tags for the Web pages.

机译：如今，大多数Web页面都包含很少的结构和支持信息，这些信息可以揭示其语义或含义。为了能够自动处理Web页面，应该向其添加语义信息，例如与每个页面有关的元数据和标签。已经开发了多种创作工具来帮助用户完成此任务。但是，当我们打算注释大量Web页面时，手动或半自动创作是不可行的。在这项工作中，我们提出了一种为网页自动生成一些描述性元数据和标签的方法。这个想法是应用自组织映射算法对网页进行聚类并发现这些聚类之间的关系。同时，每个集群的主题也被确定。然后，我们使用这种关系和主题来标记Web页面并为Web页面生成元数据。实验结果表明，我们的方法可以为网页生成语义相关的元数据和标签。

著录项

来源
《Expert systems with applications》 |2009年第6期|9709-9718|共10页
作者
Hsin-Chang Yang;
展开▼
作者单位

Department of Information Management, National University of Kaohsiung, Kaohsiung 811, Taiwan, ROC;

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类
关键词
metadata generation; semantic tagging; text mining; self-organizing map;

机译：元数据生成;语义标记;文本挖掘;自组织图;

相似文献

外文文献
中文文献
专利

1. A method for automatic construction of learning contents in semantic web by a text mining approach [J] . Hsin-Chang Yang International journal of knowledge and learning . 2006,第1a2期

机译：一种基于文本挖掘的语义网学习内容自动构建方法
2. A text mining approach on automatic generation of web directories and hierarchies [J] . Hsin-Chang Yang, Chung-Hong Lee Expert Systems with Application . 2004,第4期

机译：自动生成Web目录和层次结构的文本挖掘方法
3. From Semantic Segmentation to Semantic Registration: Derivative-Free Optimization-Based Approach for Automatic Generation of Semantically Rich As-Built Building Information Models from 3D Point Clouds [J] . Xue Fan, Lu Weisheng, Chen Ke, Journal of Computing in Civil Engineering . 2019,第4期

机译：从语义分割到语义注册：基于无导数优化的方法，可从3D点云自动生成语义丰富的竣工建筑物信息模型
4. Automatic Metadata Generation forWeb Pages Using a Text Mining Approach [C] . Hsin-Chang Yang, Chung-Hong Lee . 2005

机译：使用文本挖掘方法自动生成网页的元数据
5. Methods of Enriching Domain Knowledge with Universal Semantics for Higher Text Mining Performance [D] . Qazanfari, Kazem . 2020

机译：以普通语义丰富域知识的方法，以获得更高的文本挖掘性能
6. Visual and Semantic Enrichment of Analytical ChemistryLiterature Searches by Combining Text Mining and Computational Chemistry [O] . Magnus Palmblad, * -1

机译：视觉和语义丰富的分析化学结合文本挖掘和计算化学进行文献检索
7. Text mining with semantic annotation : using enriched text representation for entity-oriented retrieval, semantic relation identification and text clustering [O] . Hou Jun 2014

机译：具有语义注释的文本挖掘：使用丰富的文本表示法进行面向实体的检索，语义关系识别和文本聚类

Automatic generation of semantically enriched web pages by a text mining approach

摘要

著录项

引文网络

相似文献

相关主题

期刊订阅