基于 PLSA 模型的 Web 页面语义标注算法研究

王云英

首页> 中文期刊>情报杂志 >基于 PLSA 模型的 Web 页面语义标注算法研究

基于 PLSA 模型的 Web 页面语义标注算法研究

开具论文收录证明 >>

期刊封面封底目录下载 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

Efficient web-page semantic annotation is the key point to improve the efficient use of web information resource and knowledge innovation. This paper designs a web-page semantic annotation algorithm based on PLSA model according to the structural feature and the text feature existing in web-page to solve the problems of traditional annotation technology. The proposed algorithm constructs PLSA topic model for structural feature and text feature respectively, adopts an adaptive asymmetric learning approach to the integration and optimiza-tion of the PLSA model, forms a new comprehensive PLSA model to semantically annotate the unknown web pages automatically. Experi-mental results demonstrate that this algorithm dramatically improves the accuracy and efficiency of web-page semantic annotation, and can solve the problem of large-scale web-page annotation effectively.%　　高效的 Web 页面语义标注方法是提高 Web 信息资源利用效率和知识创新的关键。针对当前 Web 页面语义标注方法存在的问题和 Web 页面表现出的结构特征和文本特征及其主题分布规律，设计了基于 PLSA 主题模型的 Web 页面语义标注算法。该算法分别对 Web 页面的结构特征和文本特征构建独立的 PLSA 主题模型，采用自适应不对称学习算法对这些独立的 PLSA 主题模型进行集成和优化，最终形成新的综合性的 PLSA 主题模型进行未知Web 页面的自动语义标注。实验结果表明，该算法能够显著提高 Web 页面语义标注的准确率和效率，可以有效地解决大规模 Web 页面语义标注问题。

著录项

来源
《情报杂志》|2013年第1期|141-144|共4页
作者
王云英;
展开▼
作者单位

湘南学院图书馆郴州 423000;

展开▼
原文格式 PDF
正文语种 chi
中图分类情报学;
关键词
语义标注; PLSA 模型; 潜在语义主题; 标注算法; Web 页面;
入库时间 2022-08-17 11:29:31

相似文献

中文文献
外文文献
专利

1. 基于PLSA模型的Web用户聚类算法研究 [J] . 俞辉 . 计算机工程与科学 . 2008,第007期
2. 基于语义标注的Web服务自动发现与组合模型 [J] . 王硕 ,王如龙 ,张锦 . 微计算机信息 . 2011,第012期
3. Web页面标注模型及其实现 [J] . 陈联 . 计算机工程与设计 . 2006,第011期
4. 基于语义模型的Web挖掘算法研究 [J] . 周红芳 ,冯博琴 ,岳辉 . 哈尔滨工业大学学报 . 2009,第011期
5. 基于petri网的语义Web服务过程模型匹配算法研究 [J] . 赵娟 . 微型电脑应用 . 2009,第006期
6. OntoWord:一种新的Web页面语义标注方法 [C] . . 第二十五届中国数据库学术会议(NDBC2008) . 2008
7. 基于PLSA语义聚类的web服务发现方法 [A] . 韩蕊 . 2012

基于 PLSA 模型的 Web 页面语义标注算法研究

摘要

著录项

相似文献

相关主题

期刊订阅