首页> 外文期刊>SICE Journal of Control, Measurement, and System Integration (SICE JCMSI) >A Clustering Method for Web Mining Based on Probabilistic Latent Semantic Indexing
【24h】

A Clustering Method for Web Mining Based on Probabilistic Latent Semantic Indexing

机译:基于概率潜在语义索引的Web挖掘聚类方法

获取原文
获取原文并翻译 | 示例
           

摘要

Exploring an intranet or internet database enables us to discover useful knowledge. In this process, a search engine plays a pivotal role. To this end, various search engines have been proposed to heighten information accuracy by exploiting key content relations in semantic web resources. But a general-purpose search engine always includes useless or irrelevant web pages in the search results. The next generation of web architecture, known as Semantic Web, can build a layered architecture to possibly mitigate this deficiency by decreasing the noisy data in a searched result. The objective of this paper is to propose a Probabilistic Latent Semantic Indexing (PLSI) method used in semantic web search engines. The method can better return appropriate information for user queries; in particular, a novel ranking strategy is provided to measure the relevance score of an annotated set of web results by considering user queries, data annotation, and the underlying ontology.
机译:探索Intranet或Internet数据库使我们能够发现有用的知识。在此过程中,搜索引擎起着举足轻重的作用。为此,已经提出了各种搜索引擎,以通过利用语义Web资源中的关键内容关系来提高信息准确性。但是,通用搜索引擎始终在搜索结果中包含无用或无关的网页。下一代Web体系结构称为语义Web,可以构建分层体系结构,以通过减少搜索结果中的嘈杂数据来减轻这种缺陷。本文的目的是提出一种在语义Web搜索引擎中使用的概率潜在语义索引(PLSI)方法。该方法可以更好地返回适当的信息以供用户查询;特别是,提供了一种新颖的排名策略,通过考虑用户查询,数据注释和基础本体来测量一组带注释的Web结果的相关性得分。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号