首页> 外文期刊>Asian Journal of Information Technology >Automatic Ontology Generation for Semantic Search System Using Data Mining Techniques
【24h】

Automatic Ontology Generation for Semantic Search System Using Data Mining Techniques

机译:使用数据挖掘技术的语义搜索系统自动生成本体

获取原文
           

摘要

Here we present about automatically generated ontologies for a semantic web search system using data mining techniques. This will improve the query process and will get better semantic results. Ranking algorithm is used to search and analyze web documents in a more flexible and effective way. Hyperlink structure of web document is utilized to rank the results. We use association rule mining to find the maximal keyword patterns. Clustering is used to group retrieved documents into distinct sets. This will extract knowledge about qury from the web,populate a knowledge base. The search engine that searches the web documents so far are syntactic oriented. Here we develop a searching system that semantically searches the documents. The semantics of the terms is achieved using the ontologies. Ontology serves as Meta data schemas, providing a controlled vocabulary of concepts, each with explicitly defined meaning. Ranking algorithm used here is the hyper textual ranking algorithm that scans both the contents of the documents and also the reciprocally linked documents. This technique has several advantages that include providing better semantic notion during the search. It also serves for multiple frame documents. There is a need for automatic generation of ontologies when using the semantic searching system. The paper here focuses on how the automatic generation of ontologies could be done for a semantic search system using datamining techniques.
机译:在这里,我们介绍了使用数据挖掘技术为语义Web搜索系统自动生成的本体。这将改善查询过程并获得更好的语义结果。排名算法用于以更灵活和有效的方式搜索和分析Web文档。 Web文档的超链接结构用于对结果进行排名。我们使用关联规则挖掘来找到最大的关键字模式。聚类用于将检索到的文档分为不同的集合。这将从网络上提取有关查询的知识,填充知识库。到目前为止,搜索Web文档的搜索引擎都是面向语法的。在这里,我们开发了一种语义上搜索文档的搜索系统。术语的语义是使用本体实现的。本体充当元数据模式,提供概念的受控词汇表,每个概念都有明确定义的含义。此处使用的排名算法是超文本排名算法,它既可以扫描文档的内容,也可以扫描相互链接的文档。该技术具有几个优点,包括在搜索过程中提供更好的语义概念。它还可用于多个框架文档。当使用语义搜索系统时,需要自动生成本体。本文重点关注如何使用数据挖掘技术为语义搜索系统自动生成本体。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号