首页> 外文会议> >A binary-categorization approach for classifying multiple-record Web documents using application ontologies and a probabilistic model
【24h】

A binary-categorization approach for classifying multiple-record Web documents using application ontologies and a probabilistic model

机译:使用应用程序本体和概率模型对多记录Web文档进行分类的二进制分类方法

获取原文

摘要

The amount of information available on the World Wide Web has been increasing dramatically in recent years. To enhance speedy searching and retrieving Web documents of interest, researchers and practitioners have partially relied on various information retrieval techniques. We propose a probabilistic model to classify Web documents into relevant documents and irrelevant documents with respect to a particular application ontology, which is a conceptual-model snippet of standard ontologies. Our probabilistic model is based on multivariate statistical analysis and is different from the conventional probabilistic information retrieval models. The experiments we have conducted on a set of representative Web documents indicate that the proposed probabilistic model is promising in binary-categorization of multiple-record Web documents.
机译:近年来,万维网上可用的信息量急剧增加。为了提高快速搜索和检索感兴趣的Web文档的能力,研究人员和从业人员已部分依赖各种信息检索技术。我们提出一种概率模型,将特定于特定应用程序本体的Web文档分为相关文档和不相关文档,这是标准本体的概念模型片段。我们的概率模型基于多元统计分析,与传统的概率信息检索模型不同。我们对一组代表性Web文档进行的实验表明,所提出的概率模型在多记录Web文档的二进制分类中很有希望。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号