首页> 外文会议>2012 CSI Sixth International Conference on Software Engineering. >Identifying context of text documents using Na#x00EF;ve Bayes classification and Apriori association rule mining
【24h】

Identifying context of text documents using Na#x00EF;ve Bayes classification and Apriori association rule mining

机译:使用朴素贝叶斯分类和Apriori关联规则挖掘来识别文本文档的上下文

获取原文
获取原文并翻译 | 示例

摘要

Huge amount of unstructured data is available in the form of text documents. Ranking these text documents by considering their context will be very useful in information retrieval. We propose classification of abstracts by considering their context using Naïve Bayes classifier and Apriori association rule algorithm — i.e. Context Based Naive Bayesian and Apriori (CBNBA). In proposed approach, we initially classify the documents using Naïve Bayes. We find the context of an abstract by looking for associated terms which help us understand the focus of the abstract and interpret the information beyond simple keywords. The results indicate that context based classification increases accuracy of classification to great extent and in turn discovers different contexts of the documents. Further this approach can found to be very useful for applications beyond abstract classification where word speaks very little and lead to ambiguous state but context can lead you to right decision/classification.
机译:大量非结构化数据以文本文档的形式提供。通过考虑它们的上下文对这些文本文档进行排名将对信息检索非常有用。我们建议通过使用朴素贝叶斯分类器和Apriori关联规则算法(即基于上下文的朴素贝叶斯和Apriori(CBNBA))考虑摘要的上下文来提出摘要分类。在提议的方法中,我们最初使用朴素贝叶斯对文档进行分类。我们通过寻找相关的术语来找到摘要的上下文,这些术语有助于我们理解摘要的重点并解释简单关键字之外的信息。结果表明,基于上下文的分类在很大程度上提高了分类的准确性,进而发现了文档的不同上下文。此外,该方法对于抽象分类以外的应用程序非常有用,在抽象分类中,单词讲得很少,并导致模棱两可的状态,但是上下文可以使您做出正确的决策/分类。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号