首页> 外文会议>International Conference on Computing, Communication, Control and Automation >Implementation of an efficient web crawler to search medicinal plants and relevant diseases
【24h】

Implementation of an efficient web crawler to search medicinal plants and relevant diseases

机译:实施高效的网络爬虫以搜索药用植物和相关疾病

获取原文

摘要

In Indian Ayurvedic system, medicinal plant is the important factor for different therapeutic uses. The medicinal plants are distributed over India. Hence collection of correct information of medicinal plants is needed for various researches. The huge amount of information related to medicinal plants domain is available on the Internet. Focused web crawler is useful for collecting such close domain information from the internet. In focused web crawler, to extract the relevant web page classification method is used. The classification algorithm classifies the web pages as relevant or not for a given query. In this paper, proposed method of an efficient focused web crawler is used to search the web pages for a medicinal plant domain. Naive Bayes classifier is used for classification of web pages. The proposed focused web crawler uses a manual thesaurus of medicinal plant information for query expansion.
机译:在印度阿育吠陀系统中,药用植物是不同治疗用途的重要因素。药用植物分布在印度。因此,各种研究都需要收集正确的药用植物信息。 Internet上提供了大量与药用植物领域相关的信息。重点突出的Web搜寻器对于从Internet收集此类紧密域信息非常有用。在集中式Web爬虫中,使用提取相关网页的分类方法。分类算法将网页分类为与给定查询相关或无关。在本文中,提出了一种有效的集中式Web爬虫方法,用于在网页上搜索药用植物领域。朴素贝叶斯分类器用于网页分类。拟议的重点网络爬虫使用药用植物信息的手动词库来扩展查询。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号