首页> 外文会议>International Conference on Big Data, IoT and Data Science >Use of noun phrases in identification of a website
【24h】

Use of noun phrases in identification of a website

机译:使用名词短语在识别网站

获取原文

摘要

In this paper a methodology is proposed to prepare a domain ontology of a given website. For the same we parse through the contents of website for noun pharses. We apply Term Frequency and Inverse Document Frequency (TF-IDF). This gives us the required information about the domain which can be used to build an ontology. The ontology thus created will represent the proper taxonomy of classes and their interrelationships. This ontology will help in identification of topics in website and ability to give relevant output when complex queries are given. The system will then make use of the ontology to classify it in comparison with the ontologies in the database. The system is tested on an academic instritutes website, namely, College of Engineering Pune (COEP) and proves to be useful.
机译:在本文中,提出了一种方法来准备给定网站的域本体。对于同样的我们通过网站的内容来解析名词小组的内容。我们应用术语频率和逆文档频率(TF-IDF)。这为我们提供有关域的所需信息,可用于构建本体。如此创建的本体论将代表课程的适当分类及其相互关系。此本体将有助于确定网站中的主题和在给出复杂查询时提供相关输出的能力。然后,系统将使用本体进行对其进行分类,与数据库中的本体相比。该系统在学术侦察网站上进行了测试,即工程浦那(COEP)学院,并证明是有用的。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号