首页> 外文会议>IEEE International Conference on Communication, Networks and Satellite >Resource Description Framework Generation for Tropical Disease Using Web Scraping
【24h】

Resource Description Framework Generation for Tropical Disease Using Web Scraping

机译:使用Web爬虫生成热带疾病的资源描述框架

获取原文
获取外文期刊封面目录资料

摘要

Tropical diseases are diseases that commonly occur in tropical areas like Indonesia. The most frequent diseases plaguing Indonesia community consists of common diseases that are categorized as tropical diseases, e.g., malaria, leprosy, and lymphatic filariasis. As first aid, Indonesian people usually rely on a search engine to search for information about diseases and medication especially of general illnesses like coughs, colds, and fever. However, some result of conventional search engine still generates un-related articles. Another drawback of a conventional search engine, sometimes it cannot handle synonym meaning of disease in searching especially for the non-medical term in Bahasa Indonesia. For example, for lymphatic filariasis is better known as kaki gajah (In English: Elephant foot). One solution is to build a search engine based on semantic web technology. The semantic web can show the relationships between the data, including synonym of disease name in Bahasa Indonesia in Resource Description Framework (RDF) serialization form. Unfortunately, based on our initial survey, the websites that provide health information in Bahasa Indonesia do not provide RDF files. This research aims to generate an RDF serialization that describes the relationship ontology in tropical diseases and treatments terms. We implemented a web scraping method to harvesting and extracting vocabularies from some popular health websites, creating the relationship between terms with ontology and convert this information into an RDF serialization.
机译:热带疾病是常见于印度尼西亚等热带地区的疾病。困扰印度尼西亚社区的最常见疾病包括被归类为热带疾病的常见疾病,例如疟疾,麻风病和淋巴丝虫病。作为急救,印度尼西亚人通常依靠搜索引擎来搜索有关疾病和药物的信息,尤其是关于诸如咳嗽,感冒和发烧之类的一般疾病的信息。但是,常规搜索引擎的某些结果仍会生成不相关的文章。传统搜索引擎的另一个缺点是,有时在搜索印度尼西亚语中的非医学术语时,有时无法处理疾病的同义词。例如,对于淋巴丝虫病,众所周知的是kaki gajah(英文:Elephant foot)。一种解决方案是构建基于语义Web技术的搜索引擎。语义网可以显示数据之间的关系,包括资源描述框架(RDF)序列化形式中印度尼西亚语中疾病名称的同义词。不幸的是,根据我们的初步调查,在印度尼西亚语中提供健康信息的网站不提供RDF文件。这项研究旨在生成描述热带疾病和治疗术语中的关系本体的RDF序列化。我们实施了一种Web抓取方法,以从一些流行的健康网站中收集和提取词汇,创建术语与本体之间的关系,并将此信息转换为RDF序列化。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号