首页> 外文会议>Conference of Open Innovations Association >Automated extraction of concept matcher thesaurus from semi-structured catalogue-like sources of data on the web
【24h】

Automated extraction of concept matcher thesaurus from semi-structured catalogue-like sources of data on the web

机译:自动提取概念匹配者词库从网上半结构化目录的数据源

获取原文

摘要

Ontology design and the process of populating a data-set with knowledge following the chosen or developed ontology to fit the principles of Semantic Web and Linked Open Data is a time-consuming and iterative process, requiring either expert knowledge or a set of tools for data scraping from web. A valid and consistent ontology and knowledge withing the data-set require unification of concepts which means overcoming ambiguity and synonymy of terms which become individuals of ontology. In this paper we spot on techniques used for organising a Russian food product data-set under a light-weight FOOD Ontology and concept matching in particular. Main approaches to data-set concept unification, synonymic term matching and ways to collect dictionaries for matcher are mentioned. The tool for catalogue-like semi-structured resources parsing and thesaurus extraction is developed and introduced for the task of on-the-fly concept matching.
机译:本体设计与填充所选或开发的本体的知识数据集的过程,以满足语义Web和链接的开放数据的原理,是一个耗时和迭代的过程,需要专家知识或一组数据工具从网上刮。具有数据集的有效和一致的本体和知识需要统一的概念,这意味着克服了成为本体个人的术语的歧义和同义词。在本文中,我们发现了用于在轻量级食品本体和概念中组织俄罗斯食品产品数据集的技术。提到了数据集概念统一,同义词匹配和收集匹配词典的同义词术语匹配和方法的主要方法。开发并介绍了类似于飞行概念匹配的任务的用于目录的半结构化资源解析和叙述提取的工具。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号