Creating Data-Driven Ontologies An Agriculture Use Case

机译：创建数据驱动的本体案例

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

The manual creation of an ontology is a tedious task. In the field of ontology learning, Natural Language Processing (NLP) techniques are used to automatically create ontologies. In this paper, we present a methodology using data-driven techniques to create ontologies from unstructured documents in the agriculture domain. We use state-of-the-art NLP techniques based on Stanford OpenIE, Hearst patterns and co-occurrences to create ontologies. We add an NLP-method that uses dependency parsing and transformation rules based on linguistic patterns. In addition, we use keyword-driven techniques from the query expansion field, based on Word2vec, WordNet and ConceptNet, to create ontologies. We add a method that takes the union of the ontologies produced by the keyword-based methods. The semantic quality of the different ontologies is calculated using automatically extracted keywords. We define recall, precision and F1-score based on the concepts and relations in which the keywords are present. The results show that 1) the method based on co-occurrences has the best F1-score with more than 100 keywords; 2) the keyword-based methods have a higher F1-score than the NLP-based methods with less than 100 keywords in the evaluation and; 3) the combined keyword-based method always has a higher F1-score compared to each single method. In our future work, we will focus on improving the dependency parsing algorithm, improving combining different ontologies, and improving our quality evaluation methodology.

机译：手册创建了本体是一项繁琐的任务。在本体学习领域，使用自然语言处理（NLP）技术用于自动创建本体。在本文中，我们介绍了一种使用数据驱动技术的方法来创建来自农业领域的非结构化文件的本体。我们使用基于斯坦福Openie，赫斯特模式和共同发生的最先进的NLP技术来创建本体。我们添加了一个NLP方法，它使用基于语言模式的依赖性解析和转换规则。此外，我们使用查询扩展字段的关键字驱动技术，基于Word2VEC，WordNet和ConceptNet来创建本体。我们添加了一种方法，它采用由基于关键字的方法产生的本体的联合。使用自动提取的关键字计算不同的本体的语义质量。我们根据存在关键字的概念和关系来定义召回，精度和F1分数。结果表明，1）基于共同发生的方法具有超过100个关键词的最佳F1分数; 2）基于关键字的方法具有比评估中少于100个关键字的基于NLP的基于NLP的方法的F1分数。 3）与每个单个方法相比，基于关键字的方法始终具有更高的F1分数。在我们未来的工作中，我们将专注于改善依赖解析算法，改善不同本体的组合，提高质量评估方法。

著录项

来源
《International Conference on Big Data, Small Data, Linked Data and Open Data》|2019年|57p|共6页
会议地点
作者
Maaike H.T. de Boer; Jack P.C. Verhoosel;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类 TP311.13-53;
关键词
Knowledge engineering; Machine learning; Agriculture;

机译：知识工程;机器学习;农业;

相似文献

外文文献
中文文献
专利

1. Data-Driven Decision Making in Precision Agriculture: The Rise of Big Data in Agricultural Systems [J] . Nicoleta Tantalaki, Stavros Souravlas, Manos Roumeliotis Journal of Agricultural & Food Information . 2019,第4期

机译：精密农业数据驱动决策：农业系统大数据的兴起
2. Extracting concepts using linguistic ontology in agriculture domain. (Special Issue: Artificial intelligence in agriculture.) [J] . Aditi Sharan, Nidhi Malik, Vajenti Mala Journal of the Indian Society of Agricultural Statistics . 2013,第1期

机译：在农业领域使用语言本体提取概念。（特刊：农业人工智能。）
3. Agricultural Ontology Based Feature Optimization for Agricultural Text Clustering [J] . SU Ya-ru, WANG Ru-jing, CHEN Peng, 农业科学学报：英文版 . 2012,第005期

机译：基于农业本体的农业文本聚类特征优化
4. Creating Data-Driven Ontologies An Agriculture Use Case [C] . Maaike H.T. de Boer, Jack P.C. Verhoosel International Conference on Big Data, Small Data, Linked Data and Open Data . 2019

机译：创建数据驱动的本体案例
5. A Data-Driven Framework for Assisting Geo-Ontology Engineering Using a Discrepancy Index. [D] . Yan, Bo. 2016

机译：一种使用差异索引协助地理本体工程的数据驱动框架。
6. The Problems of Realism-Based Ontology Design: a Case Study in Creating Definitions for an Application Ontology for Diabetes Camps [O] . James C. Schuler, Werner M. Ceusters 2017

机译：基于现实主义的本体设计问题：为糖尿病营应用本体创建定义的案例研究
7. From AGROVOC to the Agricultural Ontology Service/Concept Server. An OWL model for creating ontologies in the agricultural domain [O] . Lauser Boris, Sini Margerita, Liang Anita, 2006

机译：从AGROVOC到农业本体服务/概念服务器。用于在农业领域创建本体的OWL模型

Creating Data-Driven Ontologies An Agriculture Use Case

摘要

著录项

相似文献

相关主题

期刊订阅