首页> 外文会议>International conference on conceptual modeling >Semantically Accessing Documents Using Conceptual Model Descriptions
【24h】

Semantically Accessing Documents Using Conceptual Model Descriptions

机译:使用概念模型描述语义访问文档

获取原文

摘要

When publishing documents on the Web, the user needs to describe and classify her documents for the benefit of later retrieval and use. This paper presents an approach to semantic document classification and retrieval based on Natural Language Processing and Conceptual Modeling. The Referent Model language is used in combination with a lexical analysis tool to define a controlled vocabulary for classifying documents. Documents are classified by means of sentences that contain the high frequency words in the document that also occur in the domain model defining the vocabulary. The sentences are parsed using a DCG-like grammar, mapped into a Referent Model fragment and stored along with the document using RDF-XML syntax. The model fragment represents the connection between the document and the domain model and serves as a document index. The approach is being implemented for a document collection published by the Norwegian Center for Medical Informatics (KITH).
机译:在Web上发布文档时,用户需要描述和分类她的文档,以便稍后检索和使用。本文介绍了一种基于自然语言处理和概念建模的语义文档分类和检索方法。参考模型语言与词汇分析工具结合使用,以定义用于对文档进行分类的受控词汇。文件通过包含在定义词汇表的域模型中的文档中包含的高频字的句子进行分类。使用DCG类似的语法解析句子,映射到指示模型片段中并使用RDF-XML语法与文档一起存储。模型片段表示文档和域模型之间的连接,并用作文档索引。该方法正在为由挪威医学信息学中心(KITH)发布的文件收集。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号