首页> 外文期刊>Information Technology Journal >Design and Implementation of a Semantic Document Management System
【24h】

Design and Implementation of a Semantic Document Management System

机译:语义文档管理系统的设计与实现

获取原文
       

摘要

Easily accessible information on the World Wide Web (WWW) and affordable large capacity secondary storage make it easy to build up very large document collections even in personal computers. However, the method of organizing files in computers has not been changed too much for decades. Searching for a particular document or file from a gegabytes collection based on traditional tree structured file directories becomes never an easy task. This study presents a system where documents are no longer identified by their file names. Instead, a document is represented by its semantics in terms of descriptor and contents vector. The descriptor of a document consists of a set of attributes, such as date of creation, its type, its size, annotations, etc. The content vector of a document consists of a set of terms extracted from the document. Such semantic information provides the user with associative searching capability, that is, documents can be obtained by giving required properties. The representation of document semantics and document organization and key word-based indexing techniques are discussed. Furthermore, for the largely used XML data in Web representing and exchanging, some structure-based querying techniques are proposed in this study, i.e. structural indexes and path expression optimization principles. A prototype visual based explorer that makes use of semantics of documents is also described.
机译:万维网(WWW)上易于访问的信息和负担得起的大容量二级存储,即使在个人计算机中也可以轻松建立非常大的文档集。但是,几十年来,计算机中组织文件的方法并未发生太大变化。根据传统的树形文件目录从千兆字节集合中搜索特定文档或文件绝非易事。这项研究提出了一个不再使用文件名标识文件的系统。取而代之的是,文档通过描述符和内容向量的语义来表示。文档的描述符由一组属性组成,例如创建日期,其类型,大小,注释等。文档的内容向量由从文档中提取的一组术语组成。这样的语义信息为用户提供了关联的搜索能力,也就是说,可以通过提供所需的属性来获得文档。讨论了文档语义和文档组织的表示以及基于关键字的索引技术。此外,针对Web表示和交换中大量使用的XML数据,本研究提出了一些基于结构的查询技术,即结构索引和路径表达式优化原理。还描述了利用文档语义的基于视觉的原型浏览器。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号