首页> 外文学位 >Effective, efficient retrieval in a network of digital information objects.
【24h】

Effective, efficient retrieval in a network of digital information objects.

机译:在数字信息对象网络中进行有效,高效的检索。

获取原文
获取原文并翻译 | 示例

摘要

Although different authors mean different thing by the term “digital libraries,” one common thread is that they include or are built around collections of digital objects. Digital libraries also provide services to large communities, one of which is almost always search. Digital library collections, however, have several characteristic features that make search difficult. They are typically very large. They typically involve many different kinds of objects, including but not limited to books, e-published documents, images, and hypertexts, and often including items as esoteric as subtitled videos, simulations, and entire scientific databases. Even within a category, these objects may have widely different formats and internal structure. Furthermore, they are typically in complex relationships with each other and with such non-library objects as persons, institutions, and events.; Relationships are a common feature of traditional libraries in the form of “See/See also” pointers, hierarchical relationships among categories, and relations between bibliographic and non-bibliographic objects such as having an author or being on a subject. Binary relations (typically in the form of directed links) are a common representational tool in computer science for structures from trees and graphs to semantic networks. And in recent years the World-Wide Web has made the construct of linked information objects commonplace for millions. Despite this, relationships have rarely been given “first-class” treatment in digital library collections or software.; MARIAN is a digital library system designed and built to store, search over, and retrieve large numbers of diverse objects in a network of relationships. It is designed to run efficiently over large collections of digital library objects. It addresses the problem of object diversity through a system of classes unified by common abilities including searching and presentation. Divergent internal structure is exposed and interpreted using a simple and powerful graphical representation, and varied format through a unified system of presentation. Most importantly, MARIAN collections are designed to specifically include relations in the form of an extensible collection of different sorts of links.; This thesis presents MARIAN and argues that it is both effective and efficient. MARIAN is effective in that it provides new and useful functionality to digital library end-users, and in that it makes constructing, modifying, and combining collections easy for library builders and maintainers. MARIAN collections to define on the one hand common operations required to implement a broad class of search engines, and on the other performance standards for those operations. Although some operations involve a high minimum cost under the most general assumptions, lower costs can be achieved when additional constraints are present. In particular, it is argued that the statistics of digital library collections can be exploited to obtain significant savings. MARIAN is designed to do exactly that, and in evidence from early versions appears to succeed.; In conclusion, MARIAN presents a powerful and flexible platform for retrieval on large, diverse collections of networked information, significantly extending the representation and search capabilities of digital libraries.
机译:尽管不同的作者用“数字图书馆”一词来表达不同的意思,但一个共同的线索是它们包括或围绕数字对象的集合构建。数字图书馆还为大型社区提供服务,其中之一几乎总是搜索。但是,数字图书馆馆藏具有几个使搜索困难的特征。它们通常很大。它们通常涉及许多不同种类的对象,包括但不限于书籍,电子出版的文档,图像和超文本,并且经常包括诸如字幕视频,模拟和整个科学数据库等深奥的项目。即使在一个类别内,这些对象也可能具有截然不同的格式和内部结构。此外,它们通常彼此之间以及与诸如人,机构和事件之类的非图书馆对象之间具有复杂的关系。关系是传统图书馆的常见特征,形式为“另请参见”指针,类别之间的层次关系以及书目对象和非书目对象之间的关系,例如有作者或在主题上。二进制关系(通常以定向链接的形式)是计算机科学中一种常见的表示工具,用于从树和图到语义网络的结构。近年来,万维网使数以百万计的链接信息对象的构造司空见惯。尽管如此,在数字图书馆馆藏或软件中,很少给予关系“一流”的待遇。 MARIAN是一个数字图书馆系统,其设计和构建用于在关系网络中存储,搜索和检索大量不同的对象。它旨在有效地运行在大量数字图书馆对象上。它通过由搜索和表示之类的通用能力统一的类系统解决了对象多样性的问题。使用简单而强大的图形表示法来公开和解释不同的内部结构,并通过统一的表示系统来改变格式。最重要的是,MARIAN集合旨在以各种类型的链接的可扩展集合的形式专门包含关系。本文介绍了MARIAN,并指出它既有效又有效。 MARIAN的有效之处在于,它为数字图书馆的最终用户提供了新的有用的功能,并使图书馆建设者和维护者易于构建,修改和合并馆藏。 MARIAN集合一方面定义实现广泛搜索引擎所需的常见操作,另一方面定义这些操作的性能标准。尽管在最一般的假设下,某些操作会涉及较高的最低成本,但如果存在其他限制,则可以降低成本。特别是,有人认为可以利用数字图书馆馆藏的统计数据来节省大量资金。 MARIAN正是为了做到这一点而设计的,早期版本的证据似乎很成功。总而言之,MARIAN提供了一个强大而灵活的平台,可用于检索大量多样的网络信息,从而大大扩展了数字图书馆的表示和搜索功能。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号