Semantic Indexing for a Complete Subject Discipline

机译：完整主题纪律的语义索引

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

As part of the Illinois Digital Library Initiative (DLI) project we developed "scalable semantics" technologies. These statistical techniques enabled us to index large collections for deeper search than word matching. Through the auspices of the DARPA Information Management program, we are developing an integrated analysis environment, the Interspace Prototype, that uses "semantic indexing" as the foundation for supporting concept navigation. These semantic indexes record the contextual correlation of noun phrases, and are computed generically, independent of subject domain. Using this technology, we were able to compute semantic indexes for a subject discipline. In particular, in the summer of 1998, we computed concept spaces for 9.3M MEDLINE bibliographic records from the National Library of Medicine (NLM) which extensively covered the biomedical literature for the period from 1966 to 1997. In this experiment, we first partitioned the collection into smaller collections (repositories) by subject, extracted noun phrases from titles and abstracts, then performed semantic indexing on these sub-collections by creating a concept space for each repository. The computation required 2 days on a 128-node SGI/CRAY Origin 2000 at the National Center for Supercomputer Applications (NCSA). This experiment demonstrated the feasibility of scalable semantics techniques for large collections. With the rapid increase in computing power, we believe this indexing technology will shortly be feasible on personal computers.

机译：作为伊利诺伊州数字图书馆倡议（DLI）项目的一部分，我们开发了“可扩展语义”技术。这些统计技术使我们能够为更深入的搜索索引大型集合而不是单词匹配。通过DARPA信息管理计划的主持，我们正在开发一个综合分析环境，Interspace原型，使用“语义索引”作为支持概念导航的基础。这些语义索引记录了名词短语的上下文相关性，并且在常工上计算，独立于主题域。使用此技术，我们能够计算主题纪律的语义索引。特别是，在1998年夏天，我们计算了来自国家医学图书馆（NLM）的9.3M Medline书目记录的概念空间，这在1966年至1997年的时间内广泛地涵盖了生物医学文献。在这项实验中，我们首先分区由主题收集到较小的集合（存储库），从标题和摘要中提取名词短语，然后通过为每个存储库创建概念空间来对这些子集合执行语义索引。在全国超级计算机应用程序（NCSA）的128节点SGI / Cray Origin 2000上需要计算2天。该实验表明了可扩展语义技术用于大型收藏品的可行性。随着计算能力的迅速增加，我们认为这种索引技术在个人计算机上很快就会得到可行的。

著录项

来源
《ACM conference on digital libraries》|1999年||共10页
会议地点
作者
Yi-Ming Chung; Qin He; Kevin Powell; Bruce Schatz;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类各类型图书馆;
关键词
semantic indexing; semantic retrieval; concept space; scalable semantics; interspace; MEDSPACE; MEDLINE; medical informatics;

机译：语义索引;语义检索;概念空间;可扩展语义;interspace;medspace;medline;医学信息学;

相似文献

外文文献
中文文献
专利

1. Architecture of a Semantic Data Integration System Based on a Semantically Complete Model and a Semantically Complete Query Language [J] . V. V. Ovchinnikov Programming and Computer Software . 2006,第4期

机译：基于语义完整模型和语义完整查询语言的语义数据集成系统架构
2. Semantic indexing of hybrid frequent pattern-based clustering of documents with missing semantic information [J] . E. Anupriya, N.Ch.S.N. Iyengar International journal of computational i . 2015,第1期

机译：缺少语义信息的基于混合频繁模式的文档聚类的语义索引
3. Semantic Indexing of Medical Learning Objects: Medical Students' Usage of a Semantic Network [J] . Nadine Tix, Paul Gie?ler, Ursula Ohnesorge-Radtke, JMIR medical education. . 2015,第2期

机译：医学学习对象的语义索引：医学生对语义网络的使用
4. Semantic indexing for a complete subject discipline [C] . Yi-Ming Chung, Qin He, Kevin Powell, ACM conference on Digital libraries . 1999

机译：完整学科学科的语义索引
5. Enhancing user search experience in digital libraries with rotated latent semantic indexing [D] . Polyakov, Serhiy. 2015

机译：通过旋转的潜在语义索引增强数字图书馆中的用户搜索体验
6. Journal Descriptor Indexing Tool for Categorizing Text According to Discipline or Semantic Type [O] . Susanne M. Humphrey, Chris J. Lu, Willie J. Rogers, 2006

机译：用于根据学科或语义类型对文本进行分类的日记描述符索引工具
7. Semantic Indexing for a Complete Subject Discipline [O] . Yi-Ming Chung, Qin He, Kevin Powell, 1999

机译：完整主题学科的语义索引

Semantic Indexing for a Complete Subject Discipline

摘要

著录项

相似文献

相关主题

期刊订阅