...
首页> 外文期刊>Journal of the American Society for Information Science >Using Clustering Strategies for Creating Authority Files
【24h】

Using Clustering Strategies for Creating Authority Files

机译:使用群集策略创建授权文件

获取原文
获取原文并翻译 | 示例

摘要

As more online databases are integrated into digital li- braries, the issue of quality control f the data becomes increasingly important, especially as it relates to the effective retrieval of information. Authority work, the need to discover and reconcile variant forms of strings in bibliographic entries, will become more critical in the future. Spelling variants, misspellings, and translitera- tion differences will as increase the difficulty of retriev- ing information. We investigate a number of approximate string matching techniques that have traditionally been used to help with this problem. We then introduce the notion of approximate word matching and show how it can be used to improve detection and categorization of variant forms. We demonstrate the utility of these ap- proaches using data from the Astrophysics Data System and show how we can reduce the human effort involved in the creation of authority files.
机译:随着越来越多的在线数据库被集成到数字图书馆中,数据的质量控制问题变得越来越重要,尤其是它与信息的有效检索有关。权威工作,即发现和调和书目条目中不同形式的字符串的需求,在未来将变得越来越重要。拼写变体,拼写错误和音译差异将增加检索信息的难度。我们研究了许多传统上用于解决此问题的近似字符串匹配技术。然后,我们介绍近似单词匹配的概念,并展示如何将其用于改进变体形式的检测和分类。我们使用Astrophysics数据系统中的数据演示了这些方法的实用性,并展示了如何减少创建授权文件所需的人工。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号