首页> 外文期刊>Multimedia Tools and Applications >Computational Linguistics For Metadata Building (climb): Using Text Mining For The Automatic Identification, Categorization, And Disambiguation Of Subject Terms For Image Metadata
【24h】

Computational Linguistics For Metadata Building (climb): Using Text Mining For The Automatic Identification, Categorization, And Disambiguation Of Subject Terms For Image Metadata

机译:元数据构建的计算语言学(爬升):使用文本挖掘自动识别,分类和消除图像元数据主题的歧义

获取原文
获取原文并翻译 | 示例
       

摘要

In this paper, we present a system using computational linguistic techniques to extract metadata for image access. We discuss the implementation, functionality and evaluation of an image catalogers' toolkit, developed in the Computational Linguistics for Metadata Building (CLiMB) research project. We have tested components of the system, including phrase finding for the art and architecture domain, functional semantic labeling using machine learning, and disambiguation of terms in domain-specific text vis a vis a rich thesaurus of subject terms, geographic and artist names. We present specific results on disambiguation techniques and on the nature of the ambiguity problem given the thesaurus, resources, and domain-specific text resource, with a comparison of domain-general resources and text. Our primary user group for evaluation has been the cataloger expert with specific expertise in the fields of painting, sculpture, and vernacular and landscape architecture.
机译:在本文中,我们提出了一种使用计算语言技术来提取用于图像访问的元数据的系统。我们将讨论在元数据构建计算语言(CLiMB)研究项目中开发的图像分类人员工具包的实现,功能和评估。我们已经测试了系统的组件,包括针对艺术和建筑领域的短语查找,使用机器学习的功能语义标记以及针对主题词,地理和艺术家名称的丰富词库的领域特定文本中的术语消歧。我们给出了关于歧义消除技术和歧义问题性质的具体结果,并给出了同义词库,资源和特定领域的文本资源,并与一般领域的资源和文本进行了比较。我们的主要评估用户群体是编目专家,在绘画,雕塑,乡土建筑和风景园林领域具有特定的专业知识。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号