首页> 外文期刊>ACM Journal on Computing and Cultural Heritage >Visual Recognition of Ancient Inscriptions Using Convolutional Neural Network and Fisher Vector
【24h】

Visual Recognition of Ancient Inscriptions Using Convolutional Neural Network and Fisher Vector

机译:利用卷积神经网络和Fisher矢量视觉识别古代铭文

获取原文
获取原文并翻译 | 示例
       

摘要

By bringing together the most prominent European institutions and archives in the field of Classical Latin and Greek epigraphy, the EAGLE project has collected the vast majority of the surviving Greco-Latin inscriptions into a single readily-searchable database. Text-based search engines are typically used to retrieve information about ancient inscriptions (or about other artifacts). These systems require that the users formulate a text query that contains information such as the place where the object was found or where it is currently located. Conversely, visual search systems can be used to provide information to users (like tourists and scholars) in a most intuitive and immediate way, just using an image as query. In this article, we provide a comparison of several approaches for visual recognizing ancient inscriptions. Our experiments, conducted on 17, 155 photos related to 14, 560 inscriptions, show that BoW and VLAD are outperformed by both Fisher Vector (FV) and Convolutional Neural Network (CNN) features. More interestingly, combining FV and CNN features into a single image representation allows achieving very high effectiveness by correctly recognizing the query inscription in more than 90% of the cases. Our results suggest that combinations of FV and CNN can be also exploited to effectively perform visual retrieval of other types of objects related to cultural heritage such as landmarks and monuments.
机译:通过将古典拉丁和希腊史学领域中最杰出的欧洲机构和档案馆汇集在一起​​,EAGLE项目将幸存的绝大多数希腊拉丁铭文收集到一个易于搜索的数据库中。基于文本的搜索引擎通常用于检索有关古代铭文(或其他文物)的信息。这些系统要求用户制定一个包含信息的文本查询,例如对象的发现位置或对象的当前位置。相反,视觉搜索系统可用于以最直观,最直接的方式向用户(如游客和学者)提供信息,仅使用图像作为查询即可。在本文中,我们提供了几种视觉识别古代铭文的方法的比较。我们对17张155张与14张560张铭文相关的照片进行了实验,结果表明BoW和VLAD在费希尔向量(FV)和卷积神经网络(CNN)功能方面均胜过。更有趣的是,将FV和CNN功能组合到单个图像表示中,可以通过在超过90%的情况下正确识别查询题字来实现非常高的有效性。我们的结果表明,也可以利用FV和CNN的组合来有效地视觉检索与文化遗产相关的其他类型的对象,例如地标和纪念碑。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号