...
【24h】

Retro-Remote Sensing: Generating Images From Ancient Texts

机译:复古遥感:从古代文本生成图像

获取原文
获取原文并翻译 | 示例
   

获取外文期刊封面封底 >>

       

摘要

The data available in the world come in various modalities, such as audio, text, image, and video. Each data modality has different statistical properties. Understanding each modality, individually, and the relationship between the modalities is vital for a better understanding of the environment surrounding us. Multimodal learning models allow us to process and extract useful information from multimodal sources. For instance, image captioning and text-to-image synthesis are examples of multimodal learning, which require mapping between texts and images. In this paper, we introduce a research area that has never been explored by the remote sensing community, namely the synthesis of remote sensing images from text descriptions. More specifically, in this paper, we focus on exploiting ancient text descriptions of geographical areas, inherited from previous civilizations, to generate equivalent remote sensing images. From a methodological perspective, we propose to rely on generative adversarial networks (GANs) to convert the text descriptions into equivalent pixel values. GANs are a recently proposed class of generative models that formulate learning the distribution of a given dataset as an adversarial competition between two networks. The learned distribution is represented using the weights of a deep neural network and can be used to generate more samples. To fulfill the purpose of this paper, we collected satellite images and ancient texts to train the network. We present the interesting results obtained and propose various future research paths that we believe are important to further develop this new research area.
机译:世界上可用的数据有各种方式,如音频,文本,图像和视频。每个数据模态具有不同的统计属性。单独地了解每个码形,并且模态之间的关系对于更好地了解我们周围的环境至关重要。多模式学习模型允许我们从多模式源进行处理和提取有用的信息。例如,图像标题和文本到图像合成是多模式学习的示例,其需要在文本和图像之间映射。在本文中,我们介绍了从未被遥感群落探索过的研究区,即遥感图像的遥感图像来自文本描述。更具体地说,在本文中,我们专注于利用从先前文明继承的地理区域的古代文本描述,以生成等效的遥感图像。从方法的角度来看,我们建议依靠生成的对抗性网络(GANS)将文本描述转换成等效像素值。 GAN是最近提出的一类一类生成模型,可以制定学习给定数据集的分发作为两个网络之间的对抗竞争。使用深神经网络的权重表示学习的分布,并且可以用于生成更多样本。为了满足本文的目的,我们收集了卫星图像和古代文本来培训网络。我们展示了所获得的有趣结果,并提出了我们认为我们相信进一步发展这一新研究区的各种研究路径。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号