Knowledge-driven description synthesis for floor plan interpretation

Goyal Shreya; Chattopadhyay Chiranjoy; Bhatnagar Gaurav

首页> 外文期刊>International Journal on Document Analysis and Recognition >Knowledge-driven description synthesis for floor plan interpretation

【24h】

Knowledge-driven description synthesis for floor plan interpretation

机译：知识驱动的描述综合楼面计划解释

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

Image captioning is a widely known problem in the area of AI. Caption generation from floor plan images has applications in indoor path planning, real estate, and providing architectural solutions. Several methods have been explored in the literature for generating captions or semi-structured descriptions from floor plan images. Since only the caption is insufficient to capture fine-grained details, researchers also proposed descriptive paragraphs from images. However, these descriptions have a rigid structure and lack flexibility, making it difficult to use them in real-time scenarios. This paper offers two models, description synthesis from image cue (DSIC) and transformer-based description generation (TBDG), for text generation from floor plan images. These two models take advantage of modern deep neural networks for visual feature extraction and text generation. The difference between both models is in the way they take input from the floor plan image. The DSIC model takes only visual features automatically extracted by a deep neural network, while the TBDG model learns textual captions extracted from input floor plan images with paragraphs. The specific keywords generated in TBDG and understanding them with paragraphs make it more robust in a general floor plan image. Experiments were carried out on a large-scale publicly available dataset and compared with state-of-the-art techniques to show the proposed model's superiority.

机译：图像标题是AI的区域是一个广为名的问题。从楼层平面图图像的标题产生具有在室内路径规划，房地产和提供建筑解决方案中的应用。在文献中已经探索了几种方法，用于从楼层图像图像产生标题或半结构化描述。由于只有标题不足以捕获细粒细节，因此研究人员还提出了图像的描述性段落。但是，这些描述具有刚性结构和缺乏灵活性，使得在实时场景中难以使用它们。本文提供了两种型号，描述从图像提示（DSIC）和基于变压器的描述（TBDG）合成，用于从平面图图像中产生的文本。这两种模型利用了现代深度神经网络，用于视觉特征提取和文本生成。两种模型之间的差异在于它们从平面图图像中取出的方式。 DSIC模型仅占据深神经网络自动提取的视觉功能，而TBDG模型则使用段落中从输入楼层图像图像中提取的文本标题。在TBDG中生成的特定关键字并用段落理解它们使其在一般楼层平面图中更加强大。实验是在大型公共可公共数据集上进行的，并与最先进的技术进行比较，以显示所提出的模型的优越性。

著录项

来源
《International Journal on Document Analysis and Recognition》 |2021年第2期|19-32|共14页
作者
Goyal Shreya; Chattopadhyay Chiranjoy; Bhatnagar Gaurav;
展开▼
作者单位

Indian Inst Technol Jodhpur 342037 Rajasthan India;

Indian Inst Technol Jodhpur 342037 Rajasthan India;

Indian Inst Technol Jodhpur 342037 Rajasthan India;

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类
关键词
Floor plan; Captioning; Evaluation; Language modeling;

机译：平面图;标题;评估;语言建模;
入库时间 2022-08-19 02:30:08

相似文献

外文文献
中文文献
专利

1. Description and interpretation of fault-related sedimentation and controls on shelf-edge deltas: implication on sand transportation to the basin floor in parts of Eastern Niger Delta [J] . David O. Anomneze, Anthony U. Okoro, Norbert E. Ajaegwu, Journal of Petroleum Exploration and Production Technology . 2020,第4期

机译：物理相关沉降和搁板边缘德拉斯的描述与解释：在尼日尔三角洲部分地板上对盆地楼的含义
2. Statistical segmentation and structural recognition for floor plan interpretation [J] . Lluis-Pere de las Heras, Sheraz Ahmed, Marcus Liwicki, International Journal on Document Analysis and Recognition . 2014,第3期

机译：统计细分和结构识别，用于平面图解释
3. I/O planning, floor-plan synthesis go hand in hand [J] . Jayshree Desai, Yukti Rao Electronic Engineering Times . 2008,第1530期

机译：I / O规划，平面图综合齐头并进
4. Text segmentation in ancient topographic maps and floor plans with support vector data description [C] . Machado S.C.S., Mello C.A.B. International Joint Conference on Neural Networks . 2015

机译：带有支持向量数据描述的古代地形图和平面图中的文本分割
5. The MH-2 core from project hotspot: Description, geologic interpretation, and significance to geothermal exploration in the Western Snake River Plain, Idaho. [D] . Varriale, Jerome A. 2016

机译：项目热点的MH-2核心：描述，地质解释以及对爱达荷州西蛇河平原地热勘探的意义。
6. Genome analysis and knowledge-driven variant interpretation with TGex [O] . Dvir Dahary, Yaron Golan, Yaron Mazor, 2019

机译：TGex的基因组分析和知识驱动的变异解释
7. Knowledge-driven description synthesis for floor plan interpretation [O] . Shreya Goyal, Chiranjoy Chattopadhyay, Gaurav Bhatnagar 2021

机译：建筑计划解释的知识驱动的描述合成

Knowledge-driven description synthesis for floor plan interpretation

摘要

著录项

相似文献

相关主题

期刊订阅