Knowledge-driven description synthesis for floor plan interpretation

S. Goyal; Chiranjoy Chattopadhyay; Gaurav Bhatnagar

doi:10.1007/s10032-021-00367-3

Profiles Research Units Publications

Articles

Knowledge-driven description synthesis for floor plan interpretation

S. Goyal, ,

Published in Springer Science and Business Media Deutschland GmbH

2021

DOI: 10.1007/s10032-021-00367-3

Volume: 24

Issue: 1-2

Pages: 19 - 32

Abstract

Image captioning is a widely known problem in the area of AI. Caption generation from floor plan images has applications in indoor path planning, real estate, and providing architectural solutions. Several methods have been explored in the literature for generating captions or semi-structured descriptions from floor plan images. Since only the caption is insufficient to capture fine-grained details, researchers also proposed descriptive paragraphs from images. However, these descriptions have a rigid structure and lack flexibility, making it difficult to use them in real-time scenarios. This paper offers two models, description synthesis from image cue (DSIC) and transformer-based description generation (TBDG), for text generation from floor plan images. These two models take advantage of modern deep neural networks for visual feature extraction and text generation. The difference between both models is in the way they take input from the floor plan image. The DSIC model takes only visual features automatically extracted by a deep neural network, while the TBDG model learns textual captions extracted from input floor plan images with paragraphs. The specific keywords generated in TBDG and understanding them with paragraphs make it more robust in a general floor plan image. Experiments were carried out on a large-scale publicly available dataset and compared with state-of-the-art techniques to show the proposed model’s superiority. © 2021, The Author(s), under exclusive licence to Springer-Verlag GmbH Germany, part of Springer Nature.

PDFPostprint

Postprint Version

Content may be subject to copyright.

PDF

Figures & Tables (19)

Journal Details

Authors (2)

About the journal

Journal	Data powered by SciSpaceInternational Journal on Document Analysis and Recognition
Publisher	Data powered by SciSpaceSpringer Science and Business Media Deutschland GmbH
ISSN	14332833
Open Access	No

Authors (2)

Chiranjoy Chattopadhyay
- Department of Computer Science & Engineering
Gaurav Bhatnagar
- Department of Mathematics

ACADEMICS

RESEARCH

STUDENTS

FACULTY