Indoor home scene recognition through instance segmentation using a combination of neural networks

Basu, Amlan and Kaewrak, Keerati and Petropoulakis, Lykourgos and Di Caterina, Gaetano and Soraghan, John J. (2022) Indoor home scene recognition through instance segmentation using a combination of neural networks. In: 2022 IEEE World Conference on Applied Intelligence and Computing, 2022-06-17 - 2022-06-19, Rajkiya Engineering College.

[thumbnail of Basu-etal-AIC-2022-Indoor-home-scene-recognition-through-instance-segmentation-using-a-combination-of-neural-networks]
Text. Filename: Basu_etal_AIC_2022_Indoor_home_scene_recognition_through_instance_segmentation_using_a_combination_of_neural_networks.pdf
Accepted Author Manuscript
License: Strathprints license 1.0

Download (1MB)| Preview


This work presents a technique for recognizing indoor home scenes by using object detection. The object detection task is achieved through pre-trained Mask-RCNN (Regional Convolutional Neural Network), whilst the scene recognition is performed through a Convolutional Neural Network (CNN). The output of the Mask-RCNN is fed in input to the CNN, as this provides the CNN with the information of objects detected in one scene. So, the CNN recognizes the scene by looking at the combination of objects detected. The CNN is trained using the various object detection outputs of Mask-RCNN. This helps the CNN learn about the various combinations of objects that a scene can have. The CNN is trained using 500 combinations of 5 different scenes (bathroom, bedroom, kitchen, living room, and dining room) of the indoor home generated by Mask-RCNN. The trained network was tested on 24,000 indoor home scene images. The final accuracy produced by the CNN is 97.14%.


Basu, Amlan ORCID logoORCID:, Kaewrak, Keerati, Petropoulakis, Lykourgos ORCID logoORCID:, Di Caterina, Gaetano ORCID logoORCID: and Soraghan, John J. ORCID logoORCID:;