An Enhanced Object Detection Model for Scene Graph Generation

Essam, Mohammad; Khattab, Dina; Shedeed, Howida A.; Tolba, Mohamed F.;

Abstract


With computer vision improving, a higher level of understanding is needed to solve more complex problems such as semantic image retrieval, image captioning, and scene understanding. Scene understanding has been a long-studied problem due to its complexity and lack of proper data representation. A scene Graph is one of the most powerful data representations that can better understand the scene context. The task of a Scene Graph is to encode the objects presented in the scene, their attributes, as long as the relationships between these objects. With the scene Graph proving its capabilities in complicated tasks, the automation of scene graph generation became a must. Great research has been made to obtain accurate Scene Graphs using different deep learning architectures. The common module among those different architectures is the object detection module, where objects are firstly located in the input image. In this work, we propose using the most recent object detectors from the YOLOv5 family for the scene graph generation task. The proposed YOLOv5x6 achieved a State-Of-The-Art result of 32.7 mean average precision compared to previous works. Furthermore, the paper reviews the different object detectors used in literature for the scene graph generation.


Other data

Title An Enhanced Object Detection Model for Scene Graph Generation
Authors Essam, Mohammad; Khattab, Dina ; Shedeed, Howida A.; Tolba, Mohamed F.
Keywords Object detection | | |;Scene graph;Scene graph generation;YOLO
Issue Date 1-Jan-2023
Publisher Springer
Journal Lecture Notes on Data Engineering and Communications Technologies 
Volume 152
Start page 333
End page 343
Conference International Conference on Advanced Intelligent Systems and Informatics
ISBN 978-3-031-20600-9
978-3-031-20601-6
ISSN 23674512
DOI 10.1007/978-3-031-20601-6_30
Scopus ID 2-s2.0-85142627056

Recommend this item

Similar Items from Core Recommender Database

Google ScholarTM

Check

Citations 1 in scopus


Items in Ain Shams Scholar are protected by copyright, with all rights reserved, unless otherwise indicated.