Scene Attribute Semantic Relational Regularization for Transport-travel Scene Understanding

Wei Xinlei, Cheng Ruifen, Liu Yingji, Zhou Wei, Tian Daxin
2020 IEEE Access  
Attribute learning has improved the performance in scene understanding and scene recognition. However, there are many attributes described by words or short texts in a static scene and traffic crowd scene. If there are two similar scenes, the semantic relationship topology structures of corresponding attribute groups of the two scenes are also homogeneity. But it is difficult to learn a semantic relation topology projection across semantic text data and visual data. To solve the problem, we
more » ... truct approximate homeomorphism mapping based on the scene attributes semantic relational regularization. Hence, we propose a novel attribute semantic topological relationship regularization based scene attribute semantic learning(ARSL) method for scene semantic understanding. We establish a transport and travel scene recognition model based on attribute semantic features which are achieved by the proposed ARSL algorithm. In order to verify the proposed method, the experiments are implemented on the static transport-travel scene dataset and dynamic transport-travel crowds scene dataset respectively. The static transport-travel scene dataset is constructed by the SUN Attribute dataset including images and texts. However, the dynamic transport-travel crowds scene dataset is constructed through the WWW Crowd dataset including videos and texts, and the dynamic transport-travel crowd scene dataset is named as the WWW Crowd-Sub dataset. The performances of the proposed method are improved by 38.48% and 17.51% on the SUN Attribute dataset and WWW Crowd-Sub dataset respectively. The experimental results on the SUN Attribute dataset and WWW Crowd-Sub dataset demonstrate that the proposed approach has superior performance compared to state of the art. It can be demonstrated that the performance of the proposed ARSL method is effective against static transport-travel scene and dynamic transport-travel crowd scene.
doi:10.1109/access.2020.3001294 fatcat:jvtr22of5jaqbkkphfnrzkci7i