Filters








6,203 Hits in 5.9 sec

Panoptic Multi-TSDFs: a Flexible Representation for Online Multi-resolution Volumetric Mapping and Long-term Dynamic Scene Consistency [article]

Lukas Schmid, Jeffrey Delmerico, Johannes Schönberger, Juan Nieto, Marc Pollefeys, Roland Siegwart, Cesar Cadena
2022 arXiv   pre-print
For robotic interaction in environments shared with other agents, access to volumetric and semantic maps of the scene is crucial.  ...  Through reasoning on the object level, semantic consistency over time is achieved.  ...  In this work, we aim to invert this paradigm and explore how semantic information can be leveraged to improve the modeling of geometry and achieve temporal consistency.  ... 
arXiv:2109.10165v2 fatcat:hcpgw3z6pjbj3ijqa4hjr6x7em

SEMANTIC MOTION CONCEPT RETRIEVAL IN NON-STATIC BACKGROUND UTILIZING SPATIAL-TEMPORAL VISUAL INFORMATION

DIANTING LIU, MEI-LING SHYU
2013 International Journal of Semantic Computing (IJSC)  
Based on the motion region detection model, moving object-level information is extracted for semantic retrieval.  ...  In the proposed conceptual retrieval model, temporally semantic consistency among the consecutive shots is analyzed and presented into a conditional probability model, which is then used to re-rank the  ...  Acknowledgments We thank Anhui Huang for his technical suggestions on modeling the temporally semantic consistency into the conditional probability model.  ... 
doi:10.1142/s1793351x13400035 fatcat:6zjbcbrtpfaqjjooik3k7wmybm

Bidirectional Temporal Context Fusion with Bi-Modal Semantic Features using a gating mechanism for Dense Video Captioning

Noorhan Khaled, M Aref, mohammed marey
2021 International Journal of Intelligent Computing and Information Sciences  
Secondly, we propose to explicitly extract bi-modal semantic concepts (nouns and verbs) from a detected event segment and equilibrate the contributions from the proposed event representation and the semantic  ...  Most recent works attempted to make use of an encoder-decoder neural network framework which employs a 3D-CNN as an encoder for representing a detected event frames, and an RNN as a decoder for caption  ...  Using an attention based mechanism, for the fusion produced superior results compared to using the context alone.  ... 
doi:10.21608/ijicis.2021.60216.1055 fatcat:o6is54gsjjcizohvh2jurn6xbm

Markov random fields for sketch based video retrieval

Rui Hu, Stuart James, Tinghuai Wang, John Collomosse
2013 Proceedings of the 3rd ACM conference on International conference on multimedia retrieval - ICMR '13  
The MRF energy function is used to rank videos for relevance and contains unary, pairwise and higher-order potentials that reflect the colour, shape, motion and type of sketched objects.  ...  Our query sketches depict both object appearance and motion, and are annotated with keywords that indicate the semantic category of each object.  ...  A clip is considered relevant with the query sketch when it shares approximate shape, color, motion and semantics to the sketched foreground object (and background if sketched).The P-R curve for 'fused  ... 
doi:10.1145/2461466.2461510 dblp:conf/mir/HuJWC13 fatcat:r2ybk5kmc5hqrbafjebbyj2yae

Semantic Retrieval for Videos in Non-static Background Using Motion Saliency and Global Features

Dianting Liu, Mei-Ling Shyu
2013 2013 IEEE Seventh International Conference on Semantic Computing  
In this paper, a video semantic retrieval framework is proposed based on a novel unsupervised motion region detection algorithm which works reasonably well with dynamic background and camera motion.  ...  the global texture and local motion information.  ...  Mahadevan and Vasconcelos proposed an algorithm for spatio-temporal saliency based on a center-surround framework [15] .  ... 
doi:10.1109/icsc.2013.57 dblp:conf/semco/LiuS13 fatcat:5xgf3ondqndbbluijkfwdtkxey

Temporal Fusion Based Mutli-scale Semantic Segmentation for Detecting Concealed Baggage Threats [article]

Muhammed Shafay and Taimur Hassan and Ernesto Damiani and Naoufel Werghi
2021 arXiv   pre-print
However, to our knowledge, no framework exists that utilizes temporal baggage X-ray imagery to effectively screen highly concealed and occluded objects which are barely visible even to the naked eye.  ...  To address this, we present a novel temporal fusion driven multi-scale residual fashioned encoder-decoder that takes series of consecutive scans as input and fuses them to generate distinct feature representations  ...  CONCLUSION In this paper, we proposed an original temporal fusion and multi-scale semantic segmentation-based baggage threat detection framework to recognize extremely concealed and cluttered contraband  ... 
arXiv:2111.02651v2 fatcat:ufeivmaervabbjhfjyeu6bn6tm

Bringing Background into the Foreground: Making All Classes Equal in Weakly-Supervised Video Semantic Segmentation

Fatemeh Sadat Saleh, Mohammad Sadegh Aliakbarian, Mathieu Salzmann, Lars Petersson, Jose M. Alvarez
2017 2017 IEEE International Conference on Computer Vision (ICCV)  
Pixel-level annotations are expensive and timeconsuming to obtain. Hence, weak supervision using only image tags could have a significant impact in semantic segmentation.  ...  We then develop a two-stream deep architecture that jointly leverages appearance and motion, and design a loss based on our heatmaps to train it.  ...  Qualitative results on CityScapes, CamVid, and YouTube-Objects. Note that for each dataset, from top to bottom, there is the RGB frame, Ground-truth and the prediction of our two-stream network.  ... 
doi:10.1109/iccv.2017.232 dblp:conf/iccv/SalehASPA17 fatcat:c24wvnhaovgbvm6wcvfbrfvw5y

Survey on Semantic Segmentation using Deep Learning Techniques

Fahad Lateef, Yassine Ruichek
2019 Neurocomputing  
Semantic segmentation is a challenging task in computer vision systems.  ...  For this reason, we propose to survey these methods by, first categorizing them into ten different classes according to the common concepts underlying their architectures.  ...  ACKNOWLEDGMENT The authors express their gratitude to University Technology Belfort-Montbeliard and Higher Education Commission of Pakistan for providing the support and necessary requirement for completion  ... 
doi:10.1016/j.neucom.2019.02.003 fatcat:aelsfl7unvdw5j2rtyqhtgqrsm

Pedestrian segmentation based on a spatio-temporally consistent graph-cut with optimal transport

Yang Yu, Yasushi Makihara, Yasushi Yagi
2019 IPSJ Transactions on Computer Vision and Applications  
To maintain better temporal consistency of segmentation even under relatively large motions, we introduce a transportation minimization framework that provides a temporal correspondence.  ...  instance level and semantic level.  ...  Acknowledgments We thank Glenn Pennycook, MSc, from Edanz Group (www.edanzediting.com/ ac) for editing a draft of this manuscript.  ... 
doi:10.1186/s41074-019-0062-2 fatcat:hjsf6pgmkjatjlsjwbukk7gycm

How to Reduce Change Detection to Semantic Segmentation [article]

Guo-Hua Wang, Bin-Bin Gao, Chengjie Wang
2022 arXiv   pre-print
Based on it, we devise a module named MTF to extract the change information and fuse temporal features. MTF enjoys high interpretability and reveals the essential characteristic of CD.  ...  In this paper, we propose a new paradigm that reduces CD to semantic segmentation which means tailoring an existing and powerful semantic segmentation network to solve CD.  ...  With our paradigm, applying a more powerful semantic segmentation network is a promising way to further boost the performance.  ... 
arXiv:2206.07557v1 fatcat:sjt7pf3ifzcfpprezwhfkasjha

Multiscale matters for part segmentation of instruments in robotic surgery

Wenhao He, Haitao Song, Yue Guo, Guibin Bian, Yuejie Sun, Xiaowei Zhou, Xiaonan Wang
2020 IET Image Processing  
. 4 Segmentation masks of baselines and our method: three columns, respectively, correspond to results in Endovis15, Endovis17, and Endovis18, and every colour corresponds to a specific semantic class.  ...  In this work, the authors introduce an end-to-end recurrent model that comprises a multiscale semantic segmentation network and a refinement model.  ...  For future work, we plan to build a recurrent refinement model for temporal semantic segmentation and establish an effective framework to segment instruments across multiple domains.  ... 
doi:10.1049/iet-ipr.2020.0320 fatcat:oir6g37vvvawnbeelto2tamcla

Video Panoptic Segmentation [article]

Dahun Kim, Sanghyun Woo, Joon-Young Lee, In So Kweon
2020 arXiv   pre-print
To provide appropriate metrics for this task, we propose a video panoptic quality (VPQ) metric and evaluate our method and several other baselines.  ...  Panoptic segmentation has become a new standard of visual recognition task by unifying previous semantic segmentation and instance segmentation tasks in concert.  ...  The result implies that the Fuse and Track modules share information, and synergize each other to learn more discriminative features for both segmentation and tracking.  ... 
arXiv:2006.11339v1 fatcat:5n24gqouzbchbciwzxmnrjvcny

3D Semantic Scene Perception using Distributed Smart Edge Sensors [article]

Simon Bultmann, Sven Behnke
2022 arXiv   pre-print
We present a system for 3D semantic scene perception consisting of a network of distributed smart edge sensors.  ...  well as semantically annotated point clouds are streamed from the sensors to a central backend, where multiple viewpoints are fused into an allocentric 3D semantic scene model.  ...  Acknowledgments This work was funded by grant BE 2556/16-2 of the German Research Foundation (DFG) and Fraunhofer IAIS.  ... 
arXiv:2205.01460v1 fatcat:fpe5lpemz5al5p5l4fdiltraoa

Color and Depth-Based Superpixels for Background and Object Segmentation

Islem Jebari, David Filliat
2012 Procedia Engineering  
We present an approach to multimodal semantic segmentation based on both color and depth information.  ...  Our goal is to build a semantic map containing high-level information, namely objects and background categories (carpet, parquet, walls ...).  ...  Multimodal semantic segmentation We present the three semantic segmentation algorithms that we used for evaluating the interest of color and depth information for semantic segmentation.  ... 
doi:10.1016/j.proeng.2012.07.315 fatcat:bpqwhy36lvgixdr3xd3s7kp2w4

Video Object Segmentation and Tracking: A Survey [article]

Rui Yao, Guosheng Lin, Shixiong Xia, Jiaqi Zhao, Yong Zhou
2019 arXiv   pre-print
First, we provide a hierarchical categorization existing approaches, including unsupervised VOS, semi-supervised VOS, interactive VOS, weakly supervised VOS, and segmentation-based tracking methods.  ...  Object segmentation and object tracking are fundamental research area in the computer vision community.  ...  Moreover, a probabilistic 3D segmentation method [66] is proposed to combine spatial, temporal, and semantic information to make better-informed decisions. Discussion.  ... 
arXiv:1904.09172v3 fatcat:nm3zptbidvgxfkxezqjekwdpdi
« Previous Showing results 1 — 15 out of 6,203 results