Filters








12,553 Hits in 6.9 sec

Light-weight spatio-temporal graphs for segmentation and ejection fraction prediction in cardiac ultrasound [article]

Sarina Thomas, Andrew Gilbert, Guy Ben-Yosef
2022 arXiv   pre-print
Compared to semantic segmentation, GCNs show accurate segmentation and improvements in robustness and inference runtime.  ...  Models for direct coordinate regression based on Graph Convolutional Networks (GCNs) are used to detect the keypoints.  ...  Right: Hausdorff distance (in pixels) box plot of segmentation results. Compared to semantic segmentation, the EchoGraphs (with MobileNet2 backbone) is more robust.  ... 
arXiv:2207.02549v1 fatcat:2ojsekpubrfu3ifigizgtrympa

VCIP 2020 Index

2020 2020 IEEE International Conference on Visual Communications and Image Processing (VCIP)  
Method for Semantic Segmentation Wu, Qingbo Mono is Enough: Instance Segmentation from Single Annotated Sample Wu, Xinju Sparse Representation-Based Intra Prediction Lossless/Near Lossless Video  ...  with Lyapunov Optimization Zhang, Honggang Learning Graph Topology Representation with Attention Networks Zhang, Honggang Robust Visual Tracking Via An Imbalance- Elimination Mechanism Zhang  ... 
doi:10.1109/vcip49819.2020.9301896 fatcat:bdh7cuvstzgrbaztnahjdp5s5y

AirObject: A Temporally Evolving Graph Embedding for Object Identification [article]

Nikhil Varma Keetha, Chen Wang, Yuheng Qiu, Kuan Xu, Sebastian Scherer
2022 arXiv   pre-print
However, such systems are limited to a "fixed" partial object representation from a single viewpoint.  ...  We demonstrate that AirObject achieves the state-of-the-art performance for video object identification and is robust to severe occlusion, perceptual aliasing, viewpoint shift, deformation, and scale transform  ...  Video Instance Segmentation (OVIS) [35] , and Tracking Any Object with Video Object Segmentation (TAO-VOS) [8, 44] .  ... 
arXiv:2111.15150v2 fatcat:cql6zqnvbngk3f6t3rlft4dgw4

Multi-source Multi-modal Activity Recognition in Aerial Video Surveillance

Riad I. Hammoud, Cem S. Sahin, Erik P. Blasch, Bradley J. Rhodes
2014 2014 IEEE Conference on Computer Vision and Pattern Recognition Workshops  
Recognizing activities in wide aerial/overhead imagery remains a challenging problem due in part to low-resolution video and cluttered scenes with a large number of moving objects.  ...  segments of targetsof-interest (TOIs) (in both pixel and geo-coordinates).  ...  Multi-Graph Representation of a Single FMV Track The multi-source association framework is based on a graph representation and matching of target tracks and chat messages.  ... 
doi:10.1109/cvprw.2014.44 dblp:conf/cvpr/HammoudSBR14 fatcat:p7pqnfbzefdplpm5eztcgk2lge

Learning event representations for temporal segmentation of image sequences by dynamic graph embedding [article]

Mariella Dimiccoli, Herwig Wendt
2020 arXiv   pre-print
In particular, it achieves robust temporal segmentation on the EDUBSeg and EDUBSeg-Desc benchmark datasets, outperforming the state of the art.  ...  representation to take into account the current data graph structure.  ...  Section 2 highlights related work on data representation learning on graphs and on the temporal segmentation of videos and image sequences.  ... 
arXiv:1910.03483v3 fatcat:fie43dv7srhxdh4sd42ij2dwoe

2021 Index IEEE Transactions on Image Processing Vol. 30

2021 IEEE Transactions on Image Processing  
., +, TIP 2021 6906-6916 SRGAT: Single Image Super-Resolution With Graph Attention Network.  ...  Shao, H., +, TIP 2021 3764-3777 Image segmentation 3D Interactive Segmentation With Semi-Implicit Representation and Active Learning.  ... 
doi:10.1109/tip.2022.3142569 fatcat:z26yhwuecbgrnb2czhwjlf73qu

Automatic Association of Chats and Video Tracks for Activity Learning and Recognition in Aerial Video Surveillance

Riad Hammoud, Cem Sahin, Erik Blasch, Bradley Rhodes, Tao Wang
2014 Sensors  
VIVA utilizes analyst call-outs (ACOs) in the form of chat messages (voice-to-text) to associate labels with video target tracks, to designate spatial-temporal activity boundaries and to augment video  ...  VIVA and MINER examples are demonstrated for wide aerial/overhead imagery over common data sets affording an improvement in tracking from video data alone, leading to 84% detection with modest misdetection  ...  Multi-Graph Representation of a Single FMV Track The multi-source association framework is based on a graph representation and matching of target tracks and chat messages.  ... 
doi:10.3390/s141019843 pmid:25340453 pmcid:PMC4239870 fatcat:ony3ylej4nhzxbnap2zide3kwi

Joint Future Semantic and Instance Segmentation Prediction [chapter]

Camille Couprie, Pauline Luc, Jakob Verbeek
2019 Lecture Notes in Computer Science  
In this work, we introduce a novel prediction approach that encodes instance and semantic segmentation information in a single representation based on distance maps.  ...  However, predicting directly in the image color space seems an overly complex task, and predicting higher level representations using semantic or instance segmentation approaches were shown to be more  ...  We extend semantic segmentation forecasting by proposing a novel representation that encodes both semantic and instance information, with low training requirements and temporally consistent predictions  ... 
doi:10.1007/978-3-030-11015-4_14 fatcat:r6srpgaw6vasdeg5l4ayzqqqca

SMOR: A Semantic Multi-View Object Representation System in 2D Image Sequences

Mehran Yazdi, Arash Golibagh Mahyari
2013 The Arabian Journal for Science and Engineering  
The system includes the use of a set of simple algorithms to segment, identify, group and track the semantic objects modeled as a set of regions with compatible surface features.  ...  Then we track the segmented regions/objects throughout the sequence using motion information and color constancy. The result of the system is a multi-view representation of objects of a static scene.  ...  In this paper, we investigate the combination of object segmentation from a single image with object tracking in 2D image sequences to achieve more robust objects representation of a scene containing multiple  ... 
doi:10.1007/s13369-013-0693-z fatcat:apv3yafakzh5ldgytjsliye2oa

2020 Index IEEE Transactions on Image Processing Vol. 29

2020 IEEE Transactions on Image Processing  
., +, TIP 2020 8476-8489 Adaptive Graph Representation Learning for Video Person Re-Identifica- tion.  ...  Soner, B., +, TIP 2020 4505-4515 Semantic Segmentation With Context Encoding and Multi-Path Decoding.  ... 
doi:10.1109/tip.2020.3046056 fatcat:24m6k2elprf2nfmucbjzhvzk3m

MAN: Moment Alignment Network for Natural Language Moment Retrieval via Iterative Graph Adjustment

Da Zhang, Xiyang Dai, Xin Wang, Yuan-Fang Wang, Larry S. Davis
2019 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)  
MAN naturally assigns candidate moment representations aligned with language semantics over different temporal locations and scales.  ...  This research strives for natural language moment retrieval in long, untrimmed video streams.  ...  Language features are encoded as efficient dynamic filters and convolved with input visual representations to deal with semantic misalignment.  ... 
doi:10.1109/cvpr.2019.00134 dblp:conf/cvpr/ZhangDWWD19 fatcat:mbglkapzw5hr5aoxrz6msvll5m

State of the Art: A Summary of Semantic Image and Video Retrieval Techniques

S. Suguna, C. Ranjith Kumar, D. Sheela Jeyarani
2015 Indian Journal of Science and Technology  
efficient semantic video retrieval.  ...  Due to these reasons semantic video retrieval became a challenging issue in various industries.  ...  The graph-based representation enables us to coherently fuse multimodal resources through graphs with proper probabilistic interpretation.  ... 
doi:10.17485/ijst/2015/v8i35/77061 fatcat:2htopyojqjd7bkjt6mx66cf24i

Semi-Supervised Learning for Multi-Task Scene Understanding by Neural Graph Consensus [article]

Marius Leordeanu, Mihai Pirvu, Dragos Costea, Alina Marcu, Emil Slusanschi, Rahul Sukthankar
2020 arXiv   pre-print
We show how prediction of different representations such as depth, semantic segmentation, surface normals and pose from RGB input could be effectively learned through self-supervised consensus in our graph  ...  By optimizing such consensus between different paths, the graph reaches consistency and robustness over multiple interpretations and generations, in the face of unknown labels.  ...  Making code available There is a strong need today for video data with multiple labeled representations.  ... 
arXiv:2010.01086v2 fatcat:ng27p5utdnabplogh5qhokdlh4

MAN: Moment Alignment Network for Natural Language Moment Retrieval via Iterative Graph Adjustment [article]

Da Zhang, Xiyang Dai, Xin Wang, Yuan-Fang Wang, Larry S. Davis
2019 arXiv   pre-print
MAN naturally assigns candidate moment representations aligned with language semantics over different temporal locations and scales.  ...  This research strives for natural language moment retrieval in long, untrimmed video streams.  ...  Language features are encoded as efficient dynamic filters and convolved with input visual representations to deal with semantic misalignment.  ... 
arXiv:1812.00087v2 fatcat:cbxtybz4cnf3xbqiudlyj3rlm4

Temporally coherent interpretations for long videos using pattern theory

Fillipe Souza, Sudeep Sarkar, Anuj Srivastava, Jingyong Su
2015 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)  
Graph-theoretical methods have successfully provided semantic and structural interpretations of images and videos.  ...  Interpretations for long video segments were able to yield performance increases of about 70% and, in addition, proved to be more robust to different severe scenarios of classification errors.  ...  A video interpretation is a sequence of interpretations for a temporal window containing a single video segment or multiple consecutive segments.  ... 
doi:10.1109/cvpr.2015.7298727 dblp:conf/cvpr/SouzaSSS15 fatcat:ijytw67pknhk5lmfzic4y67qna
« Previous Showing results 1 — 15 out of 12,553 results