A copy of this work was available on the public web and has been preserved in the Wayback Machine. The capture dates from 2022; you can also visit the original URL.
The file type is application/pdf
.
Filters
Light-weight spatio-temporal graphs for segmentation and ejection fraction prediction in cardiac ultrasound
[article]
2022
arXiv
pre-print
Compared to semantic segmentation, GCNs show accurate segmentation and improvements in robustness and inference runtime. ...
Models for direct coordinate regression based on Graph Convolutional Networks (GCNs) are used to detect the keypoints. ...
Right: Hausdorff distance (in pixels) box plot of segmentation results. Compared to semantic segmentation, the EchoGraphs (with MobileNet2 backbone) is more robust. ...
arXiv:2207.02549v1
fatcat:2ojsekpubrfu3ifigizgtrympa
VCIP 2020 Index
2020
2020 IEEE International Conference on Visual Communications and Image Processing (VCIP)
Method for Semantic Segmentation
Wu, Qingbo
Mono is Enough: Instance Segmentation from
Single Annotated Sample
Wu, Xinju
Sparse Representation-Based Intra Prediction
Lossless/Near Lossless Video ...
with
Lyapunov Optimization
Zhang, Honggang
Learning Graph Topology Representation with
Attention Networks
Zhang, Honggang
Robust Visual Tracking Via An Imbalance-
Elimination Mechanism
Zhang ...
doi:10.1109/vcip49819.2020.9301896
fatcat:bdh7cuvstzgrbaztnahjdp5s5y
AirObject: A Temporally Evolving Graph Embedding for Object Identification
[article]
2022
arXiv
pre-print
However, such systems are limited to a "fixed" partial object representation from a single viewpoint. ...
We demonstrate that AirObject achieves the state-of-the-art performance for video object identification and is robust to severe occlusion, perceptual aliasing, viewpoint shift, deformation, and scale transform ...
Video Instance Segmentation (OVIS) [35] , and Tracking Any Object with Video Object Segmentation (TAO-VOS) [8, 44] . ...
arXiv:2111.15150v2
fatcat:cql6zqnvbngk3f6t3rlft4dgw4
Multi-source Multi-modal Activity Recognition in Aerial Video Surveillance
2014
2014 IEEE Conference on Computer Vision and Pattern Recognition Workshops
Recognizing activities in wide aerial/overhead imagery remains a challenging problem due in part to low-resolution video and cluttered scenes with a large number of moving objects. ...
segments of targetsof-interest (TOIs) (in both pixel and geo-coordinates). ...
Multi-Graph Representation of a Single FMV Track The multi-source association framework is based on a graph representation and matching of target tracks and chat messages. ...
doi:10.1109/cvprw.2014.44
dblp:conf/cvpr/HammoudSBR14
fatcat:p7pqnfbzefdplpm5eztcgk2lge
Learning event representations for temporal segmentation of image sequences by dynamic graph embedding
[article]
2020
arXiv
pre-print
In particular, it achieves robust temporal segmentation on the EDUBSeg and EDUBSeg-Desc benchmark datasets, outperforming the state of the art. ...
representation to take into account the current data graph structure. ...
Section 2 highlights related work on data representation learning on graphs and on the temporal segmentation of videos and image sequences. ...
arXiv:1910.03483v3
fatcat:fie43dv7srhxdh4sd42ij2dwoe
2021 Index IEEE Transactions on Image Processing Vol. 30
2021
IEEE Transactions on Image Processing
., +, TIP 2021 6906-6916 SRGAT: Single Image Super-Resolution With Graph Attention Network. ...
Shao, H., +, TIP 2021 3764-3777 Image segmentation 3D Interactive Segmentation With Semi-Implicit Representation and Active Learning. ...
doi:10.1109/tip.2022.3142569
fatcat:z26yhwuecbgrnb2czhwjlf73qu
Automatic Association of Chats and Video Tracks for Activity Learning and Recognition in Aerial Video Surveillance
2014
Sensors
VIVA utilizes analyst call-outs (ACOs) in the form of chat messages (voice-to-text) to associate labels with video target tracks, to designate spatial-temporal activity boundaries and to augment video ...
VIVA and MINER examples are demonstrated for wide aerial/overhead imagery over common data sets affording an improvement in tracking from video data alone, leading to 84% detection with modest misdetection ...
Multi-Graph Representation of a Single FMV Track The multi-source association framework is based on a graph representation and matching of target tracks and chat messages. ...
doi:10.3390/s141019843
pmid:25340453
pmcid:PMC4239870
fatcat:ony3ylej4nhzxbnap2zide3kwi
Joint Future Semantic and Instance Segmentation Prediction
[chapter]
2019
Lecture Notes in Computer Science
In this work, we introduce a novel prediction approach that encodes instance and semantic segmentation information in a single representation based on distance maps. ...
However, predicting directly in the image color space seems an overly complex task, and predicting higher level representations using semantic or instance segmentation approaches were shown to be more ...
We extend semantic segmentation forecasting by proposing a novel representation that encodes both semantic and instance information, with low training requirements and temporally consistent predictions ...
doi:10.1007/978-3-030-11015-4_14
fatcat:r6srpgaw6vasdeg5l4ayzqqqca
SMOR: A Semantic Multi-View Object Representation System in 2D Image Sequences
2013
The Arabian Journal for Science and Engineering
The system includes the use of a set of simple algorithms to segment, identify, group and track the semantic objects modeled as a set of regions with compatible surface features. ...
Then we track the segmented regions/objects throughout the sequence using motion information and color constancy. The result of the system is a multi-view representation of objects of a static scene. ...
In this paper, we investigate the combination of object segmentation from a single image with object tracking in 2D image sequences to achieve more robust objects representation of a scene containing multiple ...
doi:10.1007/s13369-013-0693-z
fatcat:apv3yafakzh5ldgytjsliye2oa
2020 Index IEEE Transactions on Image Processing Vol. 29
2020
IEEE Transactions on Image Processing
., +, TIP 2020 8476-8489
Adaptive Graph Representation Learning for Video Person Re-Identifica-
tion. ...
Soner, B., +, TIP 2020 4505-4515
Semantic Segmentation With Context Encoding and Multi-Path Decoding. ...
doi:10.1109/tip.2020.3046056
fatcat:24m6k2elprf2nfmucbjzhvzk3m
MAN: Moment Alignment Network for Natural Language Moment Retrieval via Iterative Graph Adjustment
2019
2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)
MAN naturally assigns candidate moment representations aligned with language semantics over different temporal locations and scales. ...
This research strives for natural language moment retrieval in long, untrimmed video streams. ...
Language features are encoded as efficient dynamic filters and convolved with input visual representations to deal with semantic misalignment. ...
doi:10.1109/cvpr.2019.00134
dblp:conf/cvpr/ZhangDWWD19
fatcat:mbglkapzw5hr5aoxrz6msvll5m
State of the Art: A Summary of Semantic Image and Video Retrieval Techniques
2015
Indian Journal of Science and Technology
efficient semantic video retrieval. ...
Due to these reasons semantic video retrieval became a challenging issue in various industries. ...
The graph-based representation enables us to coherently fuse multimodal resources through graphs with proper probabilistic interpretation. ...
doi:10.17485/ijst/2015/v8i35/77061
fatcat:2htopyojqjd7bkjt6mx66cf24i
Semi-Supervised Learning for Multi-Task Scene Understanding by Neural Graph Consensus
[article]
2020
arXiv
pre-print
We show how prediction of different representations such as depth, semantic segmentation, surface normals and pose from RGB input could be effectively learned through self-supervised consensus in our graph ...
By optimizing such consensus between different paths, the graph reaches consistency and robustness over multiple interpretations and generations, in the face of unknown labels. ...
Making code available There is a strong need today for video data with multiple labeled representations. ...
arXiv:2010.01086v2
fatcat:ng27p5utdnabplogh5qhokdlh4
MAN: Moment Alignment Network for Natural Language Moment Retrieval via Iterative Graph Adjustment
[article]
2019
arXiv
pre-print
MAN naturally assigns candidate moment representations aligned with language semantics over different temporal locations and scales. ...
This research strives for natural language moment retrieval in long, untrimmed video streams. ...
Language features are encoded as efficient dynamic filters and convolved with input visual representations to deal with semantic misalignment. ...
arXiv:1812.00087v2
fatcat:cbxtybz4cnf3xbqiudlyj3rlm4
Temporally coherent interpretations for long videos using pattern theory
2015
2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)
Graph-theoretical methods have successfully provided semantic and structural interpretations of images and videos. ...
Interpretations for long video segments were able to yield performance increases of about 70% and, in addition, proved to be more robust to different severe scenarios of classification errors. ...
A video interpretation is a sequence of interpretations for a temporal window containing a single video segment or multiple consecutive segments. ...
doi:10.1109/cvpr.2015.7298727
dblp:conf/cvpr/SouzaSSS15
fatcat:ijytw67pknhk5lmfzic4y67qna
« Previous
Showing results 1 — 15 out of 12,553 results