Filters








26,891 Hits in 4.3 sec

A QUALITATIVE APPROACH FOR ONLINE ACTIVITY RECOGNITION [chapter]

MUHANNAD ALOMARI, PAUL DUCKWORTH, YIANNIS GATSOULIS, DAVID C. HOGG, ANTHONY G. COHN
2016 Advances in Cooperative Robotics  
where a Bayesian framework is used to temporally segment videos containing actions into atomic movements.  ...  A variety of applications rely on learning activity models and temporal segmentation of video into human activities, such as: human robot interaction, smart surveillance systems, and semantic video database  ... 
doi:10.1142/9789813149137_0086 fatcat:34irteyvc5hxrghp4fqy3bpl4q

Improving Action Localization by Progressive Cross-Stream Cooperation

Rui Su, Wanli Ouyang, Luping Zhou, Dong Xu
2019 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)  
To improve action localization results at the video level, we additionally propose a new strategy to train class-specific actionness detectors for better temporal segmentation, which can be readily learnt  ...  Spatio-temporal action localization consists of three levels of tasks: spatial localization, action classification, and temporal segmentation.  ...  On the other hand, the detected bounding boxes for actions in individual frames need to be linked in order to form action tubes and temporally segmented out from the entire video clip.  ... 
doi:10.1109/cvpr.2019.01229 dblp:conf/cvpr/SuOZX19 fatcat:dsjzouvjynhs7ccw7bvshwhuc4

Towards Collaborative Video Authoring [chapter]

Boris Novikov, Oleg Proskurnin
2003 Lecture Notes in Computer Science  
Our main contribution is the development of a new data model called concurrent video, especially intended for cooperative authoring environments.  ...  We demonstrate that the presented model provides efficient means of organizing and manipulating video data, at the same time enabling direct use of merging mechanisms, which constitute a formal basis for  ...  A set of basic operations for temporal and spatial composition of video segments is defined in video algebra [23] .  ... 
doi:10.1007/978-3-540-39403-7_28 fatcat:6fr6znlvnzcsxgw5zyadlrdecy

Coherent Loss: A Generic Framework for Stable Video Segmentation [article]

Mingyang Qian, Yi Fu, Xiao Tan, Yingying Li, Jinqing Qi, Huchuan Lu, Shilei Wen, Errui Ding
2020 arXiv   pre-print
We investigate how this jittering artifact degrades the visual quality of video segmentation results and proposed a metric of temporal stability to numerically evaluate it.  ...  Video segmentation approaches are of great importance for numerous vision tasks especially in video manipulation for entertainment.  ...  Owing it is orthogonal to the network architecture, it hence can be used to cooperate with different segmentation models.  ... 
arXiv:2010.13085v1 fatcat:hgra5ikjzfbchaj62h5n7mttam

Combining Supervised and Un-supervised Learning for Automatic Citrus Segmentation [article]

Heqing Huang, Tongbin Huang, Zhen Li, Zhiwei Wei, Shilei Lv
2021 arXiv   pre-print
and the temporal information of citrus changes to enhance the segmentation effect.  ...  Compared with most of the existing citrus segmentation methods, our method uses a small amount of supervised data and a large number of unsupervised data, while learning the pixel level location information  ...  CONCLUSION We propose a cooperative citrus segmentation method for video of learning temporal information and pixel level generation at the same time.  ... 
arXiv:2105.01553v1 fatcat:fjjsc7zpg5fxxc7gsgixqfi3mu

ASFormer: Transformer for Action Segmentation [article]

Fangqiu Yi and Hongyu Wen and Tingting Jiang
2021 arXiv   pre-print
Algorithms for the action segmentation task typically use temporal models to predict what action is occurring at each frame for a minute-long daily activity.  ...  long input sequence, and the limitation of the decoder architecture to utilize temporal relations among multiple action segments to refine the initial predictions.  ...  Instead of using raw RGB video sequences as the input, action segmentation methods operate on pre-extracted frame-wise feature sequences and focus on modeling the temporal relations among frames.  ... 
arXiv:2110.08568v1 fatcat:aoarkiutnnf3zi2oe32x3nqlk4

Trajectory based event tactics analysis in broadcast sports video

Guangyu Zhu, Qingming Huang, Changsheng Xu, Yong Rui, Shuqiang Jiang, Wen Gao, Hongxun Yao
2007 Proceedings of the 15th international conference on Multimedia - MULTIMEDIA '07  
Compared with existing work, we proposed an effective tactic representation called aggregate trajectory which is constructed based on multiple trajectories using a novel analysis of temporal-spatial interaction  ...  However, professionals, such as soccer coaches, are more interested in the tactics used in the events.  ...  Local temporal-spatial analysis is carried out on the segmented temporal intervals of the whole goal event.  ... 
doi:10.1145/1291233.1291250 dblp:conf/mm/ZhuHXRJGY07 fatcat:tvhnzhjio5fyxfplhqdish2bay

Communication Audio and Video Coding and Decoding Methods in Wireless Cooperative Communication Network

Shenghui Li, Wenmin Wang, Hasan Ali Khattak
2022 Mobile Information Systems  
The purpose of this paper is to study the communication audio and video codec methods in wireless cooperative communication networks and to understand the development status of audio and video codec by  ...  The proposed network coding can effectively eliminate the transmission bottleneck problem faced by wireless cooperative communication.  ...  Research on Communication Audio and Video Coding and Decoding Methods in Wireless Cooperative Communication Networks 2.1. Wireless Cooperative Communication System Model.  ... 
doi:10.1155/2022/1587662 fatcat:f4nxknoyznfytglatkpf6jq4w4

Event Tactic Analysis Based on Broadcast Sports Video

Guangyu Zhu, Changsheng Xu, Qingming Huang, Yong Rui, Shuqiang Jiang, Wen Gao, Hongxun Yao
2009 IEEE transactions on multimedia  
We extract the attack events with far-view shots using the analysis and alignment of web-casting text and broadcast video.  ...  Most existing approaches on sports video analysis have concentrated on semantic event detection.  ...  Secondly, the graph modeling is introduced for trajectory temporal-spatial analysis to increase the robustness of original algorithm, where the temporal relationship between the successive trajectory segments  ... 
doi:10.1109/tmm.2008.2008918 fatcat:sxpny4cbgbbznbsy4aejchynmu

Hepatocellular Carcinoma Segmentation from Digital Subtraction Angiography Videos using Learnable Temporal Difference [article]

Wenting Jiang, Yicheng Jiang, Lu Zhang, Changmiao Wang, Xiaoguang Han, Shuixing Zhang, Xiang Wan, Shuguang Cui
2021 arXiv   pre-print
Few studies have investigated HCC segmentation from DSA videos.  ...  We also propose a novel segmentation network called DSA-LTDNet, including a segmentation sub-network, a temporal difference learning (TDL) module and a liver region segmentation (LRS) sub-network for providing  ...  It is regretful that few studies have investigated segmentation of DSA videos before. Segmentation of DSA Videos is similar to the video object segmentation (VOS) task.  ... 
arXiv:2107.04306v3 fatcat:htvshmhvonbppccu7sbyabql6m

Object Detection and Tracking: A Review

Prof. Mukund R. Joshi
2016 INTERNATIONAL JOURNAL OF EMERGING TRENDS IN SCIENCE AND TECHNOLOGY  
Object tracking is performed using monitoring objects spatial and temporal changes during a video sequence, including its presence, position, size, shape, etc.  ...  In this framework, the detection and recognition of objects proceed simultaneously with image segmentation in a competitive and cooperative manner.  ...  We try to make the detection and recognition of objects proceed simultaneously with image segmentation in a competitive and cooperative manner.  ... 
doi:10.18535/ijetst/v3i04.03 fatcat:ds6xwlyvonhvfbihkflf7dvhze

Temporal Relational Modeling with Self-Supervision for Action Segmentation [article]

Dong Wang, Di Hu, Xingjian Li, Dejing Dou
2020 arXiv   pre-print
Temporal relational modeling in video is essential for human action understanding, such as action recognition and action segmentation.  ...  The main reason is that large number of nodes (i.e., video frames) makes GCNs hard to capture and model temporal relations in videos.  ...  (Fathi, Farhadi, and Rehg 2011; Fathi, Ren, and Rehg 2011; Fathi and Rehg 2013) attempted to use a segmental model to predict the temporally consistent action segments. Cheng et al.  ... 
arXiv:2012.07508v1 fatcat:au2hj2vz6vblnch2lxefyncvwa

Cooperative Cross-Stream Network for Discriminative Action Representation [article]

Jingran Zhang, Fumin Shen, Xing Xu, Heng Tao Shen
2019 arXiv   pre-print
Spatial and temporal stream model has gained great success in video action recognition.  ...  The jointly spatial and temporal stream networks feature extraction is accomplished by an end-to-end learning manner.  ...  Here, we extract three segments of a video and randomly sample a video snippet of 10 frames on each segment as input for training. During testing, 25 frames are sampled for each video.  ... 
arXiv:1908.10136v1 fatcat:hwu2pnudxffmfg3iajq7s3ymsm

A-MAL: Automatic Movement Assessment Learning from Properly Performed Movements in 3D Skeleton Videos [article]

Tal Hakim, Ilan Shimshoni
2020 arXiv   pre-print
time-segmentation algorithm, a parameter relevance detection algorithm, a novel time-warping algorithm that is based on automatic detection of common temporal points-of-interest and a textual-feedback  ...  The ability to automatically assess subject movement in videos that were captured by affordable devices, such as Kinect cameras, is essential for monitoring clinical rehabilitation processes, for improving  ...  We would like to thank the department of occupational therapy in the Galilee Medical Center for their cooperation and in particular Dorit Itah, who guided and labeled the FMA tests.  ... 
arXiv:1907.10004v4 fatcat:jpcpabgsobf6tjln6yhv2hqnk4

MPEG-7 and multimedia database systems

Harald Kosch
2002 SIGMOD record  
We argue that MPEG-7 has to be considered complementary to, rather than competing with, data models employed in MMDBSs.  ...  All the video segments are temporally connected. Text annotations are associated with the video segments using the TextAnnnotation D.  ...  This useful description scheme describes temporal intervals or segments of video data which can correspond to an arbitrary sequence of frames, a single frame, or even the full video sequence.  ... 
doi:10.1145/565117.565123 fatcat:cyfthkvgbfctfe6kbtqistprtm
« Previous Showing results 1 — 15 out of 26,891 results