Filters








14,073 Hits in 4.5 sec

Gaussian Temporal Awareness Networks for Action Localization [article]

Fuchen Long and Ting Yao and Zhaofan Qiu and Xinmei Tian and Jiebo Luo and Tao Mei
2019 arXiv   pre-print
Specifically, we present Gaussian Temporal Awareness Networks (GTAN) --- a new architecture that novelly integrates the exploitation of temporal structure into an one-stage action localization framework  ...  Temporally localizing actions in a video is a fundamental challenge in video understanding.  ...  Conclusions We have presented Gaussian Temporal Awareness Networks (GTAN) which aim to explore temporal structure of actions for temporal action localization.  ... 
arXiv:1909.03877v1 fatcat:t2anehy6pran7gp5tqejuegllm

Temporal Action Localization with Variance-Aware Networks [article]

Ting-Ting Xie, Christos Tzelepis, Ioannis Patras
2020 arXiv   pre-print
This work addresses the problem of temporal action localization with Variance-Aware Networks (VAN), i.e., DNNs that use second-order statistics in the input and/or the output of regression tasks.  ...  To train the network, we define a differentiable loss based on the KL-divergence between the predicted Gaussian and a Gaussian around the ground truth action borders, and use standard back-propagation.  ...  Variance Aware Network (VAN) In this section we describe our Variance Aware Networks (VAN) for action localization.  ... 
arXiv:2008.11254v1 fatcat:vtxg7ia5wfbjtb2iopj7nc6muy

A Context-Aware Loss Function for Action Spotting in Soccer Videos [article]

Anthony Cioppa, Adrien Deliège, Silvio Giancola, Bernard Ghanem, Marc Van Droogenbroeck, Rikke Gade, Thomas B. Moeslund
2020 arXiv   pre-print
In video understanding, action spotting consists in temporally localizing human-induced events annotated with single timestamps.  ...  Finally, we qualitatively illustrate how our loss induces a precise temporal understanding of actions and show how such semantic knowledge can be used for automatic highlights generation.  ...  temporal scale of each action proposal with Gaussian kernels.  ... 
arXiv:1912.01326v3 fatcat:j4lodzlpnfd5vjt63tshvt4lcq

2020 Index IEEE Transactions on Pattern Analysis and Machine Intelligence Vol. 42

2021 IEEE Transactions on Pattern Analysis and Machine Intelligence  
., +, TPAMI May 2020 1243-1256 Interpolation A Temporally-Aware Interpolation Network for Video Frame Inpainting.  ...  Yamasaki, R., +, TPAMI Sept. 2020 2273-2286 Extrapolation A Temporally-Aware Interpolation Network for Video Frame Inpainting.  ...  ., +, 2670 -2683 DART: Distribution Aware Retinal Transform for Event-Based Cameras.  ... 
doi:10.1109/tpami.2020.3036557 fatcat:3j6s2l53x5eqxnlsptsgbjeebe

LINDA: Multi-Agent Local Information Decomposition for Awareness of Teammates [article]

Jiahan Cao, Lei Yuan, Jianhao Wang, Shaowei Zhang, Chongjie Zhang, Yang Yu, De-Chuan Zhan
2022 arXiv   pre-print
To address this problem, we propose a novel framework, multi-agent Local INformation Decomposition for Awareness of teammates (LINDA), with which agents learn to decompose local information and build awareness  ...  Sufficient experiments show that the proposed framework learns informative awareness from local partial observations for better collaboration and significantly improves the learning performance, especially  ...  In the global mixing network, the action value functions of all the agents are gathered to estimate the global action value and compute the Temporal Difference (TD) error for optimization.  ... 
arXiv:2109.12508v3 fatcat:qvogorybrbbgvnwyxb2wzxewxi

Fast Learning of Temporal Action Proposal via Dense Boundary Generator [article]

Chuming Lin, Jian Li, Yabiao Wang, Ying Tai, Donghao Luo, Zhipeng Cui, Chengjie Wang, Jilin Li, Feiyue Huang, Rongrong Ji
2019 arXiv   pre-print
In particular, the DBG consists of two modules: Temporal boundary classification (TBC) and Action-aware completeness regression (ACR).  ...  boundary classification and action completeness regression for densely distributed proposals.  ...  for single RGB frame and temporal network for stacked optical flow field.  ... 
arXiv:1911.04127v1 fatcat:aibntdok6fc7biyzw3weqrd2tm

Fast Learning of Temporal Action Proposal via Dense Boundary Generator

Chuming Lin, Jian Li, Yabiao Wang, Ying Tai, Donghao Luo, Zhipeng Cui, Chengjie Wang, Jilin Li, Feiyue Huang, Rongrong Ji
2020 PROCEEDINGS OF THE THIRTIETH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE AND THE TWENTY-EIGHTH INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE  
In particular, the DBG consists of two modules: Temporal boundary classification (TBC) and Action-aware completeness regression (ACR).  ...  boundary classification and action completeness regression for densely distributed proposals.  ...  for single RGB frame and temporal network for stacked optical flow field.  ... 
doi:10.1609/aaai.v34i07.6815 fatcat:fyspfa3gbvcrpj63fjcxvhlctq

Interpreting Depression From Question-wise Long-term Video Recording of SDS Evaluation [article]

Wanqing Xie, Lizhong Liang, Yao Lu, Chen Wang, Jihong Shen, Hui Luo, Xiaofeng Liu
2021 arXiv   pre-print
Specifically, we resort to a hierarchical model which utilizes a 3D CNN for local temporal pattern exploration and a redundancy-aware self-attention (RAS) scheme for question-wise global feature aggregation  ...  Clinically, facial expression (FE) and actions play a vital role in clinician-administered evaluation, while FE and action are underexplored for self-administered evaluations.  ...  local temporal modeling, redundancy-aware RAS for global attention modeling, and SDS-score conditional question-wise fusion. • Our parametric redundancy-aware self-attention (RAS) scheme explicitly emphasizes  ... 
arXiv:2106.13393v1 fatcat:mtjfnhu7irhfde5fvnpfmch4pi

Improved Spatio-temporal Salient Feature Detection for Action Recognition

Amir-Hossein Shabani, David Clausi, John S. Zelek
2011 Procedings of the British Machine Vision Conference 2011  
Spatio-temporal salient features can localize the local motion events and are used to represent video sequences for many computer vision tasks such as action recognition.  ...  Based on the positive results of precision and reproducibility tests, we propose the use of temporally asymmetric filtering for robust motion feature detection and action recognition.  ...  Acknowledgment GEOIDE (Geomatics for Informed Decision), a Network for Centers of Excellence supported by the NSERC Canada, is thanked for the financial support of this project.  ... 
doi:10.5244/c.25.100 dblp:conf/bmvc/ShabaniCZ11 fatcat:qq45lyubnbgetkcdtx6ekomb4e

2020 Index IEEE Transactions on Image Processing Vol. 29

2020 IEEE Transactions on Image Processing  
., +, TIP 2020 5571-5583 Revisiting Anchor Mechanisms for Temporal Action Localization.  ...  Ao, W., +, TIP 2020 1944-1957 Occlusion-Aware Region-Based 3D Pose Tracking of Objects With Temporally Consistent Polar-Based Local Partitioning.  ... 
doi:10.1109/tip.2020.3046056 fatcat:24m6k2elprf2nfmucbjzhvzk3m

2021 Index IEEE Transactions on Multimedia Vol. 23

2021 IEEE transactions on multimedia  
The Author Index contains the primary entry for each item, listed under the first author's name.  ...  ., +, TMM 2021 2471-2480 Temporal Locality-Aware Network With Dual Structure for Accurate and Fast Action Detection.  ...  ., +, TMM 2021 1083-1094 AFNet: Temporal Locality-Aware Network With Dual Structure for Accurate and Fast Action Detection.  ... 
doi:10.1109/tmm.2022.3141947 fatcat:lil2nf3vd5ehbfgtslulu7y3lq

A Context-Aware Loss Function for Action Spotting in Soccer Videos

Anthony Cioppa, Adrien Deliege, Silvio Giancola, Bernard Ghanem, Marc Van Droogenbroeck, Rikke Gade, Thomas B. Moeslund
2020 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)  
In video understanding, action spotting consists in temporally localizing human-induced events annotated with single timestamps.  ...  Finally, we qualitatively illustrate how our loss induces a precise temporal understanding of actions and show how such semantic knowledge can be used for automatic highlights generation.  ...  temporal scale of each action proposal with Gaussian kernels.  ... 
doi:10.1109/cvpr42600.2020.01314 dblp:conf/cvpr/CioppaDGGDGM20 fatcat:md577nh6qva3pdxjveoiztj5ka

2021 Index IEEE Transactions on Image Processing Vol. 30

2021 IEEE Transactions on Image Processing  
The Author Index contains the primary entry for each item, listed under the first author's name.  ...  ., +, TIP 2021 5920-5932 Modeling Sub-Actions for Weakly Supervised Temporal Action Localization.  ...  ., +, TIP 2021 9332-9344 Multi-Scale Structure-Aware Network for Weakly Supervised Temporal Action Detection.  ... 
doi:10.1109/tip.2022.3142569 fatcat:z26yhwuecbgrnb2czhwjlf73qu

Identification and Prediction of Large Pedestrian Flow in Urban Areas Based on a Hybrid Detection Approach

Kaisheng Zhang, Mei Wang, Bangyang Wei, Daniel(Jian) Sun
2016 Sustainability  
With the hybrid model, the Log Distance Path Loss (LDPL) model was used to estimate the pedestrian density from raw network data, and retrieve information with the Gaussian Progress (GP) through supervised  ...  Temporal-spatial prediction of the pedestrian data was carried out with Machine Learning (ML) approaches.  ...  B.W. provided critical suggestions and inputs for the case study and helped with writing the manuscript.  ... 
doi:10.3390/su9010036 fatcat:e6gybjnxc5hxtdtmaqropvk7l4

Enhanced 3D convolutional networks for crowd counting [article]

Zhikang Zou, Huiliang Shao, Xiaoye Qu, Wei Wei, Pan Zhou
2019 arXiv   pre-print
Recently, convolutional neural networks (CNNs) are the leading defacto method for crowd counting.  ...  Specifically, we incorporate 3D convolution kernels to encode local spatio-temporal features.  ...  with local spatio-temporal features to boost the accuracy for crowd counting.  ... 
arXiv:1908.04121v1 fatcat:wpfdr3hy5fdi5gsp2lynkmaluy
« Previous Showing results 1 — 15 out of 14,073 results