A copy of this work was available on the public web and has been preserved in the Wayback Machine. The capture dates from 2020; you can also visit the original URL.
The file type is application/pdf
.
Filters
Gaussian Temporal Awareness Networks for Action Localization
[article]
2019
arXiv
pre-print
Specifically, we present Gaussian Temporal Awareness Networks (GTAN) --- a new architecture that novelly integrates the exploitation of temporal structure into an one-stage action localization framework ...
Temporally localizing actions in a video is a fundamental challenge in video understanding. ...
Conclusions We have presented Gaussian Temporal Awareness Networks (GTAN) which aim to explore temporal structure of actions for temporal action localization. ...
arXiv:1909.03877v1
fatcat:t2anehy6pran7gp5tqejuegllm
Temporal Action Localization with Variance-Aware Networks
[article]
2020
arXiv
pre-print
This work addresses the problem of temporal action localization with Variance-Aware Networks (VAN), i.e., DNNs that use second-order statistics in the input and/or the output of regression tasks. ...
To train the network, we define a differentiable loss based on the KL-divergence between the predicted Gaussian and a Gaussian around the ground truth action borders, and use standard back-propagation. ...
Variance Aware Network (VAN) In this section we describe our Variance Aware Networks (VAN) for action localization. ...
arXiv:2008.11254v1
fatcat:vtxg7ia5wfbjtb2iopj7nc6muy
A Context-Aware Loss Function for Action Spotting in Soccer Videos
[article]
2020
arXiv
pre-print
In video understanding, action spotting consists in temporally localizing human-induced events annotated with single timestamps. ...
Finally, we qualitatively illustrate how our loss induces a precise temporal understanding of actions and show how such semantic knowledge can be used for automatic highlights generation. ...
temporal scale of each action proposal with Gaussian kernels. ...
arXiv:1912.01326v3
fatcat:j4lodzlpnfd5vjt63tshvt4lcq
2020 Index IEEE Transactions on Pattern Analysis and Machine Intelligence Vol. 42
2021
IEEE Transactions on Pattern Analysis and Machine Intelligence
., +, TPAMI May 2020 1243-1256 Interpolation A Temporally-Aware Interpolation Network for Video Frame Inpainting. ...
Yamasaki, R., +, TPAMI Sept. 2020 2273-2286
Extrapolation
A Temporally-Aware Interpolation Network for Video Frame Inpainting. ...
., +, 2670 -2683 DART: Distribution Aware Retinal Transform for Event-Based Cameras. ...
doi:10.1109/tpami.2020.3036557
fatcat:3j6s2l53x5eqxnlsptsgbjeebe
LINDA: Multi-Agent Local Information Decomposition for Awareness of Teammates
[article]
2022
arXiv
pre-print
To address this problem, we propose a novel framework, multi-agent Local INformation Decomposition for Awareness of teammates (LINDA), with which agents learn to decompose local information and build awareness ...
Sufficient experiments show that the proposed framework learns informative awareness from local partial observations for better collaboration and significantly improves the learning performance, especially ...
In the global mixing network, the action value functions of all the agents are gathered to estimate the global action value and compute the Temporal Difference (TD) error for optimization. ...
arXiv:2109.12508v3
fatcat:qvogorybrbbgvnwyxb2wzxewxi
Fast Learning of Temporal Action Proposal via Dense Boundary Generator
[article]
2019
arXiv
pre-print
In particular, the DBG consists of two modules: Temporal boundary classification (TBC) and Action-aware completeness regression (ACR). ...
boundary classification and action completeness regression for densely distributed proposals. ...
for single RGB frame and temporal network for stacked optical flow field. ...
arXiv:1911.04127v1
fatcat:aibntdok6fc7biyzw3weqrd2tm
Fast Learning of Temporal Action Proposal via Dense Boundary Generator
2020
PROCEEDINGS OF THE THIRTIETH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE AND THE TWENTY-EIGHTH INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE
In particular, the DBG consists of two modules: Temporal boundary classification (TBC) and Action-aware completeness regression (ACR). ...
boundary classification and action completeness regression for densely distributed proposals. ...
for single RGB frame and temporal network for stacked optical flow field. ...
doi:10.1609/aaai.v34i07.6815
fatcat:fyspfa3gbvcrpj63fjcxvhlctq
Interpreting Depression From Question-wise Long-term Video Recording of SDS Evaluation
[article]
2021
arXiv
pre-print
Specifically, we resort to a hierarchical model which utilizes a 3D CNN for local temporal pattern exploration and a redundancy-aware self-attention (RAS) scheme for question-wise global feature aggregation ...
Clinically, facial expression (FE) and actions play a vital role in clinician-administered evaluation, while FE and action are underexplored for self-administered evaluations. ...
local temporal modeling, redundancy-aware RAS for global attention modeling, and SDS-score conditional question-wise fusion. • Our parametric redundancy-aware self-attention (RAS) scheme explicitly emphasizes ...
arXiv:2106.13393v1
fatcat:mtjfnhu7irhfde5fvnpfmch4pi
Improved Spatio-temporal Salient Feature Detection for Action Recognition
2011
Procedings of the British Machine Vision Conference 2011
Spatio-temporal salient features can localize the local motion events and are used to represent video sequences for many computer vision tasks such as action recognition. ...
Based on the positive results of precision and reproducibility tests, we propose the use of temporally asymmetric filtering for robust motion feature detection and action recognition. ...
Acknowledgment GEOIDE (Geomatics for Informed Decision), a Network for Centers of Excellence supported by the NSERC Canada, is thanked for the financial support of this project. ...
doi:10.5244/c.25.100
dblp:conf/bmvc/ShabaniCZ11
fatcat:qq45lyubnbgetkcdtx6ekomb4e
2020 Index IEEE Transactions on Image Processing Vol. 29
2020
IEEE Transactions on Image Processing
., +, TIP 2020 5571-5583 Revisiting Anchor Mechanisms for Temporal Action Localization. ...
Ao, W., +, TIP 2020 1944-1957 Occlusion-Aware Region-Based 3D Pose Tracking of Objects With Temporally Consistent Polar-Based Local Partitioning. ...
doi:10.1109/tip.2020.3046056
fatcat:24m6k2elprf2nfmucbjzhvzk3m
2021 Index IEEE Transactions on Multimedia Vol. 23
2021
IEEE transactions on multimedia
The Author Index contains the primary entry for each item, listed under the first author's name. ...
., +, TMM 2021 2471-2480 Temporal Locality-Aware Network With Dual Structure for Accurate and Fast Action Detection. ...
., +, TMM 2021 1083-1094 AFNet: Temporal Locality-Aware Network With Dual Structure for Accurate and Fast Action Detection. ...
doi:10.1109/tmm.2022.3141947
fatcat:lil2nf3vd5ehbfgtslulu7y3lq
A Context-Aware Loss Function for Action Spotting in Soccer Videos
2020
2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)
In video understanding, action spotting consists in temporally localizing human-induced events annotated with single timestamps. ...
Finally, we qualitatively illustrate how our loss induces a precise temporal understanding of actions and show how such semantic knowledge can be used for automatic highlights generation. ...
temporal scale of each action proposal with Gaussian kernels. ...
doi:10.1109/cvpr42600.2020.01314
dblp:conf/cvpr/CioppaDGGDGM20
fatcat:md577nh6qva3pdxjveoiztj5ka
2021 Index IEEE Transactions on Image Processing Vol. 30
2021
IEEE Transactions on Image Processing
The Author Index contains the primary entry for each item, listed under the first author's name. ...
., +, TIP 2021 5920-5932 Modeling Sub-Actions for Weakly Supervised Temporal Action Localization. ...
., +, TIP 2021 9332-9344 Multi-Scale Structure-Aware Network for Weakly Supervised Temporal Action Detection. ...
doi:10.1109/tip.2022.3142569
fatcat:z26yhwuecbgrnb2czhwjlf73qu
Identification and Prediction of Large Pedestrian Flow in Urban Areas Based on a Hybrid Detection Approach
2016
Sustainability
With the hybrid model, the Log Distance Path Loss (LDPL) model was used to estimate the pedestrian density from raw network data, and retrieve information with the Gaussian Progress (GP) through supervised ...
Temporal-spatial prediction of the pedestrian data was carried out with Machine Learning (ML) approaches. ...
B.W. provided critical suggestions and inputs for the case study and helped with writing the manuscript. ...
doi:10.3390/su9010036
fatcat:e6gybjnxc5hxtdtmaqropvk7l4
Enhanced 3D convolutional networks for crowd counting
[article]
2019
arXiv
pre-print
Recently, convolutional neural networks (CNNs) are the leading defacto method for crowd counting. ...
Specifically, we incorporate 3D convolution kernels to encode local spatio-temporal features. ...
with local spatio-temporal features to boost the accuracy for crowd counting. ...
arXiv:1908.04121v1
fatcat:wpfdr3hy5fdi5gsp2lynkmaluy
« Previous
Showing results 1 — 15 out of 14,073 results