Filters








2,236 Hits in 7.7 sec

Collaborative Distillation in the Parameter and Spectrum Domains for Video Action Recognition [article]

Haisheng Su, Jing Su, Dongliang Wang, Weihao Gan, Wei Wu, Mengmeng Wang, Junjie Yan, Yu Qiao
2020 arXiv   pre-print
Specifically, we propose two distillation strategies in the frequency domain, namely the feature spectrum and parameter distribution distillations respectively.  ...  Existing knowledge distillation methods are limited to the image-level spatial domain, ignoring the temporal and frequency information which provide structural knowledge and are important for video analysis  ...  To address the above issues, we propose two distillation strategies in the frequency domain for video action recognition, namely the feature spectrum distillation and parameter distribution distillation  ... 
arXiv:2009.06902v1 fatcat:ekr4p5r3hvgu3mfszugamlmq6m

TCGL: Temporal Contrastive Graph for Self-supervised Video Representation Learning [article]

Yang Liu, Keze Wang, Lingbo Liu, Haoyuan Lan, Liang Lin
2022 arXiv   pre-print
Experimental results demonstrate the superiority of our TCGL over the state-of-the-art methods on large-scale action recognition and video retrieval benchmarks.The code is publicly available at https:/  ...  However, existing methods fail to increase the temporal diversity of unlabeled videos and ignore elaborately modeling multi-scale temporal dependencies in an explicit way.  ...  domain for action recognition.  ... 
arXiv:2112.03587v3 fatcat:fgrz462zsrdt5ooprppcks4yim

Edge-Cloud Polarization and Collaboration: A Comprehensive Survey [article]

Jiangchao Yao, Shengyu Zhang, Yang Yao, Feng Wang, Jianxin Ma, Jianwei Zhang, Yunfei Chu, Luo Ji, Kunyang Jia, Tao Shen, Anpeng Wu, Fengda Zhang (+6 others)
2021 arXiv   pre-print
Specifically, we are the first to set up the collaborative learning mechanism for cloud and edge modeling with a thorough review of the architectures that enable such mechanism.  ...  In this survey, we conduct a systematic review for both cloud and edge AI.  ...  Image recognition involves analyzing images and identifying objects, actions, and other elements in order to draw conclusions.  ... 
arXiv:2111.06061v2 fatcat:qhbyomrom5ghvikjlqkqb7eayq

2021 Index IEEE Transactions on Multimedia Vol. 23

2021 IEEE transactions on multimedia  
Departments and other items may also be covered if they have been judged to have archival value. The Author Index contains the primary entry for each item, listed under the first author's name.  ...  -that appeared in this periodical during 2021, and items from previous years that were commented upon or corrected in 2021.  ...  ., +, TMM 2021 1640-1653 Adversarial Disentanglement Spectrum Variations and Cross-Modality Attention Networks for NIR-VIS Face Recognition.  ... 
doi:10.1109/tmm.2022.3141947 fatcat:lil2nf3vd5ehbfgtslulu7y3lq

Speech-gesture driven multimodal interfaces for crisis management

R. Sharma, M. Yeasin, N. Krahntoever, I. Rauschert, Guoray Cai, I. Brewer, A.M. Maceachren, K. Sengupta
2003 Proceedings of the IEEE  
The fourth part speculates on the short term and long term research directions that will help addressing the outstanding challenges in interfaces that support dialog and collaboration.  ...  In particular it describes, the evolution and implementation details of two representative systems, called crisis management (XISM) and Dialog Assisted Visual Environment for Geoinformation (DAVE_G).  ...  Tracking is commonly performed incrementally by adjusting the model parameters for a given video frame based on the parameters at earlier times, which improves the tracking accuracy and speed.  ... 
doi:10.1109/jproc.2003.817145 fatcat:flbaisvreresla7wufztzpnvfq

Transfer Learning for Future Wireless Networks: A Comprehensive Survey [article]

Cong T. Nguyen, Nguyen Van Huynh, Nam H. Chu, Yuris Mulya Saputra, Dinh Thai Hoang, Diep N. Nguyen, Quoc-Viet Pham, Dusit Niyato, Eryk Dutkiewicz, Won-Joo Hwang
2021 arXiv   pre-print
The issues include spectrum management, localization, signal recognition, security, human activity recognition and caching, which are all important to next-generation networks such as 5G and beyond.  ...  The core idea of TL is to leverage and synthesize distilled knowledge from similar tasks as well as from valuable experiences accumulated from the past to facilitate the learning of new problems.  ...  In this case, the TL approach can transfer the knowledge of DQN parameters and fine-tune them to map the video rate from source to target domain.  ... 
arXiv:2102.07572v2 fatcat:56si46duuvg55htquyhoawwg6m

Leveraging Unlabeled Data for Emotion Recognition With Enhanced Collaborative Semi-Supervised Learning

Zixing Zhang, Jing Han, Jun Deng, Xinzhou Xu, Fabien Ringeval, Bjorn Schuller
2018 IEEE Access  
We further exploit multiple modalities and models in the SSL system, by using collaborative SSL, where all modalities and models are considered simultaneously; samples are selected by means of minimising  ...  One of the major obstacles that has to be faced when applying automatic emotion recognition to realistic humanmachine interaction systems is the scarcity of labelled data for training a robust model.  ...  In this case, the parameter P in Algorithm 2 equals to two, and both audio and video feature vectors can serve as different 'views', i. e., x a ∈ X a = X 1 , and x v ∈ X v = X 2 .  ... 
doi:10.1109/access.2018.2821192 fatcat:fie6so25qrgp3goe5nlls64xhq

Video-Based Automatic Baby Motion Analysis for Early Neurological Disorder Diagnosis: State of the Art and Future Directions

Marco Leo, Giuseppe Massimo Bernava, Pierluigi Carcagnì, Cosimo Distante
2022 Sensors  
Besides, it gives a glimpse of the most promising techniques in computer vision, machine learning and pattern recognition which could be profitably exploited for children motion analysis in videos.  ...  Markerless approaches are easier to set up and maintain (without any human intervention) and they work well on non-collaborative users, making them the most suitable technologies for clinical applications  ...  Nevertheless, temporal modelling still remains challenging for action recognition in videos.  ... 
doi:10.3390/s22030866 pmid:35161612 pmcid:PMC8839211 fatcat:6rv7kiyj35hbrdurqmyfcufnwm

Table of Contents

2020 IEEE Signal Processing Letters  
Dytso, and M. Cardone 1909 A Novel Parameter Estimation for Polynomial Phase Signals Using the Spectrum Phase . . . . . . . . . . . . X. Jiang and S.  ...  Zhang 2129 Unsupervised Face Domain Transfer for Low-Resolution Face Recognition . X. Jiang, and C.  ...  Kim, and S.-J. Ko 530 Custom Domain Adaptation: A New Method for Cross-Subject, EEG-Based Cognitive Load Recognition .  ... 
doi:10.1109/lsp.2020.3040844 fatcat:xpovskhrvfgctk3hhufuvpyyne

Table of Contents

2020 IEEE Signal Processing Letters  
Custom Domain Adaptation: A New Method for Cross-Subject, EEG-Based Cognitive Load Recognition .  ...  Wang, and A. H. Sayed 730 On the Zeros of Ramanujan Filters. D. Hong, Y. Li, and P. Jing 740 Temporal Localization of Non-Static Digital Videos Using the Electrical Network Frequency .  ...  Vorobyov, and X. Yang 1495 Deformable 3D Convolution for Video Super-Resolution . . . . . . X. Ying, L. Wang, Y. Wang, W. Sheng, W. An, and Y.  ... 
doi:10.1109/lsp.2020.3040840 fatcat:ezrfzwo6tjbkfhohq2tgec4m6y

Sensing Technology for Human Activity Recognition: a Comprehensive Survey

Biying Fu, Naser Damer, Florian Kirchbuchner, Arjan Kuijper.
2020 IEEE Access  
To the best of our knowledge, there is no thorough sensor-driven survey that considers all sensor categories in the domain of human activity recognition with respect to the sampled physical properties,  ...  Finally, we conclude with general remarks and provide future research directions for human activity recognition within the presented sensor categorization.  ...  The usage of these sensor categories in the domain HAR are three-folds, 1) camera-based action recognition in public areas, 2) depth-based action recognition and tracking on embedded hardware platforms  ... 
doi:10.1109/access.2020.2991891 fatcat:ukkpc2gdkvd2dird52ttebe6sy

Computational Models for Intent Recognition in Robotic Systems

Michele Persiani, Thomas Hellström
2020 Zenodo  
Intent recognition relates to several system requirements, such as the need of an enhanced collaboration mechanism in human-machine interactions, the need for adversarial technology in competitive scenarios  ...  correspond to the action possibilities offered by an environment.  ...  This work has received funding from the European Union's Horizon 2020 research and innovation program under the Marie Sk lodowska-Curie grant agreement No 721619 for the SOCRATES project.  ... 
doi:10.5281/zenodo.4581003 fatcat:c5cxhty4enhedfda45qcfipgdi

2020 Index IEEE Signal Processing Letters Vol. 27

2020 IEEE Signal Processing Letters  
Zhou, L., +, LSP 2020 166-170 Periocular Recognition in the Wild With Generalized Label Smoothing Regularization. Jung, Y.G., +, LSP 2020 1455-1459 Self-Similarity Action Proposal.  ...  ., +, LSP 2020 51-55 Gesture recognition Fast Adaptive Reparametrization (FAR) With Application to Human Action Recognition. Ghorbel, E., +, LSP 2020 580-584 Self-Similarity Action Proposal.  ... 
doi:10.1109/lsp.2021.3055468 fatcat:wfdtkv6fmngihjdqultujzv4by

Learning Neural Textual Representations for Citation Recommendation

Binh Thanh Kieu, Inigo Jauregi Unanue, Son Bao Pham, Hieu Xuan Phan, Massimo Piccardi
2021 2020 25th International Conference on Pattern Recognition (ICPR)  
Stopping the Text Recognition in a Video DAY 3 -Jan 14, 2021 Gu, Chengyu; Wang, Shilin; Zhu, Yiwei; Huang, Zheng; Chen, Kai 391 Weakly Supervised Attention Rectification for Scene Text Recognition  ...  DAY 1 -Jan 12, 2021 Alibayev, Maxat; Paulius, David Andres; Sun, Yu 2072 Developing Motion Code Embedding for Action Recognition in Videos DAY 1 -Jan 12, 2021 Hu, Shengnan; Zhang, Yang; Laha  ... 
doi:10.1109/icpr48806.2021.9412725 fatcat:3vge2tpd2zf7jcv5btcixnaikm

AIBench Training: Balanced Industry-Standard AI Training Benchmarking [article]

Fei Tang, Wanling Gao, Jianfeng Zhan, Chuanxin Lan, Xu Wen, Lei Wang, Chunjie Luo, Jiahui Dai, Zheng Cao, Xingwang Xiong, Zihan Jiang, Tianshu Hao (+21 others)
2021 arXiv   pre-print
After performing an exhaustive survey on Internet service AI domains, we identify and implement nineteen representative AI tasks with state-of-the-art models.  ...  For repeatable performance ranking (RPR subset) and workload characterization (WC subset), we keep two subsets to a minimum for affordability.  ...  The target quality is 63.5% HR@10. Video Prediction Video prediction is to predict how its actions affect objects in its environment, which is a representative vido processing task.  ... 
arXiv:2004.14690v4 fatcat:34dn54tmjbhuhfcefsttf62ceu
« Previous Showing results 1 — 15 out of 2,236 results