175 Hits in 9.1 sec

Generating Realistic Training Images Based on Tonality-Alignment Generative Adversarial Networks for Hand Pose Estimation [article]

Liangjian Chen, Shih-Yao Lin, Yusheng Xie, Hui Tang, Yufan Xue, Xiaohui Xie, Yen-Yu Lin, Wei Fan
2020 arXiv   pre-print
To this end, we develop tonality-alignment generative adversarial networks (TAGANs), which align the tonality and color distributions between synthetic hand poses and real backgrounds, and can generate  ...  In this work, we circumvent this problem by proposing an effective method for generating realistic hand poses and show that state-of-the-art algorithms for hand pose estimation can be greatly improved  ...  Thus, we present a GAN-based method, tonality-alignment generative adversarial networks (TAGANs), to generate the realistic training data.  ... 
arXiv:1811.09916v4 fatcat:yip4tjg4uveltldwm7kavulk3a

A Survey on GAN-Based Data Augmentation for Hand Pose Estimation Problem

Farnaz Farahanipad, Mohammad Rezaei, Mohammad Sadegh Nasr, Farhad Kamangar, Vassilis Athitsos
2022 Technologies  
In this study, we present a comprehensive study on effective hand pose estimation approaches, which are comprised of the leveraged generative adversarial network (GAN), providing a comprehensive training  ...  The quantitative and qualitative results indicate that the state-of-the-art hand pose estimators can be greatly improved with the aid of the training data generated by these GAN-based data augmentation  ...  The HPE is responsible for estimating the 3D hand pose based on the input depth map. During the training, these three networks are optimized to reduce the error of HPE.  ... 
doi:10.3390/technologies10020043 fatcat:6ljh7d4ijrfsdgigm5zn4lg54y

DGGAN: Depth-image Guided Generative Adversarial Networks for Disentangling RGB and Depth Images in 3D Hand Pose Estimation [article]

Liangjian Chen, Shih-Yao Lin, Yusheng Xie, Yen-Yu Lin, Wei Fan, Xiaohui Xie
2020 arXiv   pre-print
In this study, we proposea conditional generative adversarial network (GAN) model,called Depth-image Guided GAN (DGGAN), to generate re-alistic depth maps conditioned on the input RGB image, anduse the  ...  , these estimators rely on both RGB images and thepaired depth maps during training.  ...  [6] propose the tonality-alignment generative adversarial networks (TAGAN) for producing more realistic images from synthetic images for hand pose estimator training.  ... 
arXiv:2012.03197v1 fatcat:krmpzj3agjdknmy7qipranteli

A Survey on 3D Hand Skeleton and Pose Estimation by Convolutional Neural Network

Van-Hung Le, Hung-Cuong Nguyen
2020 Advances in Science, Technology and Engineering Systems  
The surveyed studies were divided based on the type of input data and publication time.  ...  Lastly, we also analyze some of the challenges of estimating 3D hand pose on the egocentric vision datasets.  ...  The title is "Using the Lie algebra, Lie group to improve the skeleton hand presentation".  ... 
doi:10.25046/aj050418 fatcat:tzpjnmpwtjbh7m6ld3nucyvxia

PIE: Portrait Image Embedding for Semantic Control [article]

Ayush Tewari, Mohamed Elgharib, Mallikarjun B R., Florian Bernard, Hans-Peter Seidel, Patrick Pérez, Michael Zollhöfer, Christian Theobalt
2020 arXiv   pre-print
We present the first approach for embedding real portrait images in the latent space of StyleGAN, which allows for intuitive editing of the head pose, facial expression, and scene illumination in the image  ...  Semantic editing in parameter space is achieved based on StyleRig, a pretrained neural network that maps the control space of a 3D morphable face model to the latent space of the GAN.  ...  We thank Rameen Abdal for kindly providing the Image2StyleGAN code, Jalees Nehvi for helping us with the comparisons, and Gereon Fox for the video voiceover.  ... 
arXiv:2009.09485v1 fatcat:twv7rw3w4baizj3nomqbvxlluq

A Survey of Machine Learning Techniques in Adversarial Image Forensics [article]

Ehsan Nowroozi, Ali Dehghantanha, Reza M. Parizi, Kim-Kwang Raymond Choo
2020 arXiv   pre-print
However, there are also a number of limitations and vulnerabilities associated with machine learning-based approaches, for example how to detect adversarial (image) examples, with real-world consequences  ...  Therefore, with a focus on image forensics, this paper surveys techniques that can be used to enhance the robustness of machine learning-based binary manipulation detectors in various adversarial scenarios  ...  Acknowledgements The first author thanks members of the Visual Information Processing and Protection (VIPP) group at the University of Siena, Italy for their suggestions.  ... 
arXiv:2010.09680v1 fatcat:qzvolq6kvrggfbyg23wrcnykza

Domain Adaptive Adversarial Learning Based on Physics Model Feedback for Underwater Image Enhancement [article]

Yuan Zhou, Kangming Yan
2020 arXiv   pre-print
To address this problem, we propose a new robust adversarial learning framework via physics model based feedback control and domain adaptation mechanism for enhancing underwater images to get realistic  ...  A new method for simulating underwater-like training dataset from RGB-D data by underwater image formation model is proposed.  ...  This physics model based module acts as the feedback controller of GAN based enhancement network, provides explicit constraints for this ill-posed problem, ensures that the estimated results should be  ... 
arXiv:2002.09315v1 fatcat:defskznpbjahlnjsuqi4qhgf3q

A survey of machine learning techniques in adversarial image forensics

Ehsan Nowroozi, Ali Dehghantanha, Reza M. Parizi, Kim-Kwang Raymond Choo
2020 Zenodo  
Therefore, with a focus on image forensics, this paper surveys techniques that can be used to enhance the robustness of machine learning-based binary manipulation detectors in various adversarial scenarios  ...  However, there are also a number of limitations and vulnerabilities associated with machine learning-based approaches (e.g., how to detect adversarial (image) examples), and there are associated real-world  ...  All authors thank the handling editor and the anonymous reviewers for their insightful critiques which help to improve the quality of this paper.  ... 
doi:10.5281/zenodo.4560205 fatcat:zuplnvtwhzhbnajteyunddphkq

Machine Learning Techniques for Image Forensics in Adversarial Setting

Ehsan Nowroozi, Mauro Barni, Benedetta Tondi
2020 Zenodo  
detectors based on machine learning in several adversarial scenarios.  ...  By focusing on Image Forensics and image manipulation detection, in particular, this thesis contributes to the above mission by developing novel techniques for enhancing the security of binary manipulation  ...  Thanks to my father (Esmaeil Nowroozi -he was passed away on 1986) that reminds us love, respect, care, shelter, support, sacrifices, and many more.  ... 
doi:10.5281/zenodo.4559666 fatcat:itun24lqq5blxfrvmg67pa7ddq

Transflower: probabilistic autoregressive dance generation with multimodal attention [article]

Guillermo Valle-Pérez, Gustav Eje Henter, Jonas Beskow, André Holzapfel, Pierre-Yves Oudeyer, Simon Alexanderson
2021 arXiv   pre-print
Formally, generating dance conditioned on a piece of music can be expressed as a problem of modelling a high-dimensional continuous motion signal, conditioned on an audio signal.  ...  First, we present a novel probabilistic autoregressive architecture that models the distribution over future poses with a normalizing flow conditioned on previous poses as well as music context, using  ...  A style-based generator architecture Conference (BMVC’18). BMVA Press, Durham, UK, 14 pages. for generative adversarial networks.  ... 
arXiv:2106.13871v1 fatcat:ofkengygabcfdlbcwsl6vk4rmq

Can We Read Speech Beyond the Lips? Rethinking RoI Selection for Deep Visual Speech Recognition [article]

Yuanhang Zhang, Shuang Yang, Jingyun Xiao, Shiguang Shan, Xilin Chen
2020 arXiv   pre-print
Furthermore, we introduce a simple yet effective method based on Cutout to learn more discriminative features for face-based VSR, hoping to maximise the utility of information encoded in different facial  ...  Experiments are conducted on both word-level and sentence-level benchmarks with different characteristics.  ...  Beyond VSR, our findings also have clear implications for other speech-related vision tasks, such as realistic talking face generation, face spoofing detection and audiovisual speech enhancement.  ... 
arXiv:2003.03206v2 fatcat:7gmyhyka55dq3gwa6cgaybjs6i

GELP: GAN-Excited Linear Prediction for Speech Synthesis from Mel-Spectrogram

Lauri Juvela, Bajibabu Bollepalli, Junichi Yamagishi, Paavo Alku
2019 Interspeech 2019  
Another family of neural networks utilized in this dissertation are generative adversarial networks (GANs) (Goodfellow et al., 2014) , which likewise can provide realistic samples from a learned distribution  ...  This makes it possible to tune the resulting spectral envelope estimates for the application at hand.  ... 
doi:10.21437/interspeech.2019-2008 dblp:conf/interspeech/JuvelaBYA19 fatcat:bd6cc74arvb3joauc3wbeqnkba

Client Adaptation improves Federated Learning with Simulated Non-IID Clients [article]

Laura Rieger, Rasmus M. Th. Høegh, Lars K. Hansen
2020 arXiv   pre-print
audio and image domains.  ...  By simulating heterogeneous clients, we show that adding learned client-specific conditioning improves model performance, and the approach is shown to work on balanced and imbalanced data set from both  ...  et al., 2017) for evaluating the performance of generative adversarial networks.  ... 
arXiv:2007.04806v1 fatcat:dxvlieu2ezgorlcbhb6zgye6bm

Image color correction, enhancement, and editing [article]

Mahmoud Afifi
2021 arXiv   pre-print
In particular, we propose auto image recoloring methods to generate different realistic versions of the same camera-rendered image with new colors.  ...  As white balance (WB) is one of the major procedures applied by the ISP for color correction, this thesis presents two different methods for ISP white balancing.  ...  In this chapter, we propose a generative adversarial network (GAN)-based method for image recoloring.  ... 
arXiv:2107.13117v1 fatcat:uf6qv7iux5hqxmgxxb4olvvtzy

Smart Cameras [article]

David J. Brady, Minghao Hu, Chengyu Wang, Xuefei Yan, Lu Fang, Yiwnheng Zhu, Yang Tan, Ming Cheng, Zhan Ma
2020 arXiv   pre-print
Over the past 5 years, deep learning solutions have become superior to traditional algorithms for each of these functions.  ...  Here we review the state of the art of deep learning in camera operations and consider the impact of AI on the physical design of cameras.  ...  Such systems are an example of "generative networks," which can in fill missing image data or even generate photo realistic "fake data."  ... 
arXiv:2002.04705v1 fatcat:277yq2oaujdoxbtqqsq6naodma
« Previous Showing results 1 — 15 out of 175 results