Filters








72 Hits in 6.5 sec

Human Face Expressions from Images - 2D Face Geometry and 3D Face Local Motion versus Deep Neural Features [article]

Rafal Pilarczyk and Xin Chang and Wladyslaw Skarbek
2019 arXiv   pre-print
We conclude that contrary to CNN based emotion classifiers, the generalization capability wrt human head pose is for SVM based emotion classifiers poor.  ...  For F-score the high advantage of raw/CNN over geometric/CNN and geometric/SVM is observed, as well.  ...  Core 3D points for global motion are selected: J. 2. Points for global estimation and individualization are selected from core 3D points: J g ∈ J 3.  ... 
arXiv:1901.11179v1 fatcat:un2qks6pp5cnplyluzxytcnyei

Analysis of Facial Information for Healthcare Applications: A Survey on Computer Vision-Based Approaches

Marco Leo, Pierluigi Carcagnì, Pier Luigi Mazzeo, Paolo Spagnolo, Dario Cazzato, Cosimo Distante
2020 Information  
For each facial feature, the computer vision-based tasks aiming at analyzing it and the related healthcare goals that could be pursued are detailed.  ...  The document is not limited to global face analysis but it also concentrates on methods related to local cues (e.g., the eyes).  ...  The possibility of tracking human gaze in an unconstrained environment for assistive applications has been proposed in [48] : authors employed an RGB-D device and a head pose estimation algorithm, proposing  ... 
doi:10.3390/info11030128 fatcat:yx7izg2jlvhsjpppf6ektkmlye

Deep Learning for Face Anti-Spoofing: A Survey [article]

Zitong Yu, Yunxiao Qin, Xiaobai Li, Chenxu Zhao, Zhen Lei, Guoying Zhao
2022 arXiv   pre-print
RGB camera, we summarize the deep learning applications under multi-modal (e.g., depth and infrared) or specialized (e.g., light field and flash) sensors.  ...  With the emergence of large-scale academic datasets in the recent decade, deep learning based FAS achieves remarkable performance and dominates this area.  ...  (No. 2020YFC2003901), and the National Natural Science Foundation of China (No. 61876178, 61872367, and 61806196) .  ... 
arXiv:2106.14948v2 fatcat:wsheo7hbwvewhjoe6ykwjuqfii

2021 Index IEEE Transactions on Image Processing Vol. 30

2021 IEEE Transactions on Image Processing  
The primary entry includes the coauthors' names, the title of the paper or other item, and its location, specified by the publication abbreviation, year, month, and inclusive pagination.  ...  The Subject Index contains entries describing the item under all appropriate subject headings, plus the first author's name, the publication abbreviation, month, and year, and inclusive pages.  ...  ., +, TIP 2021 249-263 Learning Deep Global Multi-Scale and Local Attention Features for Facial Expression Recognition in the Wild.  ... 
doi:10.1109/tip.2022.3142569 fatcat:z26yhwuecbgrnb2czhwjlf73qu

Towards a complete 3D morphable model of the human head [article]

Stylianos Ploumpis, Evangelos Ververas, Eimear O' Sullivan, Stylianos Moschoglou, Haoyang Wang, Nick Pears, William A. P. Smith, Baris Gecer, Stefanos Zafeiriou
2020 arXiv   pre-print
Thus we build a new combined face-and-head shape model that blends the variability and facial detail of an existing face model (the LSFM) with the full head modelling capability of an existing head model  ...  We use our model to reconstruct full head representations from single, unconstrained images allowing us to parameterize craniofacial shape and texture, along with the ear shape, eye gaze and eye color.  ...  Then, keeping the head camera fixed, we optimize our statistical eye models based on two landmarks losses and a rendering loss.  ... 
arXiv:1911.08008v2 fatcat:ua6w5qivbvevbcvmvdzqvvi2ta

Temporal Head Pose Estimation From Point Cloud in Naturalistic Driving Conditions

Tiancheng Hu, Sumit Jha, Carlos Busso
2021 IEEE transactions on intelligent transportation systems (Print)  
Head pose estimation is an important problem as it facilitates tasks such as gaze estimation and attention modeling.  ...  While computer vision algorithms using RGB cameras are reliable in controlled environments, head pose estimation is a challenging problem in the car due to sudden illumination changes, occlusions and large  ...  For head pose estimation, it first utilizes convolutional experts constrained local model (CE-CLM) [56] to detect and track facial landmarks.  ... 
doi:10.1109/tits.2021.3075350 fatcat:57pcpuwbqjdobkpt3osx7dsr3a

An Intelligent and Low-cost Eye-tracking System for Motorized Wheelchair Control [article]

Mahmoud Dahmani, Muhammad E. H. Chowdhury, Amith Khandakar, Tawsifur Rahman, Khaled Al-Jayyousi, Abdalla Hefny, Serkan Kiranyaz
2020 arXiv   pre-print
CNN exhibited the best performance (i.e. 99.3% classification accuracy), and thus it was the model of choice for the gaze estimator, which commands the wheelchair motion.  ...  The system input was images of the users eye that were processed to estimate the gaze direction and the wheelchair was moved accordingly.  ...  Funding: The publication of this article was funded by the Qatar National Library and Qatar National Research Foundation (QNRF), grant numbers NPRP12S-0227-190164 and UREP22-043-2-015.  ... 
arXiv:2005.02118v1 fatcat:pn74qg55mfgyjop35w3eaoncdy

CycleMorph: Cycle Consistent Unsupervised Deformable Image Registration [article]

Boah Kim, Dong Hwan Kim, Seong Ho Park, Jieun Kim, June-Goo Lee, Jong Chul Ye
2020 arXiv   pre-print
The proposed method is so flexible that can be applied for both 2D and 3D registration problems for various applications, and can be easily extended to multi-scale implementation to deal with the memory  ...  Recently, deep learning based image registration methods have been extensively investigated due to their excellent performance despite the ultra-fast computational time.  ...  The moving source images are in first column, deformed images from the proposed CycleMorph (CM) are in second column (global) and third column (multiscale), and the fixed target images are in fourth column  ... 
arXiv:2008.05772v1 fatcat:xt67mgay3rebzc3d4p7bmorl5u

HUMBI: A Large Multiview Dataset of Human Body Expressions and Benchmark Challenge [article]

Jae Shin Yoon, Zhixuan Yu, Jaesik Park, Hyun Soo Park
2021 arXiv   pre-print
such as MPII-Gaze, Multi-PIE, Human3.6M, and Panoptic Studio datasets.  ...  The goal of HUMBI is to facilitate modeling view-specific appearance and geometry of five primary body signals including gaze, face, hand, body, and garment from assorted people. 107 synchronized HD cameras  ...  for the development and evaluation of gaze estimation algo- Depth-based hand pose estimation: data, methods, and chal- rithms from rgb and rgb-d cameras.  ... 
arXiv:2110.00119v2 fatcat:bakqd343fzfonl3sv2f4zw4rra

HeadFusion: 360° Head Pose tracking combining 3D Morphable Model and 3D Reconstruction

Yu Yu, Kenneth Funes Mora, Jean-Marc Odobez
2018 IEEE Transactions on Pattern Analysis and Machine Intelligence  
Head pose estimation is a fundamental task for face and social related research.  ...  Although 3D morphable model (3DMM) based methods relying on depth information usually achieve accurate results, they usually require frontal or mid-profile poses which preclude a large set of applications  ...  The development of consumer 3D RGB-D sensors offers an alternative solution.  ... 
doi:10.1109/tpami.2018.2841403 pmid:29993569 fatcat:hlungkksb5ff5n5omikvl3ef6q

Recognizing American Sign Language Nonmanual Signal Grammar Errors in Continuous Videos [article]

Elahe Vahdani, Longlong Jing, Yingli Tian, Matt Huenerfauth
2020 arXiv   pre-print
Our system is able to recognize grammatical elements on ASL-HW-RGBD from manual gestures, facial expressions, and head movements and successfully detect 8 ASL grammatical mistakes.  ...  We first recognize the ASL grammatical elements including both manual gestures and nonmanual signals independently from multiple modalities (i.e. hand gestures, facial expressions, and head movements)  ...  ACKNOWLEDGMENT This material is based upon work supported by the National Science Foundation under award numbers 1400802, 1400810, and 1462280.  ... 
arXiv:2005.00253v1 fatcat:6khl2yltfjfxnowqvqq56ul23m

ReenactGAN: Learning to Reenact Faces via Boundary Transfer [article]

Wayne Wu, Yunxuan Zhang, Cheng Li, Chen Qian, Chen Change Loy
2018 arXiv   pre-print
Thanks to the effective and reliable boundary-based transfer, our method can perform photo-realistic face reenactment.  ...  The proposed method, known as ReenactGAN, is capable of transferring facial movements and expressions from monocular video input of an arbitrary person to a target person.  ...  We would like to thank Kwan-Yee Lin for insightful discussion, and Tong Li, Yue He and Lichen Zhou for their exceptional support. This work is supported by SenseTime Research.  ... 
arXiv:1807.11079v1 fatcat:swc4ntb3obdixg2riykmlcymrm

Alzheimer's Disease Diagnosis Based on Cognitive Methods in Virtual Environments and Emotions Analysis [article]

Juan Manuel Fernández Montenegro
2018 arXiv   pre-print
based on origami crease pattern algorithm are proposed to enhance facial micro-expressions.  ...  EEG features are based on quaternions in order to keep the correlation information between sensors, whereas, for facial expression recognition, a preprocessing method for motion magnification and descriptors  ...  The different modalities from left to right in each case are EEG, gaze tracked heat map, RGB, facial landmarks, depth and IR.  ... 
arXiv:1810.10941v1 fatcat:lrqvy6gqkvhszkxffxbq5iyut4

An Intelligent and Low-Cost Eye-Tracking System for Motorized Wheelchair Control

Mahmoud Dahmani, Muhammad E. H. Chowdhury, Amith Khandakar, Tawsifur Rahman, Khaled Al-Jayyousi, Abdalla Hefny, Serkan Kiranyaz
2020 Sensors  
CNN exhibited the best performance (i.e., 99.3% classification accuracy), and thus it was the model of choice for the gaze estimator, which commands the wheelchair motion.  ...  The system input is images of the user's eye that are processed to estimate the gaze direction and the wheelchair was moved accordingly.  ...  through a facial landmark detector to find the eye corners and other fiducial points.  ... 
doi:10.3390/s20143936 pmid:32679779 fatcat:34uvg3jlgvd5dgihfsvprj55um

Symbolic Tensor Neural Networks for Digital Media - from Tensor Processing via BNF Graph Rules to CREAMS Applications [article]

Wladyslaw Skarbek
2018 arXiv   pre-print
, and data Security based on digital media objects.  ...  This tutorial material on Convolutional Neural Networks (CNN) and its applications in digital media research is based on the concept of Symbolic Tensor Neural Networks.  ...  denoted as X land ∈ R 2×42 : 2D ground truth normalized facial landmarks which are compared with the above pixels. 41 The still landmark is the neutral point wrt to possible facial expressions -its  ... 
arXiv:1809.06582v2 fatcat:v75cmkkwvfci5hq3g4awwko26u
« Previous Showing results 1 — 15 out of 72 results