Filters








35,395 Hits in 5.7 sec

Iterative Text-based Editing of Talking-heads Using Neural Retargeting [article]

Xinwei Yao, Ohad Fried, Kayvon Fatahalian, Maneesh Agrawala
2020 arXiv   pre-print
We present a text-based tool for editing talking-head video that enables an iterative editing workflow.  ...  Our approach is based on two key ideas. (1) We develop a fast phoneme search algorithm that can quickly identify phoneme-level subsequences of the source repository video that best match a desired edit  ...  We present a text-based tool for editing talking-head video that enables an iterative editing workflow.  ... 
arXiv:2011.10688v1 fatcat:odu63nsc5bdyrenfp3hbrvin3u

Text2Video: Text-driven Talking-head Video Synthesis with Personalized Phoneme-Pose Dictionary [article]

Sibo Zhang, Jiahong Yuan, Miao Liao, Liangjun Zhang
2022 arXiv   pre-print
With the advance of deep learning technology, automatic video generation from audio or text has become an emerging and promising research topic.  ...  In this paper, we present a novel approach to synthesize video from the text.  ...  Compared to audio-based methods, text-based methods have advantages. We here define Text2Video as a task of synthe-sizing talking-head video from any text input.  ... 
arXiv:2104.14631v3 fatcat:urnkghg33bab5biwctbhe476aa

A genre shift in disseminating knowledge: Student teachers' experiences of communicating their master's theses as popular science

John Henriksson, Gunilla Eklund, Jessica Aspfors
2021 Nordisk tidsskrift for utdanning og praksis  
Data (logbooks, videos and text submissions) were collected from Finnish student teachers (n = 38) during a campus-based course from 2019 to 2020.  ...  This study aims to investigate student teachers' experiences of communicating their master's theses as popular science to schools and school communities.  ...  In contrast, those students making talking head videos had gone through a learning process regarding new tools for video recording and editing.  ... 
doi:10.23865/up.v15.3227 fatcat:5zomtn34iragdf4omqbsrmhjve

A deep bidirectional LSTM approach for video-realistic talking head

Bo Fan, Lei Xie, Shan Yang, Lijuan Wang, Frank K. Soong
2015 Multimedia tools and applications  
We then stitch the selected lower face image sequence back to a background face video of the same subject, resulting in a video-realistic talking head.  ...  This paper proposes a deep bidirectional long short-term memory approach in modeling the long contextual, nonlinear mapping between audio and visual streams for video-realistic talking head.  ...  preference of the deep BLSTM-RNNS-based and HMM-based video-realistic talking heads The percentage preference of the deep BLSTM-RNNS-based and original talking heads Table 1 1 Network topologies tested  ... 
doi:10.1007/s11042-015-2944-3 fatcat:umzguqikxvajlgzb7vgpnl246i

AnyoneNet: Synchronized Speech and Talking Head Generation for Arbitrary Person [article]

Xinsheng Wang, Qicong Xie, Jihua Zhu, Lei Xie, Scharenborg
2021 arXiv   pre-print
state-of-the-art landmark-based method on generating natural talking head videos.  ...  In this paper, we present an automatic method to generate synchronized speech and talking-head videos on the basis of text and a single face image of an arbitrary person as input.  ...  Evaluation The goal of our task is to generate voiced talking head video with text and the face image as input.  ... 
arXiv:2108.04325v2 fatcat:64vpm5cz7za27ov6ltjl6dutui

Supplementary Evidence: Towards Higher Levels of Assurance in Remote Identity Proofing [article]

Jongkil Jeong, Syed Wajid Ali Shah, Ashish Nanda, Robin Doss
2022 figshare.com  
Supplementary evidence on the following topics:Quality Requirements for Identity EvidenceStrength of Methods Employed for Evidence ValidationPopular approaches for generating Replacement DeepfakesPopular  ...  Reenactment (Mouth) Text Editing Talking head [11] Leverages neural face rendering to synthesize a realistic output video with dialogues changed as per the edits in the corresponding transcript.  ...  and head pose driven by source video.  ... 
doi:10.6084/m9.figshare.19119680.v2 fatcat:ijki7jkshzbrfhk7ufsfuh2ri4

Rendering a personalized photo-real talking head from short video footage

Lijuan Wang, Wei Han, Xiaojun Qian, Frank K. Soong
2010 2010 7th International Symposium on Chinese Spoken Language Processing  
For as short as 20 minutes recording of audio/video footage, the proposed system can synthesize a highly photo-real talking head in sync with the given speech signals (natural or TTS synthesized).  ...  The generated trajectory is then used as a guide to select, from the original training database, an optimal sequence of lips images which are then stitched back to a background head video.  ...  Lin Liang in Microsoft Research Asia, for their expertise on head pose tracking and normalization.  ... 
doi:10.1109/iscslp.2010.5684834 dblp:conf/iscslp/WangHQS10 fatcat:blnbbpgy4nakjmot2ypo76lwkm

Neural Voice Puppetry: Audio-driven Facial Reenactment [article]

Justus Thies, Mohamed Elgharib, Ayush Tewari, Christian Theobalt, Matthias Nießner
2020 arXiv   pre-print
Neural Voice Puppetry has a variety of use-cases, including audio-driven video avatars, video dubbing, and text-driven video synthesis of a talking head.  ...  We demonstrate the capabilities of our method in a series of audio- and text-based puppetry examples, including comparisons to state-of-the-art techniques and a user study.  ...  Text-driven Video Synthesis: Fried et al. presented 'Text-based Editing of Talking-head Video' [11] which provides a video editing tool that is based on the transcript of the video.  ... 
arXiv:1912.05566v2 fatcat:sazmvejurrbu7kadsdjgejc2am

Write-a-speaker: Text-based Emotional and Rhythmic Talking-head Generation [article]

Lincheng Li, Suzhen Wang, Zhimeng Zhang, Yu Ding, Yixing Zheng, Xin Yu, Changjie Fan
2021 arXiv   pre-print
In this paper, we propose a novel text-based talking-head video generation framework that synthesizes high-fidelity facial expressions and head motions in accordance with contextual sentiments as well  ...  of relying on long videos of specific individuals.  ...  Text-based Talking-head Generation Our framework takes the time-aligned text as input and outputs the photo-realistic talking-head video.  ... 
arXiv:2104.07995v2 fatcat:foltetmbjzhsdk2pgfs4x7j5ky

Talking Faces: Audio-to-Video Face Generation [chapter]

Yuxin Wang, Linsen Song, Wayne Wu, Chen Qian, Ran He, Chen Change Loy
2022 Advances in Computer Vision and Pattern Recognition  
In this chapter, we first discuss the definition and underlying challenges of the problem. Then, we present an overview of recent progress in talking face generation.  ...  The emergence of deep learning and cross-modality research has led to many interesting works that address talking face generation.  ...  The development of GAN-based human face generation and editing methods on head poses [83] and facial emotions [84] influences the research in talking face generation. For instance, Zhu et al.  ... 
doi:10.1007/978-3-030-87664-7_8 fatcat:5qh2bxrthrbthgjwjzlmm3je4i

B-Script

Bernd Huber, Hijung Valentina Shin, Bryan Russell, Oliver Wang, Gautham J. Mysore
2019 Proceedings of the 2019 CHI Conference on Human Factors in Computing Systems - CHI '19  
We present B-Script, a system that supports B-roll video editing via interactive transcripts.  ...  Users found it easier and were faster to insert B-roll using the transcript-based interface, and they created more engaging videos when recommendations were provided.  ...  Experts had up to 30 minutes to edit one 3-minute talking-head video.  ... 
doi:10.1145/3290605.3300311 dblp:conf/chi/HuberSRWM19 fatcat:mewtpso5gbdofonbu3ee2owixy

Framework For Lithuanian Speech Animation

Romualdas Bausys, Ingrida Mazonaviciute
2010 Zenodo  
Publication in the conference proceedings of EUSIPCO, Aalborg, Denmark, 2010  ...  While text-driven talking heads employ both synthesized voices and head models, constituting text-to-audiovisual speech; speech-driven talking heads involve synthesizing visual speech information from  ...  Finally, Lithuanian speech audio file (.wav), 3D geometry head file (.msh) (editable in geometry and texture) and the animation script (.fml) are compound into iFACE to get video of Lithuanian talking  ... 
doi:10.5281/zenodo.42014 fatcat:avpkit4zlfc6zkoyifp2bznwue

Teaching psychology to student nurses: the use of 'Talking Head' videos

Sherrill Snelgrove, Desiree J. R. Tait, Michael Tait
2016 Research in Learning Technology  
We sought to strengthen first-year student nurses' application of psychology by developing a set of digital stories based around 'Talking Head' video clips where authentic patients relate their experiences  ...  It chronicles the development and evaluation of a Talking Head in a specific context but which may be useful across disciplines.  ...  to the Talking Head.  ... 
doi:10.3402/rlt.v24.30891 fatcat:5laxmkp5djh2posbxrp3gvtz7u

Translingual Visemes Mapping for Lithuanian Speech Animation

I. Mazonaviciute, R. Bausys
2011 Elektronika ir Elektrotechnika  
Text-driven Talking heads employ synthesized voices and synthesized head models to represent text-toaudiovisual speech, while speech-driven models utilize acoustics (and phonetic alignment) of natural  ...  Framework for Lithuanian speech animation Talking heads can be driven by input text or input speech.  ... 
doi:10.5755/j01.eee.111.5.365 fatcat:ksqnt3sugfbpdipszfsiadluoi

Vlogcast yourself

Joan-Isaac Biel, Daniel Gatica-Perez
2010 International Conference on Multimodal Interfaces and the Workshop on Machine Learning for Multimodal Interaction on - ICMI-MLMI '10  
Based on works from social psychology and computing, we first propose robust audio and visual cues to measure the nonverbal behavior of vloggers in their videos, and we then study the relation between  ...  Our study shows significant correlations between some nonverbal behavioral cues and the average number of views per video.  ...  Acknowledgments: We thank the support of the Swiss National Center of Competence (NCCR) on Interactive Multimodal Information Management (IM)2 and the voluntary annotators.  ... 
doi:10.1145/1891903.1891964 dblp:conf/icmi/BielG10 fatcat:ibqghds7xjg2biz7qzt5ofydt4
« Previous Showing results 1 — 15 out of 35,395 results