Filters








338 Hits in 6.3 sec

Single-Shot Multi-Person 3D Pose Estimation From Monocular RGB [article]

Dushyant Mehta, Oleksandr Sotnychenko, Franziska Mueller, Weipeng Xu, Srinath Sridhar, Gerard Pons-Moll, Christian Theobalt
2018 arXiv   pre-print
We propose a new single-shot method for multi-person 3D pose estimation in general scenes from a monocular RGB camera.  ...  To further stimulate research in multi-person 3D pose estimation, we will make our new datasets, and associated code publicly available for research purposes.  ...  Supplementary Document: Single-Shot Multi-Person 3D Pose Estimation From Monocular RGB 1.  ... 
arXiv:1712.03453v3 fatcat:pnhwtrnqsbhelen47etpk4namm

Deep Learning Methods for 3D Human Pose Estimation under Different Supervision Paradigms: A Survey

Dejun Zhang, Yiqi Wu, Mingyue Guo, Yilin Chen
2021 Electronics  
The learning-based pose estimation is discussed from two categories: single-person and multi-person.  ...  Based on this literature survey, it can be concluded that each branch of 3D human pose estimation starts with fully-supervised methods, and there is still much room for multi-person pose estimation based  ...  Single-Person 3D Pose Estimation from Images Images include monocular images and multi-view images.  ... 
doi:10.3390/electronics10182267 fatcat:ajnizu776ncpto3jvyh3zye2si

HMOR: Hierarchical Multi-Person Ordinal Relations for Monocular Multi-Person 3D Pose Estimation [article]

Jiefeng Li, Can Wang, Wentao Liu, Chen Qian, Cewu Lu
2020 arXiv   pre-print
Remarkable progress has been made in 3D human pose estimation from a monocular RGB camera. However, only a few studies explored 3D multi-person cases.  ...  The proposed method significantly outperforms state-of-the-art methods on publicly available multi-person 3D pose datasets.  ...  from a monocular RGB input.  ... 
arXiv:2008.00206v2 fatcat:bgdxil5v45c5tf6ufuw7f5zl7u

Video Based Reconstruction of 3D People Models

Thiemo Alldieck, Marcus Magnor, Weipeng Xu, Christian Theobalt, Gerard Pons-Moll
2018 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition  
Abstract This paper describes a method to obtain accurate 3D body models and texture of arbitrary people from a single, monocular video in which a person is moving.  ...  Figure 1 : Our technique allows to extract for the first time accurate 3D human body models, including hair and clothing, from a single video sequence of the person moving in front of the camera such that  ...  Acknowledgments The authors gratefully acknowledge funding by the German Science Foundation from project DFG MA2555/12-1.  ... 
doi:10.1109/cvpr.2018.00875 dblp:conf/cvpr/AlldieckMXTP18 fatcat:o4kwdol2ojhunonfgsilkx573e

Real-Time Hybrid Mapping of Populated Indoor Scenes using a Low-Cost Monocular UAV [article]

Stuart Golodetz, Madhu Vankadari, Aluna Everitt, Sangyun Shin, Andrew Markham, Niki Trigoni
2022 arXiv   pre-print
In this paper, we present what is thus, to our knowledge, the first system to perform simultaneous mapping and multi-person 3D human pose estimation from a monocular camera mounted on a single UAV.  ...  However, despite many recent works on both marker-based and markerless multi-UAV single-person motion capture, markerless single-camera multi-person 3D human pose estimation remains a much earlier-stage  ...  Notably, whilst many of these works are extremely impressive, none of them currently attempts to perform both online mapping and markerless multi-person 3D human pose estimation from a single monocular  ... 
arXiv:2203.02453v1 fatcat:hs2ymgqzyfethl6eszh4iemhcm

Video Based Reconstruction of 3D People Models [article]

Thiemo Alldieck, Marcus Magnor, Weipeng Xu, Christian Theobalt, Gerard Pons-Moll
2018 arXiv   pre-print
This paper describes how to obtain accurate 3D body models and texture of arbitrary people from a single, monocular video in which a person is moving.  ...  This enables efficient estimation of a consensus 3D shape, texture and implanted animation skeleton based on a large number of frames.  ...  Acknowledgments The authors gratefully acknowledge funding by the German Science Foundation from project DFG MA2555/12-1.  ... 
arXiv:1803.04758v3 fatcat:6jwhxkpxi5bmtpccgzhgkcdopa

Gravity-Aware Monocular 3D Human-Object Reconstruction [article]

Rishabh Dabral and Soshi Shimada and Arjun Jain and Christian Theobalt and Vladislav Golyanik
2021 arXiv   pre-print
., a new approach for joint markerless 3D human motion capture and object trajectory estimation from monocular RGB videos. We focus on scenes with objects partially observed during a free flight.  ...  The proposed human-object interaction constraints ensure geometric consistency of the 3D reconstructions and improved physical plausibility of human poses compared to the unconstrained case.  ...  Introduction Markerless 3D human motion capture from a single monocular RGB camera has many open challenges.  ... 
arXiv:2108.08844v1 fatcat:ve7om7j4kfcfvp76f2vs2gqjqu

SMAP: Single-Shot Multi-Person Absolute 3D Pose Estimation [article]

Jianan Zhen, Qi Fang, Jiaming Sun, Wentao Liu, Wei Jiang, Hujun Bao, Xiaowei Zhou
2020 arXiv   pre-print
Recovering multi-person 3D poses with absolute scales from a single RGB image is a challenging problem due to the inherent depth and scale ambiguity from a single view.  ...  Such a single-shot bottom-up scheme allows the system to better learn and reason about the inter-person depth relationship, improving both 3D and 2D pose estimation.  ...  Conclusion We proposed a novel single-shot bottom-up framework to estimate absolute multi-person 3D poses from a single RGB image.  ... 
arXiv:2008.11469v1 fatcat:xmfqbsjwtfd7ddy3rdtdu53zwy

Synthetic Humans for Action Recognition from Unseen Viewpoints [article]

Gül Varol, Ivan Laptev, Cordelia Schmid, Andrew Zisserman
2020 arXiv   pre-print
We make use of the recent advances in monocular 3D human body reconstruction from real action sequences to automatically render synthetic training videos for the action labels.  ...  Although synthetic training data has been shown to be beneficial for tasks such as human pose estimation, its use for RGB human action recognition is relatively unexplored.  ...  : Synth (single-person), Test: Real(0 • ) multi-person categoriesFig.  ... 
arXiv:1912.04070v2 fatcat:kgfumn26nvhw5brmqk3otb6nqu

Lightweight Multi-person Total Motion Capture Using Sparse Multi-view Cameras [article]

Yuxiang Zhang, Zhe Li, Liang An, Mengcheng Li, Tao Yu, Yebin Liu
2021 arXiv   pre-print
Multi-person total motion capture is extremely challenging when it comes to handle severe occlusions, different reconstruction granularities from body to face and hands, drastically changing observation  ...  To overcome these challenges above, we contribute a lightweight total motion capture system for multi-person interactive scenarios using only sparse multi-view cameras.  ...  Many works [47, 8, 48, 56, 65, 20, 37] that focused on 3D hand pose estimation from a single RGB image have been proposed.  ... 
arXiv:2108.10378v1 fatcat:3sqrnawevrcwfozjjyuidlz42u

Recent Advances of Monocular 2D and 3D Human Pose Estimation: A Deep Learning Perspective

Wu Liu, Tao Mei
2022 ACM Computing Surveys  
Recently, benefiting from the deep learning technologies, a significant amount of research efforts have advanced the monocular human pose estimation both in 2D and 3D areas.  ...  Estimation of the human pose from a monocular camera has been an emerging research topic in the computer vision community with many applications.  ...  Skeleton-based 3D Pose Estimation Categorized by different inputs, the Skeleton-based 3D pose estimation methods are divided into single-person and multi-person methods.  ... 
doi:10.1145/3524497 fatcat:4pbvntngrnfp7lqhcpjmy7p2fq

Recent Advances in Monocular 2D and 3D Human Pose Estimation: A Deep Learning Perspective [article]

Wu Liu, Qian Bao, Yu Sun, Tao Mei
2021 arXiv   pre-print
Recently, benefited from the deep learning technologies, a significant amount of research efforts have greatly advanced the monocular human pose estimation both in 2D and 3D areas.  ...  , and the complex multi-person scenarios.  ...  Multi-person Multi-stage methods • Scene constraints [187] ; • CRMH [188] . Single-shot methods • CenterHMR [44] . pose from monocular images, and 2) lifting the estimated 2D poses to 3D.  ... 
arXiv:2104.11536v1 fatcat:tdag2jq2vjdrjekwukm5nu7l6a

LyRN (Lyapunov Reaching Network): A Real-Time Closed Loop approach from Monocular Vision [article]

Zheyu Zhuang, Xin Yu, Robert Mahony
2020 arXiv   pre-print
(PBVS) grasping system adapted from a state-of-the-art single shot RGB 6D pose estimation algorithm.  ...  We demonstrate the proposed algorithm grasping mugs (textureless and symmetric objects) on a table-top from an over-the-shoulder monocular RGB camera.  ...  In this paper, we propose a YOLO-like [21] single-shot CNN architecture that takes a monocular RGB image and current manipulator joint angles as input to directly compute the joint velocity input for  ... 
arXiv:2005.12072v2 fatcat:x4fyhmzkmza3bnxdab4ywpnppm

Body Size and Depth Disambiguation in Multi-Person Reconstruction from Single Images [article]

Nicolas Ugrinovic, Adria Ruiz, Antonio Agudo, Alberto Sanfeliu, Francesc Moreno-Noguer
2021 arXiv   pre-print
We address the problem of multi-person 3D body pose and shape estimation from a single image.  ...  A thorough evaluation on MuPoTS-3D and 3DPW datasets demonstrates that our approach is able to robustly estimate the body translation and shape of multiple people while retrieving their spatial arrangement  ...  Single-shot multi-person 3d [47] G. Rogez, P. Weinzaepfel, and C. Schmid. Lcr-net: pose estimation from monocular rgb.  ... 
arXiv:2111.01884v2 fatcat:i35rjfb6g5dq7cxyilanljntq4

Putting People in their Place: Monocular Regression of 3D People in Depth [article]

Yu Sun, Wu Liu, Qian Bao, Yili Fu, Tao Mei, Michael J. Black
2022 arXiv   pre-print
BEV reasons simultaneously about body centers in the image and in depth and, by combing these, estimates 3D body position.  ...  To do so, we exploit a 3D body model space that lets BEV infer shapes from infants to adults. Third, to train BEV, we need a new dataset.  ...  BEV adopts a multi-head architecture. Given a single RGB image as input, BEV outputs 5 maps.  ... 
arXiv:2112.08274v3 fatcat:lqufxqkdazavll7cwxj2gvflbq
« Previous Showing results 1 — 15 out of 338 results