Human pose completion in partial body camera shots

Ruben Tous; Jordi Nin; Laura Igual

doi:10.1080/0952813X.2023.2241575

Human pose completion in partial body camera shots

Ruben Tous, Jordi Nin, Laura Igual

Departament d'Operacions, Innovació i Data Sciences

Producció científica: Article en revista indexada › Article › Avaluat per experts

Resum

Many actual images contain partial body camera shots, in which a significant part of the body is not visible. This issue is especially prevalent in film images, where less than 10% are full-body shots. Most 2D human pose estimation methods return incomplete poses when applied to partial body images. This lack of completeness becomes a problem in some situations, for example, when the 2D pose is converted to a 3D pose by a two-stage 3D human pose estimation method since most of these methods require a complete pose to work. This article proposes a new technique, called CompletePose, consisting of completing the missing keypoints when 2D human pose estimation methods are applied to images with partial body camera shots. A Conditional Generative Adversarial Network is used to obtain a complete and plausible pose, realistic enough to predict a 3D pose with a two-stage 3D human pose estimation method. A complete empirical validation has been carried out with the Human3.6 M dataset and a new dataset, called CHARADE, specially built and made public for reproducibility and benchmarking for this research. Quantitative evaluation employing the Fréchet distance shows that the approach manages to approximate the actual data distribution. The qualitative evaluation shows that the completed poses enable obtaining plausible 3D poses from images previously intractable.

Idioma original	Anglès
Nombre de pàgines	11
Revista	Journal of Experimental and Theoretical Artificial Intelligence
Data online anticipada	de jul. 2023
DOIs	https://doi.org/10.1080/0952813X.2023.2241575
Estat de la publicació	Publicació electrònica prèvia a la impressió - de jul. 2023

Accés al document

10.1080/0952813X.2023.2241575

Altres arxius i enllaços

Enllaç a la publicació de Scopus

Com citar-ho

@article{5a800571871c474ab66705b53913d8d6,

title = "Human pose completion in partial body camera shots",

abstract = "Many actual images contain partial body camera shots, in which a significant part of the body is not visible. This issue is especially prevalent in film images, where less than 10% are full-body shots. Most 2D human pose estimation methods return incomplete poses when applied to partial body images. This lack of completeness becomes a problem in some situations, for example, when the 2D pose is converted to a 3D pose by a two-stage 3D human pose estimation method since most of these methods require a complete pose to work. This article proposes a new technique, called CompletePose, consisting of completing the missing keypoints when 2D human pose estimation methods are applied to images with partial body camera shots. A Conditional Generative Adversarial Network is used to obtain a complete and plausible pose, realistic enough to predict a 3D pose with a two-stage 3D human pose estimation method. A complete empirical validation has been carried out with the Human3.6 M dataset and a new dataset, called CHARADE, specially built and made public for reproducibility and benchmarking for this research. Quantitative evaluation employing the Fr{\'e}chet distance shows that the approach manages to approximate the actual data distribution. The qualitative evaluation shows that the completed poses enable obtaining plausible 3D poses from images previously intractable.",

keywords = "deep learning, generative adversarial networks, human pose completion, Human pose estimation, movies",

author = "Ruben Tous and Jordi Nin and Laura Igual",

note = "Publisher Copyright: {\textcopyright} 2023 Informa UK Limited, trading as Taylor & Francis Group.",

year = "2023",

month = jul,

doi = "10.1080/0952813X.2023.2241575",

language = "English",

journal = "Journal of Experimental and Theoretical Artificial Intelligence",

issn = "0952-813X",

publisher = "Taylor and Francis Ltd.",

}

TY - JOUR

T1 - Human pose completion in partial body camera shots

AU - Tous, Ruben

AU - Nin, Jordi

AU - Igual, Laura

PY - 2023/7

Y1 - 2023/7

N2 - Many actual images contain partial body camera shots, in which a significant part of the body is not visible. This issue is especially prevalent in film images, where less than 10% are full-body shots. Most 2D human pose estimation methods return incomplete poses when applied to partial body images. This lack of completeness becomes a problem in some situations, for example, when the 2D pose is converted to a 3D pose by a two-stage 3D human pose estimation method since most of these methods require a complete pose to work. This article proposes a new technique, called CompletePose, consisting of completing the missing keypoints when 2D human pose estimation methods are applied to images with partial body camera shots. A Conditional Generative Adversarial Network is used to obtain a complete and plausible pose, realistic enough to predict a 3D pose with a two-stage 3D human pose estimation method. A complete empirical validation has been carried out with the Human3.6 M dataset and a new dataset, called CHARADE, specially built and made public for reproducibility and benchmarking for this research. Quantitative evaluation employing the Fréchet distance shows that the approach manages to approximate the actual data distribution. The qualitative evaluation shows that the completed poses enable obtaining plausible 3D poses from images previously intractable.

AB - Many actual images contain partial body camera shots, in which a significant part of the body is not visible. This issue is especially prevalent in film images, where less than 10% are full-body shots. Most 2D human pose estimation methods return incomplete poses when applied to partial body images. This lack of completeness becomes a problem in some situations, for example, when the 2D pose is converted to a 3D pose by a two-stage 3D human pose estimation method since most of these methods require a complete pose to work. This article proposes a new technique, called CompletePose, consisting of completing the missing keypoints when 2D human pose estimation methods are applied to images with partial body camera shots. A Conditional Generative Adversarial Network is used to obtain a complete and plausible pose, realistic enough to predict a 3D pose with a two-stage 3D human pose estimation method. A complete empirical validation has been carried out with the Human3.6 M dataset and a new dataset, called CHARADE, specially built and made public for reproducibility and benchmarking for this research. Quantitative evaluation employing the Fréchet distance shows that the approach manages to approximate the actual data distribution. The qualitative evaluation shows that the completed poses enable obtaining plausible 3D poses from images previously intractable.

KW - deep learning

KW - generative adversarial networks

KW - human pose completion

KW - Human pose estimation

KW - movies

UR - http://www.scopus.com/inward/record.url?scp=85175102867&partnerID=8YFLogxK

U2 - 10.1080/0952813X.2023.2241575

DO - 10.1080/0952813X.2023.2241575

M3 - Article

AN - SCOPUS:85175102867

SN - 0952-813X

JO - Journal of Experimental and Theoretical Artificial Intelligence

JF - Journal of Experimental and Theoretical Artificial Intelligence

ER -

Human pose completion in partial body camera shots

Resum

Accés al document

Altres arxius i enllaços

Fingerprint

Com citar-ho