Computer Science > Computer Vision and Pattern Recognition

arXiv:2206.06420 (cs)

[Submitted on 13 Jun 2022 (v1), last revised 21 Apr 2023 (this version, v3)]

Title:GraphMLP: A Graph MLP-Like Architecture for 3D Human Pose Estimation

Authors:Wenhao Li, Hong Liu, Tianyu Guo, Runwei Ding, Hao Tang

View PDF

Abstract:Modern multi-layer perceptron (MLP) models have shown competitive results in learning visual representations without self-attention. However, existing MLP models are not good at capturing local details and lack prior knowledge of human body configurations, which limits their modeling power for skeletal representation learning. To address these issues, we propose a simple yet effective graph-reinforced MLP-Like architecture, named GraphMLP, that combines MLPs and graph convolutional networks (GCNs) in a global-local-graphical unified architecture for 3D human pose estimation. GraphMLP incorporates the graph structure of human bodies into an MLP model to meet the domain-specific demand of the 3D human pose, while allowing for both local and global spatial interactions. Furthermore, we propose to flexibly and efficiently extend the GraphMLP to the video domain and show that complex temporal dynamics can be effectively modeled in a simple way with negligible computational cost gains in the sequence length. To the best of our knowledge, this is the first MLP-Like architecture for 3D human pose estimation in a single frame and a video sequence. Extensive experiments show that the proposed GraphMLP achieves state-of-the-art performance on two datasets, i.e., Human3.6M and MPI-INF-3DHP. Code and models are available at this https URL.

Comments:	Open Sourced
Subjects:	Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
Cite as:	arXiv:2206.06420 [cs.CV]
	(or arXiv:2206.06420v3 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.2206.06420

Submission history

From: Wenhao Li [view email]
[v1] Mon, 13 Jun 2022 18:59:31 UTC (10,392 KB)
[v2] Thu, 1 Sep 2022 07:22:39 UTC (10,027 KB)
[v3] Fri, 21 Apr 2023 13:45:17 UTC (10,066 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:GraphMLP: A Graph MLP-Like Architecture for 3D Human Pose Estimation

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:GraphMLP: A Graph MLP-Like Architecture for 3D Human Pose Estimation

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators