Computer Science > Computer Vision and Pattern Recognition

arXiv:2311.17135 (cs)

[Submitted on 28 Nov 2023 (v1), last revised 12 Dec 2023 (this version, v3)]

Title:TLControl: Trajectory and Language Control for Human Motion Synthesis

Authors:Weilin Wan, Zhiyang Dou, Taku Komura, Wenping Wang, Dinesh Jayaraman, Lingjie Liu

Abstract:Controllable human motion synthesis is essential for applications in AR/VR, gaming, movies, and embodied AI. Existing methods often focus solely on either language or full trajectory control, lacking precision in synthesizing motions aligned with user-specified trajectories, especially for multi-joint control. To address these issues, we present TLControl, a new method for realistic human motion synthesis, incorporating both low-level trajectory and high-level language semantics controls. Specifically, we first train a VQ-VAE to learn a compact latent motion space organized by body parts. We then propose a Masked Trajectories Transformer to make coarse initial predictions of full trajectories of joints based on the learned latent motion space, with user-specified partial trajectories and text descriptions as conditioning. Finally, we introduce an efficient test-time optimization to refine these coarse predictions for accurate trajectory control. Experiments demonstrate that TLControl outperforms the state-of-the-art in trajectory accuracy and time efficiency, making it practical for interactive and high-quality animation generation.

Subjects:	Computer Vision and Pattern Recognition (cs.CV); Graphics (cs.GR)
Cite as:	arXiv:2311.17135 [cs.CV]
	(or arXiv:2311.17135v3 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.2311.17135

Submission history

From: Weilin Wan [view email]
[v1] Tue, 28 Nov 2023 18:54:16 UTC (15,978 KB)
[v2] Thu, 30 Nov 2023 20:36:16 UTC (15,978 KB)
[v3] Tue, 12 Dec 2023 22:18:18 UTC (15,172 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:TLControl: Trajectory and Language Control for Human Motion Synthesis

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:TLControl: Trajectory and Language Control for Human Motion Synthesis

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators