Shape My Moves: Text-Driven Shape-Aware Synthesis of Human Motions

Liao, Ting-Hsuan; Zhou, Yi; Shen, Yu; Huang, Chun-Hao Paul; Mitra, Saayan; Huang, Jia-Bin; Bhattacharya, Uttaran

Computer Science > Computer Vision and Pattern Recognition

arXiv:2504.03639 (cs)

[Submitted on 4 Apr 2025]

Title:Shape My Moves: Text-Driven Shape-Aware Synthesis of Human Motions

Authors:Ting-Hsuan Liao, Yi Zhou, Yu Shen, Chun-Hao Paul Huang, Saayan Mitra, Jia-Bin Huang, Uttaran Bhattacharya

View PDF HTML (experimental)

Abstract:We explore how body shapes influence human motion synthesis, an aspect often overlooked in existing text-to-motion generation methods due to the ease of learning a homogenized, canonical body shape. However, this homogenization can distort the natural correlations between different body shapes and their motion dynamics. Our method addresses this gap by generating body-shape-aware human motions from natural language prompts. We utilize a finite scalar quantization-based variational autoencoder (FSQ-VAE) to quantize motion into discrete tokens and then leverage continuous body shape information to de-quantize these tokens back into continuous, detailed motion. Additionally, we harness the capabilities of a pretrained language model to predict both continuous shape parameters and motion tokens, facilitating the synthesis of text-aligned motions and decoding them into shape-aware motions. We evaluate our method quantitatively and qualitatively, and also conduct a comprehensive perceptual study to demonstrate its efficacy in generating shape-aware motions.

Comments:	CVPR 2025. Project page: this https URL
Subjects:	Computer Vision and Pattern Recognition (cs.CV)
Cite as:	arXiv:2504.03639 [cs.CV]
	(or arXiv:2504.03639v1 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.2504.03639

Submission history

From: Ting-Hsuan Liao [view email]
[v1] Fri, 4 Apr 2025 17:59:10 UTC (44,977 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:Shape My Moves: Text-Driven Shape-Aware Synthesis of Human Motions

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:Shape My Moves: Text-Driven Shape-Aware Synthesis of Human Motions

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators