Skip navigation
  • Home
  • Browse
    • Communities
      & Collections
    • Browse Items by:
    • Publication Date
    • Author
    • Title
    • Subject
    • Department
  • Sign on to:
    • My MacSphere
    • Receive email
      updates
    • Edit Profile


McMaster University Home Page
  1. MacSphere
  2. Open Access Dissertations and Theses Community
  3. Open Access Dissertations and Theses
Please use this identifier to cite or link to this item: http://hdl.handle.net/11375/30968
Title: Hierarchical Approaches for Generating Stylistic Human Motions from Audio
Authors: Cheng, Yanbo
Advisor: Wang, Yingying
Department: Computing and Software
Keywords: Character Animation;Motion Synthesis;Deep Learning;Motion Stylization;Multimodal Synchronization
Publication Date: 2025
Abstract: Generating realistic and plausible human motions driven by audio has been explored for decades and has a wide range of applications in real life, such as virtual reality, gaming, films, and human-computer interaction. However, the spatiotemporal complexity of human motion and the requirement for generated movements to align with audio features pose significant challenges to motion synthesis algorithms. In this thesis, we first explore the approaches to human motion learning and synthesis, with a particular focus on data-driven dance and gesture generation. Then we propose a two-level framework to synthesize dance and gesture motions based on the music and speech input respectively, where the high-level motion planner models the overall structure of the motion sequence and the low-level implementer incorporates audio influences to generate detailed movements with nuances. Moreover, we introduce a motion editing module that refines generated dance. Building on this, we propose a method for mapping user inputs to dance style, giving users further control over the generated dance, as well as a data-driven approach for mapping speech audio to gesture style. Extensive experiments conducted on two datasets demonstrate the superior effectiveness and efficiency of our proposed method in generating diverse, high-quality dance and gesture motions that are well-synchronized with audio inputs.
URI: http://hdl.handle.net/11375/30968
Appears in Collections:Open Access Dissertations and Theses

Files in This Item:
File Description SizeFormat 
Cheng_Yanbo_finalsubmission2024December_MSc.pdf
Embargoed until: 2025-12-14
8.35 MBAdobe PDFView/Open
Show full item record Statistics


Items in MacSphere are protected by copyright, with all rights reserved, unless otherwise indicated.

Sherman Centre for Digital Scholarship     McMaster University Libraries
©2022 McMaster University, 1280 Main Street West, Hamilton, Ontario L8S 4L8 | 905-525-9140 | Contact Us | Terms of Use & Privacy Policy | Feedback

Report Accessibility Issue