Semi-Automated Shape Model Fitting for HRTF Simulation using Mesh2HRTF

Quansah, Bodee

Please use this identifier to cite or link to this item: http://hdl.handle.net/11375/30699

Title:	Semi-Automated Shape Model Fitting for HRTF Simulation using Mesh2HRTF
Authors:	Quansah, Bodee
Advisor:	Zheng, Rong
Department:	Biomedical Engineering
Keywords:	HRTF;Acoustics;Graphics;Simulation
Publication Date:	2024
Abstract:	For rendering realistic binaural and spatial audio, it is important to have accurate Head Related Transfer Functions. HRTFs are directional filters that models how sound diffracts and reflects around a subject’s head and ears. Due to their dependence on a subject’s head and ear morphology, HRTFs are unique to the individual and should be measured on a per-subject basis. Simulation is an attractive alternative to measurement, because it does not require special facilities, only a 3D mesh. For simulation to work, the simulator needs a high quality mesh of the subject's head and ears as input, but 3D capture techniques produce meshes that have artifacts. This thesis proposes three semi-automated non-rigid registration pipelines that use both global and part-based approaches to generate meshes that are watertight and manifold and thus suitable for simulation. The pipelines are referred to as follows: the hybrid, global+ear-refine, and model-part pipelines. Each pipeline non-rigidly registers a template to an artifact-laden 3D scan and morphs the template mesh to resemble the 3D scan free of the artifacts that cause simulation to fail. All pipelines were tested on the scans of 15 subjects. The global+ear-refine pipeline was found to produce meshes with the lowest average vertex error. The maximum average vertex error across subjects was 0.8 mm. For the left ear the pipeline produced a maximum average landmark error of 3 mm and 2.5 mm for the right ear. The global+ear-refine pipeline was also found to produce the smoothest meshes in the forehead region with a maximum roughness of 17.28. The morphed template was used as input to the Mesh2HRTF simulator, to generate HRTFs. The simulations were compared to ground truth and were found to be comparable to the ground truth, up to 3 kHz, above which the simulations suffer from large discrepancies.
URI:	http://hdl.handle.net/11375/30699
Appears in Collections:	Open Access Dissertations and Theses

Files in This Item:

File	Description	Size	Format
Quansah_Bodee_A_2024Dec_MASc.pdf Open Access		9.53 MB	Adobe PDF	View/Open

Show full item record