Please use this identifier to cite or link to this item:
http://hdl.handle.net/11375/30699
Title: | Semi-Automated Shape Model Fitting for HRTF Simulation using Mesh2HRTF |
Authors: | Quansah, Bodee |
Advisor: | Zheng, Rong |
Department: | Biomedical Engineering |
Keywords: | HRTF;Acoustics;Graphics;Simulation |
Publication Date: | 2024 |
Abstract: | For rendering realistic binaural and spatial audio, it is important to have accurate Head Related Transfer Functions. HRTFs are directional filters that models how sound diffracts and reflects around a subject’s head and ears. Due to their dependence on a subject’s head and ear morphology, HRTFs are unique to the individual and should be measured on a per-subject basis. Simulation is an attractive alternative to measurement, because it does not require special facilities, only a 3D mesh. For simulation to work, the simulator needs a high quality mesh of the subject's head and ears as input, but 3D capture techniques produce meshes that have artifacts. This thesis proposes three semi-automated non-rigid registration pipelines that use both global and part-based approaches to generate meshes that are watertight and manifold and thus suitable for simulation. The pipelines are referred to as follows: the hybrid, global+ear-refine, and model-part pipelines. Each pipeline non-rigidly registers a template to an artifact-laden 3D scan and morphs the template mesh to resemble the 3D scan free of the artifacts that cause simulation to fail. All pipelines were tested on the scans of 15 subjects. The global+ear-refine pipeline was found to produce meshes with the lowest average vertex error. The maximum average vertex error across subjects was 0.8 mm. For the left ear the pipeline produced a maximum average landmark error of 3 mm and 2.5 mm for the right ear. The global+ear-refine pipeline was also found to produce the smoothest meshes in the forehead region with a maximum roughness of 17.28. The morphed template was used as input to the Mesh2HRTF simulator, to generate HRTFs. The simulations were compared to ground truth and were found to be comparable to the ground truth, up to 3 kHz, above which the simulations suffer from large discrepancies. |
URI: | http://hdl.handle.net/11375/30699 |
Appears in Collections: | Open Access Dissertations and Theses |
Files in This Item:
File | Description | Size | Format | |
---|---|---|---|---|
Quansah_Bodee_A_2024Dec_MASc.pdf | 9.53 MB | Adobe PDF | View/Open |
Items in MacSphere are protected by copyright, with all rights reserved, unless otherwise indicated.