Please use this identifier to cite or link to this item:
http://hdl.handle.net/11375/32571
Full metadata record
DC Field | Value | Language |
---|---|---|
dc.contributor.advisor | Bruce, Ian | - |
dc.contributor.author | O'Shaughnessy, Kyle | - |
dc.date.accessioned | 2025-10-22T19:39:43Z | - |
dc.date.available | 2025-10-22T19:39:43Z | - |
dc.date.issued | 2025 | - |
dc.identifier.uri | http://hdl.handle.net/11375/32571 | - |
dc.description.abstract | In practical acoustic environments, reflections give rise to reverberation which makes speech perception more challenging, especially for individuals with hearing impairment. This creates a need for speech reproduction systems such as hearing aids to include strategies for reducing the perceptual impacts of reverberation (i.e., dereverberation algorithms). In this thesis, an evaluation of one of the most prevalent techniques, namely delay-and-predict dereverberation (Triki and Slock, 2006), is provided. Recent advancements in physiologically motivated predictors of speech intelligibility (SI) are leveraged to explain the complex impacts of reverberation/dereverberation on speech perception. In particular, the neurogram similarity index measure (NSIM) and the spectro-temporal modulation index (STMI) are utilized in addition to the well-known hearing aid speech perception index (HASPI) and short-time objective intelligibility (STOI). The results suggest that delay-and-predict dereverberation is relatively effective at reducing the earlier part of room impulse responses (RIRs), which provides sufficient restoration of temporal fine structure (TFS) and envelope (ENV) acoustic cues to reduce listening effort (LE) and compensate deficits in SI for normal-hearing and hearing-impaired listeners. The algorithm is incapable of cancelling the later part of RIRs, but by introducing a small amount of autocorrelation regularization to the algorithm, its impact on this late reverberation is shown to greatly improve. In practice however, delay-and-predict performance is shown to be limited by the number of microphones available, the need for large amounts of signal data, the presence of interfering acoustic signals, and potentially by time-varying acoustics. The evaluation also demonstrates that the NSIM and STMI provide a more complete picture of the perceptual impacts of reverberation than HASPI or STOI. However, the NSIM is found to be highly sensitive to phase distortions which may or may not reflect a realistic impact on speech perception, thus potentially limiting its usefulness in the evaluation of complex signal processing algorithms. | en_US |
dc.language.iso | en | en_US |
dc.subject | Audio Signal Processing, Dereverberation, Hearing Aids, Speech Perception | en_US |
dc.title | A Physiologically-Motivated Analysis of the Performance of Multichannel Linear Predictive Approaches to Dereverberation | en_US |
dc.title.alternative | A Perceptual Evaluation of Multichannel Linear Predictive Dereverberation | en_US |
dc.type | Thesis | en_US |
dc.contributor.department | Electrical and Computer Engineering | en_US |
dc.description.degreetype | Thesis | en_US |
dc.description.degree | Master of Applied Science (MASc) | en_US |
dc.description.layabstract | In practical acoustic environments, sound reflects off physical surfaces resulting in a sequence of echoes called reverberation. This acoustic phenomenon makes speech perception more challenging, especially for individuals with hearing impairment. It is therefore important for speech reproduction systems such as hearing aids to include techniques for managing the effects of reverberation. In this thesis one of the most prevalent signal processing algorithms for “dereverberation”, namely the delay-andpredict algorithm, is evaluated. To provide new insights into the complex impacts of reverberation/dereverberation on speech perception, recent advancements in numerical methods for predicting speech intelligibility are leveraged. The results suggest that the delay-and-predict algorithm provides a distinct perceptual benefit under ideal conditions, but its performance is limited in many practical environments. Additionally, the results highlight potential advantages and disadvantages of different types of intelligibility predictors in the context of evaluating complex signal processing algorithms. | en_US |
Appears in Collections: | Open Access Dissertations and Theses |
Files in This Item:
File | Description | Size | Format | |
---|---|---|---|---|
O'Shaughnessy_Kyle_J_2025Sept_MASc.pdf | 12.96 MB | Adobe PDF | View/Open |
Items in MacSphere are protected by copyright, with all rights reserved, unless otherwise indicated.