A Physiologically-Motivated Analysis of the Performance of Multichannel Linear Predictive Approaches to Dereverberation

O'Shaughnessy, Kyle

Please use this identifier to cite or link to this item: http://hdl.handle.net/11375/32571

Full metadata record

DC Field	Value	Language
dc.contributor.advisor	Bruce, Ian	-
dc.contributor.author	O'Shaughnessy, Kyle	-
dc.date.accessioned	2025-10-22T19:39:43Z	-
dc.date.available	2025-10-22T19:39:43Z	-
dc.date.issued	2025	-
dc.identifier.uri	http://hdl.handle.net/11375/32571	-
dc.description.abstract	In practical acoustic environments, reflections give rise to reverberation which makes speech perception more challenging, especially for individuals with hearing impairment. This creates a need for speech reproduction systems such as hearing aids to include strategies for reducing the perceptual impacts of reverberation (i.e., dereverberation algorithms). In this thesis, an evaluation of one of the most prevalent techniques, namely delay-and-predict dereverberation (Triki and Slock, 2006), is provided. Recent advancements in physiologically motivated predictors of speech intelligibility (SI) are leveraged to explain the complex impacts of reverberation/dereverberation on speech perception. In particular, the neurogram similarity index measure (NSIM) and the spectro-temporal modulation index (STMI) are utilized in addition to the well-known hearing aid speech perception index (HASPI) and short-time objective intelligibility (STOI). The results suggest that delay-and-predict dereverberation is relatively effective at reducing the earlier part of room impulse responses (RIRs), which provides sufficient restoration of temporal fine structure (TFS) and envelope (ENV) acoustic cues to reduce listening effort (LE) and compensate deficits in SI for normal-hearing and hearing-impaired listeners. The algorithm is incapable of cancelling the later part of RIRs, but by introducing a small amount of autocorrelation regularization to the algorithm, its impact on this late reverberation is shown to greatly improve. In practice however, delay-and-predict performance is shown to be limited by the number of microphones available, the need for large amounts of signal data, the presence of interfering acoustic signals, and potentially by time-varying acoustics. The evaluation also demonstrates that the NSIM and STMI provide a more complete picture of the perceptual impacts of reverberation than HASPI or STOI. However, the NSIM is found to be highly sensitive to phase distortions which may or may not reflect a realistic impact on speech perception, thus potentially limiting its usefulness in the evaluation of complex signal processing algorithms.	en_US
dc.language.iso	en	en_US
dc.subject	Audio Signal Processing, Dereverberation, Hearing Aids, Speech Perception	en_US
dc.title	A Physiologically-Motivated Analysis of the Performance of Multichannel Linear Predictive Approaches to Dereverberation	en_US
dc.title.alternative	A Perceptual Evaluation of Multichannel Linear Predictive Dereverberation	en_US
dc.type	Thesis	en_US
dc.contributor.department	Electrical and Computer Engineering	en_US
dc.description.degreetype	Thesis	en_US
dc.description.degree	Master of Applied Science (MASc)	en_US
dc.description.layabstract	In practical acoustic environments, sound reflects off physical surfaces resulting in a sequence of echoes called reverberation. This acoustic phenomenon makes speech perception more challenging, especially for individuals with hearing impairment. It is therefore important for speech reproduction systems such as hearing aids to include techniques for managing the effects of reverberation. In this thesis one of the most prevalent signal processing algorithms for “dereverberation”, namely the delay-andpredict algorithm, is evaluated. To provide new insights into the complex impacts of reverberation/dereverberation on speech perception, recent advancements in numerical methods for predicting speech intelligibility are leveraged. The results suggest that the delay-and-predict algorithm provides a distinct perceptual benefit under ideal conditions, but its performance is limited in many practical environments. Additionally, the results highlight potential advantages and disadvantages of different types of intelligibility predictors in the context of evaluating complex signal processing algorithms.	en_US
Appears in Collections:	Open Access Dissertations and Theses

Files in This Item:

File	Description	Size	Format
O'Shaughnessy_Kyle_J_2025Sept_MASc.pdf Open Access		12.96 MB	Adobe PDF	View/Open

Show simple item record