Skip navigation
  • Home
  • Browse
    • Communities
      & Collections
    • Browse Items by:
    • Publication Date
    • Author
    • Title
    • Subject
    • Department
  • Sign on to:
    • My MacSphere
    • Receive email
      updates
    • Edit Profile


McMaster University Home Page
  1. MacSphere
  2. Open Access Dissertations and Theses Community
  3. Open Access Dissertations and Theses
Please use this identifier to cite or link to this item: http://hdl.handle.net/11375/11255
Title: Fourier & Wavelet Methods for Finding Speech Onset Latencies
Authors: Horbatiuk, Ian
Advisor: Watter, Scott
Humphreys, Karin
Yoon, Tae-Jin
Department: Psychology
Keywords: Fourier;Wavelet;Speech;Onset;Latencies;VAD;Other Computer Sciences;Psychology;Quantitative Psychology;Other Computer Sciences
Publication Date: Oct-2011
Abstract: <p>Localization of speech onsets to determine onset latencies is a complicated problem with as many different methods for finding them as there are different areas which use such measurements. A majority of research performed in cognition uses a standard amplitude threshold voice key for estimating the speech onset latencies but a number of studies have shown that this method is incredibly inaccurate and can bias data or produce contradictory results. A number of alternative methods based on modifications to traditional voice-keys have been proposed to deal with the inconsistency although still show a number of deficiencies. Previous work has suggested that switching from the amplitude domain of a signal to the frequency domain a number of the issues present with voice keys can be overcome and when used in conjunction with a number of highly sensitive heuristics highly accurate onset latencies can be produced reliably under ideal conditions. This research is refined and paired with a new user interface to improve the ease of use and increase the adoption rate of this type of analysis. Recent work in the telecommunications industry also suggests that wavelet-based algorithms in conjunction with the Teager Energy Operator (TEO) can accurately detect speech even in the presence of noise. Four wavelet-based methods are investigated and tested; a simple wavelet transform test, and three methods using wavelet-packet transforms in conjunction with the TEO. Although these methods do not perform very well compared to traditional methods a number of potential issues with the implementation are discussed.</p>
URI: http://hdl.handle.net/11375/11255
Identifier: opendissertations/6236
7265
2256364
Appears in Collections:Open Access Dissertations and Theses

Files in This Item:
File SizeFormat 
fulltext.pdf
Open Access
524.81 kBAdobe PDFView/Open
Show full item record Statistics


Items in MacSphere are protected by copyright, with all rights reserved, unless otherwise indicated.

Sherman Centre for Digital Scholarship     McMaster University Libraries
©2022 McMaster University, 1280 Main Street West, Hamilton, Ontario L8S 4L8 | 905-525-9140 | Contact Us | Terms of Use & Privacy Policy | Feedback

Report Accessibility Issue