Abstract / Description of output
There has been consistent interest among speech signal processing researchers in the accurate estimation of the fundamental frequency (F-0) of speech signals. This study examines ten F-0 estimation algorithms (some well-established and some proposed more recently) to determine which of these algorithms is, on average, better able to estimate F-0 in the sustained vowel /a/. Moreover, a robust method for adaptively weighting the estimates of individual F-0 estimation algorithms based on quality and performance measures is proposed, using an adaptive Kalman filter (KF) framework. The accuracy of the algorithms is validated using (a) a database of 117 synthetic realistic phonations obtained using a sophisticated physiological model of speech production and (b) a database of 65 recordings of human phonations where the glottal cycles are calculated from electroglottograph signals. On average, the sawtooth waveform inspired pitch estimator and the nearly defect-free algorithms provided the best individual F-0 estimates, and the proposed KF approach resulted in a similar to 16% improvement in accuracy over the best single F-0 estimation algorithm. These findings may be useful in speech signal processing applications where sustained vowels are used to assess vocal quality, when very accurate F-0 estimation is required. (C) 2014 Acoustical Society of America.
Original language | English |
---|---|
Pages (from-to) | 2885-2901 |
Number of pages | 17 |
Journal | The Journal of the Acoustical Society of America |
Volume | 135 |
Issue number | 5 |
DOIs | |
Publication status | Published - 9 May 2014 |
Keywords / Materials (for Non-textual outputs)
- PITCH DETECTION ALGORITHMS
- PARKINSONS-DISEASE
- ELECTROGLOTTOGRAPHIC SIGNALS
- PERTURBATION MEASUREMENTS
- VOCAL FOLDS
- SPEECH
- VOICE
- MODEL
- PHONATION
- MUSIC
Fingerprint
Dive into the research topics of 'Robust fundamental frequency estimation in sustained vowels: Detailed algorithmic comparisons and information fusion with adaptive Kalman filtering'. Together they form a unique fingerprint.Profiles
-
Thanasis Tsanas
- Deanery of Molecular, Genetic and Population Health Sciences - Personal Chair in Digital Health and Data Science
- Usher Institute
- Centre for Medical Informatics
- Edinburgh Neuroscience
- Data Science and Artificial Intelligence
Person: Academic: Research Active