Investigating Speaker Features From Very Short Speech Records

Berg, Brian LaRoy

Investigating Speaker Features From Very Short Speech Records

Files

afrif_berg.pdf (9.84 MB)

Downloads: 322

Date

2001-07-23

Authors

Berg, Brian LaRoy

Publisher

Virginia Tech

Abstract

A procedure is presented that is capable of extracting various speaker features, and is of particular value for analyzing records containing single words and shorter segments of speech. By taking advantage of the fast convergence properties of adaptive filtering, the approach is capable of modeling the nonstationarities due to both the vocal tract and vocal cord dynamics.

Specifically, the procedure extracts the vocal tract estimate from within the closed glottis interval and uses it to obtain a time-domain glottal signal. This procedure is quite simple, requires minimal manual intervention (in cases of inadequate pitch detection), and is particularly unique because it derives both the vocal tract and glottal signal estimates directly from the time-varying filter coefficients rather than from the prediction error signal. Using this procedure, several glottal signals are derived from human and synthesized speech and are analyzed to demonstrate the glottal waveform modeling performance and kind of glottal characteristics obtained therewith. Finally, the procedure is evaluated using automatic speaker identity verification.

Keywords

Speaker Recognition, Speech Synthesis, Speech Analysis, Digital Signal Processing, Speaker Identity Verification, Speech Processing

Persistent link

http://hdl.handle.net/10919/28691

Collections

Doctoral Dissertations

Full item page

Investigating Speaker Features From Very Short Speech Records

Files

TR Number

Date

Authors

Journal Title

Journal ISSN

Volume Title

Publisher

Abstract

Description

Keywords

Citation

Persistent link

Collections