Speech Coder using Line Spectral Frequencies of Cascaded Second Order Predictors

Namburu, Visala

Speech Coder using Line Spectral Frequencies of Cascaded Second Order Predictors

dc.contributor.author	Namburu, Visala	en
dc.contributor.committeechair	Beex, A. A. Louis	en
dc.contributor.committeemember	Baumann, William T.	en
dc.contributor.committeemember	Woerner, Brian D.	en
dc.contributor.department	Electrical and Computer Engineering	en
dc.date.accessioned	2014-03-14T20:47:46Z	en
dc.date.adate	2001-11-14	en
dc.date.available	2014-03-14T20:47:46Z	en
dc.date.issued	2001-11-09	en
dc.date.rdate	2002-11-14	en
dc.date.sdate	2001-11-12	en
dc.description.abstract	A major objective in speech coding is to represent speech with as few bits as possible. Usual transmission parameters include auto regressive parameters, pitch parameters, excitation signals and excitation gains. The pitch predictor makes these coders sensitive to channel errors. Aiming for robustness to channel errors, we do not use pitch prediction and compensate for its lack with a better representation of the excitation signal. We propose a new speech coding approach, Vector Sum Excited Cascaded Linear Prediction (VSECLP), based on code excited linear prediction. We implement forward linear prediction using five cascaded second order sections - parameterized in terms of line spectral frequency - in place of the conventional tenth order filter. The line spectral frequency parameters estimated by the Direct Line Spectral Frequency (DLSF) adaptation algorithm are closer to the true values than those estimated by the Cascaded Recursive Least Squares - Subsection algorithm. A simplified version of DLSF is proposed to further reduce computational complexity. Split vector quantization is used to quantize the line spectral frequency parameters and vector sum codebooks to quantize the excitation signals. The effect on reconstructed speech quality and transmission rate, of an increased number of bits and differently split combinations, is analyzed by testing VSECLP on the TIMIT database. The quantization of the excitation vectors using the discrete cosine transform resulted in segmental signal to noise ratio of 4 dB at 20.95 kbps, whereas the same quality was obtained at 9.6 kbps using vector sum codebooks.	en
dc.description.degree	Master of Science	en
dc.identifier.other	etd-11122001-094938	en
dc.identifier.sourceurl	http://scholar.lib.vt.edu/theses/available/etd-11122001-094938/	en
dc.identifier.uri	http://hdl.handle.net/10919/35670	en
dc.publisher	Virginia Tech	en
dc.relation.haspart	VN_etd.pdf	en
dc.rights	In Copyright	en
dc.rights.uri	http://rightsstatements.org/vocab/InC/1.0/	en
dc.subject	Vector Quantization	en
dc.subject	Speech Coding	en
dc.subject	Cascaded Second Order Predictors	en
dc.subject	Linear Prediction	en
dc.subject	Line Spectral Frequencies	en
dc.title	Speech Coder using Line Spectral Frequencies of Cascaded Second Order Predictors	en
dc.type	Thesis	en
thesis.degree.discipline	Electrical and Computer Engineering	en
thesis.degree.grantor	Virginia Polytechnic Institute and State University	en
thesis.degree.level	masters	en
thesis.degree.name	Master of Science	en

Files

Original bundle

Now showing 1 - 1 of 1

Name:: VN_etd.pdf
Size:: 1.08 MB
Format:: Adobe Portable Document Format

Download

Collections

Masters Theses