Patent10.1121/1.423211
Speech parameter encoder.
14
TL;DR: In this paper, a speech parameter encoder capable of encoding spectrum parameters at a bit rate of 1 kb/s or less with comparatively small amount of operations and memory capacity is presented.
read more
Abstract: A speech parameter encoder capable of encoding spectrum parameters at a bit rate of 1 kb/s or less with comparatively small amount of operations and memory capacity. A spectrum parameter calculation unit (130) derives a spectrum parameter representing the spectrum envelope of a discrete input speech signal through division thereof into frames each having a predetermined time length. A weighted coefficient calculation unit (150) derives a weighted coefficient corresponding to an auditory masking threshold value through derivation thereof from the speech signal. A spectrum parameter quantization unit (160) receives the weighted coefficient and the spectrum parameter and centeses the spectrum parameter through search of a codebook such as to minimize the weighting distortion based on the weighted coefficient.
read more
Chat with Paper
AI Agents for this Paper
Find similar papers on Google Scholar, PubMed and Arxiv
Write a critical review of this paper
Analyze citations of this paper to find unaddressed research gaps
Citations
Patent
Audio bandwidth extending system and method
Masayuki Nishiguchi,Shiro Ohmori +1 more
- 17 Oct 1997
TL;DR: In this paper, an autocorrelation is used on the parameters of the code books, and a signal obtained by up-sampling an linear predictive code residual is used as an exciting source at the time of audio synthesis.
66
Patent
Compound word recognition
Stijn Van Even
- 24 Sep 1999
TL;DR: In this paper, a text string is improved by analyzing the text string with respect to information about expected patterns of the parts of speech of words in text string and by modifying the text text based on the analysis.
61
Patent
Audio signal coding method, decoding method, audio signal coding apparatus, and decoding apparatus where first vector quantization is performed on a signal and second vector quantization is performed on an error component resulting from the first vector quantization
Takeshi Norimatsu,Shuji Miyasaka,Yoshihisa Nakato,Mineo Tsushima,Tomokazu Ishikawa +4 more
- 01 Jul 1997
TL;DR: In this article, a vector quantization method is used to reduce the quantity of data in an audio signal by using a minimum distance among auditive distances between sub-vectors produced by dividing an input vector and audio codes in a transmission-side code book.
45
Patent
Coding/decoding of digital audio signals
Stéphane Ragot,Cyril Gukllaume +1 more
- 30 Jan 2008
TL;DR: In this paper, a method of hierarchical coding of a digital audio frequency input signal into several frequency sub-bands, including a core coding of the input signal according to a first throughput and at least one enhancement coding of higher throughput, of a residual signal, is presented.
43
Patent
Multistage inverse quantization having the plurality of frequency bands
Takeshi Norimatsu,Shuji Miyasaka,Yoshihisa Nakatoh,Mineo Tsushima,Tomokazu Ishikawa +4 more
- 01 Oct 2004
TL;DR: In this article, the authors provided a coding apparatus that enables a decoding apparatus to reproduce an audio signal even through it does not use all of the data from the coding apparatus, and corresponding decoding apparatus corresponding to the decoding apparatus.
35
References
An Algorithm for Vector Quantizer Design
Y. Linde,A. Buzo,Robert M. Gray +2 more
TL;DR: An efficient and intuitive algorithm is presented for the design of vector quantizers based either on a known probabilistic model or on a long training sequence of data.
Perceptual linear predictive (PLP) analysis of speech
TL;DR: A new technique for the analysis of speech, the perceptual linear predictive (PLP) technique, which uses three concepts from the psychophysics of hearing to derive an estimate of the auditory spectrum, and yields a low-dimensional representation of speech.
3.1K
Transform coding of audio signals using perceptual noise criteria
TL;DR: A 4-b/sample transform coder is designed using a psychoacoustically derived noise-making threshold that is based on the short-term spectrum of the signal, and tested in a formal subjective test involving a wide selection of monophonic audio inputs.
Efficient vector quantization of LPC parameters at 24 bits/frame
Kuldip K. Paliwal,B. Atal +1 more
TL;DR: It is shown that the split vector quantizer can quantize LPC information in 24 bits/frame with an average spectral distortion of 1 dB and less than 2% of the frames having spectral distortion greater than 2 dB.
760
Linear prediction on a warped frequency scale
TL;DR: In this paper, a predictor can be computed from a frequency-warped autocorrelation function obtained from the power spectrum or by a direct linear transformation of the original acf.
252
Related Papers (5)
Keiichi C,Kazunori C +1 more
- 09 Dec 1994
Walter Etter
- 30 Sep 2003
Yang Gao,Eyal Shlomot,ガオ,ヤン,シュロモット,エヤル +3 more
- 01 Mar 2010