Applied Speech and Audio Processing: With matlab examples

bet	77/170
Sana	18.10.2023
Hajmi	2,66 Mb.
	#1708320

1 ... 73 74 75 76 77 78 79 80 ... 170

Bog'liq
Applied Speech and Audio Processing With MATLAB Examples ( PDFDrive )

5.2. Parameterisation
95
Figure 5.5
Illustration of an audio waveform being quantised to 16 adaptive step levels. The
ﬁgure shows that both the absolute placement and the size of the quantisation steps is
determined dynamically at each sample point based upon the previous quantised sample values.
Remember the masking properties of the human auditory system (Section 4.2.8)?
This states that loud tones will mask nearby, but quieter frequencies. Similarly ADPCM
tends to match the loudest sounds by varying its quantisation step to match them, leaving
quieter sounds to be lost in the quantisation noise. Since this follows in many ways the
characteristics of the human auditory system, it is not a great disadvantage – except
when the quiet sound is far away in frequency from the loud sound. In humans such a
sound would no longer be masked (since it would now be in a different critical band –
see Section 4.3.2), but ADPCM has no concept of frequency bands and would probably
lose the sound. For this reason, SB-ADPCM, being able to simultaneously code one
very loud and one very quiet sound in different frequency ranges, is perceived as having
much higher quality than the ADPCM equivalent.
5.2
Parameterisation
Coding techniques that follow, or try to predict, a waveform shape tend to be relatively
simple and consequently achieve limited results. These techniques typically assume very
little about the waveform being coded – except perhaps maximum extents and slew rate.
There is a trade-off between coding quality and bitrate, and very little room to manoeuvre
toward the ideal of a high-ﬁdelity coding scheme with very low bitrate.
Instead of coding the physical waveform directly, researchers hit upon the idea of
parameterising the sound in some way: several values are chosen to represent important
aspects of the speech signal. Whatever parameters are chosen to represent the waveform

Download 2,66 Mb.

Do'stlaringiz bilan baham:

1 ... 73 74 75 76 77 78 79 80 ... 170