Applied Speech and Audio Processing: With matlab examples

partial correlation, 103, 104

bet	169/170
Sana	18.10.2023
Hajmi	2,66 Mb.
	#1708320

1 ... 162 163 164 165 166 167 168 169 170

Bog'liq
Applied Speech and Audio Processing With MATLAB Examples ( PDFDrive )

partial correlation, 103, 104
pause(),
9
PCM, see pulse coded modulation
perceptual error, 125
perceptual weighting, 123, 168
perfect pitch, 67
PESQ, 49
phase locking, 63
phone, 40, 173, 183
phoneme, 40, 46, 47, 52, 148, 170, 177, 181
phonetic spelling, 181
phonology, of languages, 172
pitch doubling, 121, 122
pitch lag, fractional, 119
pitch perception, 63, 67, 70, 74
pitch synchronous overlap and add, 194
plain old telephone services, 1
play(),
9
plosive, 39
plot(),
10, 16
plotting, 26
poly-phones, 173
POTS, see plain old telephone services
precedence effect, 70
private mobile radio, 5
pronunciation
in speaker classiﬁcation, 171
in speech synthesis, 170, 181, 183
of phonemes, 40
proximity, 77
pseudo-stationarity, 24, 47, 97, 103
PSOLA, see pitch synchronous overlap and add
PSQM, 49
psychoacoustics, 60, 64, 74, 160–164, 167, 168
puberty, 170
pulse coded modulation, 5, 8, 10, 59, 90, 93
standards, 96
quadraphonic sound, 188
quantisation, 90
of audio samples, 9, 12
of LPC parameters, 101, 112
of LSPs, 112, 113, 115
of speech, 98
of stereo audio, 188
split vector, 116
vector, 116

Index
205
record(),
8
redundancy, 53
reﬂection coefﬁcients, 103, 105
regular pulse excitation, 118, 123, 131
reshape(),
71
resume(),
9
roots(),
108
RPE, see regular pulse excitation
sample rate, 5, 12, 89
sampling, 4
scaling of amplitude, 13
SD, see spectral distortion
segmental signal-to-noise ratio, 50, 112
segmentation, 17, 18, 21, 23, 24, 40, 118, 179
in CELP coder, 123
SEGSNR, see segmental signal-to-noise ratio
short-time Fourier transform, 25, 149
SIFT, see simpliﬁed inverse ﬁltering technique
signal-to-noise ratio, 50
simpliﬁed inverse ﬁltering technique, 149
SNR, see signal-to-noise ratio
sone, 72
sound perception, 63
sound pressure level, 42
sound strengthening, 74
sound(),
9
soundsc(),
9, 22, 184
spatial placement, in stereo, 185
speaker
classiﬁcation, 169
identiﬁcation, 169
veriﬁcation, 169
speaker identiﬁcation, 30
spectral distortion, 51, 123
spectrogram, 22, 26
spectrogram(),
26, 32
speech
amplitude, 42, 43, 171
articulation, 44, 47
cadence, 157
characteristics, 41
classiﬁcation, 41, 148
codec, 89
coding algorithms, 96
compression of, 89, 98, 123, 131, 161, 168
energy, 41, 45
formants, 41, 45, 46, 74, 75, 140, 166, 168, 192
frequency distribution, 45
intelligibility, 41, 45, 53, 69, 161, 166
intelligibility testing, 51
intelligibility vs. quality, 47
perception, 71
pitch changer, 193
pitch contour, 41
pitch extraction, 119, 120, 149
pitch models, 117
pitch period, 122, 142, 194
power, 45
production, 38
quality testing, 49
recognition, 30, 74, 170
shouting, 41
spectrum, 140
synthesis, 180
unvoiced, 44
voiced, 44
voicing, 156
speech recognition, 174
speech recognition, automatic, 174
speech recognition, continuous, 174
Sphinx, 178
SPL, see sound pressure level
split vector quantisation, 116
stationarity, 24, 47
steganography, 161
stereo, 3, 69, 184
stereo encoding, joint, 188
STFT, see short-time Fourier transform
stop(),
9
stress, on words, 179
surround sound, 188
swapbytes(),
12
syllabic rate, 47, 156, 171
syllable, 40, 46, 47, 52, 156
syntax, of languages, 172, 175
synthesiser, of speech, 180
temporal integration in hearing, 63
temporary threshold shift, 64, 75
TETRA, see trans-European trunked radio
text-to-speech, 180
TFD, see time-frequency distribution
threshold crossing rate, 137
timbre, 68
time-frequency distribution, 149, 151
toll-quality, 4
tone
generation of, 30, 31
tone induction, 74
tonegen(),
30, 65, 67, 77, 184
tongue, placement of, 171
trans-European trunked radio, 131
transcription systems, 174
TTS, see temporary threshold shift or
text-to-speech
Turk, the, 180
VAD, see voice activity detection
vector quantisation, 116, 170
vector sum excited linear prediction, 128
velum, 39

206
Index
violin, analysis of, 151
visualisation, 25, 32
vocal chord, see glottis
vocal tract, 40, 41, 44, 117, 118, 170
parameters, 125
resonances, 123
voice activity detection, 179
voice operated switch, 179
voice scrambler, 193
VOS, see voice operated switch
vowel, 40, 44, 171
waterfall(),
32
wavrecord(),
8
white noise, 30
Wigner–Ville distribution, 149
windowing, 19, 105
in CELP coder, 123
window functions, 21
window size, 24, 25
xcorr(),
27
ZCR, see zero-crossing rate
zero-crossing rate, 136, 147, 149

Download 2,66 Mb.

Do'stlaringiz bilan baham:

1 ... 162 163 164 165 166 167 168 169 170