Applied Speech and Audio Processing: With matlab examples
partial correlation, 103, 104
Download 2.66 Mb. Pdf ko'rish
|
Applied Speech and Audio Processing With MATLAB Examples ( PDFDrive )
partial correlation, 103, 104 pause(), 9 PCM, see pulse coded modulation perceptual error, 125 perceptual weighting, 123, 168 perfect pitch, 67 PESQ, 49 phase locking, 63 phone, 40, 173, 183 phoneme, 40, 46, 47, 52, 148, 170, 177, 181 phonetic spelling, 181 phonology, of languages, 172 pitch doubling, 121, 122 pitch lag, fractional, 119 pitch perception, 63, 67, 70, 74 pitch synchronous overlap and add, 194 plain old telephone services, 1 play(), 9 plosive, 39 plot(), 10, 16 plotting, 26 poly-phones, 173 POTS, see plain old telephone services precedence effect, 70 private mobile radio, 5 pronunciation in speaker classification, 171 in speech synthesis, 170, 181, 183 of phonemes, 40 proximity, 77 pseudo-stationarity, 24, 47, 97, 103 PSOLA, see pitch synchronous overlap and add PSQM, 49 psychoacoustics, 60, 64, 74, 160–164, 167, 168 puberty, 170 pulse coded modulation, 5, 8, 10, 59, 90, 93 standards, 96 quadraphonic sound, 188 quantisation, 90 of audio samples, 9, 12 of LPC parameters, 101, 112 of LSPs, 112, 113, 115 of speech, 98 of stereo audio, 188 split vector, 116 vector, 116 Index 205 record(), 8 redundancy, 53 reflection coefficients, 103, 105 regular pulse excitation, 118, 123, 131 reshape(), 71 resume(), 9 roots(), 108 RPE, see regular pulse excitation sample rate, 5, 12, 89 sampling, 4 scaling of amplitude, 13 SD, see spectral distortion segmental signal-to-noise ratio, 50, 112 segmentation, 17, 18, 21, 23, 24, 40, 118, 179 in CELP coder, 123 SEGSNR, see segmental signal-to-noise ratio short-time Fourier transform, 25, 149 SIFT, see simplified inverse filtering technique signal-to-noise ratio, 50 simplified inverse filtering technique, 149 SNR, see signal-to-noise ratio sone, 72 sound perception, 63 sound pressure level, 42 sound strengthening, 74 sound(), 9 soundsc(), 9, 22, 184 spatial placement, in stereo, 185 speaker classification, 169 identification, 169 verification, 169 speaker identification, 30 spectral distortion, 51, 123 spectrogram, 22, 26 spectrogram(), 26, 32 speech amplitude, 42, 43, 171 articulation, 44, 47 cadence, 157 characteristics, 41 classification, 41, 148 codec, 89 coding algorithms, 96 compression of, 89, 98, 123, 131, 161, 168 energy, 41, 45 formants, 41, 45, 46, 74, 75, 140, 166, 168, 192 frequency distribution, 45 intelligibility, 41, 45, 53, 69, 161, 166 intelligibility testing, 51 intelligibility vs. quality, 47 perception, 71 pitch changer, 193 pitch contour, 41 pitch extraction, 119, 120, 149 pitch models, 117 pitch period, 122, 142, 194 power, 45 production, 38 quality testing, 49 recognition, 30, 74, 170 shouting, 41 spectrum, 140 synthesis, 180 unvoiced, 44 voiced, 44 voicing, 156 speech recognition, 174 speech recognition, automatic, 174 speech recognition, continuous, 174 Sphinx, 178 SPL, see sound pressure level split vector quantisation, 116 stationarity, 24, 47 steganography, 161 stereo, 3, 69, 184 stereo encoding, joint, 188 STFT, see short-time Fourier transform stop(), 9 stress, on words, 179 surround sound, 188 swapbytes(), 12 syllabic rate, 47, 156, 171 syllable, 40, 46, 47, 52, 156 syntax, of languages, 172, 175 synthesiser, of speech, 180 temporal integration in hearing, 63 temporary threshold shift, 64, 75 TETRA, see trans-European trunked radio text-to-speech, 180 TFD, see time-frequency distribution threshold crossing rate, 137 timbre, 68 time-frequency distribution, 149, 151 toll-quality, 4 tone generation of, 30, 31 tone induction, 74 tonegen(), 30, 65, 67, 77, 184 tongue, placement of, 171 trans-European trunked radio, 131 transcription systems, 174 TTS, see temporary threshold shift or text-to-speech Turk, the, 180 VAD, see voice activity detection vector quantisation, 116, 170 vector sum excited linear prediction, 128 velum, 39 206 Index violin, analysis of, 151 visualisation, 25, 32 vocal chord, see glottis vocal tract, 40, 41, 44, 117, 118, 170 parameters, 125 resonances, 123 voice activity detection, 179 voice operated switch, 179 voice scrambler, 193 VOS, see voice operated switch vowel, 40, 44, 171 waterfall(), 32 wavrecord(), 8 white noise, 30 Wigner–Ville distribution, 149 windowing, 19, 105 in CELP coder, 123 window functions, 21 window size, 24, 25 xcorr(), 27 ZCR, see zero-crossing rate zero-crossing rate, 136, 147, 149 Download 2.66 Mb. Do'stlaringiz bilan baham: |
Ma'lumotlar bazasi mualliflik huquqi bilan himoyalangan ©fayllar.org 2024
ma'muriyatiga murojaat qiling
ma'muriyatiga murojaat qiling