Applied Speech and Audio Processing: With matlab examples
Download 2.66 Mb. Pdf ko'rish
|
Applied Speech and Audio Processing With MATLAB Examples ( PDFDrive )
Speech
Figure 3.2 Spectrum plot of a 20 ms recording of voiced speech, showing three distinct formant peaks. The pitch contour (often called f0 – note the lower case notation) is the parameter that describes the tone of the voice (the perceived frequency), and is in effect the funda- mental vocal frequency. Again, pitch frequencies contain energy but contribute little to intelligibility for English and other European languages [6]. It is, however, a very dif- ferent matter in a tonal language such as Mandarin Chinese which is totally dependent on tone for conveying meaning [7,8]. As an example, in Chinese the single word ‘ma’ can mean one of five things depending on which tone it is spoken with: mother, horse, scold, question, etc. and this is not an isolated example since all single Chinese word sounds have multiple meanings differentiated by tone. 3.2.2 Amplitude distribution of speech The overall amplitude distribution of speech depends upon the speaker’s personality and mood (every reader is likely to have endured monotonous talks on occasion – literally meaning ‘single tone’ speech), environmental noise, infection, and so on. Also feedback from a listener, either verbal ‘speak up please’ or non-verbal, such as cupping a hand around an ear, can prompt a speaker to alter their vocal characteristics. However despite this variability, it is interesting to determine average speech levels in different environments, as shown in Table 3.1, reproduced from [9], where the sound amplitudes are listed in dB SPL . 1 Note the wide range in speech level, and the relationship between that and the location. In a train and in an aeroplane are the only situations listed where a negative 1 dB (decibel) is a base 10 logarithmic measure of amplitude, with the SPL subscript in dB SPL referring to sound pressure level, which is referenced so that 0 dB is the quietest average audible sound at 1 kHz. In terms of measurable pressure, 74 dB SPL is 1 µbar, 1 dyne cm −2 or 0.1 Pa in different units. |
Ma'lumotlar bazasi mualliflik huquqi bilan himoyalangan ©fayllar.org 2024
ma'muriyatiga murojaat qiling
ma'muriyatiga murojaat qiling