7.10
Summary
This chapter has presented a collection of several methods and techniques that have,
in the most part, built upon the speech and audio analysis foundations laid in previous
chapters. A complete and workable psychoacoustic model was developed and perceptual
weighting discussed (also implemented later using LSP adjustment) along with several
discussions relating to the perception of speech and sound. Recent advances in speech
analysis and recognition were outlined, along with speech synthesis. Finally, the inter-
esting application of voice masking or pitch changing was discussed, along with two
alternative Matlab implementations of such a system.
7.10. Summary
199
Bibliography
• Psychoacoustics: Facts and Models
Eds. H. Fastl and E. Zwicker (Springer, 3rd edition 2006)
• An Introduction to the Psychology of Hearing
B. C. J. Moore (Academic Press, 4th edition 1997)
• Acoustics and Psychoacoustics
D. Howard and J. Angus (Focal Press, 3rd edition 2006)
• Hearing (Handbook of Perception and Cognition)
B. C. J. Moore (Academic Press, 2nd edition 1995)
• Speech Enhancement
Ed. J. S. Lim (Prentice-Hall, 1983)
• Music, Cognition and Computerized Sound: An Introduction to Psychoacoustics
P. R. Cook (MIT Press, 2001)
• Speech Communications, Human and Machine
D. O’Shaughnessy (Addison-Wesley, 1987)
A rather expensive book, but one with over 500 pages describing the speech communications
field, from basic topics extending to more state-of-the-art coverage of speech enhancement,
speech recognition and even a final chapter dedicated to speaker recognition.
• Survey of the State of the Art in Human Language Technology
Eds. R. Cole, J. Mariani, H. Uszkoreit, G. Batista Varile, A. Zaenen, A. Zampolli and
V. Zue (Cambridge University Press and Giardini, 1997)
Also available online from www.dfki.de/ hansu/HLT-Survey.pdf.
This is a published book, also available online as the result of a joint project between the
European Commission and the National Science Foundation of the USA. As the name implies,
it describes the state of the art in the human language fields, including speech recognition,
speaker recognition and so on.
Do'stlaringiz bilan baham: |