Springer Handbook of Speech Processing

Springer Handbook of Speech Processing

  • ただいまウェブストアではご注文を受け付けておりません。 ⇒古書を探す
  • 商品コード 9783540491279

Full Description


This handbook plays a fundamental role in sustainable progress in speech research and development. With an accessible format and with accompanying DVD-Rom, it targets three categories of readers: graduate students, professors and active researchers in academia, and engineers in industry who need to understand or implement some specific algorithms for their speech-related products. It is a superb source of application-oriented, authoritative and comprehensive information about these technologies, this work combines the established knowledge derived from research in such fast evolving disciplines as Signal Processing and Communications, Acoustics, Computer Science and Linguistics.

Contents

Foreword by J. L. Flanagan Chap. 1 Introduction to Speech ProcessingPart A: Production, Perception, and Modeling of Speech (M. M. Sondhi)Part A describes the contemporary views on phonatory and articulatory mechanisms of humans to illustrate the physiological processes of speech production. It also describes the nonlinear cochlear speech processing in auditory masking, the perception of speech and sound by humans, and various methods for speech quality assessment with a focus on standardized methods. Chap. 2 Physiological Processes of Speech ProductionChap. 3 Nonlinear Cochlear Signal Processing and Masking in Speech PerceptionChap. 4 Perception of Speech and SoundChap. 5 Speech Quality Estimation Part B: Signal Processing for Speech (Y. Huang, J. Benesty)Part B gives a large number of signal processing concepts and algorithms that are widely used in speech processing and in the applications of speech. Chap. 6 Wiener and Adaptive FiltersChap. 7 Linear PredictionChap. 8 Kalman FilterChap. 9 Homomorphic Systems and Cepstrum Analysis of SpeechChap. 10 Pitch and Voicing Determination of Speech with an Extension Toward Music SignalsChap. 11 Formant Estimation and TrackingChap. 12 The STFT, Sinusoidal Models, and Speech ModificationChap. 13 Adaptive Blind Multichannel Identification Part C: Speech Coding (W. B. Kleijn)Part C discusses the attributes of speech coders as well as the underlying principles that determine their behavior and architecture. Coders for both traditional and packet networks are discussed, as well as low-bit-rate speech coding, various speech coding standards, and perceptual audio coders. Chap. 14 Principles of Speech CodingChap. 15 Voice over IP: Speech Transmission over Packet NetworksChap. 16 Low-Bit-Rate Speech CodingChap. 17 Analysis-by-Synthesis Speech CodingChap. 18 Perceptual Audio Coding of Speech Signals Part D: Text-to-Speech Synthesis (S. Narayanan)Part D presents different techniques for speech synthesis, including rule-based, corpus-based, and a combination of both. Linguistic analysis and prosodic processing, which are important parts of a text-to-speech (TTS) system, are reviewed. Other aspects of interest for TTS such as voice transformation and synthesis of expressive speech are also discussed. Chap. 19 Basic Principles of Speech SynthesisChap. 20 Rule-Based Speech SynthesisChap. 21 Corpus-Based Speech SynthesisChap. 22 Linguistic Processing for Speech SynthesisChap. 23 Prosodic ProcessingChap. 24 Voice TransformationChap. 25 Expressive/Affective Speech Synthesis Part E: Speech Recognition (L. Rabiner, B.-H. Juang)Part E describes the most important speech recognition technologies. The approach based on the powerful hidden Markov models is generously presented and some other promising approaches are outlined. The robustness issues concerning the acoustical environment are studied. Finally, several fundamental applications are also discussed. Chap. 26 Historical Perspective of the Field of ASR/NLUChap. 27 HMMs and Related Speech TechnologiesChap. 28 Speech Recognition with Weighted Finite-State TransducersChap. 29 A Machine Learning Framework for Spoken-Dialog ClassificationChap. 30 Towards Superhuman Speech RecognitionChap. 31 Natural Language UnderstandingChap. 32 Transcription and Distillation of Spontaneous SpeechChap. 33 Environmental RobustnessChap. 34 The Business of Speech TechnologiesChap. 35 Spoken Dialog Systems Part F: Speaker Recognition (S. Parthasarathy)Part F develops the field of speaker recognition. It covers text-dependent and text-independent speaker recognition and their applications. Chap. 36 Overview of Speaker RecognitionChap. 37 Text-Dependent Speaker RecognitionChap. 38 Text-Independent Speaker Recognition Part G: Language Recognition (C.-H. Lee)Part G provides an overview on principles of state-of-the-art language recognition approaches. Language characterization, identification, and modeling are addressed. Vector space characterization approaches to converting speech utterances into spoken document vectors for modeling and classification are also presented. Chap. 39 Principle of Spoken Language RecognitionChap. 40 Spoken Language CharacterizationChap. 41 Automatic Language Recognition via Spectral and Token Based ApproachesChap. 42 Vector Based Spoken Language Classification Part H: Speech Enhancement (J. Chen, S. Gannot, J. Benesty)Part H develops all classical aspects of speech enhancement: noise reduction, dereverberation, echo cancellation, feedback control, and active noise control. Chap. 43 Fundamentals of Noise ReductionChap. 44 Spectral Enhancement methodsChap. 45 Echo CancellationChap. 46 DereverberationChap. 47 Adaptive Beamforming and PostfilteringChap. 48 Feedback Control in Hearing AidsChap. 49 Active Noise Control Part I: Multichannel Speech Processing (J. Benesty, I. Cohen, Y. Huang)Part I presents modern aspects of multichannel processing, for acoustic scene analysis, speech acquisition and presentation, when a large number of microphones and loudspeakers are available. Chap. 50 Microphone ArraysChap. 51 Time Delay Estimation and Source LocalizationChap. 52 Convolutive Blind Source Separation MethodsChap. 53 Sound Field Reproduction About the Authors Subject Index

最近チェックした商品