音声・音響処理(テキスト)<br>Speech and Audio Processing : A MATLAB-Based Approach

個数:
電子版価格 ¥6,872
  • 電書あり

音声・音響処理(テキスト)
Speech and Audio Processing : A MATLAB-Based Approach

  • 提携先の海外書籍取次会社に在庫がございます。通常2週間で発送いたします。
    重要ご説明事項
    1. 納期遅延や、ご入手不能となる場合が若干ございます。
    2. 複数冊ご注文の場合、分割発送となる場合がございます。
    3. 美品のご指定は承りかねます。
  • 製本 Hardcover:ハードカバー版/ページ数 386 p.
  • 言語 ENG
  • 商品コード 9781107085466
  • DDC分類 006.45

Full Description


With this comprehensive and accessible introduction to the field, you will gain all the skills and knowledge needed to work with current and future audio, speech, and hearing processing technologies. Topics covered include mobile telephony, human-computer interfacing through speech, medical applications of speech and hearing technology, electronic music, audio compression and reproduction, big data audio systems and the analysis of sounds in the environment. All of this is supported by numerous practical illustrations, exercises, and hands-on MATLAB (R) examples on topics as diverse as psychoacoustics (including some auditory illusions), voice changers, speech compression, signal analysis and visualisation, stereo processing, low-frequency ultrasonic scanning, and machine learning techniques for big data. With its pragmatic and application driven focus, and concise explanations, this is an essential resource for anyone who wants to rapidly gain a practical understanding of speech and audio processing and technology.

Table of Contents

Preface                                            ix
Book features xii
Acknowledgements xv
1 Introduction 1 (8)
1.1 Computers and audio 1 (2)
1.2 Digital audio 3 (1)
1.3 Capturing and converting sound 4 (1)
1.4 Sampling 5 (1)
1.5 Summary 6 (3)
Bibliography 7 (2)
2 Basic audio processing 9 (45)
2.1 Sound in MATLAB 10 (8)
2.2 Normalisation 18 (2)
2.3 Continuous audio processing 20 (4)
2.4 Segmentation 24 (8)
2.5 Analysis window sizing 32 (5)
2.6 Visualisation 37 (7)
2.7 Sound generation 44 (6)
2.8 Summary 50 (4)
Bibliography 50 (2)
Questions 52 (2)
3 The human voice 54 (31)
3.1 Speech production 55 (2)
3.2 Characteristics of speech 57 (10)
3.3 Types of speech 67 (4)
3.4 Speech understanding 71 (11)
3.5 Summary 82 (3)
Bibliography 83 (1)
Questions 83 (2)
4 The human auditory system 85 (24)
4.1 Physical processes 85 (2)
4.2 Perception 87 (16)
4.3 Amplitude and frequency models 103(4)
4.4 Summary 107(2)
Bibliography 107(1)
Questions 108(1)
5 Psychoacoustics 109(31)
5.1 Psychoacoustic processing 109(3)
5.2 Auditory scene analysis 112(9)
5.3 Psychoacoustic modelling 121(11)
5.4 Hermansky-style model 132(2)
5.5 MFCC model 134(3)
5.6 Masking effect of speech 137(1)
5.7 Summary 138(2)
Bibliography 138(1)
Questions 139(1)
6 Speech communications 140(55)
6.1 Quantisation 140(8)
6.2 Parameterisation 148(28)
6.3 Pitch models 176(6)
6.4 Analysis-by-synthesis 182(9)
6.5 Perceptual weighting 191(1)
6.6 Summary 192(3)
Bibliography 192(1)
Questions 193(2)
7 Audio analysis 195(28)
7.1 Analysis toolkit 196(12)
7.2 Speech analysis and classification 208(3)
7.3 Some examples of audio analysis 211(2)
7.4 Statistics and classification 213(3)
7.5 Analysing other signals 216(4)
7.6 Summary 220(3)
Bibliography 220(1)
Questions 221(2)
8 Big data 223(44)
8.1 The rationale behind big data 225(1)
8.2 Obtaining big data 226(1)
8.3 Classification and modelling 227(7)
8.4 Summary of techniques 234(29)
8.5 Big data applications 263(1)
8.6 Summary 264(3)
Bibliography 264(1)
Questions 265(2)
9 Speech recognition 267(47)
9.1 What is speech recognition? 267(8)
9.2 Voice activity detection and 275(7)
segmentation
9.3 Current speech recognition research 282(6)
9.4 Hidden Markov models 288(10)
9.5 ASR in practice 298(4)
9.6 Speaker identification 302(3)
9.7 Language identification 305(3)
9.8 Diarization 308(1)
9.9 Related topics 309(2)
9.10 Summary 311(3)
Bibliography 311(1)
Questions 312(2)
10 Advanced topics 314(52)
10.1 Speech synthesis 314(10)
10.2 Stereo encoding 324(10)
10.3 Formant strengthening and steering 334(4)
10.4 Voice and pitch changer 338(8)
10.5 Statistical voice conversion 346(1)
10.6 Whisper-to-speech conversion 347(7)
10.7 Whisperisation 354(3)
10.8 Super-audible speech 357(6)
10.9 Summary 363(3)
Bibliography 364(1)
Questions 365(1)
11 Conclusion 366(4)
References 370(9)
Index 379