Speech Processing in Embedded Systems - Priyabrata Sinha

Blick ins Buch

Speech Processing in Embedded Systems (eBook)

Priyabrata Sinha (Autor)

eBook Download: PDF

2009
XI, 171 Seiten
Springer US (Verlag)
9780387755816 (ISBN)

Lese- und Medienproben

Ebook-Leseprobe (PDF)

'Speech Processing in Embedded Systems' describes several areas of speech processing, and the various algorithms and industry standards that address each of these areas. The topics covered include different types of Speech Compression, Echo Cancellation, Noise Suppression, Speech Recognition and Speech Synthesis. In addition this book explores various issues and considerations related to efficient implementation of these algorithms on real-time embedded systems, including the role played by processor CPU and peripheral functionality.

Speech Processing has rapidly emerged as one of the most widespread and well-understood application areas in the broader discipline of Digital Signal Processing. Besides the telecommunications applications that have hitherto been the largest users of speech processing algorithms, several non-traditional embedded processor applications are enhancing their functionality and user interfaces by utilizing various aspects of speech processing."e;Speech Processing in Embedded Systems"e; describes several areas of speech processing, and the various algorithms and industry standards that address each of these areas. The topics covered include different types of Speech Compression, Echo Cancellation, Noise Suppression, Speech Recognition and Speech Synthesis. In addition this book explores various issues and considerations related to efficient implementation of these algorithms on real-time embedded systems, including the role played by processor CPU and peripheral functionality.

Preface 5
Contents 7
Introduction 12
Digital vs. Analog Systems 12
Embedded Systems Overview 14
Speech Processing in Everyday Life 15
Common Speech Processing Tasks 16
Summary 18
References 18
Signal Processing Fundamentals 19
Signals and Systems [1] 19
Sampling and Quantization [1, 2] 21
Convolution and Correlation [2, 3] 25
Frequency Transformations and FFT [1] 30
Introduction to Filters [1] 35
FIR and IIR Filters [1, 2] 40
Interpolation and Decimation [1, 2] 45
Summary 46
References 46
Basic Speech Processing Concepts 47
Mechanism of Human Speech Production [1, 3] 47
Types of Speech Signals [1, 2] 49
Digital Models for the Speech Production System [1, 2] 52
Alternative Filtering Methodologies Used in Speech Processing [ 2, 4] 53
Some Basic Speech Processing Operations [1, 2] 57
Key Characteristics of the Human Auditory System [3] 59
Evaluation of Speech Quality [2] 61
Summary 63
References 64
CPU Architectures for Speech Processing 65
The Microprocessor Concept 65
Microcontroller Units Architecture Overview [3–5, 9, 10] 67
Digital Signal Processor Architecture Overview [3–5, 7, 8] 69
Digital Signal Controller Architecture Overview [3–5] 70
Fixed-Point and Floating-Point Processors [1, 2, 6] 70
Accumulators and MAC Operations [2, 3] 72
Multiplication, Division, and 32-Bit Operations [3, 4] 75
Program Flow Control [3, 4] 76
Special Addressing Modes [3, 4] 77
Data Scaling, Normalization, and Bit Manipulation Support [ 2– 4] 80
Other Architectural Considerations [5, 6] 81
Summary 84
References 84
Peripherals for Speech Processing 85
Speech Sampling Using Analog-to-Digital Converters [2, 3, 8–10] 85
Speech Playback Using Digital-to-Analog Converters [2, 3] 90
Speech Playback Using PulseWidth Modulation [3, 4] 91
Interfacing with Audio Codec Devices [3, 8–10] 92
Communication Peripherals [3, 5–7] 95
Other Peripheral Features [1, 8–10] 100
Summary 100
References 101
Speech Compression Overview 102
Speech Compression and Embedded Applications [2, 3, 5] 102
Types of Speech Compression Techniques [1–4] 105
Comparison of Speech Coders [5] 106
Summary 108
References 109
Waveform Coders 110
Introduction to Scalar Quantization 110
Uniform Quantization [1–4] 111
Logarithmic Quantization [1–4] 112
ITU-T G.711 Speech Coder [5] 113
ITU-T G.726 and G.726A Speech Coders [7, 8] 114
ITU-T G.722 Speech Coder [6] 117
Summary 119
References 121
Voice Coders 122
Linear Predictive Coding [1–4] 122
Vector Quantization [1–4] 127
Speex Speech Coder [8] 128
ITU-T G.728 Speech Coder [5] 129
ITU-T G.729 Speech Coder [6] 131
ITU-T G.723.1 Speech Coder [7] 131
Summary 133
References 133
Noise and Echo Cancellation 135
Benefits and Applications of Noise Suppression [4–8] 135
Noise Cancellation Algorithms for 2-Microphone Systems 138
Noise Suppression Algorithms for 1-Microphone Systems [4] 141
Active Noise Cancellation Systems [1, 7] 143
Benefits and Applications of Echo Cancellation [1, 5, 6] 144
Acoustic Echo Cancellation Algorithms [2, 5] 146
Line Echo Cancellation Algorithms [3, 6] 148
Computational Resource Requirements 148
Summary 149
References 150
Speech Recognition 151
Benefits and Applications of Speech Recognition [2, 3] 151
Speech Recognition Using Template Matching [1, 3] 155
Speech Recognition Using Hidden Markov Models [1, 3] 158
Viterbi Algorithm [1, 3] 159
Front-End Analysis [1, 3] 160
Other Practical Considerations [1, 3] 161
Performance Assessment of Speech Recognizers [2, 3] 162
Computational Resource Requirements [3] 162
Summary 163
References 163
Speech Synthesis 164
Benefits and Applications of Concatenative Speech Synthesis [2] 164
Benefits and Applications of Text-to-Speech Systems [2] 166
Speech Synthesis by Concatenation ofWords and Subwords [1] 167
Speech Synthesis by Concatenating Waveform Segments [1] 168
Speech Synthesis by Conversion from Text (TTS) [1] 169
Computational Resource Requirements [1] 171
Summary 171
References 171
Conclusion 172
References 174
Index 175

Erscheint lt. Verlag	1.12.2009
Zusatzinfo	XI, 171 p.
Verlagsort	New York
Sprache	englisch
Themenwelt	Informatik ► Theorie / Studium ► Künstliche Intelligenz / Robotik
Themenwelt	Technik ► Elektrotechnik / Energietechnik
Schlagworte	algorithms • digital signal processing • Embedded System • Embedded Systems • Interface • Signal Processing • Speech processing • Speech Recognition • Speech Synthesis • Standards • telecommunications
ISBN-13	9780387755816 / 9780387755816

Informationen gemäß Produktsicherheitsverordnung (GPSR)
Haben Sie eine Frage zum Produkt?

PDF (Wasserzeichen)

DRM: Digitales Wasserzeichen
Dieses eBook enthält ein digitales Wasserzeichen und ist damit für Sie personalisiert. Bei einer missbräuchlichen Weitergabe des eBooks an Dritte ist eine Rückverfolgung an die Quelle möglich.

Dateiformat: PDF (Portable Document Format)
Mit einem festen Seitenlayout eignet sich die PDF besonders für Fachbücher mit Spalten, Tabellen und Abbildungen. Eine PDF kann auf fast allen Geräten angezeigt werden, ist aber für kleine Displays (Smartphone, eReader) nur eingeschränkt geeignet.

Systemvoraussetzungen:
PC/Mac: Mit einem PC oder Mac können Sie dieses eBook lesen. Sie benötigen dafür einen PDF-Viewer - z.B. den Adobe Reader oder Adobe Digital Editions.
eReader: Dieses eBook kann mit (fast) allen eBook-Readern gelesen werden. Mit dem amazon-Kindle ist es aber nicht kompatibel.
Smartphone/Tablet: Egal ob Apple oder Android, dieses eBook können Sie lesen. Sie benötigen dafür einen PDF-Viewer - z.B. die kostenlose Adobe Digital Editions-App.

Buying eBooks from abroad
For tax law reasons we can sell eBooks just within Germany and Switzerland. Regrettably we cannot fulfill eBook-orders from other countries.