Speech Signal Processing Based on Deep Learning in Complex Acoustic Environments
Elsevier - Health Sciences Division (Verlag)
978-0-443-24856-6 (ISBN)
Xiao-Lei Zhang received his Ph.D. degree with honors from Tsinghua University, China, in 2012. He was a postdoctoral researcher with the Department of Electronic Engineering at Tsinghua University from 2012 to 2014. He was a visiting scholar at The Ohio State University, USA, from 2013 to 2014 and a postdoctoral researcher with the Department of Computer Science and Engineering, The Ohio State University, from 2014 to 2016. Since 2016 he has been a full professor at the Northwestern Polytechnical University, Xi'an, China. His research interests are the topics in speech processing, machine learning, statistical signal processing, and artificial intelligence.
1. Introduction
2. Fundamentals of Deep Learning
3. Voice Activity Detection
4. Single-Channel Speech Enhancement
5. Multi-Channel Speech Enhancement
6. Multi-Speaker Speech Separation
7. Speaker Recognition
8. Speech Recognition
Erscheinungsdatum | 17.09.2024 |
---|---|
Verlagsort | Philadelphia |
Sprache | englisch |
Maße | 152 x 229 mm |
Gewicht | 450 g |
Themenwelt | Informatik ► Theorie / Studium ► Künstliche Intelligenz / Robotik |
ISBN-10 | 0-443-24856-7 / 0443248567 |
ISBN-13 | 978-0-443-24856-6 / 9780443248566 |
Zustand | Neuware |
Haben Sie eine Frage zum Produkt? |
aus dem Bereich