Nicht aus der Schweiz? Besuchen Sie lehmanns.de
Deep Learning Based Speech Quality Prediction - Gabriel Mittag

Deep Learning Based Speech Quality Prediction

(Autor)

Buch | Softcover
XIV, 165 Seiten
2023 | 1st ed. 2022
Springer International Publishing (Verlag)
978-3-030-91481-3 (ISBN)
CHF 149,75 inkl. MwSt
  • Versand in 15-20 Tagen
  • Versandkostenfrei
  • Auch auf Rechnung
  • Artikel merken
This book presents how to apply recent machine learning (deep learning) methods for the task of speech quality prediction. The author shows how recent advancements in machine learning can be leveraged for the task of speech quality prediction and provides an in-depth analysis of the suitability of different deep learning architectures for this task. The author then shows how the resulting model outperforms traditional speech quality models and provides additional information about the cause of a quality impairment through the prediction of the speech quality dimensions of noisiness, coloration, discontinuity, and loudness.

Gabriel Mittag received his B.Sc. and M.Sc. degree in electrical and electronic engineering at the Technische Universität Berlin. During his master's degree he spent two semesters at the RMIT University in Melbourne, Australia and focused primarily on biomedical and speech signal processing. From 2016 he was employed as research assistant at the Quality and Usability Lab at the TU Berlin, where he finished his PhD on the machine learning based prediction of speech quality. In May 2021, Gabriel Mittag started as Machine Learning Scientist at Microsoft in Redmond, WA, USA.

1. Introduction.- 2. Quality Assessment of Transmitted Speech.- 3. Neural Network Architectures for Speech Quality Prediction.- 4. Double-Ended Speech Quality Prediction Using Siamese Networks.- 5. Prediction of Speech Quality Dimensions With Multi-Task Learning.- 6. Bias-Aware Loss for Training From Multiple Datasets.- 7. NISQA - A Single-Ended Speech Quality Model.- 8. Conclusions.- A. Dataset Condition Tables.- B. Train and Validation Dataset Dimension Histograms.- References.

Erscheinungsdatum
Reihe/Serie T-Labs Series in Telecommunication Services
Zusatzinfo XIV, 165 p. 58 illus., 54 illus. in color.
Verlagsort Cham
Sprache englisch
Maße 155 x 235 mm
Gewicht 282 g
Themenwelt Informatik Software Entwicklung User Interfaces (HCI)
Informatik Theorie / Studium Künstliche Intelligenz / Robotik
Technik Elektrotechnik / Energietechnik
Schlagworte Deep learning • machine learning • quality of experience • Quality of Service • Speech Quality
ISBN-10 3-030-91481-X / 303091481X
ISBN-13 978-3-030-91481-3 / 9783030914813
Zustand Neuware
Haben Sie eine Frage zum Produkt?
Mehr entdecken
aus dem Bereich
Aus- und Weiterbildung nach iSAQB-Standard zum Certified Professional …

von Mahbouba Gharbi; Arne Koschel; Andreas Rausch; Gernot Starke

Buch | Hardcover (2023)
dpunkt Verlag
CHF 48,85
Wissensverarbeitung - Neuronale Netze

von Uwe Lämmel; Jürgen Cleve

Buch | Hardcover (2023)
Carl Hanser (Verlag)
CHF 48,95