Quality of Synthetic Speech
Perceptual Dimensions, Influencing Factors, and Instrumental Assessment
Seiten
2017
|
1st ed. 2017
Springer Verlag, Singapore
978-981-10-3733-7 (ISBN)
Springer Verlag, Singapore
978-981-10-3733-7 (ISBN)
This book reviews research towards perceptual quality dimensions of synthetic speech, compares these findings with the state of the art, and derives a set of five universal perceptual quality dimensions for TTS signals. In addition, different techniques for the instrumental quality assessment of TTS signals are introduced, reviewed and tested.
This book reviews research towards perceptual quality dimensions of synthetic speech, compares these findings with the state of the art, and derives a set of five universal perceptual quality dimensions for TTS signals. They are: (i) naturalness of voice, (ii) prosodic quality, (iii) fluency and intelligibility, (iv) absence of disturbances, and (v) calmness. Moreover, a test protocol for the efficient indentification of those dimensions in a listening test is introduced. Furthermore, several factors influencing these dimensions are examined. In addition, different techniques for the instrumental quality assessment of TTS signals are introduced, reviewed and tested. Finally, the requirements for the integration of an instrumental quality measure into a concatenative TTS system are examined.
This book reviews research towards perceptual quality dimensions of synthetic speech, compares these findings with the state of the art, and derives a set of five universal perceptual quality dimensions for TTS signals. They are: (i) naturalness of voice, (ii) prosodic quality, (iii) fluency and intelligibility, (iv) absence of disturbances, and (v) calmness. Moreover, a test protocol for the efficient indentification of those dimensions in a listening test is introduced. Furthermore, several factors influencing these dimensions are examined. In addition, different techniques for the instrumental quality assessment of TTS signals are introduced, reviewed and tested. Finally, the requirements for the integration of an instrumental quality measure into a concatenative TTS system are examined.
Introduction.- Speech Synthesis.- Auditory and Instrumental Quality Evaluation Metrics.- Perceptual Quality Dimensions.- Influencing Factors on Perceptual Quality.- Instrumental Quality Assessment.- Requirements for the Integration of an Instrumental Quality Measure into a Concatenative TTS System.- Conclusions.
Erscheinungsdatum | 28.04.2017 |
---|---|
Reihe/Serie | T-Labs Series in Telecommunication Services |
Zusatzinfo | 29 Illustrations, black and white; XVI, 157 p. 29 illus. |
Verlagsort | Singapore |
Sprache | englisch |
Maße | 155 x 235 mm |
Themenwelt | Informatik ► Software Entwicklung ► User Interfaces (HCI) |
Informatik ► Theorie / Studium ► Künstliche Intelligenz / Robotik | |
Technik ► Elektrotechnik / Energietechnik | |
Schlagworte | Auditory Quality Evaluation • Influencing Factors on Perceptual Quality • Instrumental Quality Evaluation • MaryTTS-Unit-Selection • Perceptual Quality Dimensions • Text-to-Speech TTS Synthesis |
ISBN-10 | 981-10-3733-7 / 9811037337 |
ISBN-13 | 978-981-10-3733-7 / 9789811037337 |
Zustand | Neuware |
Haben Sie eine Frage zum Produkt? |
Mehr entdecken
aus dem Bereich
aus dem Bereich
Lean UX und Design Thinking: Teambasierte Entwicklung …
Buch | Hardcover (2022)
dpunkt (Verlag)
CHF 48,85
Aus- und Weiterbildung nach iSAQB-Standard zum Certified Professional …
Buch | Hardcover (2023)
dpunkt Verlag
CHF 48,85
Wissensverarbeitung - Neuronale Netze
Buch | Hardcover (2023)
Carl Hanser (Verlag)
CHF 48,95