Time Domain Representation of Speech Sounds (eBook)
XVI, 154 Seiten
Springer Singapore (Verlag)
978-981-13-2303-4 (ISBN)
The book presents the history of time-domain representation and the extent of its development along with that of spectral domain representation in the cognitive and technology domains. It discusses all the cognitive experiments related to this development, along with details of technological developments related to both automatic speech recognition (ASR) and text to speech synthesis (TTS), and introduces a viable time-domain representation for both objective and subjective analysis, as an alternative to the well-known spectral representation.
The book also includes a new cohort study on the use of lexical knowledge in ASR.
India has numerous official dialects, and spoken-language technology development is a burgeoning area. In fact TTS and ASR taken together constitute the most important technology for empowering people. As such, the book describes time domain representation in such a way that it can be easily and seamlessly incorporated into ASR and TTS research and development. In short, it is a valuable guidebook for the development of ASR and TTS in all the Indian Standard Dialects using signal domain parameters.
Prof. Asoke Kumar Datta, an MSc. (Pure Math), worked at the Indian Statistical Institute from 1955-1994. He retired from the HOD Electronics and Communication Sciences Department, and is an ISI Visiting Professor. He is President, BOM-BOM, Kolkata; Senior Guest Researcher, Sir C V Raman Centre for Physics and Music, JU; Executive Member, Society for Natural Language Technology Research, Kolkata; Life Member, Acoustical Society of India. He received the J C Bose Memorial Award, 1969; Sir C V Raman Award, 1982-83 & 1998-99; S K Mitra Memorial Award, 1984; and the Sri C AchyutMenon Prize, 2001. His areas of academic interest include pattern recognition, AI, speech, music and consciousness.
The book presents the history of time-domain representation and the extent of its development along with that of spectral domain representation in the cognitive and technology domains. It discusses all the cognitive experiments related to this development, along with details of technological developments related to both automatic speech recognition (ASR) and text to speech synthesis (TTS), and introduces a viable time-domain representation for both objective and subjective analysis, as an alternative to the well-known spectral representation.The book also includes a new cohort study on the use of lexical knowledge in ASR. India has numerous official dialects, and spoken-language technology development is a burgeoning area. In fact TTS and ASR taken together constitute the most important technology for empowering people. As such, the book describes time domain representation in such a way that it can be easily and seamlessly incorporated into ASR and TTS research and development. In short, it is a valuable guidebook for the development of ASR and TTS in all the Indian Standard Dialects using signal domain parameters.
Prof. Asoke Kumar Datta, an MSc. (Pure Math), worked at the Indian Statistical Institute from 1955-1994. He retired from the HOD Electronics and Communication Sciences Department, and is an ISI Visiting Professor. He is President, BOM-BOM, Kolkata; Senior Guest Researcher, Sir C V Raman Centre for Physics and Music, JU; Executive Member, Society for Natural Language Technology Research, Kolkata; Life Member, Acoustical Society of India. He received the J C Bose Memorial Award, 1969; Sir C V Raman Award, 1982-83 & 1998-99; S K Mitra Memorial Award, 1984; and the Sri C AchyutMenon Prize, 2001. His areas of academic interest include pattern recognition, AI, speech, music and consciousness.
Chapter 1. Introduction.- Chapter 2. Spectral Domain.- Chapter 3. Cognition of Phones.- Chapter 4. Signal Processing.- Chapter 5. Time Domain Representation of Phones.- Chapter 6. Role of Lexical Knowledge in ASR.- Chapter 7. Random Perturbations.- Chapter 8. Non linearity in Speech signal.
Erscheint lt. Verlag | 3.11.2018 |
---|---|
Zusatzinfo | XVI, 154 p. 117 illus., 27 illus. in color. |
Verlagsort | Singapore |
Sprache | englisch |
Themenwelt | Mathematik / Informatik ► Informatik ► Betriebssysteme / Server |
Informatik ► Software Entwicklung ► User Interfaces (HCI) | |
Informatik ► Theorie / Studium ► Künstliche Intelligenz / Robotik | |
Technik ► Elektrotechnik / Energietechnik | |
Schlagworte | Automatic speech recognition (ASR) • Cognitive Development • lexical knowledge • perturbations • Signal Processing • Speech processing • Text to Speech Synthesis (TTS) |
ISBN-10 | 981-13-2303-8 / 9811323038 |
ISBN-13 | 978-981-13-2303-4 / 9789811323034 |
Haben Sie eine Frage zum Produkt? |
Größe: 7,9 MB
DRM: Digitales Wasserzeichen
Dieses eBook enthält ein digitales Wasserzeichen und ist damit für Sie personalisiert. Bei einer missbräuchlichen Weitergabe des eBooks an Dritte ist eine Rückverfolgung an die Quelle möglich.
Dateiformat: PDF (Portable Document Format)
Mit einem festen Seitenlayout eignet sich die PDF besonders für Fachbücher mit Spalten, Tabellen und Abbildungen. Eine PDF kann auf fast allen Geräten angezeigt werden, ist aber für kleine Displays (Smartphone, eReader) nur eingeschränkt geeignet.
Systemvoraussetzungen:
PC/Mac: Mit einem PC oder Mac können Sie dieses eBook lesen. Sie benötigen dafür einen PDF-Viewer - z.B. den Adobe Reader oder Adobe Digital Editions.
eReader: Dieses eBook kann mit (fast) allen eBook-Readern gelesen werden. Mit dem amazon-Kindle ist es aber nicht kompatibel.
Smartphone/Tablet: Egal ob Apple oder Android, dieses eBook können Sie lesen. Sie benötigen dafür einen PDF-Viewer - z.B. die kostenlose Adobe Digital Editions-App.
Buying eBooks from abroad
For tax law reasons we can sell eBooks just within Germany and Switzerland. Regrettably we cannot fulfill eBook-orders from other countries.
aus dem Bereich