Deep Learning Methods for Processing Endoscopic High-Speed Video and Laryngeal Parameter Estimation
Seiten
- Keine Verlagsinformationen verfügbar
- Artikel merken
Deep learning methods have had tremendous impact in computer vision, image processing and all areas that relate to these fields. This dissertation explores the application of these methods to the enhancement and processing of endoscopic high-speed video (HSV).
HSV is one of the main technique used in voice research as the small-scale, rapid oscillation of the vocal folds requires sophisticated recording techniques. As voice disorders have been shown to have a tremendous negative impact on the quality of life of the affected and society in general, a new generation of more objective diagnostic techniques is required. This dissertation features several contributions towards this goal:
- An innovative method to enhance low-light HSV using an improved U-Net convolutional neural network
- A robust and fast deep-learning-based automatic method for the segmentation of the glottis in HSV data
- Development of an improved two-mass-model of the vocal folds
- Proof of concept of estimating ex-vivo subglottal pressure validated on experimental data
- Proof of concept of estimating subglottal pressure with a recurrent neural network trained on a numerical model
After a thorough introduction to the field of voice research and deep learning the dissertation describes the developed methods and results in detail. The dissertation describes signifcant improvements in regard to low-light image enhancement, automatic glottis segmentation physical voice parameter inference.
HSV is one of the main technique used in voice research as the small-scale, rapid oscillation of the vocal folds requires sophisticated recording techniques. As voice disorders have been shown to have a tremendous negative impact on the quality of life of the affected and society in general, a new generation of more objective diagnostic techniques is required. This dissertation features several contributions towards this goal:
- An innovative method to enhance low-light HSV using an improved U-Net convolutional neural network
- A robust and fast deep-learning-based automatic method for the segmentation of the glottis in HSV data
- Development of an improved two-mass-model of the vocal folds
- Proof of concept of estimating ex-vivo subglottal pressure validated on experimental data
- Proof of concept of estimating subglottal pressure with a recurrent neural network trained on a numerical model
After a thorough introduction to the field of voice research and deep learning the dissertation describes the developed methods and results in detail. The dissertation describes signifcant improvements in regard to low-light image enhancement, automatic glottis segmentation physical voice parameter inference.
Erscheinungsdatum | 09.08.2019 |
---|---|
Reihe/Serie | Kommunikationsstörungen - Berichte aus Phoniatrie und Pädandiologie ; 27 |
Verlagsort | Düren |
Sprache | englisch |
Maße | 148 x 210 mm |
Gewicht | 219 g |
Themenwelt | Medizin / Pharmazie ► Medizinische Fachgebiete ► HNO-Heilkunde |
Schlagworte | Automatic segmentation • Deep learning • High-speed Videoendoscopy • High-Speed Videoendoskopie • Image Processing • Neuronale Netzwerke • Phoniatrie • Recurrent Neural Network • Vocal Fold Models • Voice Parameter Estimation |
ISBN-10 | 3-8440-6845-7 / 3844068457 |
ISBN-13 | 978-3-8440-6845-0 / 9783844068450 |
Zustand | Neuware |
Haben Sie eine Frage zum Produkt? |
Mehr entdecken
aus dem Bereich
aus dem Bereich
ein Kompendium von Praktikern für Praktiker
Buch (2023)
Thieme (Verlag)
CHF 459,95
Differenzierte Diagnostik und Therapie
Buch | Hardcover (2021)
Springer (Verlag)
CHF 249,95
Buch | Softcover (2022)
Median-Verlag von Killisch-Horn GmbH
CHF 74,20