Buch, Englisch, 136 Seiten, Format (B × H): 155 mm x 235 mm, Gewicht: 236 g
Buch, Englisch, 136 Seiten, Format (B × H): 155 mm x 235 mm, Gewicht: 236 g
Reihe: SpringerBriefs in Speech Technology
ISBN: 978-3-030-02758-2
Verlag: Springer International Publishing
This book presents a statistical parametric speech synthesis (SPSS) framework for developing a speech synthesis system where the desired speech is generated from the parameters of vocal tract and excitation source. Throughout the book, the authors discuss novel source modeling techniques to enhance the naturalness and overall intelligibility of the SPSS system. This book provides several important methods and models for generating the excitation source parameters for enhancing the overall quality of synthesized speech. The contents of the book are useful for both researchers and system developers. For researchers, the book is useful for knowing the current state-of-the-art excitation source models for SPSS and further refining the source models to incorporate the realistic semantics present in the text. For system developers, the book is useful to integrate the sophisticated excitation source models mentioned to the latest models of mobile/smart phones.
Zielgruppe
Research
Autoren/Hrsg.
Fachgebiete
- Technische Wissenschaften Sonstige Technologien | Angewandte Technik Signalverarbeitung, Bildverarbeitung, Scanning
- Mathematik | Informatik EDV | Informatik Informatik Tonsignalverarbeitung
- Mathematik | Informatik EDV | Informatik Informatik Künstliche Intelligenz Spracherkennung, Sprachverarbeitung
Weitere Infos & Material
Chapter 1. Introduction.- Chapter 2. Background and literature review.- Chapter 3. Robust voicing detection and F0 estimation method.- Chapter 4. Parametric approach of modeling the source signal.- Chapter 5. Hybrid approach of modeling the source signal.- Chapter 6. Generation of creaky voice.- Chapter 7. Summary and conclusions.