HNM-based DSP (Digital Signal Processing) module implementation of a TTS system

HNM-based DSP (Digital Signal Processing) module implementation of a TTS system

URI: https://www.openarchives.gr/aggregator-openarchives/edm/nemertes/000009-10889_159
RDF/XML JSON-LD

This item is provided by the institution :
University of Patras

Repository :
Nemertes

see the original item page
in the repository's web site and access all digital files if the item^*

Title

Υλοποίηση βαθμίδας ΨΕΣ (Ψηφιακής Επεξεργασίας Σήματος) συστήματος σύνθεσης ομιλίας με βάση τον αλγόριθμο ΗΝΜ. (EN)

HNM-based DSP (Digital Signal Processing) module implementation of a TTS system (EN)

Creator

Βασιλόπουλος, Ιωάννης

Contributor

Vasilopoulos, Ioannis (EN)

Νίκος Φακωτάκης

Φακωτάκης, Νίκος

Στουραίτης, Αθανάσιος

Μουρτζόπουλος, Ιωάννης

Issued

2005-02-27

2007-05-16T11:27:32Z

Year

2005 (EN)

Description

A TTS (Text-To-Speech) System is used to convert any given text to its corresponding speech with natural characteristics. A TTS consists of two modules, the Natural Language Processing (NLP) module and the Digital Signal Processing (DSP) module. The NLP module analyses the input text and supplies the DSP module with the appropriate phonemes and prosodic modifications, with concern to pitch, duration and volume of each phoneme. Then the DSP module synthesizes speech with the target prosody, using speech analysis-synthesis algorithms such as HNM. HNM (Harmonic plus Noise Model) algorithm models speech signal as the sum two parts, the harmonic part and the noise part. Speech analysis and speech synthesis with or without modifications, is achieved using the harmonic and the noise part (EN)

Ένα TTS (Τext-To-Speech) σύστημα μετατρέπει ένα οποιοδήποτε κείμενο στην αντιστοιχούσα ομιλία, η οποία έχει φυσικά χαρακτηριστικά. Το ΤΤS αποτελείται από δύο βαθμίδες, τη βαθμίδα Επεξεργασίας Φυσικής Γλώσσας (ΕΦΓ) και τη βαθμίδα Ψηφιακής Επεξεργασίας Σήματος (ΨΕΣ). Η βαθμίδα ΕΦΓ είναι υπεύθυνη για την σωστή ανάλυση του κειμένου εισόδου σε φωνήματα και το καθορισμό των επιθυμητών προσωδιακών χαρακτηριστικών, όπως το pitch, η διάρκεια και η ένταση του κάθε φωνήματος. Η βαθμίδα ΨΕΣ αναλαμβάνει να συνθέσει την ομιλία με τα επιθυμητά προσωδιακά χαρακτηρίστηκα, τα οποία έδωσε η βαθμίδα ΕΦΓ. Ένας τρόπος για να επιτευχθεί αυτό είναι με χρήση αλγορίθμων ανάλυσης και σύνθεσης ομιλίας, όπως ο αλγόριθμος HNM (Harmonic plus Noise Model).Ο ΗΝΜ μοντελοποιεί το σήμα ομιλίας ως άθροισμα δύο τμημάτων, ενός τμήματος με αρμονικά χαρακτηριστικά και ενός τμήματος με χαρακτηριστικά θορύβου. Χρησιμοποιώντας αυτό το μοντέλο γίνεται η ανάλυση και η σύνθεση του σήματος ομιλίας με ή χωρίς προσωδιακές μεταβολές.

Scientific field

Natural Sciences
Computer and Information Sciences (EN)

Subject

621.382 23

TTS (EN)

Text To Speech (EN)

Speech analysis (EN)

HNM (EN)

Harmonic plus Noise Model (EN)

Speech synthesis (EN)

Ανάλυση ομιλίας

Σύνθεση ομιλίας

School / Department / Institute

University of Patras ▶ School of Engineering
Department of Computer Engineering and Informatics

Provider

University of Patras

Repository / collection

Nemertes

Subcollections

Μεταπτυχιακές Εργασίες

Τμήμα Μηχανικών Η/Υ και Πληροφορικής (ΜΔΕ)

1. Διατριβές & Εργασίες | Theses & Dissertations

*Institutions are responsible for keeping their URLs functional (digital file, item page in repository site)

HNM-based DSP (Digital Signal Processing) module implementation of a TTS system

Βοηθείστε μας να κάνουμε καλύτερο το OpenArchives.gr.