Bookbot

Springer Series in Information Sciences - 3: Pitch Determination of Speech Signals

Algorithms and Devices

En savoir plus sur le livre

Pitch, encompassing fundamental frequency (FO) and fundamental period (TO), is crucial in the acoustic speech signal, primarily determining prosodic information. The ear is significantly more sensitive to changes in fundamental frequency than to other speech parameters. Consequently, the quality of vocoded speech relies heavily on accurate pitch measurement, underscoring the need for reliable methods. Although it may seem straightforward to detect the fundamental frequency or period of a quasi-periodic signal, pitch determination is one of the most challenging problems in speech analysis for several reasons. Firstly, speech is inherently nonstationary; the vocal tract's position can change abruptly, causing significant variations in the signal's temporal structure, making the assumption of a quasi-periodic signal often unrealistic. Secondly, the flexibility of the human vocal tract and the diversity of voices contribute to a wide range of possible temporal structures. Additionally, narrow-band formants at low harmonics, particularly at the second or third harmonic, complicate the analysis further. Lastly, for any given speech signal from an unknown speaker, the fundamental frequency can fluctuate across a range of nearly four octaves (50 to 800 Hz), adding to the complexity of accurate pitch determination.

Achat du livre

Springer Series in Information Sciences - 3: Pitch Determination of Speech Signals, Wolfgang Heß

Langue
Année de publication
1983
product-detail.submit-box.info.binding
(rigide)
Nous vous informerons par e-mail dès que nous l’aurons retrouvé.

Modes de paiement

Personne n'a encore évalué .Évaluer

Titre
Springer Series in Information Sciences - 3: Pitch Determination of Speech Signals
Sous-titre
Algorithms and Devices
Langue
Anglais
Éditeur
Springer
Publié
1983
Format
rigide
Pages
700
ISBN10
3540119337
ISBN13
9783540119333
Séries
Description
Pitch, encompassing fundamental frequency (FO) and fundamental period (TO), is crucial in the acoustic speech signal, primarily determining prosodic information. The ear is significantly more sensitive to changes in fundamental frequency than to other speech parameters. Consequently, the quality of vocoded speech relies heavily on accurate pitch measurement, underscoring the need for reliable methods. Although it may seem straightforward to detect the fundamental frequency or period of a quasi-periodic signal, pitch determination is one of the most challenging problems in speech analysis for several reasons. Firstly, speech is inherently nonstationary; the vocal tract's position can change abruptly, causing significant variations in the signal's temporal structure, making the assumption of a quasi-periodic signal often unrealistic. Secondly, the flexibility of the human vocal tract and the diversity of voices contribute to a wide range of possible temporal structures. Additionally, narrow-band formants at low harmonics, particularly at the second or third harmonic, complicate the analysis further. Lastly, for any given speech signal from an unknown speaker, the fundamental frequency can fluctuate across a range of nearly four octaves (50 to 800 Hz), adding to the complexity of accurate pitch determination.