CATALOGO DEI PRODOTTI DELLA RICERCA

We discuss the use of low-dimensional physical models of the voice source for speech coding and processing applications. A class of waveform-adaptive dynamic glottal models and parameter identification procedures are illustrated. The model and the identification procedures are assessed by addressing signal transformations on recorded speech, achievable by fitting the model to the data, and then acting on the physically oriented parameters of the voice source. The class of models proposed provides in principle a tool for both the estimation of glottal source signals, and the encoding of the speech signal for transformation purposes. The application of this model to time stretching and to fundamental frequency control (pitch shifting) is also illustrated. The experiments show that copy synthesis is perceptually very similar to the target, and that time stretching and "pitch extrapolation" effects can be obtained by simple control strategies.

Speaker adaptive voice source modeling with applications to speech coding and processing

Drioli, Carlo;Calanca, Andrea

2014-01-01

Abstract

We discuss the use of low-dimensional physical models of the voice source for speech coding and processing applications. A class of waveform-adaptive dynamic glottal models and parameter identification procedures are illustrated. The model and the identification procedures are assessed by addressing signal transformations on recorded speech, achievable by fitting the model to the data, and then acting on the physically oriented parameters of the voice source. The class of models proposed provides in principle a tool for both the estimation of glottal source signals, and the encoding of the speech signal for transformation purposes. The application of this model to time stretching and to fundamental frequency control (pitch shifting) is also illustrated. The experiments show that copy synthesis is perceptually very similar to the target, and that time stretching and "pitch extrapolation" effects can be obtained by simple control strategies.

Scheda breve

Scheda completa

Scheda completa (DC)

	Anno
	
				2014
			
	Parole chiave
	
				Glottal modeling, Model inversion, Model-based transformations, Speech synthesis and processing
			
	Appare nelle tipologie:
	
				01.01 Articolo in Rivista

File in questo prodotto:

File	Dimensione	Formato
AS967468478668801400077654900_content_1.pdf non disponibili Tipologia: Documento in Pre-print Licenza: Accesso ristretto Dimensione 2.97 MB Formato Adobe PDF Visualizza/Apri Richiedi una copia	2.97 MB	Adobe PDF	Visualizza/Apri Richiedi una copia

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11562/977000

Citazioni

ND

7

5

social impact