Perceptual equivalence of the Liljencrants–Fant and linear-filter glottal flow models

Tools

Perrotin, Olivier, Feugère, Lionel ORCID: https://orcid.org/0000-0003-0883-5224 and d'Alessandro, Christophe (2021) Perceptual equivalence of the Liljencrants–Fant and linear-filter glottal flow models. The Journal of the Acoustical Society of America, 150 (2):1273. ISSN 0001-4966 (doi:10.1121/10.0005879)

[thumbnail of Author's published manuscript]

Preview

PDF (Author's published manuscript)
33628_FEUGERE_Perceptual_equivalence_of_the_liljencrants_fant.pdf - Published Version
Available under License Creative Commons Attribution.
Download (3MB) | Preview

Official URL: https://doi.org/10.1121/10.0005879

Abstract

Speech glottal flow has been predominantly described in the time-domain in past decades, the Liljencrants–Fant (LF) model being the most widely used in speech analysis and synthesis, despite its computational complexity. The causal/ anti-causal linear model (LFCALM) was later introduced as a digital filter implementation of LF, a mixed-phase spectral model including both anti-causal and causal filters to model the vocal-fold open and closed phases, respectively. To further simplify computation, a causal linear model (LFLM) describes the glottal flow with a fully causal set of filters. After expressing these three models under a single analytic formulation, we assessed here their perceptual consistency, when driven by a single parameter Rd related to voice quality. All possible paired combinations of signals generated using six Rd levels for each model were presented to subjects who were asked whether the two signals in each pair differed. Model pairs LFLM–LFCALM were judged similar when sharing the same Rd value, and LF was considered the same as LFLM and LFCALM given a consistent shift in Rd. Overall, the similarity between these models encourages the use of the simpler and more computationally efficient models LFCALM and LFLM in speech synthesis applications.

Item Type:	Article
Uncontrolled Keywords:	glottal flow model, speech synthesis, singing synthesis, voice perception
Subjects:	Q Science > QA Mathematics > QA75 Electronic computers. Computer science Q Science > QM Human anatomy
Faculty / School / Research Centre / Research Group:	Faculty of Engineering & Science Faculty of Engineering & Science > Natural Resources Institute Faculty of Engineering & Science > Natural Resources Institute > Agriculture, Health & Environment Department Faculty of Engineering & Science > Natural Resources Institute > Pest Behaviour Research Group Faculty of Engineering & Science > Natural Resources Institute > Centre for Sustainable Agriculture 4 One Health Faculty of Engineering & Science > Natural Resources Institute > Centre for Sustainable Agriculture 4 One Health > Behavioural Ecology
Last Modified:	27 Nov 2024 14:29
URI:	https://gala.gre.ac.uk/id/eprint/33628

Actions (login required)

View Item

Downloads

Downloads per month over past year

View more statistics

Altmetric