Investigando a robustez de uma metodologia para determinação do valor de base da frequência fundamental

Pablo Arantes; Maria Érica do Nascimento Linhares

doi:10.17851/2237-2083.26.2.535-570

Investigando a robustez de uma metodologia para determinação do valor de base da frequência fundamental

Pablo Arantes, Maria Érica do Nascimento Linhares

Abstract

Resumo: Este trabalho testa a robustez de uma metodologia proposta pelos foneticistas suecos Traunmüller e Eriksson para determinar o valor de base, um estimador estatístico do valor típico da frequência fundamental (F0) de um falante com base na média e no desvio-padrão da F0. A metodologia consiste em estimar uma constante, k, que indica quantos desvios-padrão abaixo da média de F0 do falante o valor de base está. O método para estimar a constante foi criado e testado em amostras de fala atuada. Verificamos neste trabalho se a aplicação da mesma técnica a amostras de fala não atuada produz resultados comparáveis aos reportados por Traunmüller e Eriksson. A investigação usou amostras de fala produzidas por falantes nativos de alemão, estoniano, francês, inglês britânico, italiano, português brasileiro e sueco, em três estilos de elocução: entrevista, leitura de frases e leitura de palavras. Os resultados indicam que a variabilidade causada pelos estilos de enunciação na F0 possibilita a aplicação da metodologia a amostras de fala não atuada. Os valores da constante derivados dos dados não atuados são próximos aos reportados pelos autores suecos, o que indica que ela é robusta tanto do ponto de vista dos falantes quanto das línguas.

Palavras-chave: entoação; valor de base; frequência fundamental.

Abstract: This paper probes the robustness of Traunmüller and Eriksson’s methodology to determine the base value of the fundamental frequency of speech, an estimator of a speaker’s typical F0 value. The methodology entails the estimation of a constant, k, indicating where the base value for a speaker lies in relation to F0 standard deviations below the F0 mean. The methodology was originally developed from acted speech samples. Here we test if k values can be successfully obtained from non-acted samples and how they compare to the ones reported by Traunmüller and Eriksson. A speech corpus of speech samples differing in speaking styles (spontaneous interview, sentence reading, word list reading) from seven languages (English, Estonian, French, German, Italian, Brazilian Portuguese, Swedish) was used. Results show that k values estimated from non-acted speech are roughly the same as those reported in Traunmüller and Eriksson’s original paper. We speculate that deviations can be explained by the fact that some speakers make extensive use of non-modal register.

Keywords: intonation; base value; fundamental frequency.

Keywords

intonation; base value; fundamental frequency.

Full Text:

PDF (Português (Brasil))

References

ARANTES, Pablo; LINHARES, Maria E. N. Efeito da língua, estilo de elocução e sexo do falante sobre medidas globais da frequência fundamental. Letras de Hoje, PUCRS, v. 52, n. 1, p. 26-39, 2017. Doi: http://dx.doi.org/10.15448/1984-7726.2017.1.25419

BOERSMA, Paul. Praat, a system for doing phonetics by computer. Glot International, Elsevier, v. 5, n. 9/10, p. 341-345, 2001.

BOERSMA, Paul; KOVACIC, Gordana. Spectral characteristics of three styles of Croatian folk singing. Journal of the Acoustical Society of America, Acoustical Society of America, v. 119, p. 1805-1816, 2006. Doi: https://doi.org/10.1121/1.2168549.

DE LOOZE, Céline; HIRST, Daniel J. The OMe (Octave-Median) scale: A natural scale for speech melody. 2014, Dublin: [s.n.], 2014. p. 910-914.

ERIKSSON, Anders. Aural/Acoustic vs. Automatic Methods in Forensic Phonetic Case Work. In: NEUSTEIN, A.; PATIL, H. A. (Org.). Forensic Speaker Recognition: Law Enforcement and Counter-terrorism. [S.l.]: Springer, 2011. p. 41-70.

ESKÉNAZI, Maxine. Trends in Speaking Styles Research. 1993, Berlin: ISCA, 1993. p. 501-509. Disponível em: .

FUJISAKI; HIROSE, K. Analysis of voice fundamental frequency contours for declarative sentences of Japanese. Journal of the Acoustic Society of Japan, Acoustical Society of Japan, v. 5, n. 4, p. 233-242, 1984. Doi: 10.1250/ast.5.233

GÅRDING, Eva. A Generative Model of Intonation. In: CUTLER, A.; LADD, D. R. (Org.). Prosody: Models and Measurements. Berlin: Springer-Verlag, 1983. p. 11-25.

HIRST, Daniel J. Prosodic aspects of speech and language. In: BROWN, K. (Org.). Encyclopedia of Language and Linguistics. [S.l.]: Elsevier Science, 2005. v. X. p. 167-178.

HIRST, Daniel J. The Analysis by Synthesis of Speech Melody: from Data to Models. Journal of Speech Sciences, Unicamp, v. 1, n. 1, p. 55-83, 2011.

HOLLIEN, Harry; HOLLIEN, Patricia; JONG, Gea De. Effects of three parameters on speaking fundamental frequency. Journal of the Acoustical Society of America, Acoustical Society of America, v. 102, n. 5, p. 2984-2992, 1997. Doi: https://doi.org/10.1121/1.420353

JASSEM, Wiktor. Normalisation of F0 curves. In: FANT, GUNNAR; TATHAM, M. A. A. (Org.). Auditory Analysis and Perception of Speech. London: Academic Press, 1975. p. 523-530.

JESSEN, Michael. Forensic Phonetics. Language and Linguistics Compass, Wiley Online Library, v. 2, n. 4, p. 671-711, 2008. Doi: 10.1111/j.1749-818X.2008.00066.x

JESSEN, Michael; KÖSTER, Olaf; GFROERER, Stefan. Influence of vocal effort on average and variability of fundamental frequency. Speech, Language and the Law, Equinox Publishing, v. 12, n. 2, p. 174-213, 2005. Doi: 10.1558/sll.2005.12.2.174

KENNEY, J. F.; KEEPING, E. S. Mathematics of Statistics. [S.l.]: Van Nostrand, 1962. p. 50-54.

LINDH, Jonas; ERIKSSON, Anders. Robustness of Long Time Measures of Fundamental Frequency. 2007, Antwerp, Belgium: [s.n.], 2007. p. 2025-2028.

LLISTERRI, Joaquim. Speaking styles in speech research. 1992, Dublin, Ireland: [s.n.], 1992.

MAIDMENT, J. A.; LECUMBERRI, M. L. Pitch analysis methods for cross-speaker comparison. 1996, Delaware: [s.n.], 1996.

MIXDORFF, Hansjörg. Extraction, Analysis and Synthesis of Fujisaki Model Parameters. In: HIROSE, KEIKICHI; TAO, JIANHUA (Org.). Speech Prosody in Speech Synthesis: Modeling and generation of prosody for high quality and flexible speech synthesis. Berlin: Springer, 2015. p. 35-47.

NOLAN, Francis. Intonation in speaker identification: an experiment on pitch alignment features. Forensic Linguistics, International Association for Forensic Phonetics and Acoustics, v. 9, n. 1, p. 3-21, 2002. Doi: 10.1558/sll.2002.9.1.1

NOLAN, Francis. The Phonetic Bases of Speaker Recognition. Cambridge, UK: Cambridge University Press, 1993.

ROSE, Philip. How effective are long term mean and standard deviation as normalisation parameters for tonal fundamental frequency? Speech Communication, Elsevier, v. 10, n. 3, p. 229-247, 1991. Doi: https://doi.org/10.1016/0167-6393(91)90014-K

SCHULTZ, Tanja. Speaker Characteristics. In: MÜLLER, CHRISTIAN (Org.). Speaker Classification I: Fundamentals, Features, and Methods. [S.l.]: Springer, 2007. p. 47-74.

STEVENS, S. S. On the theory of scales of measurement. Science, American Association for the Advancement of Science, v. 103, Issue 2684, p. 677-680, Jun. 7, 1946. Doi: 10.1126/science.103.2684.677

TITZE, Ingo. Principles of voice production. Englewood Cliffs: Prentice Hall, 1994.

TRAUNMÜLLER, Hartmut. Conventional, biological and environmental factors in speech communication: a modulation theory. Phonetica, Karger Publishers v. 51, p. 170-183, 1994. Doi:10.1159/000261968

TRAUNMÜLLER, Hartmut; ERIKSSON, Anders. The frequency range of the voice fundamental in the speech of male and female adults. [s.d.]. Disponível em: .

VAISSIÈRE, J. Language-Independent Prosodic Features. In: CUTLER, A.; LADD, D. R. (Org.). Prosody: Models and Measurements. Berlin: Springer-Verlag, 1983. p. 53-66.

DOI: http://dx.doi.org/10.17851/2237-2083.26.2.535-570