Perception of emotional prosody: investigating the relation between the discrete and dimensional approaches to emotions

Wellington da Silva, Plinio Almeida Barbosa

Abstract


Abstract: Emotional phenomena can be described according to various psychological approaches, the most adopted being the discrete (basic) and the dimensional ones. This study aimed at investigating the relation between the perception of some basic emotions and emotional dimensions in speech, as well as determining which acoustic cues are related to their perception. We conducted two perception experiments with utterances selected from a foreign language (Swedish) of which the listeners had no knowledge. In the first one, Brazilian subjects rated on 5-point scales the expressivity of four basic emotions: joy, anger, sadness, and calmness. In the second, a distinct group of Brazilian subjects rated the expressivity of five emotional dimensions: activation, fairness, valence, motivation, and involvement. The perception of the basic emotions and of the emotional dimensions was then compared by means of the Spearman’s correlation coefficient. The five emotional dimensions were significantly correlated to some extent with the basic emotions, and these correlations were, in general, consistent with the literature and with the hypotheses that guided this study. We also performed an acoustic analysis, in which twelve acoustic parameters were automatically computed for the utterances evaluated by the listeners. The parameters which correlated better with the listeners’ judgments were fundamental frequency (median, interquantile semi-amplitude, 99.5% quantile), spectral tilt (mean and standard deviation), and LTAS slope. We concluded that it is possible to describe the perception of basic emotions in speech as a combination of emotional dimensions and that emotional dimensions may be better for describing the expression of emotions in speech.

Keywords: emotional prosody; basic emotions; emotional dimensions; perception test.

Resumo: Os fenômenos emocionais podem ser descritos de acordo com várias abordagens psicológicas, sendo a discreta (básica) e a dimensional as mais adotadas. Este estudo teve como objetivo investigar a relação entre a percepção de algumas emoções básicas e dimensões emocionais na fala, bem como determinar quais pistas acústicas estão relacionadas com sua percepção. Conduziram-se dois experimentos de percepção com enunciados selecionados de uma língua estrangeira (sueco) da qual os ouvintes não possuíam nenhum conhecimento. No primeiro, sujeitos brasileiros julgaram em escalas de cinco pontos a expressividade de quatro emoções básicas: alegria, raiva, tristeza e calma. No segundo, um grupo distinto de sujeitos brasileiros avaliou a expressividade de cinco dimensões emocionais: ativação, justiça, valência, motivação e envolvimento. A percepção das emoções básicas e das dimensões emocionais foi então comparada por meio do coeficiente de correlação de Spearman. As cinco dimensões emocionais correlacionaram-se significativamente em algum grau com as emoções básicas e essas correlações foram, no geral, consistentes com a literatura e com as hipóteses que nortearam este estudo. Realizou-se também uma análise acústica, na qual doze parâmetros acústicos foram computados automaticamente para os enunciados avaliados pelos ouvintes. Os parâmetros que melhor se correlacionaram com os julgamentos dos ouvintes foram: frequência fundamental (mediana, semiamplitude entre quantis, quantil 99,5%), inclinação espectral (média e desvio padrão) e inclinação do LTAS. Concluiu-se que é possível descrever a percepção das emoções básicas na fala como uma combinação de dimensões emocionais e que as dimensões emocionais podem ser melhores para descrever a expressão de emoções na fala.

Palavras-chave: prosódia emocional; emoções básicas; dimensões emocionais; teste de percepção.


Keywords


emotional prosody; basic emotions; emotional dimensions; perception test.

Full Text:

PDF

References


ALM, C. O.; SPROAT, R. Perceptions of emotions in expressive storytelling. In: INTERSPEECH 2005, 2005, Lisboa. Proceedings of Interspeech 2005. Lisboa: ISCA Archive, 2005. p. 533-536. Disponível em: http://www.isca-speech.org/archive/interspeech_2005. Acesso em: 14 jul. 2016.

AMIR, N.; MIXDORFF, H.; OFER AMIR, D. R.; DIAMOND, G. M.; PFITZINGER, H. R.; LEVI-ISSERLISH, T.; ABRAMSON, S. Unresolved Anger: Prosodic analysis and classification of speech from a therapeutical setting. In: INTERNATIONAL CONFERENCE ON SPEECH PROSODY 2010, 5., 2010, Chicago. Proceedings… Chicago, 2010. Disponível em: http://speechprosody2010.illinois.edu/program.php. Acesso em: 14 jul. 2016.

BANSE, R.; SCHERER, K. R. Acoustic profiles in vocal emotion expression. Journal of Personality and Social Psychology, v. 70, n. 3, p. 614-636, 1996. doi.org/10.1037/0022-3514.70.3.614

BARBOSA, P. A. Detecting changes in speech expressiveness in participants of a radio program. In: INTERSPEECH 2009, 2009, Brighton. Proceedings of Interspeech 2009 - Speech and Intelligence. Londres: Causal Productions, 2009. p. 2155-2158. Disponível em: http://www.isca-speech.org/archive/interspeech_2009/i09_2155.html. Acesso em: 18 nov. 2016.

BARBOSA, P. A. Conhecendo melhor a prosódia: aspectos teóricos e metodológicos daquilo que molda nossa enunciação. Revista de Estudos da Linguagem, Belo Horizonte, v. 20, n. 1, p. 11-27, 2012. Disponível em: http://periodicos.letras.ufmg.br/index.php/relin/article/view/2571. Acesso em: 18 nov. 2016.

BOERSMA, P.; WEENINK, D. Praat: doing phonetics by computer (Versão 5.2.25). Disponível em: http://www.praat.org. Acesso em: 20 Mai. 2011.

BROSCH, T; MOORS, A. Valence. In: SANDER, D.; SCHERER, K.R. (Org.). The Oxford Companion to Emotion and the Affective Sciences. Oxford: Oxford University Press, 2009. p. 401-402.

CORNELIUS, R. R. Theoretical approaches to emotion. In: ISCA TUTORIAL AND RESEARCH WORKSHOP (ITRW) ON SPEECH AND EMOTION, 2000, Newcastle. Proceedings... Disponível em: http://www.isca-speech.org/archive_open/speech_emotion/. Acesso em: 14 jul. 2016.

COWIE, R.; CORNELIUS, R. R. Describing the emotional states that are expressed in speech. Speech communication, v. 40, n. 1, p. 5-32, 2003. doi.org/10.1016/S0167-6393(02)00071-7

DARWIN, C. A expressão das emoções no homem e nos animais. São Paulo: Companhia das Letras, 2009 [1872].

DEVILLERS, L.; COWIE, R.; MARTIN, J.-C.; DOUGLAS-COWIE, E.; ABRILIAN, S.; MCRORIE, M. Real life emotions in French and English TV video clips: An integrated annotation protocol combining continuous and discrete approaches. In: INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION, Fifth., 2006, Genoa. Proceedings... Disponível em: http://www.lrec-conf.org/proceedings/lrec2006/. Acesso em: 14 jul. 2016.

DOWDY, S.; WEARDEN, S.; CHILKO, D. Statistics for research. 3. ed. Hoboken: John Wiley & Sons, 2004. doi.org/10.1002/0471477435

EKMAN, P. An argument for basic emotions. Cognition & Emotion, v. 6, n. 3/4, p. 169-200, 1992. doi.org/10.1080/02699939208411068

ELLSWORTH, P. C.; SCHERER, K. R. Appraisal processes in emotion. In: DAVIDSON, R. J.; SCHERER, K. R.; GOLDSMITH, H. H. (Org.). Handbook of affective sciences. New York: Oxford University Press, 2003. p. 572-595.

FLEISS, J. L. Measuring nominal scale agreement among many raters. Psychological Bulletin, v. 76, n. 5, p. 378-382, 1971. doi.org/10.1037/h0031619

FONTAINE, J. Dimensional Emotion Models. In: SANDER, D.; SCHERER, K.R. (Org.). The Oxford Companion to Emotion and the Affective Sciences. Oxford: Oxford University Press, 2009. p. 119-120.

FOWLES, D. C. Arousal. In: SANDER, D.; SCHERER, K. R. (Org.). The Oxford Companion to Emotion and the Affective Sciences. Oxford: Oxford University Press, 2009. p. 50-50.

FRICK, R. W. Communicating emotion: The role of prosodic features. Psychological Bulletin, v. 97, n. 3, p. 412-429, 1985. doi.org/10.1037/0033-2909.97.3.412

FRIJDA, N. H.; MARKAM, S.; SATO, K.; WIERS, R. Emotions and emotion words. In: RUSSELL, J. A.; FERNANDEZ-DOLS, J. M.; MANSTEAD, A. S. R.; WELLENKAMP, J. C. (Org.) Everyday Conceptions of Emotion: An Introduction to the Psychology, Anthropology and Linguistics of Emotion. NATO ASI series D: Behavioural and social sciences. Nova Iorque: Kluwer Academic/Plenum Publishers, 1995. v. 81, p. 121-143. doi.org/10.1007/978-94-015-8484-5_7

GRIMM, M.; KROSCHEL, K.; MOWER, E.; NARAYANAN, S. Primitives-based evaluation and estimation of emotions in speech. Speech Communication, v. 49, n. 10, p. 787-800, 2007. doi.org/10.1016/j.specom.2007.01.010

JOHNSTONE, T.; SCHERER, K. R. Vocal communication of emotion. In: LEWIS, M.; HAVILAND, J. M. (Org.). Handbook of emotions. 2. ed. Nova Iorque: Guilford, 2000. p. 220-235.

KEHREIN, R. The prosody of authentic emotions. In: SPEECH PROSODY 2002, 2002, Aix-en-Provence. Proceedings of Speech Prosody 2002. Disponível em: http://sprosig.isle.illinois.edu/sp2002/papers.htm. Acesso em: 14 jul. 2016.

LAUKKA, P.; ELFENBEIN, H. A. Emotion appraisal dimensions can be inferred from vocal expressions. Social Psychological and Personality Science, v. 3, n. 5, p. 529-536, 2012. doi.org/10.1177/1948550611428011

LAUKKA, P.; JUSLIN, P.; BRESIN, R. A dimensional approach to vocal expression of emotion. Cognition & Emotion, v. 19, n. 5, p. 633-653, 2005. doi.org/10.1080/02699930441000445

LAUKKANEN, A-M; VILKMAN, E.; ALKU, P.; OKSANEN, H. On the perception of emotions in speech: the role of voice quality. Logopedics Phoniatrics Vocology, v. 22, n. 4, p. 157-168, 1997. doi.org/10.3109/14015439709075330

LUGGER, M., YANG, B. An incremental analysis of different feature groups in speaker independent emotion recognition. In: INTERNATIONAL CONGRESS OF PHONETIC SCIENCES, 16., 2007, Saarbrücken. Proceedings…. Disponível em: http://www.icphs2007.de/. Acesso em: 14 jul. 2016.

MATSUMOTO, D.; EKMAN, P. Basic emotions. In: SANDER, D.; SCHERER, K.R. (Org.). The Oxford Companion to Emotion and the Affective Sciences. Oxford: Oxford University Press, 2009. p. 69-72.

MAUSS, I. B.; ROBINSON, M. D. Measures of emotion: A review. Cognition and Emotion, v. 23, n. 2, p. 209-237, 2009. doi.org/10.1080/02699930802204677

MESQUITA, B.; FRIJDA, N. H. Cultural Variations in Emotions: A Review. Psychological Bulletin, v. 112, n. 2, p. 179-204, 1992. doi.org/10.1037/0033-2909.112.2.179

PEREIRA, C. Dimensions of emotional meaning in speech. In: ISCA TUTORIAL AND RESEARCH WORKSHOP (ITRW) ON SPEECH AND EMOTION, 2000, Newcastle. Proceedings of the ISCA Workshop on Speech and Emotion. Disponível em: http://www.isca-speech.org/archive_open/speech_emotion/. Acesso em: 14 jul. 2016.

PERES, D. O. A manifestação da emoção na fala: estudo perceptual com falantes nativos e não nativos. Estudos Linguísticos, São Paulo, v. 43, n. 1, p. 10-21, 2014.

PITTAM, J.; GALLOIS, C.; CALLAN, V. The long-term spectrum and perceived emotion. Speech Communication, Amsterdam, v. 9, n. 3, p. 177-187, 1990. doi.org/10.1016/0167-6393(90)90055-E

PITTAM, J.; SCHERER, K. R. Vocal expression and communication of emotion. In: LEWIS, M.; HAVILAND, J. M. (Org.). Handbook of emotions. New York: Guilford Press, 1993. p. 185-198.

R CORE TEAM. R: A language and environment for statistical computing. R Foundation for Statistical Computing, Vienna, Austria. ISBN 3-900051-07-0. Disponível em: http://www.R-project.org. Acesso em: 5 fev. 2014.

RUSSELL, J. A. A circumplex model of affect. Journal of Personality and Social Psychology, v. 39, p. 1161-1178, 1980. doi.org/10.1037/h0077714

SCHERER, K. R. Methods of research on vocal communication: Paradigms and parameters. In: SCHERER, K. R.; EKMAN, P. (Org.). Handbook of methods in nonverbal behavior research. Cambridge: Cambridge University Press, 1982. p. 136-198.

SCHERER, K. R. Vocal affect expression: a review and a model for future research. Psychological Bulletin, v. 99, n. 2, p. 143-165, 1986. doi.org/10.1037/0033-2909.99.2.143

SCHERER, K. R. Emotion. In: HEWSTONE, M.; STROEBE, W. (Org.). Introduction to Social Psychology: A European perspective. 3. ed. Oxford: Blackwell, 2000. p. 151-191.

SCHERER, K. R. Vocal communication of emotion: a review of research paradigms. Speech Communication, v. 40, n. 2, p. 227-256, 2003. doi.org/10.1016/S0167-6393(02)00084-5

SCHERER, K. R.; BANSE, R.; WALLBOTT, H. G. Emotion Inferences from Vocal Expression Correlate across Languages and Cultures. Journal of Cross-Cultural Psychology, Sage Publications, v. 32, n. 1, p. 76-92, 2001. doi.org/10.1177/0022022101032001009

SCHERER, K. R.; ELLGRING, H. Multimodal expression of emotion: Affect programs or componential appraisal patterns? Emotion, v. 7, n. 1, p. 158-171, 2007. doi.org/10.1037/1528-3542.7.1.158

SCHLOSBERG, H. A scale for the judgement of facial expressions. J. Exp. Psychology, Association for the Advancement of Affective Computing., v. 29, p. 497-510, 1941.

SCHLOSBERG, H. Three dimensions of emotion. Psychological Review, American Psychological Association, v. 61, n. 2, p. 81-88, 1954. doi.org/10.1037/h0054570

SCHRÖDER, M.; COWIE, R.; DOUGLAS-COWIE, E.; WESTERDIJK, M.; GIELEN, S. C. Acoustic correlates of emotion dimensions in view of speech synthesis. In: INTERSPEECH 2001, 2001, Aalborg. Proceedings of Interspeech 2001. Disponível em: http://www.isca-speech.org/archive/eurospeech_2001/index.html. Acesso em: 14 jul. 2016.

SILVA, W.; BARBOSA, P. A.; ABELIN, Å. Cross-cultural and cross-linguistic perception of authentic emotions through speech: An acoustic-phonetic study with Brazilian and Swedish listeners. DELTA, São Paulo, v. 32, n. 2, p. 449-480, 2016. Disponível em: http://www.scielo.br/scielo.php?script=sci_arttext&pid=S0102-44502016000200449&lng=en&nrm=iso. Acesso em: 18 nov. 2016.

SMITH, C. A.; ELLSWORTH, P. C. Patterns of cognitive appraisal in emotion. Journal of Personality and Social Psychology, v. 48, n. 4, p. 813-838, 1985. doi.org/10.1037/0022-3514.48.4.813

TRAUNMÜLLER, H; ERIKSSON, A. Acoustic effects of variation in vocal effort by men, women, and children. The Journal of the Acoustical Society of America, v. 107, n. 6, p. 3438-3451, 2000. doi.org/10.1121/1.429414

VOGT, T.; ANDRÉ, E.; WAGNER, J. Automatic Recognition of Emotions from Speech: A Review of the Literature and Recommendations for Practical Realisation. In: PETER, C.; BEALE, R. (Org.). Affect and Emotion in Human-Computer Interaction: From Theory to Applications (Lecture Notes in Computer Science). Heidelberg, Germany: Springer, 2008. v. 4868, p. 75-91. doi.org/10.1007/978-3-540-85099-1_7

WREDE, B.; SHRIBERG, E. Spotting “hot spots” in meetings: human judgments and prosodic cues. In: INTERSPEECH 2003, 2003, Genebra. Proceedings of Interspeech 2003. Genebra: ISCA Archive, 2003. p. 2805-2808. Disponível em: http://www.isca-speech.org/archive/eurospeech_2003/e03_2805.html. Acesso em: 19 out. 2016.




DOI: http://dx.doi.org/10.17851/2237-2083.25.3.1075-1103

Refbacks

  • There are currently no refbacks.
';



Copyright (c) 2017 Wellington da Silva, Plinio Almeida Barbosa

Creative Commons License
This work is licensed under a Creative Commons Attribution 4.0 International License.

e - ISSN 2237-2083 

License

Licensed through  Creative Commons Atribuição 4.0 Internacional