Skip to main content

Automatic Classification and Prediction of Attitudes: Audio - Visual Analysis of Video Blogs

  • Conference paper
  • First Online:
Speech and Computer (SPECOM 2015)

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 9319))

Included in the following conference series:


This paper reports a study of automatic attitude recognition from a collection of over 500 segments of our video blog data. We annotated and analysed 3 different attitudinal states of the speakers. Following that, we extracted and analysed prosodic and visual features relevant to the classification task. We use machine learning methods and techniques to attain better understanding of the feature sets and their contribution to the prediction model.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
USD 39.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions


  1. 1.

  2. 2.,,,,,,,,,


  1. Gobl, C., Ní Chasaide, A.: The role of voice quality in communicating emotion, mood and attitude. Speech Commun. 40(1–2), 189–212 (2003)

    Article  MATH  Google Scholar 

  2. Henrichsen, P.J., Allwood, J.: Predicting the attitude flow in dialogue based on multi-modal speech cues. In: NEALT Proceedings Series (2012)

    Google Scholar 

  3. Mac, D.-K., et al.: Cross-cultural perception of vietnamese audio-visual prosodic attitudes. In: Speech Prosody (2010)

    Google Scholar 

  4. Baumeister, R.F., Finkel, E.J.: Advanced Social Psychology: The State of the Science. Oxford University Press, USA (2010)

    Google Scholar 

  5. Wen, G., et al.: A survey of videoblogging technology on the web. ACM Comput. Surv. (CSUR) 42(4), 15–78 (2010)

    Google Scholar 

  6. Biel, J.I., Aran, O., Gatica-Perez, D.: You are known by how you vlog: personality impressions and nonverbal behavior in YouTube. In: ICWSM (2011)

    Google Scholar 

  7. Morency, L.P., Mihalcea, R., Doshi, P.: Towards multimodal sentiment analysis: harvesting opinions from the web. In: Proceedings of the 13th International Conference on Multimodal Interfaces, pp. 169–176 (2011)

    Google Scholar 

  8. Madzlan, N., et al.: Automatic recognition of attitudes in video blogs - prosodic and visual feature analysis. In: INTERSPEECH (2014)

    Google Scholar 

  9. Ekman, P.: About brows: emotional and conversational signals. In: von Cranach, M., Foppa, K., Lepenies, W., Ploog, D. (eds.) Human Ethology, pp. 169–249. Cambridge University Press, Cambridge (1979)

    Google Scholar 

  10. Sadrô, J., Jarudi, I., Sinhaô, P.: The role of eyebrows in face recognition. Perception 32(3), 285–293 (2003)

    Article  Google Scholar 

  11. Anllo-Vento, L., Hillyard, S.A.: Selective attention to the color and direction of moving stimuli: electrophysiological correlates of hierarchical feature selection. Percept. Psychophys. 58(2), 191–206 (1996)

    Article  Google Scholar 

  12. Chih-Chung, C., Chih-Jen, L.: LIBSVM: a library for support vector machines. ACM Trans. Intell. Syst. Technol. 2(3), 27:1–27:27 (2011)

    Google Scholar 

Download references


This work is supported by the English Language and Literature Department, UPSI, Ministry of Education Malaysia, Center for Global Intelligent Content (CNGL) at TCD and the Speech Communication Laboratory at TCD.

Author information

Authors and Affiliations


Corresponding author

Correspondence to Yuyun Huang .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2015 Springer International Publishing Switzerland

About this paper

Cite this paper

Madzlan, N.A., Huang, Y., Campbell, N. (2015). Automatic Classification and Prediction of Attitudes: Audio - Visual Analysis of Video Blogs. In: Ronzhin, A., Potapova, R., Fakotakis, N. (eds) Speech and Computer. SPECOM 2015. Lecture Notes in Computer Science(), vol 9319. Springer, Cham.

Download citation

  • DOI:

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-319-23131-0

  • Online ISBN: 978-3-319-23132-7

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics