How do you feel? Using natural language processing to automatically rate emotion in psychotherapy

Michael J Tanana; Christina S Soma; Patty B Kuo; Nicolas M Bertagnolli; Aaron Dembe; Brian T Pace; Vivek Srikumar; David C Atkins; Zac E Imel

doi:10.3758/s13428-020-01531-z

How do you feel? Using natural language processing to automatically rate emotion in psychotherapy

Behav Res Methods. 2021 Oct;53(5):2069-2082. doi: 10.3758/s13428-020-01531-z. Epub 2021 Mar 22.

Authors

Michael J Tanana¹, Christina S Soma², Patty B Kuo³, Nicolas M Bertagnolli⁴, Aaron Dembe³, Brian T Pace⁵, Vivek Srikumar⁶, David C Atkins⁷, Zac E Imel³

Affiliations

¹ Social Research Institute, University of Utah, Salt Lake City, UT, USA.
² Department of Educational Psychology, University of Utah, Salt Lake City, UT, USA. tsoma15@gmail.com.
³ Department of Educational Psychology, University of Utah, Salt Lake City, UT, USA.
⁴ https://www.empathy.rocks/, Seattle, WA, USA.
⁵ Lyssn.io, Seattle, WA, USA.
⁶ School of Computing, University of Utah, Salt Lake City, UT, USA.
⁷ Department of Psychiatry and Behavioral Sciences, University of Washington, Seattle, WA, USA.

Abstract

Emotional distress is a common reason for seeking psychotherapy, and sharing emotional material is central to the process of psychotherapy. However, systematic research examining patterns of emotional exchange that occur during psychotherapy sessions is often limited in scale. Traditional methods for identifying emotion in psychotherapy rely on labor-intensive observer ratings, client or therapist ratings obtained before or after sessions, or involve manually extracting ratings of emotion from session transcripts using dictionaries of positive and negative words that do not take the context of a sentence into account. However, recent advances in technology in the area of machine learning algorithms, in particular natural language processing, have made it possible for mental health researchers to identify sentiment, or emotion, in therapist-client interactions on a large scale that would be unattainable with more traditional methods. As an attempt to extend prior findings from Tanana et al. (2016), we compared their previous sentiment model with a common dictionary-based psychotherapy model, LIWC, and a new NLP model, BERT. We used the human ratings from a database of 97,497 utterances from psychotherapy to train the BERT model. Our findings revealed that the unigram sentiment model (kappa = 0.31) outperformed LIWC (kappa = 0.25), and ultimately BERT outperformed both models (kappa = 0.48).

Keywords: Emotion; Emotion coding; Natural language processing; Psychotherapy process; Sentiment analysis.

Publication types

Research Support, N.I.H., Extramural

MeSH terms

Emotions
Humans
Language
Machine Learning
Natural Language Processing*
Psychotherapy*

Abstract

Publication types

MeSH terms

Grants and funding