A Weakly Supervised Learning Framework for Detecting Social Anxiety and Depression
Asif Salekin,Jeremy W. Eberle,Jeffrey J. Glenn,Bethany A. Teachman,John A. Stankovic +4 more
- 05 Jul 2018
- Vol. 2, Iss: 2, pp 81
TL;DR: A novel feature modeling technique named NN2Vec is presented that identifies and exploits the inherent relationship between speakers' vocal states and symptoms/affective states and achieves F-1 scores 17% and 13% higher than those of the best available baselines.
read more
Abstract: Although social anxiety and depression are common, they are often underdiagnosed and undertreated, in part due to difficulties identifying and accessing individuals in need of services. Current assessments rely on client self-report and clinician judgment, which are vulnerable to social desirability and other subjective biases. Identifying objective, nonburdensome markers of these mental health problems, such as features of speech, could help advance assessment, prevention, and treatment approaches. Prior research examining speech detection methods has focused on fully supervised learning approaches employing strongly labeled data. However, strong labeling of individuals high in symptoms or state affect in speech audio data is impractical, in part because it is not possible to identify with high confidence which regions of a long speech indicate the person's symptoms or affective state. We propose a weakly supervised learning framework for detecting social anxiety and depression from long audio clips. Specifically, we present a novel feature modeling technique named NN2Vec that identifies and exploits the inherent relationship between speakers' vocal states and symptoms/affective states. Detecting speakers high in social anxiety or depression symptoms using NN2Vec features achieves F-1 scores 17% and 13% higher than those of the best available baselines. In addition, we present a new multiple instance learning adaptation of a BLSTM classifier, named BLSTM-MIL. Our novel framework of using NN2Vec features with the BLSTM-MIL classifier achieves F-1 scores of 90.1% and 85.44% in detecting speakers high in social anxiety and depression symptoms.
read more
Chat with Paper
AI Agents for this Paper
Find similar papers on Google Scholar, PubMed and Arxiv
Write a critical review of this paper
Analyze citations of this paper to find unaddressed research gaps
Citations
Machine Learning in Mental Health: A Systematic Review of the HCI Literature to Support the Development of Effective and Implementable ML Systems
TL;DR: This article presents an introduction to, and a systematic review of, current ML work regarding psycho-socially based mental health conditions from the computing and HCI literature, and reflects on the current state-of-the-art of ML work for mental health.
279
MFCC-based Recurrent Neural Network for automatic clinical depression recognition and assessment from speech
TL;DR: In this paper , a deep recurrent neural network-based framework is presented to detect depression and to predict its severity level from speech, which has several advantages such as fastness, non-invasiveness, and non-intrusion.
150
•Posted Content
MFCC-based Recurrent Neural Network for Automatic Clinical Depression Recognition and Assessment from Speech
TL;DR: A deep recurrent neural network-based framework is presented to detect depression and to predict its severity level from speech and promising results are obtained.
140
A survey on big data-driven digital phenotyping of mental health
TL;DR: The vision of digital phenotyping of mental health (DPMH) is outlined by fusing the enriched data from ubiquitous sensors, social media and healthcare systems, and a broad overview of DPMH from sensing and computing perspectives is presented.
139
AudVowelConsNet: A phoneme-level based deep CNN architecture for clinical depression diagnosis
Muhammad Muzammel,Hanan Salam,Yann Hoffmann,Mohamed Chetouani,Alice Othmani +4 more
- 01 Dec 2020
TL;DR: This paper investigates the acoustic characteristics of phoneme units, specifically vowels and consonants for depression recognition via Deep Learning, and presents and compares three spectrogram-based Deep Neural Network architectures, trained on phoneme consonant and vowel units and their fusion respectively.
75
References
Diagnostic and Statistical Manual of Mental Disorders
Vijay A. Mittal,Elaine F. Walker +1 more
TL;DR: An issue concerning the criteria for tic disorders is highlighted, and how this might affect classification of dyskinesias in psychotic spectrum disorders.
Glove: Global Vectors for Word Representation
Jeffrey Pennington,Richard Socher,Christopher D. Manning +2 more
- 01 Oct 2014
TL;DR: A new global logbilinear regression model that combines the advantages of the two major model families in the literature: global matrix factorization and local context window methods and produces a vector space with meaningful substructure.
A rating scale for depression
TL;DR: The present scale has been devised for use only on patients already diagnosed as suffering from affective disorder of depressive type, used for quantifying the results of an interview, and its value depends entirely on the skill of the interviewer in eliciting the necessary information.
32.8K
•Journal Article
Diagnostic and Statistical Manual of Mental Disorders (DSM-5)
TL;DR: Diagnostic and statistical manual of mental disorders (DSM-5) was translated by psychiatrists and psychologists, mainly from the University psychiatric hospital Vrapce and published by the Naklada Slap publisher.
15.8K
•Posted Content
Empirical evaluation of gated recurrent neural networks on sequence modeling
TL;DR: These advanced recurrent units that implement a gating mechanism, such as a long short-term memory (LSTM) unit and a recently proposed gated recurrent unit (GRU), are found to be comparable to LSTM.
14.1K