Lasse Hansen
Master student of Cognitive Science at Aarhus University, Denmark
Intern at Data Science 1 since late August 2020
Supervised by Yan-Ping Zhang, Detlef Wolf, and Riccardo Fusaroli (AU)
Feeling sad
Loss of interest and energy
Difficulty concentrating
Fatigue
Stomach aches
Psychomotor retardation
Slowing of thought and speech
Increased tension in the vocal tract
→ Subtle changes in voice quality
Removed audio for data privacy
Use emotion recognition model to predict depression
Controls and depression at 2 visits 6 months
apart
Only those in remission at 6 month follow up
Diagnosis | Gender | N | Hamilton mean | Hamilton SD | Age mean |
---|---|---|---|---|---|
Visit 1 | |||||
Controls | f | 33 | 1.6 | 1.4 | 32.3 |
Controls | m | 9 | 1.8 | 1.1 | 36.3 |
Depression | f | 31 | 22.1 | 3.6 | 32.0 |
Depression | m | 9 | 21.8 | 3.3 | 34.0 |
Visit 2 | |||||
Controls | f | 20 | 1.5 | 1.8 | 37.8 |
Controls | m | 5 | 3.0 | 1.6 | 35.0 |
Depression | f | 20 | 3.8 | 3.0 | 29.9 |
Depression | m | 5 | 4.8 | 3.7 | 34.9 |
Can we predict depression based on how happy/sad their voice sounds?
Do patients in remission sound like depressed individuals or healthy controls?
Can we predict prognosis based on voice?
Noise removal → Speaker diarization → VAD
Noise removal → Speaker diarization → VAD
Noise removal → Speaker diarization → VAD
Extract MFCCs each 10 ms.
Summarize in bins of 30 seconds
Depression vs controls at visit 1
precision (tp / tp + fp) (ie. proportion pred dep actually dep) = 75%
Depression vs controls at visit 1
precision (tp / tp + fp) (ie. proportion pred dep actually dep) = 75%
Effect of preprocessing
Effect of preprocessing
Bayesian Estimation Superseedes the T Test (Kruschke, 2012)
Provides complete information on parameters of interest
in the form of posterior distributions
Can accept the null
Handles extreme values better
Easy to incorporate mixed effects
"Bayesian inference is just counting"
A naïve emotion classifier can distinguish patients with
depression and healthy controls reliably above chance level
Around 50% of depressed patients show marked symptoms
in the emotional content of their voice
Patients who enter remission sound similar to controls
Bayesian methods provide a richer representation of your
data and its uncertainty
missing that those not in remission did not come back :(
high P(happy) does not mean not depression, just that voice does not show the phenotype
for diarization say how standard tests would say no difference, but with bayes we can see that it exists
Lasse Hansen
Master student of Cognitive Science at Aarhus University, Denmark
Intern at Data Science 1 since late August 2020
Supervised by Yan-Ping Zhang, Detlef Wolf, and Riccardo Fusaroli (AU)
Keyboard shortcuts
↑, ←, Pg Up, k | Go to previous slide |
↓, →, Pg Dn, Space, j | Go to next slide |
Home | Go to first slide |
End | Go to last slide |
Number + Return | Go to specific slide |
b / m / f | Toggle blackout / mirrored / fullscreen mode |
c | Clone slideshow |
p | Toggle presenter mode |
t | Restart the presentation timer |
?, h | Toggle this help |
o | Tile View: Overview of Slides |
Esc | Back to slideshow |
Lasse Hansen
Master student of Cognitive Science at Aarhus University, Denmark
Intern at Data Science 1 since late August 2020
Supervised by Yan-Ping Zhang, Detlef Wolf, and Riccardo Fusaroli (AU)
Feeling sad
Loss of interest and energy
Difficulty concentrating
Fatigue
Stomach aches
Psychomotor retardation
Slowing of thought and speech
Increased tension in the vocal tract
→ Subtle changes in voice quality
Removed audio for data privacy
Use emotion recognition model to predict depression
Controls and depression at 2 visits 6 months
apart
Only those in remission at 6 month follow up
Diagnosis | Gender | N | Hamilton mean | Hamilton SD | Age mean |
---|---|---|---|---|---|
Visit 1 | |||||
Controls | f | 33 | 1.6 | 1.4 | 32.3 |
Controls | m | 9 | 1.8 | 1.1 | 36.3 |
Depression | f | 31 | 22.1 | 3.6 | 32.0 |
Depression | m | 9 | 21.8 | 3.3 | 34.0 |
Visit 2 | |||||
Controls | f | 20 | 1.5 | 1.8 | 37.8 |
Controls | m | 5 | 3.0 | 1.6 | 35.0 |
Depression | f | 20 | 3.8 | 3.0 | 29.9 |
Depression | m | 5 | 4.8 | 3.7 | 34.9 |
Can we predict depression based on how happy/sad their voice sounds?
Do patients in remission sound like depressed individuals or healthy controls?
Can we predict prognosis based on voice?
Noise removal → Speaker diarization → VAD
Noise removal → Speaker diarization → VAD
Noise removal → Speaker diarization → VAD
Extract MFCCs each 10 ms.
Summarize in bins of 30 seconds
Depression vs controls at visit 1
precision (tp / tp + fp) (ie. proportion pred dep actually dep) = 75%
Depression vs controls at visit 1
precision (tp / tp + fp) (ie. proportion pred dep actually dep) = 75%
Effect of preprocessing
Effect of preprocessing
Bayesian Estimation Superseedes the T Test (Kruschke, 2012)
Provides complete information on parameters of interest
in the form of posterior distributions
Can accept the null
Handles extreme values better
Easy to incorporate mixed effects
"Bayesian inference is just counting"
A naïve emotion classifier can distinguish patients with
depression and healthy controls reliably above chance level
Around 50% of depressed patients show marked symptoms
in the emotional content of their voice
Patients who enter remission sound similar to controls
Bayesian methods provide a richer representation of your
data and its uncertainty
missing that those not in remission did not come back :(
high P(happy) does not mean not depression, just that voice does not show the phenotype
for diarization say how standard tests would say no difference, but with bayes we can see that it exists