Automatic Depression Level Detection via lp-norm Pooling
Mingyue Niu; Jianhua Tao; Bin Liu; Cunhang Fan
2019
会议日期2019-9
会议地点奥地利格拉茨
英文摘要

Related physiological studies have shown that Mel-frequency cepstral coefficient (MFCC) is a discriminative acoustic feature for depression detection. This fact has led to some works using MFCCs to identify individual depression degree. However, they rarely adopt neural network to capture high-level feature associated with depression detection. And the suitable feature pooling parameter for depression detection has not been optimized. For these reasons, we propose a hybrid network and lp-norm pooling combined with least absolute shrinkage and selection operator (LASSO) to improve the accuracy of depression detection. Firstly, the MFCCs of the original speech are divided into many segments. Then, we extract the segment-level feature using the proposed hybrid network, which investigates the depression-related information in the spatial structure, temporal changes and discriminative representation of short-term MFCC segments. Thirdly, lp-norm pooling combined with LASSO is adopted to find the optimal pooling parameter for depression detection to generate the utterance-level feature. Finally, depression level prediction is accomplished using support vector regression (SVR). Experiments are conducted on AVEC2013 and AVEC2014. The results demonstrate that our proposed method achieves better performance than the previous algorithms.

内容类型会议论文
源URL[http://ir.ia.ac.cn/handle/173211/44404]  
专题模式识别国家重点实验室_智能交互
通讯作者Mingyue Niu
作者单位1.School of Artificial Intelligence, University of Chinese Academy of Sciences, Beijing, China
2.National Laboratory of Pattern Recognition, CASIA, Beijing, China
推荐引用方式
GB/T 7714
Mingyue Niu,Jianhua Tao,Bin Liu,et al. Automatic Depression Level Detection via lp-norm Pooling[C]. 见:. 奥地利格拉茨. 2019-9.
个性服务
查看访问统计
相关权益政策
暂无数据
收藏/分享
所有评论 (0)
暂无评论
 

除非特别说明,本系统中所有内容都受版权保护,并保留所有权利。


©版权所有 ©2017 CSpace - Powered by CSpace