Integration of multilayer regression analysis with structure-based pronunciation assessment
Masayuki Suzuki; Yu Qiao; Nobuaki Minematsu; Keikichi Hirose
2010
会议名称11th Annual Conference of the International Speech Communication Association: Spoken Language Processing for All, INTERSPEECH 2010
英文摘要Automatic pronunciation assessment has several difficulties. Adequacy in controlling the vocal organs is often estimated from the spectral envelopes of input utterances but the envelope patterns are also affected by other factors such as speaker identity. Recently, a new method of speech representation was proposed where these non-linguistic variations are effectively removed through modeling only the contrastive aspects of speech features. This speech representation is called speech structure. However, the often excessively high dimensionality of the speech structure can degrade the performance of structure-based pronunciation assessment. To deal with this problem, we integrate multilayer regression analysis with the structure-based assessment. The results show higher correlation between human and machine scores and also show much higher robustness to speaker differences compared to widely used GOP-based analysis
收录类别EI
语种英语
内容类型会议论文
源URL[http://ir.siat.ac.cn:8080/handle/172644/2764]  
专题深圳先进技术研究院_集成所
作者单位2010
推荐引用方式
GB/T 7714
Masayuki Suzuki,Yu Qiao,Nobuaki Minematsu,et al. Integration of multilayer regression analysis with structure-based pronunciation assessment[C]. 见:11th Annual Conference of the International Speech Communication Association: Spoken Language Processing for All, INTERSPEECH 2010.
个性服务
查看访问统计
相关权益政策
暂无数据
收藏/分享
所有评论 (0)
暂无评论
 

除非特别说明,本系统中所有内容都受版权保护,并保留所有权利。


©版权所有 ©2017 CSpace - Powered by CSpace