Integration of multilayer regression analysis with structure-based pronunciation assessment | |
Masayuki Suzuki; Yu Qiao; Nobuaki Minematsu; Keikichi Hirose | |
2010 | |
会议名称 | 11th Annual Conference of the International Speech Communication Association: Spoken Language Processing for All, INTERSPEECH 2010 |
英文摘要 | Automatic pronunciation assessment has several difficulties. Adequacy in controlling the vocal organs is often estimated from the spectral envelopes of input utterances but the envelope patterns are also affected by other factors such as speaker identity. Recently, a new method of speech representation was proposed where these non-linguistic variations are effectively removed through modeling only the contrastive aspects of speech features. This speech representation is called speech structure. However, the often excessively high dimensionality of the speech structure can degrade the performance of structure-based pronunciation assessment. To deal with this problem, we integrate multilayer regression analysis with the structure-based assessment. The results show higher correlation between human and machine scores and also show much higher robustness to speaker differences compared to widely used GOP-based analysis |
收录类别 | EI |
语种 | 英语 |
内容类型 | 会议论文 |
源URL | [http://ir.siat.ac.cn:8080/handle/172644/2764] ![]() |
专题 | 深圳先进技术研究院_集成所 |
作者单位 | 2010 |
推荐引用方式 GB/T 7714 | Masayuki Suzuki,Yu Qiao,Nobuaki Minematsu,et al. Integration of multilayer regression analysis with structure-based pronunciation assessment[C]. 见:11th Annual Conference of the International Speech Communication Association: Spoken Language Processing for All, INTERSPEECH 2010. |
个性服务 |
查看访问统计 |
相关权益政策 |
暂无数据 |
收藏/分享 |
除非特别说明,本系统中所有内容都受版权保护,并保留所有权利。
修改评论