CORC  > 清华大学
TTS语音单元边界的自动切分
王丽娟 ; 曹志刚 ; WANG Li-juan ; CAO Zhi-gang
2010-06-09 ; 2010-06-09
关键词前后音素相关 边界模型 分类与衰退树 自动切分 TTS Context-dependent boundary model, CART, Automatic segmentation, TTS TN912.3
其他题名Automatic Segmentation for TTS Units
中文摘要语音单元边界的准确切分对基于波形拼接的语音合成系统至关重要。文章采用了两步切分方法,第一步中先由基于HMM模型的强制对齐方法得到初始的边界,在第二步中提出用基于前后音素的边界模型来修正初始边界。为解决训练数据不足的问题,提出用分类与衰退树将前后因素发音相近的边界模型进行聚类。这样可以根据训练数据的多少,动态调节边界模型的数目,以保证模型训练的可靠性。在对中文语音库的实验中,自动切分的准确度由78.7%提高到91.5%。; Correct unit segmentation are, though laborsome, very crucial to the performance of a concatenation based TTS system. This paper suggests a two-step procedure for automatic unit segmentation, which coarsely segments speech data in the first step and refines segment boundaries in the secord step. A new Context-Dependent Boundary Model (CDBM) to describe the evolution across the segment boundary is proposed. To reduce manual segmentation, Classification and Regression Tree(CART) is used to structure the available data into a more efficient usage. Acoustically similar boundaries are clustered together and corresponding tied CDBM models are thus trained and used for boundary refinement during the secord step. After a series of experiments, the optimal CDBM parameters and the training conditions are found. The segmentation accuracy is raised from 78.7% to 91.5% in Mandarin syllable segmentation with about 1,000 manually segmented sentences as CDBM training data.
语种中文 ; 中文
内容类型期刊论文
源URL[http://hdl.handle.net/123456789/54646]  
专题清华大学
推荐引用方式
GB/T 7714
王丽娟,曹志刚,WANG Li-juan,等. TTS语音单元边界的自动切分[J],2010, 2010.
APA 王丽娟,曹志刚,WANG Li-juan,&CAO Zhi-gang.(2010).TTS语音单元边界的自动切分..
MLA 王丽娟,et al."TTS语音单元边界的自动切分".(2010).
个性服务
查看访问统计
相关权益政策
暂无数据
收藏/分享
所有评论 (0)
暂无评论
 

除非特别说明,本系统中所有内容都受版权保护,并保留所有权利。


©版权所有 ©2017 CSpace - Powered by CSpace