TTS语音单元边界的自动切分

CORC > 清华大学

	TTS语音单元边界的自动切分
	王丽娟 ; 曹志刚 ; WANG Li-juan ; CAO Zhi-gang
	2010-06-09 ; 2010-06-09
关键词	前后音素相关边界模型分类与衰退树自动切分 TTS Context-dependent boundary model, CART, Automatic segmentation, TTS TN912.3
其他题名	Automatic Segmentation for TTS Units
中文摘要	语音单元边界的准确切分对基于波形拼接的语音合成系统至关重要。文章采用了两步切分方法,第一步中先由基于HMM模型的强制对齐方法得到初始的边界,在第二步中提出用基于前后音素的边界模型来修正初始边界。为解决训练数据不足的问题,提出用分类与衰退树将前后因素发音相近的边界模型进行聚类。这样可以根据训练数据的多少,动态调节边界模型的数目,以保证模型训练的可靠性。在对中文语音库的实验中,自动切分的准确度由78.7%提高到91.5%。; Correct unit segmentation are, though laborsome, very crucial to the performance of a concatenation based TTS system. This paper suggests a two-step procedure for automatic unit segmentation, which coarsely segments speech data in the first step and refines segment boundaries in the secord step. A new Context-Dependent Boundary Model (CDBM) to describe the evolution across the segment boundary is proposed. To reduce manual segmentation, Classification and Regression Tree(CART) is used to structure the available data into a more efficient usage. Acoustically similar boundaries are clustered together and corresponding tied CDBM models are thus trained and used for boundary refinement during the secord step. After a series of experiments, the optimal CDBM parameters and the training conditions are found. The segmentation accuracy is raised from 78.7% to 91.5% in Mandarin syllable segmentation with about 1,000 manually segmented sentences as CDBM training data.
语种	中文 ; 中文
内容类型	期刊论文
源URL	[http://hdl.handle.net/123456789/54646]
专题	清华大学
推荐引用方式 GB/T 7714	王丽娟,曹志刚,WANG Li-juan,等. TTS语音单元边界的自动切分[J],2010, 2010.
APA	王丽娟,曹志刚,WANG Li-juan,&CAO Zhi-gang.(2010).TTS语音单元边界的自动切分..
MLA	王丽娟,et al."TTS语音单元边界的自动切分".(2010).

个性服务

查看访问统计

相关权益政策

暂无数据

收藏/分享

所有评论 (0)

[发表评论/异议/意见]

暂无评论

评论
权益异议
反馈意见

评注功能仅针对注册用户开放，请您登录

您对该条目有什么异议，请向管理员反馈。
内容：
Email：	*
单位:
验证码：	刷新

您在知识库使用过程中有什么好的想法或者建议可以反馈给我们。
标题：	*
内容：
Email：	*
验证码：	刷新

相关链接

CORC

联系我们