基于交叉熵顺序统计滤波的语音端点检测算法 | |
钱彦旻 ; 刘加 ; QIAN Yanmin ; LIU Jia | |
2010-06-09 ; 2010-06-09 | |
关键词 | 语音信号处理 端点检测 子带交叉熵 顺序统计滤波(OSF) speech signal processing voice activity detection sub-band cross-entropy order statistics filter(OSF) TN912.3 |
其他题名 | Voice activity detection algorithm based on cross-entropy order statistics filter |
中文摘要 | 为提高语音端点检测在强噪声环境下的准确率,提出了一种基于交叉熵顺序统计滤波(OSF)的语音端点检测算法。该算法以子带交叉熵为语音/非语音的区分特征,首先将每帧语音的频谱划分成若干个子带,估计出每个子带能量与背景噪声之间的交叉熵,然后把相继若干帧的子带能量交叉熵经过一组顺序统计滤波器,最后根据各帧交叉熵的值对输入的语音进行分类。实验结果表明:该算法能够有效地区分语音和非语音。特别是在强噪声环境下依然能够保持很高的检测率,具有鲁棒性。通过实验结果比较,该算法在性能上优于最近提出的基于能量顺序统计滤波和单纯交叉熵判别的两种方法。; Voice activity detection in strong noise environments is improved by an algorithm based on the cross-entropy with an order statistics filter(OSF).The algorithm makes use of the sub-band cross-entropy as the speech/non-speech discrimination feature.The analyses first divides the speech spectrum into several sub-bands and then estimates the cross-entropy between the speech signal and the non-speech signal.An order statistics filter is applied to a sequence of the sub-band cross-entropies to obtain the cross-entropy of each frame.The speech and non-speech signal are classified based on the cross-entropy.Tests show that the algorithm effectively distinguishes speech from non-speech,even in high noise environments.Thus the algorithm outperforms two other the recently reported algorithms.; 国家自然科学基金资助项目(60776800); 国家“八六三”高技术项目(2006AA0101012007AA04Z2232008AA02Z414) |
语种 | 中文 ; 中文 |
内容类型 | 期刊论文 |
源URL | [http://hdl.handle.net/123456789/54158] |
专题 | 清华大学 |
推荐引用方式 GB/T 7714 | 钱彦旻,刘加,QIAN Yanmin,等. 基于交叉熵顺序统计滤波的语音端点检测算法[J],2010, 2010. |
APA | 钱彦旻,刘加,QIAN Yanmin,&LIU Jia.(2010).基于交叉熵顺序统计滤波的语音端点检测算法.. |
MLA | 钱彦旻,et al."基于交叉熵顺序统计滤波的语音端点检测算法".(2010). |
个性服务 |
查看访问统计 |
相关权益政策 |
暂无数据 |
收藏/分享 |
除非特别说明,本系统中所有内容都受版权保护,并保留所有权利。
修改评论