基于声纹特征与语义信息的身份认证系统研究

CORC > 自动化研究所 > 中国科学院自动化研究所 > 毕业生 > 硕士学位论文

题名	基于声纹特征与语义信息的身份认证系统研究
作者	刘洋
学位类别	工学硕士
答辩日期	2002-05-01
授予单位	中国科学院研究生院
授予地点	中国科学院自动化研究所
导师	徐波
关键词	身份认证说话人识别语义信息认证分组高斯混合模型非标准支持向量机 Authentication Speaker Verification Verbal Information Verification CG-GMM nonstandard SVM
其他题名	National Laboratory of Pattern Recogntion Institute of Automation Chinese Academy of Sciences
学位专业	模式识别与智能系统
中文摘要	身份认证技术在近年来得到了广泛关注和重点研究。做为语音处理技术的一个分支，基于语音的身份认证技术从语音识别等其他技术中汲取了大量的有益经验并发展出了有自己特色的研究成果；与此同时，这一技术也存在着许多问题，集中于怎样能从语音信号中提取尽量多与身份相关且比较稳定的特征信息，以及怎样改进训练方法使其针对现有特征能得到推广性更好的模型。本文将详细介绍作者攻读硕士学位期间在基于语音的身份认证技术这些方面的一点儿工作。我们首先研究了说话人识别(声纹识别)技术，特别是与文本无关的说话人识别。在详尽分析声学建模的各个环节对说话人辨认效果之影响的基础上，提出了一个识别能力更高的后处理方法；提出了一种新的分组高斯混合模型方法来提高声学建模的精度，有效的抑制了特征向量中各个分量之间在物理意义和量纲等方面的不一致性。在说话人确认平台，提出速度更快的反模型构建方法以及更为灵活的判决门限设置。此后，为了寻找分类能力更强的后端处理方法，我们深入比较了高斯混合模型和支持向量机技术两种声学建模的方法；在分析支持向量机基线系统的基础上提出用内嵌高斯混合的方法来修改核函数，并改进支持向量机分类器的训练过程。其次，我们深入研究了可以满足更高安全性要求的语义信息认证技术。实验表明，语义信息用于身份认证与声纹相比具有更好的鲁棒性。我们实现了语句切分、子词级假设检验和整句可信度认证的各个功能模块，并提出了子词级假设检验模块中核心部分反模型的几种训练方法，有效的提高了系统的认证性能。再次，在以上两部分研究的基础上我们提出用支持向量机分类器来融合声纹识别和语义确认两种技术，充分利用了声纹信息和用户个人信息，取得了很好的效果；同时，针对传统支持向量机的训练算法无法改变两类错误率的问题，提出了非标准支持向量机方法来进一步提高融合认证系统的灵活性。本文中提出的各种算法及改进都通过大量实验来验证其效果，并在实验后进行了有针对性的分析。
英文摘要	Authentication system has been a promising field and been studied for a long time. Speech-based authentication system, part of speech processing technology, benefits a lot from the research in the field of Automatic Speech Recognition ,and is becoming some reasonable selection for information security. However, this technology still has some Achilles' Heel which focuse on how to get more identity-related and more robust information, and how to improve training method to provide models with higher generalization. This paper presents in detail the author's research work on those problems as a master candidate. Firstly, we do some deep research on Speaker Recognition (Voiceprint Recognition). especially Text-Independent Speaker Recognition. After detailedly analyzing the influence of each module of acoustic modeling, we proposed a novel method, namely Compo- nent-Group GMM, to improve the acoustic modeling, which effectively restrains the discrepancy between physical meaning and units in the elements of feature vectors. On the platform of Speaker Verification, we proposed quicker method to train anti-models and more flexible threshold setting. Hereafter, in order to find some stronger classifer, we compared GMM and Support Vector Machine to model acoustic features. We proposed embedded GMM to modify kernel functions and improve the training procedure of SVM. Secondly, we studied Verbal Information Verification which is argued that it's more robust in the comparison with voiceprint verification. We implemented Utterance Segmentation, Sub-word Hypothesis Testing and Sentence Confidence Measure. We proposed four methods to train sub-word anti-model and compared their performance in detailed testing. Thirdly, we proposed to apply SVM to integrate Speaker Verification and Verbal Information Verification, which benefis from voiceprint and user profile information and achieve satisfying performance. After that, we proposed nonstandard SVM training method to tackle the difficulty of adjusting two kinds of error-FA and FR. The new method enjoys more con- figurable authentication system.
语种	中文
其他标识符	626
内容类型	学位论文
源URL	[http://ir.ia.ac.cn/handle/173211/6775]
专题	毕业生_硕士学位论文
推荐引用方式 GB/T 7714	刘洋. 基于声纹特征与语义信息的身份认证系统研究[D]. 中国科学院自动化研究所. 中国科学院研究生院. 2002.

个性服务

查看访问统计

相关权益政策

暂无数据

收藏/分享

所有评论 (0)

[发表评论/异议/意见]

暂无评论

评论
权益异议
反馈意见

评注功能仅针对注册用户开放，请您登录

您对该条目有什么异议，请向管理员反馈。
内容：
Email：	*
单位:
验证码：	刷新

您在知识库使用过程中有什么好的想法或者建议可以反馈给我们。
标题：	*
内容：
Email：	*
验证码：	刷新

相关链接

CORC

联系我们