光学 精密工程, 2013, 21 (6): 1598, 网络出版: 2013-07-01   

基于自适应高斯混合模型与静动态听觉特征融合的说话人识别

Speaker recognition based on adapted Gaussian mixture model and static and dynamic auditory feature fusion
吴迪 1,*曹洁 1,2王进花 1
作者单位
1 兰州理工大学 电气工程与信息工程学院,甘肃 兰州 730050
2 兰州理工大学 计算机与通信学院,甘肃 兰州 730050
摘要
对特征参数和高斯混合模型进行改进,提出了一种特征域和模型域混合补偿的方法用于解决说话人识别特征受噪声影响较大以及高斯混合模型随训练样本长度减小而性能下降的问题。通过模拟人耳听觉,给出了基于伽马通滤波器的伽马通滤波倒谱系数;考虑其只反映了语音的静态特征,提取了能够反映语音动态特征的伽马通滑动差分倒谱系数。基于因子分析技术,利用移动因子表示高斯混合模型的自适应过程,通过训练语料较充分的说话人模型中的均值向量补偿受训练语料长度影响较大的分量的均值向量。仿真实验表明:在纯净背景下,本文方法的识别率达到了98.46%;在不同噪声环境下,本文提出的混合补偿方法能有效提高说话人识别系统的性能。
Abstract
By optimizing the feature vectors and Gaussian Mixture Models(GMMs), a hybrid compensation method in model and feature domains is proposed. With the method, the speaker recognition features effected by the noise and the declined performance of GMM with reducing length of the training data under different unexpected noise environments are improved. By emulating human auditory, Gammatone Filter Cepstral Coefficients(GFCC) is given out based on Gammatone Filter bank models. As the GFCC only reflects the static properties, the Gammatone Filter Shifted Delta Cepstral Coefficients(GFSDCC) is extracted based on Shifted Delta Cepstral. Then, the adaptive process for each GMM model with sufficient training data is transformed to the shift factor based on factor analysis. Furthermore, when the training data are insufficient, the coordinate of the shift factor is learned from the GMM mixtures of insensitive to the training data and then it is adapted to compensate other GMM mixtures. The experiment result shows that the recognition rate of the method proposed is 98.46% . The conclusion is that the performance of speaker recognition system is improved under several kinds of noise environments.

吴迪, 曹洁, 王进花. 基于自适应高斯混合模型与静动态听觉特征融合的说话人识别[J]. 光学 精密工程, 2013, 21(6): 1598. WU Di, CAO Jie, WANG Jin-hua. Speaker recognition based on adapted Gaussian mixture model and static and dynamic auditory feature fusion[J]. Optics and Precision Engineering, 2013, 21(6): 1598.

本文已被 1 篇论文引用
被引统计数据来源于中国光学期刊网
引用该论文: TXT   |   EndNote

相关论文

加载中...

关于本站 Cookie 的使用提示

中国光学期刊网使用基于 cookie 的技术来更好地为您提供各项服务,点击此处了解我们的隐私策略。 如您需继续使用本网站,请您授权我们使用本地 cookie 来保存部分信息。
全站搜索
您最值得信赖的光电行业旗舰网络服务平台!