Feature transformation is achieved by FastICA algorithm. Speaker clustering is implemented by Gaussian Mixture Model which can make system applied to a wider group of people and speaker adaption is achieved by Maximum Likelihood Linear Regression algorithm.
此外,本文在提高系统鲁棒性和识别速度方面做了新的尝试:应用FastICA算法对特征变换和降维;实现了说话人分类和说话人自适应基本算法,说话人分类由混合高斯模型实现,可以扩大应用人群,提高识别率,说话人自适应由最大似然线性回归算法实现;在提高系统识别速度方面采用高斯选择法。
参考来源 - 电话信道自然语音关键词检测·2,447,543篇论文数据,部分数据来源于NoteExpress
应用推荐