首站-论文投稿智能助手
典型文献
Optimizing the Perceptual Quality of Time-Domain Speech Enhancement with Reinforcement Learning
文献摘要:
In neural speech enhancement,a mismatch exists between the training objective,i.e.,Mean-Square Error(MSE),and perceptual quality evaluation metrics,i.e.,perceptual evaluation of speech quality and short-time objective intelligibility.We propose a novel reinforcement learning algorithm and network architecture,which incorporate a non-differentiable perceptual quality evaluation metric into the objective function using a dynamic filter module.Unlike the traditional dynamic filter implementation that directly generates a convolution kernel,we use a filter generation agent to predict the probability density function of a multivariate Gaussian distribution,from which we sample the convolution kernel.Experimental results show that the proposed reinforcement learning method clearly improves the perceptual quality over other supervised learning methods with the MSE objective function.
文献关键词:
作者姓名:
Xiang Hao;Chenglin Xu;Lei Xie;Haizhou Li
作者机构:
School of Computer Science,Northwestern Polytechnical University,Xi'an 710000,China;Department of Electrical and Computer Engineering,National University of Singapore,Singapore 710129,Singapore
引用格式:
[1]Xiang Hao;Chenglin Xu;Lei Xie;Haizhou Li-.Optimizing the Perceptual Quality of Time-Domain Speech Enhancement with Reinforcement Learning)[J].清华大学学报自然科学版(英文版),2022(06):939-947
A类:
B类:
Optimizing,Perceptual,Quality,Time,Domain,Speech,Enhancement,Reinforcement,Learning,In,neural,speech,enhancement,mismatch,exists,between,training,objective,Mean,Square,Error,MSE,perceptual,quality,evaluation,metrics,short,intelligibility,We,novel,reinforcement,learning,algorithm,network,architecture,which,incorporate,differentiable,into,function,using,dynamic,filter,module,Unlike,traditional,implementation,that,directly,generates,convolution,kernel,use,generation,agent,predict,probability,density,multivariate,Gaussian,distribution,from,sample,Experimental,results,show,proposed,clearly,improves,over,other,supervised,methods
AB值:
0.659301
相似文献
Accumulative Time Based Ranking Method to Reputation Evaluation in Information Networks
Hao Liao;Qi-Xin Liu;Ze-Cheng Huang;Ke-Zhong Lu;Chi Ho Yeung;Yi-Cheng Zhang-National Engineering Laboratory on Big Data System Computing Technology,College of Computer Science and Software Engineering,Shenzhen University,Shenzhen 518060,China;Guangdong Province Key Laboratory of Popular High Performance Computers,Shenzhen University Shenzhen 518060,China;Guangdong Province Engineering Center of China-Made High Performance Data Computing System,College of Computer Science and Software Engineering,Shenzhen University,Shenzhen 518060,China;Institute of Big Data Intelligent Management and Decision,Shenzhen University,Shenzhen 518060,China;Department of Science and Environmental Studies,The Education University of Hong Kong,Hong Kong 999077,China;Department of Physics,University of Fribourg,Fribourg 1700,Switzerland
机标中图分类号,由域田数据科技根据网络公开资料自动分析生成,仅供学习研究参考。