Optimizing the Perceptual Quality of Time-Domain Speech Enhancement with Reinforcement Learning|Xiang Hao;Chenglin Xu;Lei Xie;Haizhou Li|Department of Electrical and Computer Engineering,National University of Singapore,Singapore 710129,Singapore - 期刊导航|首站-论文投稿智能助手|论文发表|论文智能投稿|期刊自助发表推荐|杂志社快速发表|查同导刊-域田数据官方网站

典型文献

Optimizing the Perceptual Quality of Time-Domain Speech Enhancement with Reinforcement Learning

文献摘要：

In neural speech enhancement,a mismatch exists between the training objective,i.e.,Mean-Square Error(MSE),and perceptual quality evaluation metrics,i.e.,perceptual evaluation of speech quality and short-time objective intelligibility.We propose a novel reinforcement learning algorithm and network architecture,which incorporate a non-differentiable perceptual quality evaluation metric into the objective function using a dynamic filter module.Unlike the traditional dynamic filter implementation that directly generates a convolution kernel,we use a filter generation agent to predict the probability density function of a multivariate Gaussian distribution,from which we sample the convolution kernel.Experimental results show that the proposed reinforcement learning method clearly improves the perceptual quality over other supervised learning methods with the MSE objective function.

文献关键词：

中图分类号：

[1] 自动化技术、计算机技术（TP） / 计算技术、计算机技术（TP3） / 计算机的应用（TP39） / 信息处理(信息加工)（TP391）

[2] 数理科学和化学（O） / 力学（O3） / 振动理论（O32） / 非线性振动（O322）

[3] 医药、卫生（R） / 基础医学（R3） / 病理学（R36） / 病理过程（R364）

作者姓名：

Xiang Hao;Chenglin Xu;Lei Xie;Haizhou Li

作者机构：

School of Computer Science,Northwestern Polytechnical University,Xi'an 710000,China;Department of Electrical and Computer Engineering,National University of Singapore,Singapore 710129,Singapore

文献出处：

清华大学学报自然科学版（英文版）

引用格式：

[1]Xiang Hao;Chenglin Xu;Lei Xie;Haizhou Li-.Optimizing the Perceptual Quality of Time-Domain Speech Enhancement with Reinforcement Learning)[J].清华大学学报自然科学版（英文版）,2022(06):939-947

A类：

B类：

Optimizing,Perceptual,Quality,Time,Domain,Speech,Enhancement,Reinforcement,Learning,In,neural,speech,enhancement,mismatch,exists,between,training,objective,Mean,Square,Error,MSE,perceptual,quality,evaluation,metrics,short,intelligibility,We,novel,reinforcement,learning,algorithm,network,architecture,which,incorporate,differentiable,into,function,using,dynamic,filter,module,Unlike,traditional,implementation,that,directly,generates,convolution,kernel,use,generation,agent,predict,probability,density,multivariate,Gaussian,distribution,from,sample,Experimental,results,show,proposed,clearly,improves,over,other,supervised,methods

AB值：

0.659301

相似文献

Multi-Objective Deep Reinforcement Learning Based Time-Frequency Resource Allocation for Multi-Beam Satellite Communications

Yuanzhi He;Biao Sheng;Hao Yin;Di Yan;Yingchao Zhang-School of systems science and engineering,Sun Yat-Sen University,Guangzhou 100876,China;Institute of Systems Engineering,AMS,PLA,Beijing 100141,China

Cooperative Caching for Scalable Video Coding Using Value-Decomposed Dimensional Networks

Youjia Chen;Yuekai Cai;Haifeng Zheng;Jinsong Hu;Jun Li-Fujian Key Lab for Intelligent Processing and Wireless Transmission of Media Information,College of Physics and Information Engineering,Fuzhou University,Fuzhou 350000,China;School of Electronic and Optical Engineering,Nanjing University of Science and Technology,Nanjing 210000,China

Robotic Knee Tracking Control to Mimic the Intact Human Knee Profile Based on Actor-Critic Reinforcement Learning

Ruofan Wu-School of Electrical,Computer and Energy Engineering,Arizona State University,Tempe,AZ 85287 USA;Department of Biomedical Engineering,North Carolina State University,Raleigh,NC 27695 USA;University of North Carolina at Chapel Hill,Chapel Hill,NC 27599 USA

Scribble-Supervised Video Object Segmentation

Peiliang Huang;Junwei Han;Nian Liu;Jun Ren;Dingwen Zhang-Zhang are with the Brain and Artificial Intelligence Laboratory,School of Automation,Northwestern Polytechnical University,Xi'an 710072,China;Department of Engagement Services,Mohamed Bin Zayed University of Artificial Intelligence,AbuDhabi,United Arab Emirate;Science and Technology on Complex System Control and Intelligent Agent Cooperation Laboratory,Beijing,China

BaMBNet: A Blur-Aware Multi-Branch Network for Dual-Pixel Defocus Deblurring