典型文献
Enhancing Speech Recognition for Parkinson's Disease Patient Using Transfer Learning Technique
文献摘要:
Parkinson's disease patients suffer from disorders of speech.The most frequently reported speech problems are weak,hoarse,nasal or monotonous voice,imprecise articulation,slow or fast speech,difficulty starting speech,impaired stress or rhythm,stuttering,and tremor.To improve the speech quality and assist the patient with speech rehabilitation therapy,we have proposed the speech recognition model for Parkinson's disease patients using transfer learning technique (PSTL),where we have pre-trained the long short-term memory (LSTM)neural network model with our developed publicly available dataset that has been obtained from healthy people through the social media platform.Then,we applied the transfer learning technique to improve the performance of the PSTL framework.The frequency spectrogram masking data augmentation method has been used to alleviate the over-fitting problem so that the word error rate (WER) is further reduced.Even with a limited dataset,our proposed model has effectively reduced the WER from 58% to 44.5% on the original speech dataset and 53.1% to 43% on the denoised speech dataset,which demonstrated the feasibility of our framework.
文献关键词:
中图分类号:
作者姓名:
YU Qing;MA Yi
作者机构:
Department of Micro-Nano Electronics,Shanghai Jiao Tong University,Shanghai 200240,China;MoE Key Lab of Artificial Intelligence,Shanghai Jiao Tong University,Shanghai 200240,China
文献出处:
引用格式:
[1]YU Qing;MA Yi-.Enhancing Speech Recognition for Parkinson's Disease Patient Using Transfer Learning Technique)[J].上海交通大学学报(英文版),2022(01):90-98
A类:
hoarse,stuttering,PSTL
B类:
Enhancing,Speech,Recognition,Parkinson,Disease,Patient,Using,Transfer,Learning,Technique,disease,patients,suffer,from,disorders,speech,most,frequently,reported,problems,are,weak,nasal,monotonous,voice,imprecise,articulation,slow,fast,difficulty,starting,impaired,stress,rhythm,tremor,To,improve,quality,assist,rehabilitation,therapy,have,proposed,recognition,model,using,transfer,learning,technique,where,trained,long,short,term,memory,neural,network,our,developed,publicly,available,dataset,that,has,been,obtained,healthy,people,through,social,media,platform,Then,applied,performance,framework,frequency,spectrogram,masking,augmentation,method,used,alleviate,over,fitting,word,error,WER,further,reduced,Even,limited,effectively,original,denoised,which,demonstrated,feasibility
AB值:
0.593866
相似文献
机标中图分类号,由域田数据科技根据网络公开资料自动分析生成,仅供学习研究参考。