典型文献
Enhancing N-Gram Based Metrics with Semantics for Better Evaluation of Abstractive Text Summarization
文献摘要:
Text summarization is an important task in natural language processing and it has been applied in many applications.Recently,abstractive summarization has attracted many attentions.However,the traditional evaluation metrics that consider little semantic information,are unsuitable for evaluating the quality of deep learning based abstractive summarization models,since these models may generate new words that do not exist in the original text.Moreover,the out-of-vocabulary(OOV)problem that affects the evaluation results,has not been well solved yet.To address these issues,we propose a novel model called ENMS,to enhance existing N-gram based evaluation metrics with semantics.To be specific,we present two types of methods:N-gram based Semantic Matching(NSM for short),and N-gram based Semantic Similarity(NSS for short),to improve several widely-used evaluation metrics including ROUGE(Recall-Oriented Understudy for Gisting Evaluation),BLEU(Bilingual Evaluation Understudy),etc.NSM and NSS work in different ways.The former calculates the matching degree directly,while the latter mainly improves the similarity measurement.Moreover we propose an N-gram representation mechanism to explore the vector representation of N-grams(including skip-grams).It serves as the basis of our ENMS model,in which we exploit some simple but effective integration methods to solve the OOV problem efficiently.Experimental results over the TAC AESOP dataset show that the metrics improved by our methods are well correlated with human judgements and can be used to better evaluate abstractive summarization methods.
文献关键词:
中图分类号:
作者姓名:
Jia-Wei He;Wen-Jun Jiang;Guo-Bang Chen;Yu-Quan Le;Xiao-Fei Ding
作者机构:
College of Information Science and Electronic Engineering,Hunan University,Changsha 410082,China
文献出处:
引用格式:
[1]Jia-Wei He;Wen-Jun Jiang;Guo-Bang Chen;Yu-Quan Le;Xiao-Fei Ding-.Enhancing N-Gram Based Metrics with Semantics for Better Evaluation of Abstractive Text Summarization)[J].计算机科学技术学报(英文版),2022(05):1118-1133
A类:
Abstractive,Summarization,abstractive,ENMS,Understudy,Gisting,Bilingual,AESOP,judgements
B类:
Enhancing,Gram,Based,Metrics,Semantics,Better,Evaluation,Text,summarization,important,task,natural,language,processing,has,been,applied,many,applications,Recently,attracted,attentions,However,traditional,evaluation,metrics,that,consider,little,information,are,unsuitable,evaluating,quality,deep,learning,models,since,these,may,generate,new,words,do,not,original,text,Moreover,out,vocabulary,OOV,problem,affects,results,well,solved,yet,To,address,issues,propose,novel,called,enhance,existing,semantics,specific,two,types,methods,Matching,NSM,short,Similarity,NSS,several,widely,used,including,ROUGE,Recall,Oriented,BLEU,etc,work,different,ways,former,calculates,matching,degree,directly,while,latter,mainly,improves,similarity,measurement,representation,mechanism,explore,vector,grams,skip,It,serves,basis,our,which,exploit,some,simple,but,effective,integration,efficiently,Experimental,TAC,dataset,show,improved,by,correlated,human,can,better,evaluate
AB值:
0.52885
相似文献
机标中图分类号,由域田数据科技根据网络公开资料自动分析生成,仅供学习研究参考。