典型文献
Leveraging Document-Level and Query-Level Passage Cumulative Gain for Document Ranking
文献摘要:
Document ranking is one of the most studied but challenging problems in information retrieval(IR).More and more studies have begun to address this problem from fine-grained document modeling.However,most of them focus on context-independent passage-level relevance signals and ignore the context information.In this paper,we investigate how information gain accumulates with passages and propose the context-aware Passage Cumulative Gain(PCG).The fine-grained PCG avoids the need to split documents into independent passages.We investigate PCG patterns at the document level(DPCG)and the query level(QPCG).Based on the patterns,we propose a BERT-based sequential model called Passage-level Cumulative Gain Model(PCGM)and show that PCGM can effectively predict PCG sequences.Finally,we apply PCGM to the document ranking task using two approaches.The first one is leveraging DPCG sequences to estimate the gain of an individual document.Experimental results on two public ad hoc retrieval datasets show that PCGM outperforms most existing ranking models.The second one considers the cross-document effects and leverages QPCG sequences to estimate the marginal relevance.Experimental results show that predicted results are highly consistent with users'preferences.We believe that this work contributes to improving ranking performance and providing more explainability for document ranking.
文献关键词:
中图分类号:
作者姓名:
Zhi-Jing Wu;Yi-Qun Liu;Jia-Xin Mao;Min Zhang;Shao-Ping Ma
作者机构:
Department of Computer Science and Technology,Tsinghua University,Beijing 100084,China;Beijing National Research Center for Information Science and Technology,Tsinghua University,Beijing 100084,China;Gaoling School of Artificial Intelligence,Renmin University of China,Beijing 100084,China
文献出处:
引用格式:
[1]Zhi-Jing Wu;Yi-Qun Liu;Jia-Xin Mao;Min Zhang;Shao-Ping Ma-.Leveraging Document-Level and Query-Level Passage Cumulative Gain for Document Ranking)[J].计算机科学技术学报(英文版),2022(04):814-838
A类:
DPCG,QPCG,PCGM,explainability
B类:
Leveraging,Document,Level,Query,Passage,Cumulative,Gain,Ranking,ranking,one,most,studied,challenging,problems,information,retrieval,More,more,studies,have,begun,address,this,from,fine,grained,modeling,However,them,focus,context,independent,level,relevance,signals,ignore,In,paper,investigate,gain,accumulates,passages,propose,aware,avoids,need,split,documents,into,We,patterns,query,Based,BERT,sequential,called,Model,show,that,can,effectively,sequences,Finally,apply,task,using,two,approaches,first,leveraging,estimate,individual,Experimental,results,public,hoc,datasets,outperforms,existing,models,second,considers,cross,effects,leverages,marginal,predicted,highly,consistent,users,preferences,believe,work,contributes,improving,performance,providing
AB值:
0.461546
相似文献
机标中图分类号,由域田数据科技根据网络公开资料自动分析生成,仅供学习研究参考。