首站-论文投稿智能助手
典型文献
Leveraging Document-Level and Query-Level Passage Cumulative Gain for Document Ranking
文献摘要:
Document ranking is one of the most studied but challenging problems in information retrieval(IR).More and more studies have begun to address this problem from fine-grained document modeling.However,most of them focus on context-independent passage-level relevance signals and ignore the context information.In this paper,we investigate how information gain accumulates with passages and propose the context-aware Passage Cumulative Gain(PCG).The fine-grained PCG avoids the need to split documents into independent passages.We investigate PCG patterns at the document level(DPCG)and the query level(QPCG).Based on the patterns,we propose a BERT-based sequential model called Passage-level Cumulative Gain Model(PCGM)and show that PCGM can effectively predict PCG sequences.Finally,we apply PCGM to the document ranking task using two approaches.The first one is leveraging DPCG sequences to estimate the gain of an individual document.Experimental results on two public ad hoc retrieval datasets show that PCGM outperforms most existing ranking models.The second one considers the cross-document effects and leverages QPCG sequences to estimate the marginal relevance.Experimental results show that predicted results are highly consistent with users'preferences.We believe that this work contributes to improving ranking performance and providing more explainability for document ranking.
文献关键词:
作者姓名:
Zhi-Jing Wu;Yi-Qun Liu;Jia-Xin Mao;Min Zhang;Shao-Ping Ma
作者机构:
Department of Computer Science and Technology,Tsinghua University,Beijing 100084,China;Beijing National Research Center for Information Science and Technology,Tsinghua University,Beijing 100084,China;Gaoling School of Artificial Intelligence,Renmin University of China,Beijing 100084,China
引用格式:
[1]Zhi-Jing Wu;Yi-Qun Liu;Jia-Xin Mao;Min Zhang;Shao-Ping Ma-.Leveraging Document-Level and Query-Level Passage Cumulative Gain for Document Ranking)[J].计算机科学技术学报(英文版),2022(04):814-838
A类:
DPCG,QPCG,PCGM,explainability
B类:
Leveraging,Document,Level,Query,Passage,Cumulative,Gain,Ranking,ranking,one,most,studied,challenging,problems,information,retrieval,More,more,studies,have,begun,address,this,from,fine,grained,modeling,However,them,focus,context,independent,level,relevance,signals,ignore,In,paper,investigate,gain,accumulates,passages,propose,aware,avoids,need,split,documents,into,We,patterns,query,Based,BERT,sequential,called,Model,show,that,can,effectively,sequences,Finally,apply,task,using,two,approaches,first,leveraging,estimate,individual,Experimental,results,public,hoc,datasets,outperforms,existing,models,second,considers,cross,effects,leverages,marginal,predicted,highly,consistent,users,preferences,believe,work,contributes,improving,performance,providing
AB值:
0.461546
相似文献
Annotating TSSs in Multiple Cell Types Based on DNA Sequence and RNA-seq Data via DeeReCT-TSS
Juexiao Zhou;Bin Zhang;Haoyang Li;Longxi Zhou;Zhongxiao Li;Yongkang Long;Wenkai Han;Mengran Wang;Huanhuan Cui;Jingjing Li;Wei Chen;Xin Gao-Computer Science Program,Computer,Electrical and Mathematical Sciences and Engineering Division,King Abdullah University of Science and Technology,Thuwal 23955-6900,Saudi Arabia;Computational Bioscience Research Center,King Abdullah University of Science and Technology,Thuwal 23955-6900,Saudi Arabia;Department of Biology,School of Life Sciences,Southern University of Science and Technology,Shenzhen 518055,China;Shenzhen Key Laboratory of Gene Regulation and Systems Biology,School of Life Sciences,Southern University of Science and Technology,Shenzhen 518055,China;Academy for Advanced Interdisciplinary Studies,Southern University of Science and Technology,Shenzhen 518055,China
机标中图分类号,由域田数据科技根据网络公开资料自动分析生成,仅供学习研究参考。