Improving Parameter Estimation and Defensive Ability of Latent Dirichlet Allocation Model Training Under Rényi Differential Privacy|Tao Huang;Su-Yun Zhao;Hong Chen;Yi-Xuan Liu|School of Information,Renmin University of China,Beijing 100087,China - 期刊导航|首站-论文投稿智能助手|论文发表|论文智能投稿|期刊自助发表推荐|杂志社快速发表|查同导刊-域田数据官方网站

典型文献

Improving Parameter Estimation and Defensive Ability of Latent Dirichlet Allocation Model Training Under Rényi Differential Privacy

文献摘要：

Latent Dirichlet allocation(LDA)is a topic model widely used for discovering hidden semantics in massive text corpora.Collapsed Gibbs sampling(CGS),as a widely-used algorithm for learning the parameters of LDA,has the risk of privacy leakage.Specifically,word count statistics and updates of latent topics in CGS,which are essential for parameter estimation,could be employed by adversaries to conduct effective membership inference attacks(MIAs).Till now,there are two kinds of methods exploited in CGS to defend against MIAs:adding noise to word count statistics and utilizing inherent privacy.These two kinds of methods have their respective limitations.Noise sampled from the Laplacian distribution sometimes produces negative word count statistics,which render terrible parameter estimation in CGS.Utilizing inherent privacy could only provide weak guaranteed privacy when defending against MIAs.It is promising to propose an effective framework to obtain accurate parameter estimations with guaranteed differential privacy.The key issue of obtaining accurate parameter estimations when introducing differential privacy in CGS is making good use of the privacy budget such that a precise noise scale is derived.It is the first time that Rényi differential privacy(RDP)has been introduced into CGS and we propose RDP-LDA,an effective framework for analyzing the privacy loss of any differentially private CGS.RDP-LDA could be used to derive a tighter upper bound of privacy loss than the overestimated results of existing differentially private CGS obtained by ε-DP.In RDP-LDA,we propose a novel truncated-Gaussian mechanism that keeps word count statistics non-negative.And we propose distribution perturbation which could provide more rigorous guaranteed privacy than utilizing inherent privacy.Experiments validate that our proposed methods produce more accurate parameter estimation under the JS-divergence metric and obtain lower precision and recall when defending against MIAs.

文献关键词：

中图分类号：

[1] 医药、卫生（R） / 药学（R9） / 药理学（R96） / 实验药理学（R965）

[2] 医药、卫生（R） / 基础医学（R3） / 病理学（R36） / 病理过程（R364）

[3] 自动化技术、计算机技术（TP） / 计算技术、计算机技术（TP3） / 计算机的应用（TP39） / 信息处理(信息加工)（TP391）

作者姓名：

Tao Huang;Su-Yun Zhao;Hong Chen;Yi-Xuan Liu

作者机构：

Key Laboratory of Data Engineering and Knowledge Engineering(Renmin University of China),Ministry of Education Beijing 100087,China;School of Information,Renmin University of China,Beijing 100087,China;School of Information,Renmin University of China,Beijing;computer application at Renmin University of China,Beijing

文献出处：

计算机科学技术学报（英文版）

引用格式：

[1]Tao Huang;Su-Yun Zhao;Hong Chen;Yi-Xuan Liu-.Improving Parameter Estimation and Defensive Ability of Latent Dirichlet Allocation Model Training Under Rényi Differential Privacy)[J].计算机科学技术学报（英文版）,2022(06):1382-1397

A类：

Collapsed,MIAs

B类：

Improving,Parameter,Estimation,Defensive,Ability,Latent,Dirichlet,Allocation,Model,Training,Under,nyi,Differential,Privacy,allocation,LDA,model,widely,used,discovering,hidden,semantics,massive,text,corpora,Gibbs,sampling,CGS,algorithm,learning,parameters,has,risk,privacy,leakage,Specifically,word,count,statistics,updates,latent,topics,which,are,essential,could,employed,by,adversaries,conduct,effective,membership,inference,attacks,Till,now,there,two,kinds,methods,exploited,against,adding,noise,utilizing,inherent,These,have,their,respective,limitations,Noise,sampled,from,Laplacian,distribution,sometimes,produces,negative,render,terrible,Utilizing,only,provide,weak,guaranteed,when,defending,It,promising,framework,accurate,estimations,key,issue,obtaining,introducing,making,good,budget,such,that,precise,scale,derived,first,RDP,been,introduced,into,analyzing,loss,any,differentially,private,tighter,upper,bound,than,overestimated,results,existing,obtained,In,novel,truncated,Gaussian,mechanism,keeps,And,perturbation,more,rigorous,Experiments,validate,our,proposed,under,JS,divergence,metric,lower,precision,recall

AB值：

0.507218

相似文献

Joint Scheduling and Resource Allocation for Federated Learning in SWIPT-Enabled Micro UAV Swarm Networks

Wanli Wen;Yunjian Jia;Wenchao Xia-School of Microelectronics and Communication Engineering,Chongqing University,Chongqing 400044,China;National Mobile Communications Research Laboratory,Southeast University,Nanjing 210009,China;Jiangsu Key Laboratory of Wireless Communications,Nanjing University of Posts and Telecommunications,Nanjing 210003,China

Differential Privacy Preserving Dynamic Data Release Scheme Based on Jensen-Shannon Divergence

Ying Cai;Yu Zhang;Jingjing Qu;Wenjin Li-Department of Computer Science and Technology,Beijing Key Laboratory of Internet Culture and Digital Dissemination Research,Beijing Information Science and Technology University,Beijing 100101,China

Fine-Grained and Fair Identity Authentication Scheme for Mobile Networks Based on Blockchain

Mengjuan Zhai;Yanli Ren;Guorui Feng;Xinpeng Zhang-School of Communication and Information Engineering,Shanghai University,Shanghai 200444,China;Zhengzhou Xinda Institute of Advanced Technology,Zhengzhou 450001,Henan,China

Achieving Fuzzy Matching Data Sharing for Secure Cloud-Edge Communication

Chuan Zhang;Mingyang Zhao;Yuhua Xu;Tong Wu;Yanwei Li;Liehuang Zhu;Haotian Wang-School of Cyberspace Science and Technology,Beijing Institute of Technology,Beijing 100081,China;School of Computer Science and Technology,Beijing Institute of Technology,Beijing 100081,China;Yangtze Delta Region Academy of Beijing Institute of Technology,Jiaxing 314019,China;National Computer Network Emergency Response Technical Team/Coordination Center of China,Beijing 100029,China;College of Arts and Science,University of Pennsylvania,Philadelphia 19104,USA

AvoidPwd:A Mnemonic Password Generation Strategy Based on Keyboard Transformation