FAILED
首站-论文投稿智能助手
典型文献
One-against-all-based Hellinger distance decision tree for multiclass imbalanced learning
文献摘要:
Since traditional machine learning methods are sensitive to skewed distribution and do not consider the characteristics in multiclass imbalance problems, the skewed distribution of multiclass data poses a major challenge to machine learning algorithms. To tackle such issues, we propose a new splitting criterion of the decision tree based on the one-against-all-based Hellinger distance (OAHD). Two crucial elements are included in OAHD. First, the one-against-all scheme is integrated into the process of computing the Hellinger distance in OAHD, thereby extending the Hellinger distance decision tree to cope with the multiclass imbalance problem. Second, for the multiclass imbalance problem, the distribution and the number of distinct classes are taken into account, and a modified Gini index is designed. Moreover, we give theoretical proofs for the properties of OAHD, including skew insensitivity and the ability to seek a purer node in the decision tree. Finally, we collect 20 public real-world imbalanced data sets from the Knowledge Extraction based on Evolutionary Learning (KEEL) repository and the University of California, Irvine (UCI) repository. Experimental and statistical results show that OAHD significantly improves the performance compared with the five other well-known decision trees in terms of Precision, F-measure, and multiclass area under the receiver operating characteristic curve (MAUC). Moreover, through statistical analysis, the Friedman and Nemenyi tests are used to prove the advantage of OAHD over the five other decision trees.
文献关键词:
作者姓名:
Minggang DONG;Ming LIU;Chao JING
作者机构:
School of Information Science and Engineering,Guilin University of Technology,Guilin 541004,China;Guangxi Key Laboratory of Embedded Technology and Intelligent System,Guilin 541004,China;Guangxi Key Laboratory of Trusted Software,Guilin University of Electronic Technology,Guilin 541004,China
引用格式:
[1]Minggang DONG;Ming LIU;Chao JING-.One-against-all-based Hellinger distance decision tree for multiclass imbalanced learning)[J].信息与电子工程前沿(英文),2022(02):278-290
A类:
OAHD,purer,MAUC
B类:
One,against,Hellinger,distance,decision,multiclass,imbalanced,learning,Since,traditional,machine,methods,sensitive,skewed,distribution,do,not,consider,characteristics,problems,data,poses,major,challenge,algorithms,To,tackle,such,issues,propose,new,splitting,criterion,one,Two,crucial,elements,included,First,scheme,integrated,into,process,computing,thereby,extending,cope,Second,number,distinct,classes,taken,account,modified,Gini,designed,Moreover,give,theoretical,proofs,properties,including,insensitivity,ability,seek,node,Finally,collect,public,real,world,sets,from,Knowledge,Extraction,Evolutionary,Learning,KEEL,repository,University,California,Irvine,UCI,Experimental,statistical,results,show,that,significantly,improves,performance,compared,five,other,well,known,trees,terms,Precision,measure,area,under,receiver,operating,curve,through,analysis,Friedman,Nemenyi,tests,used,advantage
AB值:
0.509525
相似文献
Machine learning-based classification of rock discontinuity trace:SMOTE oversampling integrated with GBT ensemble learning
Jiayao Chen;Hongwei Huang;Anthony G.Cohn;Dongming Zhang;Mingliang Zhou-Key Laboratory of Geotechnical and Underground Engineering of Ministry of Education,Department of Geotechnical Engineering,Tongji University,Shanghai 200092,China;School of Computing,University of Leeds,LS2 9JT Leeds,United Kingdom;Department of Computer Science and Technology,Tongji University,Shanghai 211985,China;School of Civil Engineering,Shandong University,Jinan 250061,China;Luzhong Institute of Safety,Environmental Protection Engineering and Materials,Qingdao University of Science and Technology,Zibo 255000,China;School of Mechanical and Electrical Engineering,Qingdao University of Science and Technology,Qingdao 260061,China
机标中图分类号,由域田数据科技根据网络公开资料自动分析生成,仅供学习研究参考。