Investigating the Relevance of Arabic Text Classification Datasets Based on Supervised Learning|Ahmad Hussein Ababneh - 期刊导航|首站-论文投稿智能助手|论文发表|论文智能投稿|期刊自助发表推荐|杂志社快速发表|查同导刊-域田数据官方网站

典型文献

Investigating the Relevance of Arabic Text Classification Datasets Based on Supervised Learning

文献摘要：

Training and testing different models in the field of text classification mainly depend on the pre-classified text document datasets. Recently, seven datasets have emerged for Arabic text classification, including Single-Label Arabic News Articles Dataset (SANAD), Khaleej, Arabiya, Akhbarona, KALIMAT, Waten2004, and Khaleej2004. This study investigates which of these datasets can provide significant training and fair evaluation for text classification (TC). In this investigation, well-known and accurate learning models are used, including naive Bayes (NB), random forest (RF), K-nearest neighbor (KNN), support vector machines (SVM), and logistic regression (LR) models. We present relevance and time measures of training the models with these datasets to enable Arabic language researchers to select the appropriate dataset to use based on a solid basis of comparison. The performances of the five learning models across the seven datasets are measured and compared with the performances of the same models trained on a well-known English language dataset. The analysis of the relevance and time scores shows that training the SVM model on Khaleej and Arabiya obtained the most significant results in the shortest amount of time, with the accuracy of 82％.

文献关键词：

中图分类号：

[1] 天文学、地球科学（P） / 地球物理学（P3） / 空间物理（P35）

[2] 自动化技术、计算机技术（TP） / 计算技术、计算机技术（TP3） / 计算机的应用（TP39） / 信息处理(信息加工)（TP391）

[3] 医药、卫生（R） / 基础医学（R3） / 病理学（R36） / 病理过程（R364）

作者姓名：

Ahmad Hussein Ababneh

作者机构：

Computer Science Department,American University of Madaba,Madaba 2882

文献出处：

电子科技学刊

引用格式：

[1]Ahmad Hussein Ababneh-.Investigating the Relevance of Arabic Text Classification Datasets Based on Supervised Learning)[J].电子科技学刊,2022(02):187-208

A类：

SANAD,Khaleej,Arabiya,Akhbarona,KALIMAT,Waten2004,Khaleej2004

B类：

Investigating,Relevance,Arabic,Text,Classification,Datasets,Based,Supervised,Learning,Training,testing,different,models,field,text,classification,mainly,depend,classified,document,datasets,Recently,seven,have,emerged,including,Single,Label,News,Articles,This,study,investigates,which,these,provide,significant,training,fair,evaluation,this,investigation,well,known,accurate,learning,used,naive,Bayes,NB,random,forest,RF,nearest,neighbor,KNN,support,vector,machines,logistic,regression,LR,We,present,relevance,measures,enable,language,researchers,select,appropriate,solid,basis,comparison,performances,five,across,measured,compared,same,trained,English,analysis,scores,shows,that,obtained,most,results,shortest,amount,accuracy

AB值：

0.521328

相似文献

Scribble-Supervised Video Object Segmentation

Peiliang Huang;Junwei Han;Nian Liu;Jun Ren;Dingwen Zhang-Zhang are with the Brain and Artificial Intelligence Laboratory,School of Automation,Northwestern Polytechnical University,Xi'an 710072,China;Department of Engagement Services,Mohamed Bin Zayed University of Artificial Intelligence,AbuDhabi,United Arab Emirate;Science and Technology on Complex System Control and Intelligent Agent Cooperation Laboratory,Beijing,China

Belief Combination of Classifiers for Incomplete Data

Zuowei Zhang;Songtao Ye;Yiru Zhang;Weiping Ding;Hao Wang-Research Center for Optical Fiber Sensing,Zhejiang Laboratory,Hangzhou 310000;School of Automation,Northwestern Polytechnical University,Xi'an 710072,China;Research Center for Optical Fiber Sensing,Zhejiang Laboratory,Hangzhou 310000,China;School of Cyber Engineering,Xidian University,Xi'an 710000,China;Department of Computer Science,CY Cergy Paris Université,2 Av.Adolphe Chauvin,95302 Cedex,France;School of Information Science and Technology,Nantong University,Nantong 226019,China;School with of Cyber Engineering,Xidian University,Xi'an 710000,China;Department of Computer Science,Norwegian University of Science and Technology,Trondheim 7491,Norway

Meta Ordinal Regression Forest for Medical Image Classification With Ordinal Labels

Yiming Lei;Haiping Zhu;Junping Zhang;Hongming Shan-Shanghai Key Laboratory of Intelligent Information Processing,the School of Computer Science,Fudan University,Shanghai 200433,China;Shanghai Key Laboratory of Intelligent Information Processing,the School of Computer Science,Fudan University,Shanghai 200433,Huawei,Shanghai 200120,China;Institute of Science and Technology for Brain-Inspired Intelligence and MOE Frontiers Center for Brain Science,Fudan University,Shanghai 200433;Shanghai Center for Brain Science and Brain-Inspired Technology,Shanghai 200031,China

Visuals to Text:A Comprehensive Review on Automatic Image Captioning

Yue Ming;Nannan Hu;Chunxiao Fan;Fan Feng;Jiangwan Zhou;Hui Yu-Beijing University of Posts and Telecommunications,Beijing 100876,China;School of Creative Technologies,University of Ports-mouth,Portsmouth PO1 2DJ,UK

Self-Supervised Graph Neural Networks for Accurate Prediction of Néel Temperature