Determining an Optimal Data Classification Model for Credibility-Based Fake News Detection

Fake or false information on social media platforms is a significant challenge that leads to deliberately misleading users due to the inclusion of rumors, propaganda, or deceptive information about a person, organization, or service. Twitter is one of the most widely used social media platforms, especially in the Arab region, where the number of users is steadily increasing, accompanied by an increase in the rate of fake news. This drew the attention of researchers to provide a safe online environment free of misleading information. This paper aims to propose a smart classification model for the early detection of fake news in Arabic tweets utilizing Natural Language Processing (NLP) techniques, Machine Learning (ML) models, and Harris Hawks Optimizer (HHO) as a wrapper-based feature selection approach. Arabic Twitter corpus composed of 1862 previously annotated tweets was utilized by this research to assess the efficiency of the proposed model. The Bag of Words (BoW) model is utilized using different term-weighting schemes for feature extraction. Eight well-known learning algorithms are investigated with varying combinations of features, including user-profile, content-based, and words-features. Reported results showed that the Logistic Regression (LR) with Term Frequency-Inverse Document Frequency (TF-IDF) model scores the best rank. Moreover, feature selection based on the binary HHO algorithm plays a vital role in reducing dimensionality, thereby enhancing the learning model’s performance for fake news detection. Interestingly, the proposed BHHO-LR model can yield a better enhancement of 5% compared with previous works on the same dataset.

Download Full-text

Dynamic Replication Based on a Data Classification Model in Cloud Computing

Modelling and Implementation of Complex Systems - Lecture Notes in Networks and Systems ◽

10.1007/978-3-030-58861-8_1 ◽

2020 ◽

pp. 3-17

Author(s):

Imad Eddine Miloudi ◽

Belabbas Yagoubi ◽

Fatima Zohra Bellounar

Keyword(s):

Cloud Computing ◽

Data Classification ◽

Classification Model ◽

Dynamic Replication

Download Full-text

Towards a Data Classification Model for Circular Product Life Cycle Management

Product Lifecycle Management Enabling Smart X - IFIP Advances in Information and Communication Technology ◽

10.1007/978-3-030-62807-9_38 ◽

2020 ◽

pp. 473-486

Author(s):

Federica Acerbi ◽

Marco Taisch

Keyword(s):

Life Cycle ◽

Product Life Cycle ◽

Data Classification ◽

Life Cycle Management ◽

Classification Model ◽

Product Life ◽

Product Life Cycle Management

Download Full-text

A study secure multi authentication based data classification model in cloud based system

International Journal of Advances in Applied Sciences ◽

10.11591/ijaas.v9.i3.pp240-254 ◽

2020 ◽

Vol 9 (3) ◽

pp. 240

Author(s):

Sakshi Kaushal ◽

Bala Buksh

Keyword(s):

Machine Learning ◽

Cloud Computing ◽

Data Classification ◽

Classification Model ◽

Sensitive Data ◽

Learning Technique ◽

Mathematical Algorithms ◽

Encryption Algorithms ◽

Cloud Applications ◽

Technology Resources

Cloud computing is the most popular term among enterprises and news. The concepts come true because of fast internet bandwidth and advanced cooperation technology. Resources on the cloud can be accessed through internet without self built infrastructure. Cloud computing is effectively manage the security in the cloud applications. Data classification is a machine learning technique used to predict the class of the unclassified data. Data mining uses different tools to know the unknown, valid patterns and relationships in the dataset. These tools are mathematical algorithms, statistical models and Machine Learning (ML) algorithms. In this paper author uses improved Bayesian technique to classify the data and encrypt the sensitive data using hybrid stagnography. The encrypted and non encrypted sensitive data is sent to cloud environment and evaluate the parameters with different encryption algorithms.

Download Full-text

Optimal Deep Learning based Data Classification Model for Type-2 Diabetes Mellitus Diagnosis and Prediction System

International Journal of Innovative Technology and Exploring Engineering - Special Issue ◽

10.35940/ijitee.c8656.019320 ◽

2020 ◽

Vol 9 (3) ◽

pp. 1596-1604

Keyword(s):

Deep Learning ◽

Data Classification ◽

Research Area ◽

Classification Model ◽

Prediction System ◽

Pima Indians ◽

Significant Research ◽

Diabetes Mellitus Diagnosis ◽

Simulation Results

In recent days, deep learning models become a significant research area because of its applicability in diverse domains. In this paper, we employ an optimal deep neural network (DNN) based model for classifying diabetes disease. The DNN is employed for diagnosing the patient diseases effectively with better performance. To further improve the classifier efficiency, multilayer perceptron (MLP) is employed to remove the misclassified instance in the dataset. Then, the processed data is again provided as input to the DNN based classification model. The use of MLP significantly helps to remove the misclassified instances. The presented optimal data classification model is experimented on the PIMA Indians Diabetes dataset which holds the medical details of 768 patients under the presence of 8 attributes for every record. The obtained simulation results verified the superior nature of the presented model over the compared methods.

Download Full-text

Research on the Construction of Big Data Classification System Based on Distributed Data Flow

Journal of Physics Conference Series ◽

10.1088/1742-6596/2136/1/012057 ◽

2021 ◽

Vol 2136 (1) ◽

pp. 012057

Author(s):

Han Zhou

Keyword(s):

Big Data ◽

New Technologies ◽

Data Classification ◽

Classification Model ◽

Distributed Data ◽

Analysis Model ◽

Database Construction ◽

Big Data Classification ◽

Operation Status ◽

Model Algorithm

Abstract In the context of the comprehensive popularization of network technical services and database construction system, more and more data are used by enterprises or individuals. It is difficult for the existing technology to meet the technical analysis requirements of the development of the era of big data. Therefore, in the development of practice, we should continue to explore new technologies and methods to reasonably use big data. Therefore, on the basis of understanding the current big data technology and its system operation status, this paper designs relevant algorithms according to the big data classification model, and verifies the effectiveness of the analysis model algorithm based on practice.

Download Full-text

Data Classification Model for Fog-Enabled Mobile IoT Systems

Advances in Intelligent Systems and Computing - Congress on Intelligent Systems ◽

10.1007/978-981-33-6984-9_11 ◽

2021 ◽

pp. 125-138

Author(s):

Aung Myo Thaw ◽

Nataly Zhukova ◽

Tin Tun Aung ◽

Vladimir Chernokulsky

Keyword(s):

Data Classification ◽

Classification Model

Download Full-text

An Online Education Data Classification Model Based on Tr_MAdaBoost Algorithm

Chinese Journal of Electronics ◽

10.1049/cje.2018.06.006 ◽

2019 ◽

Vol 28 (1) ◽

pp. 21-28 ◽

Cited By ~ 1

Author(s):

Lasheng Yu ◽

Xu Wu ◽

Yu Yang

Keyword(s):

Online Education ◽

Data Classification ◽

Classification Model ◽

Model Based ◽

Education Data

Download Full-text

A Novel Model for Imbalanced Data Classification

Proceedings of the AAAI Conference on Artificial Intelligence ◽

10.1609/aaai.v34i04.6145 ◽

2020 ◽

Vol 34 (04) ◽

pp. 6680-6687

Author(s):

Jian Yin ◽

Chunjing Gan ◽

Kaiqi Zhao ◽

Xuan Lin ◽

Zhe Quan ◽

...

Keyword(s):

Imbalanced Data ◽

Data Classification ◽

Classification Performance ◽

Classification Model ◽

Proposed Model ◽

Imbalanced Data Classification ◽

Public Datasets ◽

Distribution Cost ◽

Novel Model ◽

Learning Data

Recently, imbalanced data classification has received much attention due to its wide applications. In the literature, existing researches have attempted to improve the classification performance by considering various factors such as the imbalanced distribution, cost-sensitive learning, data space improvement, and ensemble learning. Nevertheless, most of the existing methods focus on only part of these main aspects/factors. In this work, we propose a novel imbalanced data classification model that considers all these main aspects. To evaluate the performance of our proposed model, we have conducted experiments based on 14 public datasets. The results show that our model outperforms the state-of-the-art methods in terms of recall, G-mean, F-measure and AUC.

Download Full-text

Determining an Optimal Data Classification Model for Credibility-Based Fake News Detection

An Experimental Evaluation of Data Classification Models for Credibility Based Fake News Detection

Intelligent Detection of False Information in Arabic Tweets Utilizing Hybrid Harris Hawks Based Feature Selection and Machine Learning Models

Dynamic Replication Based on a Data Classification Model in Cloud Computing

Towards a Data Classification Model for Circular Product Life Cycle Management

A study secure multi authentication based data classification model in cloud based system

Optimal Deep Learning based Data Classification Model for Type-2 Diabetes Mellitus Diagnosis and Prediction System

Research on the Construction of Big Data Classification System Based on Distributed Data Flow

Data Classification Model for Fog-Enabled Mobile IoT Systems

An Online Education Data Classification Model Based on Tr_MAdaBoost Algorithm

A Novel Model for Imbalanced Data Classification

Export Citation Format