Modelo de classificação de dados não estruturados para análise da competitividade de mercado / Unstructured data classification model for competitiveness analysis competitiveness

Cloud computing is the most popular term among enterprises and news. The concepts come true because of fast internet bandwidth and advanced cooperation technology. Resources on the cloud can be accessed through internet without self built infrastructure. Cloud computing is effectively manage the security in the cloud applications. Data classification is a machine learning technique used to predict the class of the unclassified data. Data mining uses different tools to know the unknown, valid patterns and relationships in the dataset. These tools are mathematical algorithms, statistical models and Machine Learning (ML) algorithms. In this paper author uses improved Bayesian technique to classify the data and encrypt the sensitive data using hybrid stagnography. The encrypted and non encrypted sensitive data is sent to cloud environment and evaluate the parameters with different encryption algorithms.

Download Full-text

Optimal Deep Learning based Data Classification Model for Type-2 Diabetes Mellitus Diagnosis and Prediction System

International Journal of Innovative Technology and Exploring Engineering - Special Issue ◽

10.35940/ijitee.c8656.019320 ◽

2020 ◽

Vol 9 (3) ◽

pp. 1596-1604

Keyword(s):

Deep Learning ◽

Data Classification ◽

Research Area ◽

Classification Model ◽

Prediction System ◽

Pima Indians ◽

Significant Research ◽

Diabetes Mellitus Diagnosis ◽

Simulation Results

In recent days, deep learning models become a significant research area because of its applicability in diverse domains. In this paper, we employ an optimal deep neural network (DNN) based model for classifying diabetes disease. The DNN is employed for diagnosing the patient diseases effectively with better performance. To further improve the classifier efficiency, multilayer perceptron (MLP) is employed to remove the misclassified instance in the dataset. Then, the processed data is again provided as input to the DNN based classification model. The use of MLP significantly helps to remove the misclassified instances. The presented optimal data classification model is experimented on the PIMA Indians Diabetes dataset which holds the medical details of 768 patients under the presence of 8 attributes for every record. The obtained simulation results verified the superior nature of the presented model over the compared methods.

Download Full-text

Research on the Construction of Big Data Classification System Based on Distributed Data Flow

Journal of Physics Conference Series ◽

10.1088/1742-6596/2136/1/012057 ◽

2021 ◽

Vol 2136 (1) ◽

pp. 012057

Author(s):

Han Zhou

Keyword(s):

Big Data ◽

New Technologies ◽

Data Classification ◽

Classification Model ◽

Distributed Data ◽

Analysis Model ◽

Database Construction ◽

Big Data Classification ◽

Operation Status ◽

Model Algorithm

Abstract In the context of the comprehensive popularization of network technical services and database construction system, more and more data are used by enterprises or individuals. It is difficult for the existing technology to meet the technical analysis requirements of the development of the era of big data. Therefore, in the development of practice, we should continue to explore new technologies and methods to reasonably use big data. Therefore, on the basis of understanding the current big data technology and its system operation status, this paper designs relevant algorithms according to the big data classification model, and verifies the effectiveness of the analysis model algorithm based on practice.

Download Full-text

Data Classification Model for Fog-Enabled Mobile IoT Systems

Advances in Intelligent Systems and Computing - Congress on Intelligent Systems ◽

10.1007/978-981-33-6984-9_11 ◽

2021 ◽

pp. 125-138

Author(s):

Aung Myo Thaw ◽

Nataly Zhukova ◽

Tin Tun Aung ◽

Vladimir Chernokulsky

Keyword(s):

Data Classification ◽

Classification Model

Download Full-text

An Online Education Data Classification Model Based on Tr_MAdaBoost Algorithm

Chinese Journal of Electronics ◽

10.1049/cje.2018.06.006 ◽

2019 ◽

Vol 28 (1) ◽

pp. 21-28 ◽

Cited By ~ 1

Author(s):

Lasheng Yu ◽

Xu Wu ◽

Yu Yang

Keyword(s):

Online Education ◽

Data Classification ◽

Classification Model ◽

Model Based ◽

Education Data

Download Full-text

A Novel Model for Imbalanced Data Classification

Proceedings of the AAAI Conference on Artificial Intelligence ◽

10.1609/aaai.v34i04.6145 ◽

2020 ◽

Vol 34 (04) ◽

pp. 6680-6687

Author(s):

Jian Yin ◽

Chunjing Gan ◽

Kaiqi Zhao ◽

Xuan Lin ◽

Zhe Quan ◽

...

Keyword(s):

Imbalanced Data ◽

Data Classification ◽

Classification Performance ◽

Classification Model ◽

Proposed Model ◽

Imbalanced Data Classification ◽

Public Datasets ◽

Distribution Cost ◽

Novel Model ◽

Learning Data

Recently, imbalanced data classification has received much attention due to its wide applications. In the literature, existing researches have attempted to improve the classification performance by considering various factors such as the imbalanced distribution, cost-sensitive learning, data space improvement, and ensemble learning. Nevertheless, most of the existing methods focus on only part of these main aspects/factors. In this work, we propose a novel imbalanced data classification model that considers all these main aspects. To evaluate the performance of our proposed model, we have conducted experiments based on 14 public datasets. The results show that our model outperforms the state-of-the-art methods in terms of recall, G-mean, F-measure and AUC.

Download Full-text

Sentiment Analysis towards Actionable Intelligence via Deep Learning

TEM Journal ◽

10.18421/tem94-44 ◽

2020 ◽

pp. 1663-1668

Author(s):

Shorouq Fathi Eletter

Keyword(s):

Deep Learning ◽

Sentiment Analysis ◽

Service Providers ◽

Unstructured Data ◽

Classification Model ◽

Online Review ◽

Support Vector ◽

Competitive Advantages ◽

Accuracy Rate ◽

Vector Machines

The exponential growth of unstructured data and the ability of businesses to utilize such data in decision-making have led to competitive advantages. The knowledge provided by analyzing unstructured data is crucial for product developers or service providers because it might affect the sustainability of the business. Sentiment analysis is used to gain an understanding of the attitudes, opinions, and emotions expressed within an online review. Naïve Bayes (NB), logistic regression (LR), decision trees (DT), deep learning (DL), and support vector machines (SVM) were used to build a classification model. In the data mining settings, the classification accuracy is the best metric to highlight the best classifier. The DL classifier outperformed other models in terms of accuracy rate. Classifying customers' feelings toward a product or service is critical for providing actionable insights. Utilizing such models will help to analyze huge volumes of reviews, saving both time and costs.

Download Full-text

MapReduce-based big data classification model using feature subset selection and hyperparameter tuned deep belief network

Scientific Reports ◽

10.1038/s41598-021-03019-y ◽

2021 ◽

Vol 11 (1) ◽

Author(s):

Surendran Rajendran ◽

Osamah Ibrahim Khalaf ◽

Youseef Alotaibi ◽

Saleh Alghamdi

Keyword(s):

Feature Selection ◽

Big Data ◽

Selection Process ◽

Data Classification ◽

Deep Belief Network ◽

Feature Subset Selection ◽

Classification Model ◽

Feature Subset ◽

Belief Network ◽

Big Data Classification

AbstractIn recent times, big data classification has become a hot research topic in various domains, such as healthcare, e-commerce, finance, etc. The inclusion of the feature selection process helps to improve the big data classification process and can be done by the use of metaheuristic optimization algorithms. This study focuses on the design of a big data classification model using chaotic pigeon inspired optimization (CPIO)-based feature selection with an optimal deep belief network (DBN) model. The proposed model is executed in the Hadoop MapReduce environment to manage big data. Initially, the CPIO algorithm is applied to select a useful subset of features. In addition, the Harris hawks optimization (HHO)-based DBN model is derived as a classifier to allocate appropriate class labels. The design of the HHO algorithm to tune the hyperparameters of the DBN model assists in boosting the classification performance. To examine the superiority of the presented technique, a series of simulations were performed, and the results were inspected under various dimensions. The resultant values highlighted the supremacy of the presented technique over the recent techniques.

Download Full-text