Artificial Intelligent Machine Learning and Big Data Mining of Desert Geothermal Heat Pump: Analysis, Design and Control

Morbidity and mortality caused by infectious diseases rank first among all human illnesses. Many pathogenic mechanisms remain unclear, while misuse of antibiotics has led to the emergence of drug-resistant strains. Infectious diseases spread rapidly and pathogens mutate quickly, posing new threats to human health. However, with the increasing use of high-throughput screening of pathogen genomes, research based on big data mining and visualization analysis has gradually become a hot topic for studies of infectious disease prevention and control. In this paper, the framework was performed on four infectious pathogens (Fusobacterium, Streptococcus, Neisseria, and Streptococcus salivarius) through five functions: 1) genome annotation, 2) phylogeny analysis based on core genome, 3) analysis of structure differences between genomes, 4) prediction of virulence genes/factors with their pathogenic mechanisms, and 5) prediction of resistance genes/factors with their signaling pathways. The experiments were carried out from three angles: phylogeny (macro perspective), structure differences of genomes (micro perspective), and virulence and drug-resistance characteristics (prediction perspective). Therefore, the framework can not only provide evidence to support the rapid identification of new or unknown pathogens and thus plays a role in the prevention and control of infectious diseases, but also help to recommend the most appropriate strains for clinical and scientific research. This paper presented a new genome information visualization analysis process framework based on big data mining technology with the accommodation of the depth and breadth of pathogens in molecular level research.

Download Full-text

Trends of Evolutionary Machine Learning to Address Big Data Mining

10.1007/978-3-030-85977-0_7 ◽

2021 ◽

pp. 85-99

Author(s):

Sana Ben Hamida ◽

Ghita Benjelloun ◽

Hmida Hmida

Keyword(s):

Machine Learning ◽

Data Mining ◽

Big Data ◽

Big Data Mining

Download Full-text

Big Data Mining Algorithms

Encyclopedia of Information Science and Technology, Fifth Edition - Advances in Information Quality and Management ◽

10.4018/978-1-7998-3479-3.ch052 ◽

2021 ◽

pp. 768-777

Author(s):

M. Govindarajan

Keyword(s):

Machine Learning ◽

Data Mining ◽

Big Data ◽

Unsupervised Learning ◽

Supervised Learning ◽

Learning Algorithms ◽

Machine Learning Algorithms ◽

Data Sets ◽

Big Data Mining ◽

Supervised Learning Algorithms

Big data mining involves knowledge discovery from these large data sets. The purpose of this chapter is to provide an analysis of different machine learning algorithms available for performing big data analytics. The machine learning algorithms are categorized in three key categories, namely, supervised, unsupervised, and semi-supervised machine learning algorithm. The supervised learning algorithms are trained with a complete set of data, and thus, the supervised learning algorithms are used to predict/forecast. Example algorithms include logistic regression and the back propagation neural network. The unsupervised learning algorithms starts learning from scratch, and therefore, the unsupervised learning algorithms are used for clustering. Example algorithms include: the Apriori algorithm and K-Means. The semi-supervised learning combines both supervised and unsupervised learning algorithms. The semi-supervised algorithms are trained, and the algorithms also include non-trained learning.

Download Full-text

Research on personalized referral service and big data mining for e-commerce with machine learning

2018 4th International Conference on Computer and Technology Applications (ICCTA) ◽

10.1109/cata.2018.8398652 ◽

2018 ◽

Cited By ~ 3

Author(s):

Hui-ke Rao ◽

Zhi Zeng ◽

Ai-ping Liu

Keyword(s):

Machine Learning ◽

Data Mining ◽

Big Data ◽

Big Data Mining ◽

Referral Service

Download Full-text

Accurate computation: COVID-19 rRT-PCR positive test dataset using stages classification through textual big data mining with machine learning

The Journal of Supercomputing ◽

10.1007/s11227-020-03586-3 ◽

2021 ◽

Author(s):

Shalini Ramanathan ◽

Mohan Ramasundaram

Keyword(s):

Machine Learning ◽

Data Mining ◽

Big Data ◽

Positive Test ◽

Test Dataset ◽

Big Data Mining ◽

Accurate Computation

Download Full-text

Dynamic Distributed and Parallel Machine Learning algorithms for big data mining processing

Data Technologies and Applications ◽

10.1108/dta-06-2021-0153 ◽

2021 ◽

Vol ahead-of-print (ahead-of-print) ◽

Author(s):

Laouni Djafri

Keyword(s):

Machine Learning ◽

Data Mining ◽

Big Data ◽

Random Sampling ◽

Sampling Method ◽

Parallel Machine ◽

Machine Learning Algorithms ◽

Content Type ◽

Big Data Mining ◽

Partial Learning

PurposeThis work can be used as a building block in other settings such as GPU, Map-Reduce, Spark or any other. Also, DDPML can be deployed on other distributed systems such as P2P networks, clusters, clouds computing or other technologies.Design/methodology/approachIn the age of Big Data, all companies want to benefit from large amounts of data. These data can help them understand their internal and external environment and anticipate associated phenomena, as the data turn into knowledge that can be used for prediction later. Thus, this knowledge becomes a great asset in companies' hands. This is precisely the objective of data mining. But with the production of a large amount of data and knowledge at a faster pace, the authors are now talking about Big Data mining. For this reason, the authors’ proposed works mainly aim at solving the problem of volume, veracity, validity and velocity when classifying Big Data using distributed and parallel processing techniques. So, the problem that the authors are raising in this work is how the authors can make machine learning algorithms work in a distributed and parallel way at the same time without losing the accuracy of classification results. To solve this problem, the authors propose a system called Dynamic Distributed and Parallel Machine Learning (DDPML) algorithms. To build it, the authors divided their work into two parts. In the first, the authors propose a distributed architecture that is controlled by Map-Reduce algorithm which in turn depends on random sampling technique. So, the distributed architecture that the authors designed is specially directed to handle big data processing that operates in a coherent and efficient manner with the sampling strategy proposed in this work. This architecture also helps the authors to actually verify the classification results obtained using the representative learning base (RLB). In the second part, the authors have extracted the representative learning base by sampling at two levels using the stratified random sampling method. This sampling method is also applied to extract the shared learning base (SLB) and the partial learning base for the first level (PLBL1) and the partial learning base for the second level (PLBL2). The experimental results show the efficiency of our solution that the authors provided without significant loss of the classification results. Thus, in practical terms, the system DDPML is generally dedicated to big data mining processing, and works effectively in distributed systems with a simple structure, such as client-server networks.FindingsThe authors got very satisfactory classification results.Originality/valueDDPML system is specially designed to smoothly handle big data mining classification.

Download Full-text

Research on the Application of Machine Learning Big Data Mining Algorithms in Digital Signal Processing

2021 IEEE Asia-Pacific Conference on Image Processing, Electronics and Computers (IPEC) ◽

10.1109/ipec51340.2021.9421229 ◽

2021 ◽

Author(s):

Simin Niu

Keyword(s):

Machine Learning ◽

Data Mining ◽

Signal Processing ◽

Big Data ◽

Digital Signal Processing ◽

Digital Signal ◽

Big Data Mining ◽

Data Mining Algorithms ◽

Mining Algorithms

Download Full-text