Applying Improved Apriori Algorithm in Figuring out the Relation between Weather Factors and Rainfall

Siti Zulaikha; Martaleli Bettiza; Nola Ritha

doi:10.31629/jit.v1i1.2133

Applying Improved Apriori Algorithm in Figuring out the Relation between Weather Factors and Rainfall

Journal of Innovation and Technology ◽

10.31629/jit.v1i1.2133 ◽

2020 ◽

Vol 1 (1) ◽

pp. 23-26

Author(s):

Siti Zulaikha ◽

Martaleli Bettiza ◽

Nola Ritha

Keyword(s):

Data Mining ◽

Big Data ◽

Light Intensity ◽

Apriori Algorithm ◽

Weather Factors ◽

Factors Affecting ◽

Data Store ◽

Repeated Pattern ◽

Major Factors ◽

Sun Light

Data on the rainfall is compelling to study as it becomes one of the major factors affecting the weather in a certain region and various aspects of life as well. Generally, predicting rainfall is performed by analyzing data in the past in certain methods. Rainfall is prone to follow repeated pattern in sequence of time. The utilization of big data mining is expected to result in any valuable information that used to be unrevealed in the big data store. Some methods used in data mining are Apriori Algorithm and Improved Apriori Algorithm. Improved Apriori itself is to represent the database in the form of matrix to describe its relation in the database. Data used in this research is the rainfall factor in 2016 in Tanjungpinang city. Based on the test of Improved Apriori Algorithm, it was found out that the relation of the rainfall and weather factors utilizing 2 item sets, that is, if the temperature is low (24,0 - 26,0), the humidity is high (85 - 100), then the rainfall is mild. If the temperature is low (24,0 - 26,0), the light intensity is low (0 – 3), then the rainfall is heavy, and 3 item sets if the temperature is low (24,0 - 26,0), the humidity is high (85 - 100), the sun light intensity is low (0-3), then the rainfall is medium.

Download Full-text

Research of Association Rules Algorithm Based on Matrix under Cloud Computing

Applied Mechanics and Materials ◽

10.4028/www.scientific.net/amm.568-570.798 ◽

2014 ◽

Vol 568-570 ◽

pp. 798-801

Author(s):

Ye Qing Xiong ◽

Shu Dong Zhang

Keyword(s):

Data Mining ◽

Cloud Computing ◽

Big Data ◽

Association Rules ◽

Apriori Algorithm ◽

Computing Technology ◽

Binary Matrix ◽

Transaction Data ◽

Big Data Mining ◽

Parallel Mining

It occurs time and space performance bottlenecks when traditional association rules algorithms are used to big data mining. This paper proposes a parallel algorithm based on matrix under cloud computing to improve Apriori algorithm. The algorithm uses binary matrix to store transaction data, uses matrix "and" operation to replace the connection between itemsets and combines cloud computing technology to implement the parallel mining for frequent itemsets. Under different conditions, the simulation shows it improves the efficiency, solves the performance bottleneck problem and can be widely used in big data mining with strong scalability and stability.

Download Full-text

Analysis of Tumor Disease Patterns Based on Medical Big Data

Journal of Medical Imaging and Health Informatics ◽

10.1166/jmihi.2021.3306 ◽

2021 ◽

Vol 11 (2) ◽

pp. 478-486

Author(s):

Jing Zheng ◽

Zhongjun Gao ◽

Lixin Pu ◽

Mingjie He ◽

Jipeng Fan ◽

...

Keyword(s):

Data Mining ◽

Big Data ◽

Business Process ◽

Data Science ◽

Process Analysis ◽

Practical Significance ◽

Apriori Algorithm ◽

Cancer Disease ◽

Analysis And Design ◽

Medical Big Data

Using the medical big data mining related technology, the model of tumor disease was analyzed and studied. Using data science methods as a guiding method and idea, analyzing and constructing a medical service model based on big data for oncology diseases, exploring its development strategy; using business process analysis method to analyze the business process and mapping of cancer disease medical services; using serviceoriented architecture analysis and Design methodology to build a highly flexible, configurable, and easily scalable precision medical big data platform. By analyzing the characteristics of medical big data and the shortcomings of the traditional Apriori algorithm, the Hadoop platform is used to improve and optimize the Apriori algorithm. The results show that the improved Apriori algorithm has great improvement in efficiency and performance, and can be adapted to mining medical big data. Through data mining experiments, it is concluded that there is a correlation between tumors and smoking, chronic infection, occupational pathogenic factors, etc. It has certain guiding significance for the prevention and treatment of tumors, thus also demonstrating the improved Apriori algorithm for lung tumors. Clinical research has practical significance.

Download Full-text

A Combined Horizontal Parallel Apriori Algorithm and Adaptive Frequent Pattern Growth Algorithm for Big Data Mining

International Journal of Innovative Technology and Exploring Engineering - Special Issue ◽

10.35940/ijitee.b1133.1292s219 ◽

2019 ◽

Vol 9 (2S2) ◽

pp. 859-863

Keyword(s):

Data Mining ◽

Big Data ◽

Frequent Itemsets ◽

Frequent Pattern ◽

Apriori Algorithm ◽

Distributed Computing Systems ◽

Big Data Mining ◽

Pattern Growth ◽

Mining Algorithms ◽

Combined Algorithm

Due to the massive data size and complexness, big data mining using a sole computer is a problematic task. With the rapid increase in the database size, parallel and distributed computing systems can yield better benefits in the data mining applications. Parallelization of the Association Rule Mining (ARM) algorithms is a significant task in the data mining application for effectively mining the frequent itemsets from the large-size databases. These mining algorithms allocate the database in a horizontal manner or increase the number of processors to decrease the overall time necessary for mining the frequent itemsets. In this paper, a combined Horizontal Parallel-Apriori (HP-Apriori) and Adaptive Frequent Pattern (FP) Growth algorithm is proposed to divide the database both horizontally and vertically into four sub-processes, for parallel processing of all four tasks. The Horizontal Parallel-Apriori algorithm increases the speed of the mining process using an index file. Adaptive Binomial Distribution (ABD) is applied to the Frequent Pattern Growth Algorithm to find the minimum support for mining the optimal frequent itemsets. Experimental analysis established that the combined algorithm outperforms in terms of minimizing the overall execution time and increasing the computational speed in high scalability.

Download Full-text

Bibliometric Knowledge Mapping of E-Commerce Platform Operation on Data Mining

10.20944/preprints202012.0529.v1 ◽

2020 ◽

Author(s):

Min Ye ◽

Hongxia Li

Keyword(s):

Data Mining ◽

Big Data ◽

Quantitative Research ◽

Common Knowledge ◽

Development Trend ◽

Research Field ◽

Knowledge Mapping ◽

Structure Relationship ◽

Major Factors ◽

Research Frontiers

The e-commerce platform in the digital economy era has evolved into a data platform ecosystem built around data resources and data mining technology systems. The most typical applications of big data are also concentrated in the field of e-commerce. E-commerce companies should first grasp the interactive relationship among the three major factors of data, technology and innovation, e-commerce platform operation is a multidisciplinary research field. It is not easy for researchers to obtain a panoramic view of the knowledge structure in this field. Knowledge graph is a kind of graph that shows the development process and structure relationship of knowledge with the field of knowledge as the object. It is not only a visual knowledge mapping, but also a serialized knowledge pedigree, which provides researchers with a quantitative research method for the development trend of statistics and academic status. The purpose of this research is to help researchers understand the key knowledge, evolutionary trends and research frontiers of current research. This study uses Citespace bibliometric analysis to analyze the data of the Science Net database and finds that: 1) The development of the research field has gone through three stages, and some representative key scholars and key documents have been recognized; 2) the common knowledge mapping of literature The co-occurrence of citations and keywords shows research hotspots; 3) The results of burst detection and central node analysis reveal research frontiers and development trends. Today, the visualization of big data brings different challenges. The abstraction between the world and today's data visualization occurs when the data is captured. Every user sees his own visualization data generated by standardized calculations. At the same time, there are still many controversies in the theoretical model, structure and structural dimensions. This is the direction that future researchers need to further study.

Download Full-text

Challenges and Cloud Computing Environments Towards Big Data

International Journal of Scientific Research in Science Engineering and Technology ◽

10.32628/ijsrset207277 ◽

2014 ◽

pp. 203-208

Author(s):

Kiran Kumar S V N Madupu

Keyword(s):

Data Mining ◽

Cloud Computing ◽

Big Data ◽

Technology Development ◽

Computing Environments ◽

Modern Technologies

Big Data has terrific influence on scientific discoveries and also value development. This paper presents approaches in data mining and modern technologies in Big Data. Difficulties of data mining as well as data mining with big data are discussed. Some technology development of data mining as well as data mining with big data are additionally presented.

Download Full-text

Major Factors Affecting Waste Generation on Construction Sites in Iran

Proceedings of the 2015 (6th) International Conference on Engineering, Project, and Production Management ◽

10.32738/ceppm.201509.0051 ◽

2015 ◽

Cited By ~ 1

Author(s):

Bahareh Nikmehr ◽

◽

M Reza Hosseini ◽

Mehran Oraee ◽

Nicholas Chileshe ◽

...

Keyword(s):

Waste Generation ◽

Construction Sites ◽

Factors Affecting ◽

Major Factors

Download Full-text

Major factors affecting the severity of motor vehicle accidents in Sri Lanka

Proceedings of The 4th International Conference on Applied Research in Science, Technology and Knowledge ◽

10.33422/4th.stk.2019.11.674 ◽

2019 ◽

Author(s):

N.A.M.R Senaviratna ◽

T.M.J.A Cooray

Keyword(s):

Sri Lanka ◽

Motor Vehicle ◽

Motor Vehicle Accidents ◽

Factors Affecting ◽

Vehicle Accidents ◽

Major Factors

Download Full-text

Analytical Study on Big Data

International Journal of Advanced Research in Computer Science and Software Engineering ◽

10.23956/ijarcsse.v8i5.668 ◽

2018 ◽

Vol 8 (5) ◽

pp. 75

Author(s):

Vivek Raich ◽

Pankaj Maurya

Keyword(s):

Information Technology ◽

Big Data ◽

Decision Maker ◽

Analytical Study ◽

Large Data ◽

Decision Makers ◽

Continuous Increase ◽

Analytic Methods ◽

Data Store ◽

Business Engineering

in the time of the Information Technology, the big data store is going on. Due to which, Huge amounts of data are available for decision makers, and this has resulted in the progress of information technology and its wide growth in many areas of business, engineering, medical, and scientific studies. Big data means that the size which is bigger in size, but there are several types, which are not easy to handle, technology is required to handle it. Due to continuous increase in the data in this way, it is important to study and manage these datasets by adjusting the requirements so that the necessary information can be obtained.The aim of this paper is to analyze some of the analytic methods and tools. Which can be applied to large data. In addition, the application of Big Data has been analyzed, using the Decision Maker working on big data and using enlightened information for different applications.

Download Full-text

Retrieving Information and Discovering Knowledge from Unstructured Data Using Big Data Mining Technique: Heavy Oil Fields Example

10.2523/17805-ms ◽

2014 ◽

Cited By ~ 1

Author(s):

Wenkuang Wu ◽

Xiaoguang Lu ◽

Ben Cox ◽

Guoqiang Li ◽

Lihua Lin ◽

...

Keyword(s):

Data Mining ◽

Big Data ◽

Heavy Oil ◽

Oil Fields ◽

Unstructured Data ◽

Data Mining Technique ◽

Big Data Mining ◽

Mining Technique

Download Full-text

Implementation Of Data Mining Association Methods With Apriori Algorithm For Determining The Key Players Of Football Club

International Journal of Computer Techniques ◽

10.29126/23942231/ijct-v7i2p13 ◽

2020 ◽

Vol 7 (2) ◽

Author(s):

Ari Zakaria ◽

Arief Wibowo

Keyword(s):

Data Mining ◽

Apriori Algorithm ◽

Football Club ◽

Key Players

Download Full-text