Genomic Anomaly Searching with BLAST Algorithm using MapReduce Framework in Big Data Platform

Big data is featured by multiple sources and heterogeneity. Based on the big data platform of Hadoop and spark, a hybrid analysis on forest fire is built in this study. This platform combines the big data analysis and processing technology, and learns from the research results of different technical fields, such as forest fire monitoring. In this system, HDFS of Hadoop is used to store all kinds of data, spark module is used to provide various big data analysis methods, and visualization tools are used to realize the visualization of analysis results, such as Echarts, ArcGIS and unity3d. Finally, an experiment for forest fire point detection is designed so as to corroborate the feasibility and effectiveness, and provide some meaningful guidance for the follow-up research and the establishment of forest fire monitoring and visualized early warning big data platform. However, there are two shortcomings in this experiment: more data types should be selected. At the same time, if the original data can be converted to XML format, the compatibility is better. It is expected that the above problems can be solved in the follow-up research.

Download Full-text

Research on the Construction of Smart Cities by the Big Data Platform of the Blockchain

Journal of Physics Conference Series ◽

10.1088/1742-6596/1883/1/012144 ◽

2021 ◽

Vol 1883 (1) ◽

pp. 012144

Author(s):

Qianwei Ma ◽

Yanxia Yang

Keyword(s):

Big Data ◽

Smart Cities ◽

Data Platform

Download Full-text

A monitoring framework for transparency and fairness in big data platform

Concurrency and Computation Practice and Experience ◽

10.1002/cpe.6069 ◽

2021 ◽

Author(s):

Karima Aslaoui Mokhtari ◽

Salima Benbernou ◽

Mourad Ouziri ◽

Hakim Lahmar ◽

Muhammad Younas

Keyword(s):

Big Data ◽

Data Platform ◽

Monitoring Framework

Download Full-text

Research on the Design of Intelligent Energy Efficiency Management System for Ships Based on Computer Big Data Platform

Journal of Physics Conference Series ◽

10.1088/1742-6596/1744/2/022026 ◽

2021 ◽

Vol 1744 (2) ◽

pp. 022026

Author(s):

Fangxuan Li ◽

Wenxue Gao

Keyword(s):

Energy Efficiency ◽

Big Data ◽

Management System ◽

Data Platform

Download Full-text

Big Data Platform for Intelligence Industrial IoT Sensor Monitoring System Based on Edge Computing and AI

2021 International Conference on Artificial Intelligence in Information and Communication (ICAIIC) ◽

10.1109/icaiic51459.2021.9415189 ◽

2021 ◽

Author(s):

Sothearin Ren ◽

Jae-Sung Kim ◽

Wan-Sup Cho ◽

Saravit Soeng ◽

Sovanreach Kong ◽

...

Keyword(s):

Big Data ◽

Monitoring System ◽

Edge Computing ◽

Industrial Iot ◽

Data Platform ◽

Sensor Monitoring

Download Full-text

Efficient indexing and retrieval of patient information from the big data using MapReduce framework and optimisation

Journal of Information Science ◽

10.1177/01655515211013708 ◽

2021 ◽

pp. 016555152110137

Author(s):

N.R. Gladiss Merlin ◽

Vigilson Prem. M

Keyword(s):

Big Data ◽

Similarity Measure ◽

Patient Information ◽

Complex Data ◽

Mapreduce Framework ◽

Maximum Value ◽

User Query ◽

Indexing And Retrieval ◽

Sine Cosine Algorithm ◽

Disparate Source

Large and complex data becomes a valuable resource in biomedical discovery, which is highly facilitated to increase the scientific resources for retrieving the helpful information. However, indexing and retrieving the patient information from the disparate source of big data is challenging in biomedical research. Indexing and retrieving the patient information from big data is performed using the MapReduce framework. In this research, the indexing and retrieval of information are performed using the proposed Jaya-Sine Cosine Algorithm (Jaya–SCA)-based MapReduce framework. Initially, the input big data is forwarded to the mapper randomly. The average of each mapper data is calculated, and these data are forwarded to the reducer, where the representative data are stored. For each user query, the input query is matched with the reducer, and thereby, it switches over to the mapper for retrieving the matched best result. The bilevel matching is performed while retrieving the data from the mapper based on the distance between the query. The similarity measure is computed based on the parametric-enabled similarity measure (PESM), cosine similarity and the proposed Jaya–SCA, which is the integration of the Jaya algorithm and the SCA. Moreover, the proposed Jaya–SCA algorithm attained the maximum value of F-measure, recall and precision of 0.5323, 0.4400 and 0.6867, respectively, using the StatLog Heart Disease dataset.

Download Full-text