Research on Insurance Data Analysis Platform Based on the Hadoop Framework

AbstractBackgroundExploration and processing of FASTQ files are the first steps in state-of-the-art data analysis workflows of Next Generation Sequencing (NGS) platforms. The large amount of data generated by these technologies has put a challenge in terms of rapid analysis and visualization of sequencing information. Recent integration of the R data analysis platform with web visual frameworks has stimulated the development of user-friendly, powerful, and dynamic NGS data analysis applications.ResultsThis paper presents FastqCleaner, a Bioconductor visual application for both quality-control (QC) and pre-processing of FASTQ files. The interface shows diagnostic information for the input and output data and allows to select a series of filtering and trimming operations in an interactive framework. FastqCleaner combines the technology of Bioconductor for NGS data analysis with the data visualization advantages of a web environment.ConclusionsFastqCleaner is an user-friendly, offline-capable tool that enables access to advanced Bioconductor infrastructure. The novel concept of a Bioconductor interactive application that can be used without the need for programming skills, makes FastqCleaner a valuable resource for NGS data analysis.

Download Full-text

Multi-dimensional Data Analysis Platform (MuDAP): A Versatile Analysis Toolbox for Multi-dimensional Perception Data

2021 IEEE 10th Data Driven Control and Learning Systems Conference (DDCLS) ◽

10.1109/ddcls52934.2021.9455457 ◽

2021 ◽

Author(s):

Yiyang Chen ◽

Lin Qiu ◽

Haojiang Ying

Keyword(s):

Data Analysis ◽

Analysis Platform

Download Full-text

Research on Big Data Analysis Platform of Power Grid Enterprise Accounting Based on Cloud Computing

IOP Conference Series Materials Science and Engineering ◽

10.1088/1757-899x/677/4/042110 ◽

2019 ◽

Vol 677 ◽

pp. 042110

Author(s):

Jia Tian ◽

Dan Xu ◽

Meng Cui

Keyword(s):

Cloud Computing ◽

Big Data ◽

Data Analysis ◽

Power Grid ◽

Big Data Analysis ◽

Analysis Platform ◽

Power Grid Enterprise

Download Full-text

A Microservice-Based Big Data Analysis Platform for Online Educational Applications

Scientific Programming ◽

10.1155/2020/6929750 ◽

2020 ◽

Vol 2020 ◽

pp. 1-13

Author(s):

Kehua Miao ◽

Jie Li ◽

Wenxing Hong ◽

Mingtao Chen

Keyword(s):

Big Data ◽

Data Analysis ◽

Data Science ◽

Modular Design ◽

Science Research ◽

Big Data Analysis ◽

Research Field ◽

Traditional Work ◽

Educational Applications ◽

Analysis Platform

The booming development of data science and big data technology stacks has inspired continuous iterative updates of data science research or working methods. At present, the granularity of the labor division between data science and big data is more refined. Traditional work methods, from work infrastructure environment construction to data modelling and analysis of working methods, will greatly delay work and research efficiency. In this paper, we focus on the purpose of the current friendly collaboration of the data science team to build data science and big data analysis application platform based on microservices architecture for education or nonprofessional research field. In the environment based on microservices that facilitates updating the components of each component, the platform has a personal code experiment environment that integrates JupyterHub based on Spark and HDFS for multiuser use and a visualized modelling tools which follow the modular design of data science engineering based on Greenplum in-database analysis. The entire web service system is developed based on spring boot.

Download Full-text