Multi-Source and Heterogeneous Data Integration Model for Big Data Analytics in Power DCS

2014 ◽

Vol 912-914 ◽

pp. 1201-1204

Author(s):

Gang Huang ◽

Xiu Ying Wu ◽

Man Yuan

Keyword(s):

Data Integration ◽

Heterogeneous Data ◽

Data Sources ◽

Semantic Heterogeneity ◽

Integration Model ◽

Integration Framework ◽

Heterogeneous Data Integration ◽

Semantic Level ◽

Heterogeneous Data Sources ◽

Semantic Difference

This paper provides an ontology-based distributed heterogeneous data integration framework (ODHDIF). The framework resolves the problem of semantic interoperability between heterogeneous data sources in semantic level. By metadatas specifying the distributed, heterogeneous data and by describing semantic information of data source , having "ontology" as a common semantic model, semantic match is established through ontology mapping between heterogeneous data sources and semantic difference institutions are shielded, so that semantic heterogeneity problem of the heterogeneous data sources can be effectively solved. It provides an effective technology measure for the interior information of enterprises to be shared in time accurately.

Download Full-text

Big Data Analytics in Online Structural Health Monitoring

International Journal of Prognostics and Health Management ◽

10.36001/ijphm.2016.v7i4.2462 ◽

2020 ◽

Vol 7 (4) ◽

Author(s):

Guowei Cai ◽

Sankaran Mahadevan

Keyword(s):

Big Data ◽

Structural Health Monitoring ◽

Health Monitoring ◽

Data Analytics ◽

Structural Damage ◽

Big Data Analytics ◽

High Volume ◽

Heterogeneous Data ◽

Sensor Technology ◽

Structural Health

This manuscript explores the application of big data analytics in online structural health monitoring. As smart sensor technology is making progress and low cost online monitoring is increasingly possible, large quantities of highly heterogeneous data can be acquired during the monitoring, thus exceeding the capacity of traditional data analytics techniques. This paper investigates big data techniques to handle the highvolume data obtained in structural health monitoring. In particular, we investigate the analysis of infrared thermal images for structural damage diagnosis. We explore the MapReduce technique to parallelize the data analytics and efficiently handle the high volume, high velocity and high variety of information. In our study, MapReduce is implemented with the Spark platform, and image processing functions such as uniform filter and Sobel filter are wrapped in the mappers. The methodology is illustrated with concrete slabs, using actual experimental data with induced damage

Download Full-text

Integrated Data Architecture for Business

Advanced Methodologies and Technologies in Business Operations and Management - Advances in Logistics, Operations, and Management Science ◽

10.4018/978-1-5225-7362-3.ch035 ◽

2019 ◽

pp. 478-490

Author(s):

Richard Kumaradjaja

Keyword(s):

Big Data ◽

Data Integration ◽

Data Analytics ◽

Big Data Analytics ◽

Point Of View ◽

Future Research ◽

Integration Framework ◽

Architecture Framework ◽

Data Architecture ◽

Future Research Directions

This chapter describes data integration issues in big data analytics and proposes an integrated data integration framework for big data analytics. The main focus of this chapter is to address the issues of data integration from the architectural point of view. Addressing the issues of data integration from the architectural point of view will lead to a better understanding of the current situation and better construction of proposed solutions to those issues since architectural approach can give us a holistic and comprehensive view of the problems. The chapter also discusses future research directions of the proposed integrated data architecture framework.

Download Full-text

A Long Short Term Memory with Peephole Connections and Generative Adversarial Network Based Collaborative Methodology to Identify Outliers in ECG Dataset

Journal of Computational and Theoretical Nanoscience ◽

10.1166/jctn.2020.9273 ◽

2020 ◽

Vol 17 (8) ◽

pp. 3798-3803

Author(s):

M. D. Anto Praveena ◽

B. Bharathi

Keyword(s):

Time Series ◽

Big Data ◽

Data Analytics ◽

Time Series Data ◽

Short Term Memory ◽

Big Data Analytics ◽

Data Preprocessing ◽

Heterogeneous Data ◽

Series Data ◽

Outlier Identification

Big Data analytics has become an upward field, and it plays a pivotal role in Healthcare and research practices. Big data analytics in healthcare cover vast numbers of dynamic heterogeneous data integration and analysis. Medical records of patients include several data including medical conditions, medications and test findings. One of the major challenges of analytics and prediction in healthcare is data preprocessing. In data preprocessing the outlier identification and correction is the important challenge. Outliers are exciting values that deviates from other values of the attribute; they may simply experimental errors or novelty. Outlier identification is the method of identifying data objects with somewhat different behaviors than expectations. Detecting outliers in time series data is different from normal data. Time series data are the data that are in a series of certain time periods. This kind of data are identified and cleared to bring the quality dataset. In this proposed work a hybrid outlier detection algorithm extended LSTM-GAN is helped to recognize the outliers in time series data. The outcome of the proposed extended algorithm attained better enactment in the time series analysis on ECG dataset processing compared with traditional methodologies.

Download Full-text

Big Data Analytics in Healthcare

International Journal of Big Data and Analytics in Healthcare ◽

10.4018/ijbdah.2020010102 ◽

2020 ◽

Vol 5 (1) ◽

pp. 19-27

Author(s):

Jaimin Navinchandra Undavia ◽

Atul Manubhai Patel

Keyword(s):

Big Data ◽

Data Analytics ◽

Big Data Analytics ◽

Heterogeneous Data ◽

Simple Type ◽

Healthcare Industry ◽

Technological Advancement ◽

Huge Amount ◽

High Level ◽

Almost All

The technological advancement has also opened up various ways to collect data through automatic mechanisms. One such mechanism collects a huge amount of data without any further maintenance or human interventions. The health industry sector has been confronted by the need to manage the big data being produced by various sources, which are well known for producing high volumes of heterogeneous data. High level of sophistication has been incorporated in almost all the industry, and healthcare is one of them. The article shows that the existence of huge amount of data in healthcare industry and the data generated in healthcare industry is neither homogeneous nor a simple type of data. Then the various sources and objectives of data are also highlighted and discussed. As data come from various sources, they must be versatile in nature in all aspects. So, rightly and meaningfully, big data analytics has penetrated the healthcare industry and its impact is also highlighted.

Download Full-text

Big Data Analytics in Healthcare using Machine Learning Algorithms: A Comparative Study

International Journal of Online and Biomedical Engineering (iJOE) ◽

10.3991/ijoe.v16i13.18609 ◽

2020 ◽

Vol 16 (13) ◽

pp. 19

Author(s):

Sai Hanuman Akundi ◽

Soujanya R ◽

Madhuri PM

Keyword(s):

Machine Learning ◽

Big Data ◽

Comparative Study ◽

Data Analytics ◽

Learning Algorithms ◽

Big Data Analytics ◽

Large Data ◽

Heterogeneous Data ◽

Machine Learning Algorithms ◽

Healthcare Sector

In recent years vast quantities of data have been managed in various ways of medical applications and multiple organizations worldwide have developed this type of data and, together, these heterogeneous data are called big data. Data with other characteristics, quantity, speed and variety are the word big data. The healthcare sector has faced the need to handle the large data from different sources, renowned for generating large amounts of heterogeneous data. We can use the Big Data analysis to make proper decision in the health system by tweaking some of the current machine learning algorithms. If we have a large amount of knowledge that we want to predict or identify patterns, master learning would be the way forward. In this article, a brief overview of the Big Data, functionality and ways of Big data analytics are presented, which play an important role and affect healthcare information technology significantly. Within this paper we have presented a comparative study of algorithms for machine learning. We need to make effective use of all the current machine learning algorithms to anticipate accurate outcomes in the world of nursing.

Download Full-text

An Evaluation of Big Data Analytics Projects and the Project Predictive Analytics Approach

Oriental journal of computer science and technology ◽

10.13005/ojcst12.04.01 ◽

2020 ◽

Vol 12 (4) ◽

pp. 132-146

Author(s):

Gabriel Kabanda

Keyword(s):

Big Data ◽

Data Analytics ◽

Predictive Analytics ◽

Big Data Analytics ◽

Document Analysis ◽

Heterogeneous Data ◽

Discourse Theory ◽

Mining Machine ◽

Data Types ◽

Qualitative Research Methodology

Big Data is the process of managing large volumes of data obtained from several heterogeneous data types e.g. internal, external, structured and unstructured that can be used for collecting and analyzing enterprise data. The purpose of the paper is to conduct an evaluation of Big Data Analytics Projects which discusses why the projects fail and explain why and how the Project Predictive Analytics (PPA) approach may make a difference with respect to the future methods based on data mining, machine learning, and artificial intelligence. A qualitative research methodology was used. The research design was discourse analysis supported by document analysis. Laclau and Mouffe’s discourse theory was the most thoroughly poststructuralist approach.

Download Full-text

Integrated Data Architecture for Business

Encyclopedia of Information Science and Technology, Fourth Edition ◽

10.4018/978-1-5225-2255-3.ch074 ◽

2018 ◽

pp. 862-872

Author(s):

Richard Kumaradjaja

Keyword(s):

Big Data ◽

Data Integration ◽

Data Analytics ◽

Big Data Analytics ◽

Point Of View ◽

Future Research ◽

Integration Framework ◽

Architecture Framework ◽

Data Architecture ◽

Future Research Directions

This paper describes data integration issues in big data analytics and proposes an integrated data integration framework for big data analytics. The main focus of this article is to address the issues of data integration from the architectural point of view. Addressing the issues of data integration from the architectural point of view will lead to a better understanding of the current situation and better able to construct proposed solutions to those issues since architectural approach can give us a holistic and comprehensive view of the problems. The paper also discusses about future research directions of the proposed integrated data architecture framework.

Download Full-text

A Novel Framework for the Seamless Integration of FPGA Accelerators with Big Data Analytics Frameworks in Heterogeneous Data Centers

2018 International Conference on High Performance Computing & Simulation (HPCS) ◽

10.1109/hpcs.2018.00090 ◽

2018 ◽

Author(s):

Ioannis Stamelos ◽

Elias Koromilas ◽

Christoforos Kachris ◽

Dimitrios Soudris

Keyword(s):

Big Data ◽

Data Analytics ◽

Data Centers ◽

Big Data Analytics ◽

Heterogeneous Data ◽

Seamless Integration

Download Full-text

A Multisource Retrospective Audit Method for Data Quality Optimization and Evaluation

International Journal of Distributed Sensor Networks ◽

10.1155/2015/195015 ◽

2015 ◽

Vol 2015 ◽

pp. 1-8 ◽

Cited By ~ 2

Author(s):

Li Jiang ◽

Hao Chen ◽

Yueqi Ouyang ◽

Canbing Li

Keyword(s):

Data Integration ◽

Data Quality ◽

Rapid Development ◽

Original Data ◽

Heterogeneous Data ◽

Integration Model ◽

Heterogeneous Data Integration ◽

Model Based ◽

Retrospective Audit

With the rapid development of information technology and the coming of the era of big data, various data are constantly emerging and present the characteristics of autonomy and heterogeneity. How to optimize data quality and evaluate the effect has become a challenging problem. Firstly, a heterogeneous data integration model based on retrospective audit is proposed to locate the original data source and match the data. Secondly, in order to improve the integrated data quality, a retrospective audit model and associative audit rules are proposed to fix incomplete and incorrect data from multiple heterogeneous data sources. The heterogeneous data integration model based on retrospective audit is divided into four modules including original heterogeneous data, data structure, data processing, and data retrospective audit. At last, some assessment criteria such as redundancy, sparsity, and accuracy are defined to evaluate the effect of the optimized data quality. Experimental results show that the quality of the integrated data is significantly higher than the quality of the original data.

Download Full-text

Multi-Source and Heterogeneous Data Integration Model for Big Data Analytics in Power DCS

Design and Implementation of Oilfield Heterogeneous Data Integration Model Based on Ontology

Big Data Analytics in Online Structural Health Monitoring

Integrated Data Architecture for Business

A Long Short Term Memory with Peephole Connections and Generative Adversarial Network Based Collaborative Methodology to Identify Outliers in ECG Dataset

Big Data Analytics in Healthcare

Big Data Analytics in Healthcare using Machine Learning Algorithms: A Comparative Study

An Evaluation of Big Data Analytics Projects and the Project Predictive Analytics Approach

Integrated Data Architecture for Business

A Novel Framework for the Seamless Integration of FPGA Accelerators with Big Data Analytics Frameworks in Heterogeneous Data Centers

A Multisource Retrospective Audit Method for Data Quality Optimization and Evaluation

Export Citation Format