massive data processing Latest Research Papers

With the rapid development and popularization of information technology, cloud computing technology provides a good environment for solving massive data processing. Hadoop is an open-source implementation of MapReduce and has the ability to process large amounts of data. Aiming at the shortcomings of the fault-tolerant technology in the MapReduce programming model, this paper proposes a reliability task scheduling strategy that introduces a failure recovery mechanism, evaluates the trustworthiness of resource nodes in the cloud environment, establishes a trustworthiness model, and avoids task allocation to low reliability node, causing the task to be re-executed, wasting time and resources. Finally, the simulation platform CloudSim verifies the validity and stability of the task scheduling algorithm and scheduling model proposed in this paper.

Download Full-text

Research on Massive Data Processing System of Coal Mine Safety Monitoring Based on Computer Technology

Journal of Physics Conference Series ◽

10.1088/1742-6596/1992/3/032013 ◽

2021 ◽

Vol 1992 (3) ◽

pp. 032013

Author(s):

Baozhong Yu

Keyword(s):

Data Processing ◽

Computer Technology ◽

Coal Mine ◽

Processing System ◽

Safety Monitoring ◽

Mine Safety ◽

Massive Data ◽

Data Processing System ◽

Coal Mine Safety ◽

Massive Data Processing

Download Full-text

Photonic perceptron at Giga-OP/s speeds with Kerr microcombs for scalable optical neural networks

10.21203/rs.3.rs-453033/v1 ◽

2021 ◽

Author(s):

mengxi tan ◽

xingyuan xu ◽

David Moss

Keyword(s):

Neural Networks ◽

Matrix Multiplication ◽

Cell Detection ◽

Digit Recognition ◽

High Speeds ◽

Novel Approach ◽

Handwritten Digit ◽

Deep Learning Network ◽

Cancer Cell Detection ◽

Massive Data Processing

Abstract Optical artificial neural networks (ONNs) have significant potential for ultra-high computing speed and energy efficiency. We report a novel approach to ONNs that uses integrated Kerr optical micro-combs. This approach is programmable and scalable and is capable of reaching ultra-high speeds. We demonstrate the basic building block ONNs — a single neuron perceptron — by mapping synapses onto 49 wavelengths to achieve an operating speed of 11.9 x 109 operations per second, or Giga-OPS, at 8 bits per operation, which equates to 95.2 gigabits/s (Gbps). We test the perceptron on handwritten-digit recognition and cancer-cell detection — achieving over 90% and 85% accuracy, respectively. By scaling the perceptron to a deep learning network using off-the-shelf telecom technology we can achieve high throughput operation for matrix multiplication for real-time massive data processing.

Download Full-text

Time Wavelength Interleaving Perceptron at 12 Giga-Ops/s with a Kerr Soliton Crystal Microcomb for Optical Neural Networks

10.20944/preprints202103.0033.v1 ◽

2021 ◽

Author(s):

Mengxi Tan ◽

Xingyuan Xu ◽

David Moss

Keyword(s):

Neural Networks ◽

Matrix Multiplication ◽

Cell Detection ◽

Digit Recognition ◽

High Speeds ◽

Novel Approach ◽

Handwritten Digit ◽

Deep Learning Network ◽

Cancer Cell Detection ◽

Massive Data Processing

Optical artificial neural networks (ONNs) have significant potential for ultra-high computing speed and energy efficiency. We report a novel approach to ONNs that uses integrated Kerr optical micro-combs. This approach is programmable and scalable and is capable of reaching ultra-high speeds. We demonstrate the basic building block ONNs — a single neuron perceptron — by mapping synapses onto 49 wavelengths to achieve an operating speed of 11.9 x 109 operations per second, or Giga-OPS, at 8 bits per operation, which equates to 95.2 gigabits/s (Gbps). We test the perceptron on handwritten-digit recognition and cancer-cell detection — achieving over 90% and 85% accuracy, respectively. By scaling the perceptron to a deep learning network using off-the-shelf telecom technology we can achieve high throughput operation for matrix multiplication for real-time massive data processing.

Download Full-text

IEEE Microwave Photonics Conf 2020 P.26 perceptron OSF

10.31219/osf.io/wkrzg ◽

2021 ◽

Author(s):

David Moss

Keyword(s):

Matrix Multiplication ◽

Cell Detection ◽

New Approach ◽

Digit Recognition ◽

High Speeds ◽

Handwritten Digit ◽

Deep Learning Network ◽

Cancer Cell Detection ◽

Massive Data Processing ◽

Ieee Microwave

Optical artificial neural networks (ONNs) have significant potential for ultra-high computing speed and energy efficiency. We report a new approach to ONNs based on integrated Kerr micro-combs that is programmable, highly scalable and capable of reaching ultra-high speeds, demonstrating the building block of the ONN — a single neuron perceptron — by mapping synapses onto 49 wavelengths to achieve a single-unit throughput of 11.9 Giga-OPS at 8 bits per OP, or 95.2 Gbps. We test the perceptron on handwritten-digit recognition and cancer-cell detection — achieving over 90% and 85% accuracy, respectively. By scaling the perceptron to a deep learning network using off-the-shelf telecom technology we can achieve high throughput operation for matrix multiplication for real-time massive data processing.

Download Full-text

Efficient and Privacy-Preserving Massive Data Processing for Smart Grids

IEEE Access ◽

10.1109/access.2021.3078629 ◽

2021 ◽

pp. 1-1

Author(s):

Hua Shen ◽

Mingwu Zhang ◽

Hao Wang ◽

Fuchun Guo ◽

Willy Susilo

Keyword(s):

Data Processing ◽

Smart Grids ◽

Privacy Preserving ◽

Massive Data ◽

Massive Data Processing

Download Full-text

Adoption of Big data technology by water stakeholders in Morocco: An adaptation of the technology acceptance model

E3S Web of Conferences ◽

10.1051/e3sconf/202131406003 ◽

2021 ◽

Vol 314 ◽

pp. 06003

Author(s):

Aniss Moumen ◽

Hajar Slimani ◽

Nezha Mejjad ◽

Mohamed Ben-Daoud

Keyword(s):

Big Data ◽

Technology Acceptance ◽

Technology Acceptance Model ◽

New Technologies ◽

New Technology ◽

Massive Data ◽

Big Data Technologies ◽

Acceptance Model ◽

Massive Data Processing ◽

Big Data Technology

Nowadays, big data technologies are becoming increasingly important in the modernization of organizations’ information systems. Indeed, water and climatology data producers and users deal daily with massive data processing. These actors need new technology to overcome the difficulties in data integration, processing and visualization. This paper presents an exploratory study about the intention to use big data technology by the water stakeholders in Morocco; we also present an exploratory review of technology acceptance model theory, a theoretical framework that explains the factors of adopting new technologies by users.

Download Full-text

Big data clustering techniques based on Spark: a literature review

PeerJ Computer Science ◽

10.7717/peerj-cs.321 ◽

2020 ◽

Vol 6 ◽

pp. e321

Author(s):

Mozamel M. Saeed ◽

Zaher Al Aghbari ◽

Mohammed Alsharidah

Keyword(s):

Big Data ◽

Data Clustering ◽

Apache Spark ◽

Mining Machine ◽

Massive Data ◽

Clustering Methods ◽

Research Directions ◽

Massive Growth ◽

New Research ◽

Massive Data Processing

A popular unsupervised learning method, known as clustering, is extensively used in data mining, machine learning and pattern recognition. The procedure involves grouping of single and distinct points in a group in such a way that they are either similar to each other or dissimilar to points of other clusters. Traditional clustering methods are greatly challenged by the recent massive growth of data. Therefore, several research works proposed novel designs for clustering methods that leverage the benefits of Big Data platforms, such as Apache Spark, which is designed for fast and distributed massive data processing. However, Spark-based clustering research is still in its early days. In this systematic survey, we investigate the existing Spark-based clustering methods in terms of their support to the characteristics Big Data. Moreover, we propose a new taxonomy for the Spark-based clustering methods. To the best of our knowledge, no survey has been conducted on Spark-based clustering of Big Data. Therefore, this survey aims to present a comprehensive summary of the previous studies in the field of Big Data clustering using Apache Spark during the span of 2010–2020. This survey also highlights the new research directions in the field of clustering massive data.

Download Full-text

NOVEL EVALUATION INDEX OF CROSS-SCALE DISCRETIZATION UNCERTAINTY BASED ON LOCAL STANDARD SCORE

ISPRS Annals of Photogrammetry Remote Sensing and Spatial Information Sciences ◽

10.5194/isprs-annals-v-4-2020-11-2020 ◽

2020 ◽

Vol V-4-2020 ◽

pp. 11-18

Author(s):

J. Chen ◽

W. Feng ◽

Y. Huang

Keyword(s):

Evaluation Index ◽

Massive Data ◽

Standard Score ◽

Operation Efficiency ◽

Optimal Discretization ◽

Local Standard ◽

Uncertainty Index ◽

General Similarity ◽

Special Distribution ◽

Massive Data Processing

Abstract. Optimal discretization of continuously valued attributes is an uncertainty problem. The uncertainty of discretization is propagated and accumulated in the process of data mining, which has a direct influence on the usability and operation of the output results for mining. To address the limitations of existing discretization evaluation indices in describing accuracy and operation efficiency, this work suggests a discretization uncertainty index based on individuals. This method takes the local standard score as the general similarity measure in and between the intervals and evaluates discretization reliability according to the relative position of individuals in each interval. The experiment shows the new evaluation index is consistent with commonly used metrics. Under the premise of guaranteeing the validity of discrete evaluation, the proposed method has greater description accuracy and operation efficiency than extant approaches; it also has more advantages for massive data processing and special distribution detection.

Download Full-text

Taxonomy of Cluster-Based Target Tracking System in Wireless Sensor Networks

International Journal of Sensors Wireless Communications and Control ◽

10.2174/2210327910999200606230150 ◽

2020 ◽

Vol 10 ◽

Author(s):

Ammar Odeh

Keyword(s):

Wireless Sensor Networks ◽

Sensor Networks ◽

Target Tracking ◽

Boundary Problem ◽

Tracking System ◽

Cluster Formation ◽

Wireless Sensor ◽

Analysis Model ◽

Massive Data Processing ◽

The Internet Of Things

Objective: In the last decade, with the advancement of big data technology and the internet of things, Wireless Sensor Networks (WSN) become fundamental for the success of a different range of applications especially those demanding massive data processing. Methods: This paper investigates several tracking methods to introduce a novel cluster-based target tracking analysis model. Results: Some crucial factors of the cluster-based routing protocols are demonstrated, and a comparison among these different methods is conducted according to our taxonomy such as cluster formation, predicate/proactive, target speed, single or multi-object tracking, boundary problem, scalability, energy efficiency, and communication cost. This can help the community of researchers by providing clear information for further study. Conclusion: The proposed paper compares the differences and similarities between the available approaches across different categories in terms of the Cluster construction, Clustering method, Object Speed, Number of Objects, Boundary problem, and scalability. Finally, we can recognize some open issues that have so far gained little publicity or remain unexplored yet.

Download Full-text

massive data processing
Recently Published Documents

TOTAL DOCUMENTS

H-INDEX

Research on Task Scheduling Strategy based on the Trustworthiness of MapReduce

Research on Massive Data Processing System of Coal Mine Safety Monitoring Based on Computer Technology

Photonic perceptron at Giga-OP/s speeds with Kerr microcombs for scalable optical neural networks

Time Wavelength Interleaving Perceptron at 12 Giga-Ops/s with a Kerr Soliton Crystal Microcomb for Optical Neural Networks

IEEE Microwave Photonics Conf 2020 P.26 perceptron OSF

Efficient and Privacy-Preserving Massive Data Processing for Smart Grids

Adoption of Big data technology by water stakeholders in Morocco: An adaptation of the technology acceptance model

Big data clustering techniques based on Spark: a literature review

NOVEL EVALUATION INDEX OF CROSS-SCALE DISCRETIZATION UNCERTAINTY BASED ON LOCAL STANDARD SCORE

Taxonomy of Cluster-Based Target Tracking System in Wireless Sensor Networks

Export Citation Format

massive data processingRecently Published Documents

TOTAL DOCUMENTS

H-INDEX

Research on Task Scheduling Strategy based on the Trustworthiness of MapReduce

Research on Massive Data Processing System of Coal Mine Safety Monitoring Based on Computer Technology

Photonic perceptron at Giga-OP/s speeds with Kerr microcombs for scalable optical neural networks

Time Wavelength Interleaving Perceptron at 12 Giga-Ops/s with a Kerr Soliton Crystal Microcomb for Optical Neural Networks

IEEE Microwave Photonics Conf 2020 P.26 perceptron OSF

Efficient and Privacy-Preserving Massive Data Processing for Smart Grids

Adoption of Big data technology by water stakeholders in Morocco: An adaptation of the technology acceptance model

Big data clustering techniques based on Spark: a literature review

NOVEL EVALUATION INDEX OF CROSS-SCALE DISCRETIZATION UNCERTAINTY BASED ON LOCAL STANDARD SCORE

Taxonomy of Cluster-Based Target Tracking System in Wireless Sensor Networks

massive data processing
Recently Published Documents