GENERALIZED PARTICLE MODEL USED FOR DATA CLUSTERING

Machine learning (ML), neural network (NN), evolutionary algorithm (EA), fuzzy systems (FSs), as well as computer science have been very famous and very significant for many years. They have been applied to many different areas. They have contributed much to developments of many large-scale corporations, massive organizations, etc. Lots of information and massive data sets (MDSs) have been generated from these big corporations, organizations, etc. These big data sets (BDSs) have been the challenges of many commercial applications, researches, etc. Therefore, there have been many algorithms of the ML, the NN, the EA, the FSs, as well as computer science which have been developed to handle these massive data sets successfully. To support for this process, the authors have displayed all the possible algorithms of the NN for the large-scale data sets (LSDSs) successfully in this chapter. Finally, they have presented a novel model of the NN for the BDS in a sequential environment (SE) and a distributed network environment (DNE).

Download Full-text

Understanding Large-Scale Structure in Massive Data Sets

Encyclopedia of Quantitative Risk Analysis and Assessment ◽

10.1002/9780470061596.risk0672 ◽

2008 ◽

Cited By ~ 1

Author(s):

Amy J. Braverman

Keyword(s):

Large Scale ◽

Large Scale Structure ◽

Scale Structure ◽

Massive Data ◽

Data Sets ◽

Massive Data Sets

Download Full-text

Multifidelity Information Fusion Algorithms for High-Dimensional Systems and Massive Data sets

SIAM Journal on Scientific Computing ◽

10.1137/15m1055164 ◽

2016 ◽

Vol 38 (4) ◽

pp. B521-B538 ◽

Cited By ~ 30

Author(s):

Paris Perdikaris ◽

Daniele Venturi ◽

George Em Karniadakis

Keyword(s):

Information Fusion ◽

High Dimensional ◽

Massive Data ◽

Data Sets ◽

Massive Data Sets

Download Full-text

Understanding Large-Scale Structure in Massive Data Sets

Wiley StatsRef: Statistics Reference Online ◽

10.1002/9781118445112.stat03685 ◽

2014 ◽

Author(s):

Amy J. Braverman

Keyword(s):

Large Scale ◽

Large Scale Structure ◽

Scale Structure ◽

Massive Data ◽

Data Sets ◽

Massive Data Sets

Download Full-text

Neural Network for Big Data Sets

Computational Intelligence in the Internet of Things - Advances in Computational Intelligence and Robotics ◽

10.4018/978-1-5225-7955-7.ch012 ◽

2019 ◽

pp. 271-303

Author(s):

Vo Ngoc Phu ◽

Vo Thi Ngoc Tran

Keyword(s):

Neural Network ◽

Big Data ◽

Computer Science ◽

Large Scale ◽

Massive Data ◽

Data Sets ◽

Massive Data Sets ◽

Large Scale Data ◽

Commercial Applications ◽

Novel Model

Machine learning (ML), neural network (NN), evolutionary algorithm (EA), fuzzy systems (FSs), as well as computer science have been very famous and very significant for many years. They have been applied to many different areas. They have contributed much to developments of many large-scale corporations, massive organizations, etc. Lots of information and massive data sets (MDSs) have been generated from these big corporations, organizations, etc. These big data sets (BDSs) have been the challenges of many commercial applications, researches, etc. Therefore, there have been many algorithms of the ML, the NN, the EA, the FSs, as well as computer science which have been developed to handle these massive data sets successfully. To support for this process, the authors have displayed all the possible algorithms of the NN for the large-scale data sets (LSDSs) successfully in this chapter. Finally, they have presented a novel model of the NN for the BDS in a sequential environment (SE) and a distributed network environment (DNE).

Download Full-text

The Data Big Bang and the Expanding Digital Universe: High-Dimensional, Complex and Massive Data Sets in an Inflationary Epoch

Advances in Astronomy ◽

10.1155/2010/350891 ◽

2010 ◽

Vol 2010 ◽

pp. 1-16 ◽

Cited By ~ 6

Author(s):

Meyer Z. Pesenson ◽

Isaac Z. Pesenson ◽

Bruce McCollum

Keyword(s):

Applied Mathematics ◽

Big Bang ◽

The Other ◽

High Dimensional ◽

Massive Data ◽

Data Sets ◽

Massive Data Sets ◽

Astronomical Data ◽

Illustrative Visualization ◽

The One

Recent and forthcoming advances in instrumentation, and giant new surveys, are creating astronomical data sets that are not amenable to the methods of analysis familiar to astronomers. Traditional methods are often inadequate not merely because of the size in bytes of the data sets, but also because of the complexity of modern data sets. Mathematical limitations of familiar algorithms and techniques in dealing with such data sets create a critical need fornew paradigmsfor the representation, analysis and scientific visualization (as opposed to illustrative visualization) of heterogeneous, multiresolution data across application domains. Some of the problems presented by the new data sets have been addressed by other disciplines such as applied mathematics, statistics and machine learning and have been utilized by other sciences such as space-based geosciences. Unfortunately, valuable results pertaining to these problems are mostly to be found in publications outside of astronomy. Here we offer brief overviews of a number of concepts, techniques and developments that are vital to the analysis and visualization of complex datasets and images. One of the goals of this paper is to help bridge the gap between applied mathematics and artificial intelligence on the one side and astronomy on the other.

Download Full-text

Fundamental resource trade-offs for encoded distributed optimization

Information and Inference A Journal of the IMA ◽

10.1093/imaiai/iaaa026 ◽

2020 ◽

Author(s):

A Salman Avestimehr ◽

Seyed Mohammadreza Mousavi Kalan ◽

Mahdi Soltanolkotabi

Keyword(s):

Computational Time ◽

Massive Data ◽

Data Sets ◽

Massive Data Sets ◽

Computational Framework ◽

Data Set ◽

Trade Offs ◽

Major Bottleneck ◽

Computing Environments ◽

Analyze Data

Abstract Dealing with the shear size and complexity of today’s massive data sets requires computational platforms that can analyze data in a parallelized and distributed fashion. A major bottleneck that arises in such modern distributed computing environments is that some of the worker nodes may run slow. These nodes a.k.a. stragglers can significantly slow down computation as the slowest node may dictate the overall computational time. A recent computational framework, called encoded optimization, creates redundancy in the data to mitigate the effect of stragglers. In this paper, we develop novel mathematical understanding for this framework demonstrating its effectiveness in much broader settings than was previously understood. We also analyze the convergence behavior of iterative encoded optimization algorithms, allowing us to characterize fundamental trade-offs between convergence rate, size of data set, accuracy, computational load (or data redundancy) and straggler toleration in this framework.

Download Full-text

A methodology for supporting collaborative exploratory analysis of massive data sets in tele-immersive environments

Proceedings. The Eighth International Symposium on High Performance Distributed Computing (Cat. No.99TH8469) ◽

10.1109/hpdc.1999.805283 ◽

2003 ◽

Cited By ~ 8

Author(s):

J. Leigh ◽

A.E. Johnson ◽

T.A. DeFanti ◽

S. Bailey ◽

R. Grossman

Keyword(s):

Exploratory Analysis ◽

Massive Data ◽

Data Sets ◽

Massive Data Sets ◽

Immersive Environments

Download Full-text

Massive Data Sets Issues in Earth Observing

Massive Computing - Handbook of Massive Data Sets ◽

10.1007/978-1-4615-0005-6_29 ◽

2002 ◽

pp. 1093-1140 ◽

Cited By ~ 3

Author(s):

Ruixin Yang ◽

Menas Kafatos

Keyword(s):

Massive Data ◽

Data Sets ◽

Massive Data Sets

Download Full-text

BSO-MV: An Optimized Multiview Clustering Approach for Items Recommendation in Social Networks

JUCS - Journal of Universal Computer Science ◽

10.3897/jucs.70341 ◽

2021 ◽

Vol 27 (7) ◽

pp. 667-692

Author(s):

Lamia Berkani ◽

Lylia Betit ◽

Louiza Belarif

Keyword(s):

Social Networks ◽

Large Scale ◽

Data Sets ◽

Large Scale Data ◽

Recommendation Algorithms ◽

Clustering Approach ◽

Real World Datasets ◽

Multiview Clustering ◽

Improving Accuracy

Clustering-based approaches have been demonstrated to be efficient and scalable to large-scale data sets. However, clustering-based recommender systems suffer from relatively low accuracy and coverage. To address these issues, we propose in this article an optimized multiview clustering approach for the recommendation of items in social networks. First, the selection of the initial medoids is optimized using the Bees Swarm optimization algorithm (BSO) in order to generate better partitions (i.e. refining the quality of medoids according to the objective function). Then, the multiview clustering (MV) is applied, where users are iteratively clustered from the views of both rating patterns and social information (i.e. friendships and trust). Finally, a framework is proposed for testing the different alternatives, namely: (1) the standard recommendation algorithms; (2) the clustering-based and the optimized clustering-based recommendation algorithms using BSO; and (3) the MV and the optimized MV (BSO-MV) algorithms. Experimental results conducted on two real-world datasets demonstrate the effectiveness of the proposed BSO-MV algorithm in terms of improving accuracy, as it outperforms the existing related approaches and baselines.

Download Full-text