COMMUNITY-CURATED DATA RESOURCES AND LARGE-SCALE DATA-MODEL SYNTHESES: THE CHILDREN OF COHMAP

STANDARDS-BASED SERVICES FOR BIG SPATIO-TEMPORAL DATA

ISPRS - International Archives of the Photogrammetry, Remote Sensing and Spatial Information Sciences ◽

10.5194/isprs-archives-xli-b4-691-2016 ◽

2016 ◽

Vol XLI-B4 ◽

pp. 691-699

Author(s):

P. Baumann ◽

V. Merticariu ◽

A. Dumitru ◽

D. Misev

Keyword(s):

Data Model ◽

Large Scale ◽

Point Clouds ◽

Current Status ◽

Temporal Data ◽

Large Scale Data ◽

Spatio Temporal ◽

High Level ◽

Immense Potential ◽

Scale Data

With the unprecedented availability of continuously updated measured and generated data there is an immense potential for getting new and timely insights – yet, the value is not fully leveraged as of today. The quest is up for high-level service interfaces for dissecting datasets and rejoining them with other datasets – ultimately, to allow users to ask "any question, anytime, on any size" enabling them to "build their own product on the go". <br><br> With OGC Coverages, a concrete, interoperable data model has been established which unifies n-D spatio-temporal regular and irregular grids, point clouds, and meshes. The Web Coverage Service (WCS) suite provides versatile streamlined coverage functionality ranging from simple access to flexible spatio-temporal analytics. Flexibility and scalability of the WCS suite has been demonstrated in practice through massive services run by large-scale data centers. We present the current status in OGC Coverage data and service models, contrast them to related work, and describe a scalable implementation based on the rasdaman array engine.

Download Full-text

STANDARDS-BASED SERVICES FOR BIG SPATIO-TEMPORAL DATA

ISPRS - International Archives of the Photogrammetry, Remote Sensing and Spatial Information Sciences ◽

10.5194/isprsarchives-xli-b4-691-2016 ◽

2016 ◽

Vol XLI-B4 ◽

pp. 691-699 ◽

Cited By ~ 1

Author(s):

P. Baumann ◽

V. Merticariu ◽

A. Dumitru ◽

D. Misev

Keyword(s):

Data Model ◽

Large Scale ◽

Point Clouds ◽

Current Status ◽

Temporal Data ◽

Large Scale Data ◽

Spatio Temporal ◽

High Level ◽

Immense Potential ◽

Scale Data

With the unprecedented availability of continuously updated measured and generated data there is an immense potential for getting new and timely insights – yet, the value is not fully leveraged as of today. The quest is up for high-level service interfaces for dissecting datasets and rejoining them with other datasets – ultimately, to allow users to ask "any question, anytime, on any size" enabling them to "build their own product on the go". <br><br> With OGC Coverages, a concrete, interoperable data model has been established which unifies n-D spatio-temporal regular and irregular grids, point clouds, and meshes. The Web Coverage Service (WCS) suite provides versatile streamlined coverage functionality ranging from simple access to flexible spatio-temporal analytics. Flexibility and scalability of the WCS suite has been demonstrated in practice through massive services run by large-scale data centers. We present the current status in OGC Coverage data and service models, contrast them to related work, and describe a scalable implementation based on the rasdaman array engine.

Download Full-text

Large-Scale Data Learning Method for Anomaly Detection using Machine Learning for Monitoring Vibration in Vehicle Equipment

IEEJ Transactions on Industry Applications ◽

10.1541/ieejias.140.480 ◽

2020 ◽

Vol 140 (6) ◽

pp. 480-487

Author(s):

Minoru Kondo

Keyword(s):

Machine Learning ◽

Anomaly Detection ◽

Large Scale ◽

Learning Method ◽

Large Scale Data ◽

Scale Data

Download Full-text

Faculty Opinions recommendation of Comparative assessment of large-scale data sets of protein-protein interactions.

Faculty Opinions – Post-Publication Peer Review of the Biomedical Literature ◽

10.3410/f.1006598.82257 ◽

2002 ◽

Author(s):

Rob Russell

Keyword(s):

Protein Interactions ◽

Large Scale ◽

Comparative Assessment ◽

Data Sets ◽

Protein Protein Interactions ◽

Large Scale Data ◽

Scale Data ◽

Large Scale Data Sets

Download Full-text

ProGen:Provenance database generator for large-scale data set

Journal of Computer Applications ◽

10.3724/sp.j.1087.2008.02737 ◽

2009 ◽

Vol 28 (11) ◽

pp. 2737-2740

Author(s):

Xiao ZHANG ◽

Shan WANG ◽

Na LIAN

Keyword(s):

Large Scale ◽

Data Set ◽

Large Scale Data ◽

Scale Data

Download Full-text

Construction of integrated particle rendering environment for large scale data visualization

Impact ◽

10.21820/23987073.2018.11.9 ◽

2018 ◽

Vol 2018 (11) ◽

pp. 9-11

Author(s):

Koji Koyamada

Keyword(s):

Data Visualization ◽

Large Scale ◽

Large Scale Data ◽

Scale Data

Download Full-text

Local and global approaches of affinity propagation clustering for large scale data

Journal of Zhejiang University SCIENCE A ◽

10.1631/jzus.a0720058 ◽

2008 ◽

Vol 9 (10) ◽

pp. 1373-1381 ◽

Cited By ~ 28

Author(s):

Ding-yin Xia ◽

Fei Wu ◽

Xu-qing Zhang ◽

Yue-ting Zhuang

Keyword(s):

Large Scale ◽

Affinity Propagation ◽

Large Scale Data ◽

Affinity Propagation Clustering ◽

Scale Data

Download Full-text

Towards Large-Scale Data Annotation of Audio from Wearables: Validating Zooniverse Annotations of Infant Vocalization Types

2021 IEEE Spoken Language Technology Workshop (SLT) ◽

10.1109/slt48900.2021.9383511 ◽

2021 ◽

Author(s):

Chiara Semenzin ◽

Lisa Hamrick ◽

Amanda Seidl ◽

Bridgette Kelleher ◽

Alejandrina Cristia

Keyword(s):

Large Scale ◽

Data Annotation ◽

Large Scale Data ◽

Infant Vocalization ◽

Scale Data

Download Full-text

A Framework for International Collaboration on ITER Using Large-Scale Data Transfer to Enable Near-Real-Time Analysis

Fusion Science & Technology ◽

10.1080/15361055.2020.1851073 ◽

2021 ◽

Vol 77 (2) ◽

pp. 98-108

Author(s):

R. M. Churchill ◽

C. S. Chang ◽

J. Choi ◽

J. Wong ◽

S. Klasky ◽

...

Keyword(s):

Real Time ◽

International Collaboration ◽

Large Scale ◽

Data Transfer ◽

Time Analysis ◽

Real Time Analysis ◽

Large Scale Data ◽

Scale Data

Download Full-text

Multi-GPU approach to global induction of classification trees for large-scale data mining

Applied Intelligence ◽

10.1007/s10489-020-01952-5 ◽

2021 ◽

Author(s):

Krzysztof Jurczuk ◽

Marcin Czajkowski ◽

Marek Kretowski

Keyword(s):

Data Mining ◽

Large Scale ◽

Real Life ◽

Population Based ◽

Tree Structure ◽

Global Approach ◽

Data Parallel ◽

Large Scale Data ◽

The Impact ◽

Scale Data

AbstractThis paper concerns the evolutionary induction of decision trees (DT) for large-scale data. Such a global approach is one of the alternatives to the top-down inducers. It searches for the tree structure and tests simultaneously and thus gives improvements in the prediction and size of resulting classifiers in many situations. However, it is the population-based and iterative approach that can be too computationally demanding to apply for big data mining directly. The paper demonstrates that this barrier can be overcome by smart distributed/parallel processing. Moreover, we ask the question whether the global approach can truly compete with the greedy systems for large-scale data. For this purpose, we propose a novel multi-GPU approach. It incorporates the knowledge of global DT induction and evolutionary algorithm parallelization together with efficient utilization of memory and computing GPU’s resources. The searches for the tree structure and tests are performed simultaneously on a CPU, while the fitness calculations are delegated to GPUs. Data-parallel decomposition strategy and CUDA framework are applied. Experimental validation is performed on both artificial and real-life datasets. In both cases, the obtained acceleration is very satisfactory. The solution is able to process even billions of instances in a few hours on a single workstation equipped with 4 GPUs. The impact of data characteristics (size and dimension) on convergence and speedup of the evolutionary search is also shown. When the number of GPUs grows, nearly linear scalability is observed what suggests that data size boundaries for evolutionary DT mining are fading.

Download Full-text