ConnectIt

Connected components is a fundamental kernel in graph applications. The fastest existing multicore algorithms for solving graph connectivity are based on some form of edge sampling and/or linking and compressing trees. However, many combinations of these design choices have been left unexplored. In this paper, we design the ConnectIt framework, which provides different sampling strategies as well as various tree linking and compression schemes. ConnectIt enables us to obtain several hundred new variants of connectivity algorithms, most of which extend to computing spanning forest. In addition to static graphs, we also extend ConnectIt to support mixes of insertions and connectivity queries in the concurrent setting. We present an experimental evaluation of ConnectIt on a 72-core machine, which we believe is the most comprehensive evaluation of parallel connectivity algorithms to date. Compared to a collection of state-of-the-art static multicore algorithms, we obtain an average speedup of 12.4x (2.36x average speedup over the fastest existing implementation for each graph). Using ConnectIt, we are able to compute connectivity on the largest publicly-available graph (with over 3.5 billion vertices and 128 billion edges) in under 10 seconds using a 72-core machine, providing a 3.1x speedup over the fastest existing connectivity result for this graph, in any computational setting. For our incremental algorithms, we show that our algorithms can ingest graph updates at up to several billion edges per second. To guide the user in selecting the best variants in ConnectIt for different situations, we provide a detailed analysis of the different strategies. Finally, we show how the techniques in ConnectIt can be used to speed up two important graph applications: approximate minimum spanning forest and SCAN clustering.

Download Full-text

Beyond Synchronous: New Techniques for External-Memory Graph Connectivity and Minimum Spanning Forest

Experimental Algorithms - Lecture Notes in Computer Science ◽

10.1007/978-3-319-07959-2_11 ◽

2014 ◽

pp. 123-137 ◽

Cited By ~ 4

Author(s):

Aapo Kyrola ◽

Julian Shun ◽

Guy Blelloch

Keyword(s):

Graph Connectivity ◽

External Memory ◽

New Techniques ◽

Spanning Forest ◽

Minimum Spanning Forest

Download Full-text

Rewriting Minimizations for Efficient Query Answering over Ontologies

International Journal of Artificial Intelligence Tools ◽

10.1142/s0218213017600247 ◽

2017 ◽

Vol 26 (05) ◽

pp. 1760024 ◽

Cited By ~ 1

Author(s):

Tassos Venetis ◽

Giorgos Stoilos ◽

Vasilis Vassalos

Keyword(s):

Real World ◽

Experimental Evaluation ◽

State Of The Art ◽

Query Answering ◽

Current Paper ◽

Query Rewriting ◽

Speed Up ◽

Using Data ◽

The Given ◽

Data Constraints

Computing a (Union of Conjunctive Queries — UCQ) rewriting ℛ for an input query and ontology and evaluating it over the given dataset is a prominent approach to query answering over ontologies. However, ℛ can be large and complex in structure hence additional techniques, like query subsumption and data constraints, need to be employed in order to minimize ℛ and lead to an efficient evaluation. Although sound in theory, how to efficiently and effectively implement many of these techniques in practice could be challenging. For example, many systems do not implement query subsumption. In the current paper we present several practical techniques for UCQ rewriting minimization. First, we present an optimized algorithm for eliminating redundant (w.r.t. subsumption) queries as well as a novel framework for rewriting minimization using data constraints. Second, we show how these techniques can also be used to speed up the computation of ℛ in first place. Third, we integrated all our techniques in our query rewriting system IQAROS and conducted an extensive experimental evaluation using many artificial as well as challenging real-world ontologies obtaining encouraging results as, in the vast majority of cases, our system is more efficient compared to the two most popular state-of-the-art systems.

Download Full-text

Review on biomass feedstocks, pyrolysis mechanism and physicochemical properties of biochar: State-of-the-art framework to speed up vision of circular bioeconomy

Journal of Cleaner Production ◽

10.1016/j.jclepro.2021.126645 ◽

2021 ◽

Vol 297 ◽

pp. 126645

Author(s):

Gajanan Sampatrao Ghodake ◽

Surendra Krushna Shinde ◽

Avinash Ashok Kadam ◽

Rijuta Ganesh Saratale ◽

Ganesh Dattatraya Saratale ◽

...

Keyword(s):

Physicochemical Properties ◽

State Of The Art ◽

Pyrolysis Mechanism ◽

Biomass Feedstocks ◽

Speed Up

Download Full-text

All-gather Algorithms Resilient to Imbalanced Process Arrival Patterns

ACM Transactions on Architecture and Code Optimization ◽

10.1145/3460122 ◽

2021 ◽

Vol 18 (4) ◽

pp. 1-22

Author(s):

Jerzy Proficz

Keyword(s):

Experimental Evaluation ◽

Data Exchange ◽

State Of The Art ◽

Monitoring And Evaluation ◽

The Other ◽

Early Data ◽

Cluster Architecture ◽

Novel Algorithms

Two novel algorithms for the all-gather operation resilient to imbalanced process arrival patterns (PATs) are presented. The first one, Background Disseminated Ring (BDR), is based on the regular parallel ring algorithm often supplied in MPI implementations and exploits an auxiliary background thread for early data exchange from faster processes to accelerate the performed all-gather operation. The other algorithm, Background Sorted Linear synchronized tree with Broadcast (BSLB), is built upon the already existing PAP-aware gather algorithm, that is, Background Sorted Linear Synchronized tree (BSLS), followed by a regular broadcast distributing gathered data to all participating processes. The background of the imbalanced PAP subject is described, along with the PAP monitoring and evaluation topics. An experimental evaluation of the algorithms based on a proposed mini-benchmark is presented. The mini-benchmark was performed over 2,000 times in a typical HPC cluster architecture with homogeneous compute nodes. The obtained results are analyzed according to different PATs, data sizes, and process numbers, showing that the proposed optimization works well for various configurations, is scalable, and can significantly reduce the all-gather elapsed times, in our case, up to factor 1.9 or 47% in comparison with the best state-of-the-art solution.

Download Full-text

Cache-efficient sweeping-based interval joins for extended Allen relation predicates

The VLDB Journal ◽

10.1007/s00778-020-00650-5 ◽

2021 ◽

Author(s):

Danila Piatov ◽

Sven Helmer ◽

Anton Dignös ◽

Fabio Persia

Keyword(s):

Data Structure ◽

Experimental Evaluation ◽

State Of The Art ◽

Temporal Databases ◽

Access Method ◽

Wide Range ◽

Interval Relation ◽

Cache Efficient ◽

Join Algorithms ◽

Better Than

AbstractWe develop a family of efficient plane-sweeping interval join algorithms for evaluating a wide range of interval predicates such as Allen’s relationships and parameterized relationships. Our technique is based on a framework, components of which can be flexibly combined in different manners to support the required interval relation. In temporal databases, our algorithms can exploit a well-known and flexible access method, the Timeline Index, thus expanding the set of operations it supports even further. Additionally, employing a compact data structure, the gapless hash map, we utilize the CPU cache efficiently. In an experimental evaluation, we show that our approach is several times faster and scales better than state-of-the-art techniques, while being much better suited for real-time event processing.

Download Full-text

Segmentation and Classification of Hyperspectral Images Using Minimum Spanning Forest Grown From Automatically Selected Markers

IEEE Transactions on Systems Man and Cybernetics Part B (Cybernetics) ◽

10.1109/tsmcb.2009.2037132 ◽

2010 ◽

Vol 40 (5) ◽

pp. 1267-1279 ◽

Cited By ~ 196

Author(s):

Y Tarabalka ◽

J Chanussot ◽

J A Benediktsson

Keyword(s):

Hyperspectral Images ◽

Spanning Forest ◽

Minimum Spanning Forest

Download Full-text

Persistent memory hash indexes

Proceedings of the VLDB Endowment ◽

10.14778/3446095.3446101 ◽

2021 ◽

Vol 14 (5) ◽

pp. 785-798

Author(s):

Daokun Hu ◽

Zhiwen Chen ◽

Jianbing Wu ◽

Jianhua Sun ◽

Hao Chen

Keyword(s):

Future Development ◽

High Performance ◽

Performance Metrics ◽

Comprehensive Evaluation ◽

State Of The Art ◽

Hash Tables ◽

Trade Offs ◽

Depth Analysis ◽

Persistent Memory ◽

Memory Modules

Persistent memory (PM) is increasingly being leveraged to build hash-based indexing structures featuring cheap persistence, high performance, and instant recovery, especially with the recent release of Intel Optane DC Persistent Memory Modules. However, most of them are evaluated on DRAM-based emulators with unreal assumptions, or focus on the evaluation of specific metrics with important properties sidestepped. Thus, it is essential to understand how well the proposed hash indexes perform on real PM and how they differentiate from each other if a wider range of performance metrics are considered. To this end, this paper provides a comprehensive evaluation of persistent hash tables. In particular, we focus on the evaluation of six state-of-the-art hash tables including Level hashing, CCEH, Dash, PCLHT, Clevel, and SOFT, with real PM hardware. Our evaluation was conducted using a unified benchmarking framework and representative workloads. Besides characterizing common performance properties, we also explore how hardware configurations (such as PM bandwidth, CPU instructions, and NUMA) affect the performance of PM-based hash tables. With our in-depth analysis, we identify design trade-offs and good paradigms in prior arts, and suggest desirable optimizations and directions for the future development of PM-based hash tables.

Download Full-text

An Efficient Transaction-Based GPU Implementation of Minimum Spanning Forest Algorithm

2017 International Conference on High Performance Computing & Simulation (HPCS) ◽

10.1109/hpcs.2017.100 ◽

2017 ◽

Cited By ~ 1

Author(s):

Shayan Manoochehri ◽

Bahareh Goodarzi ◽

Dhrubajyoti Goswami

Keyword(s):

Spanning Forest ◽

Minimum Spanning Forest ◽

Gpu Implementation

Download Full-text

A minimum spanning forest based hyperspectral image classification method for cancerous tissue detection

10.1117/12.2043848 ◽

2014 ◽

Cited By ~ 3

Author(s):

Robert Pike ◽

Samuel K. Patton ◽

Guolan Lu ◽

Luma V. Halig ◽

Dongsheng Wang ◽

...

Keyword(s):

Image Classification ◽

Hyperspectral Image ◽

Classification Method ◽

Hyperspectral Image Classification ◽

Cancerous Tissue ◽

Spanning Forest ◽

Minimum Spanning Forest

Download Full-text

Data-Efficient Sensor Upgrade Path Using Knowledge Distillation

Sensors ◽

10.3390/s21196523 ◽

2021 ◽

Vol 21 (19) ◽

pp. 6523

Author(s):

Pieter Van Van Molle ◽

Cedric De De Boom ◽

Tim Verbelen ◽

Bert Vankeirsbilck ◽

Jonas De De Vylder ◽

...

Keyword(s):

Deep Neural Networks ◽

State Of The Art ◽

Original Data ◽

Radar Data ◽

Teacher Supervision ◽

Multispectral Images ◽

Test Set ◽

Time To Market ◽

Speed Up ◽

Knowledge Distillation

Deep neural networks have achieved state-of-the-art performance in image classification. Due to this success, deep learning is now also being applied to other data modalities such as multispectral images, lidar and radar data. However, successfully training a deep neural network requires a large reddataset. Therefore, transitioning to a new sensor modality (e.g., from regular camera images to multispectral camera images) might result in a drop in performance, due to the limited availability of data in the new modality. This might hinder the adoption rate and time to market for new sensor technologies. In this paper, we present an approach to leverage the knowledge of a teacher network, that was trained using the original data modality, to improve the performance of a student network on a new data modality: a technique known in literature as knowledge distillation. By applying knowledge distillation to the problem of sensor transition, we can greatly speed up this process. We validate this approach using a multimodal version of the MNIST dataset. Especially when little data is available in the new modality (i.e., 10 images), training with additional teacher supervision results in increased performance, with the student network scoring a test set accuracy of 0.77, compared to an accuracy of 0.37 for the baseline. We also explore two extensions to the default method of knowledge distillation, which we evaluate on a multimodal version of the CIFAR-10 dataset: an annealing scheme for the hyperparameter α and selective knowledge distillation. Of these two, the first yields the best results. Choosing the optimal annealing scheme results in an increase in test set accuracy of 6%. Finally, we apply our method to the real-world use case of skin lesion classification.

Download Full-text