Multi-GPU Support on Single Node Using Directive-Based Programming Model

Scientific Programming ◽

10.1155/2015/621730 ◽

2015 ◽

Vol 2015 ◽

pp. 1-15 ◽

Cited By ~ 5

Author(s):

Rengan Xu ◽

Xiaonan Tian ◽

Sunita Chandrasekaran ◽

Barbara Chapman

Keyword(s):

Hybrid Model ◽

Programming Model ◽

Heterogeneous Systems ◽

Single Node ◽

Heterogeneous Processors ◽

Multiple Gpus ◽

Significant Performance ◽

Performance Gains ◽

Immense Potential ◽

Model Approach

Existing studies show that using single GPU can lead to obtaining significant performance gains. We should be able to achieve further performance speedup if we use more than one GPU. Heterogeneous processors consisting of multiple CPUs and GPUs offer immense potential and are often considered as a leading candidate for porting complex scientific applications. Unfortunately programming heterogeneous systems requires more effort than what is required for traditional multicore systems. Directive-based programming approaches are being widely adopted since they make it easy to use/port/maintain application code. OpenMP and OpenACC are two popular models used to port applications to accelerators. However, neither of the models provides support for multiple GPUs. A plausible solution is to use combination of OpenMP and OpenACC that forms a hybrid model; however, building this model has its own limitations due to lack of necessary compilers’ support. Moreover, the model also lacks support for direct device-to-device communication. To overcome these limitations, an alternate strategy is to extend OpenACC by proposing and developing extensions that follow a task-based implementation for supporting multiple GPUs. We critically analyze the applicability of the hybrid model approach and evaluate the proposed strategy using several case studies and demonstrate their effectiveness.

Investigating Exchange Rate Dynamics in Mozambique: A Hybrid Model Approach

SSRN Electronic Journal ◽

10.2139/ssrn.3732947 ◽

2020 ◽

Author(s):

Aurélio Bucuane

Keyword(s):

Exchange Rate ◽

Hybrid Model ◽

Exchange Rate Dynamics ◽

Model Approach

Executing linear algebra kernels in heterogeneous distributed infrastructures with PyCOMPSs

Oil & Gas Science and Technology – Revue d’IFP Energies nouvelles ◽

10.2516/ogst/2018047 ◽

2018 ◽

Vol 73 ◽

pp. 47 ◽

Cited By ~ 3

Author(s):

Ramon Amela ◽

Cristian Ramon-Cortes ◽

Jorge Ejarque ◽

Javier Conejero ◽

Rosa M. Badia

Keyword(s):

Programming Languages ◽

Linear Algebra ◽

Programming Model ◽

Xeon Phi ◽

Scientific Communities ◽

Heterogeneous Architectures ◽

Parallel Programming Model ◽

Significant Performance ◽

Thread Level Parallelism ◽

Level Parallelism

Python is a popular programming language due to the simplicity of its syntax, while still achieving a good performance even being an interpreted language. The adoption from multiple scientific communities has evolved in the emergence of a large number of libraries and modules, which has helped to put Python on the top of the list of the programming languages [1]. Task-based programming has been proposed in the recent years as an alternative parallel programming model. PyCOMPSs follows such approach for Python, and this paper presents its extensions to combine task-based parallelism and thread-level parallelism. Also, we present how PyCOMPSs has been adapted to support heterogeneous architectures, including Xeon Phi and GPUs. Results obtained with linear algebra benchmarks demonstrate that significant performance can be obtained with a few lines of Python.

DDxNet: a deep learning model for automatic interpretation of electronic health records, electrocardiograms and electroencephalograms

Scientific Reports ◽

10.1038/s41598-020-73126-9 ◽

2020 ◽

Vol 10 (1) ◽

Author(s):

Jayaraman J. Thiagarajan ◽

Deepta Rajan ◽

Sameeksha Katoch ◽

Andreas Spanias

Keyword(s):

Electronic Health Records ◽

Rapid Development ◽

Benchmark Problems ◽

Health Records ◽

Automatic Interpretation ◽

Deep Architecture ◽

Significant Performance ◽

Electronic Health ◽

Improving Accuracy ◽

Performance Gains

Abstract Effective patient care mandates rapid, yet accurate, diagnosis. With the abundance of non-invasive diagnostic measurements and electronic health records (EHR), manual interpretation for differential diagnosis has become time-consuming and challenging. This has led to wide-spread adoption of AI-powered tools, in pursuit of improving accuracy and efficiency of this process. While the unique challenges presented by each modality and clinical task demand customized tools, the cumbersome process of making problem-specific choices has triggered the critical need for a generic solution to enable rapid development of models in practice. In this spirit, we develop DDxNet, a deep architecture for time-varying clinical data, which we demonstrate to be well-suited for diagnostic tasks involving different modalities (ECG/EEG/EHR), required level of characterization (abnormality detection/phenotyping) and data fidelity (single-lead ECG/22-channel EEG). Using multiple benchmark problems, we show that DDxNet produces high-fidelity predictive models, and sometimes even provides significant performance gains over problem-specific solutions.

Active Data: A programming model to manage data life cycle across heterogeneous systems and infrastructures

Future Generation Computer Systems ◽

10.1016/j.future.2015.05.015 ◽

2015 ◽

Vol 53 ◽

pp. 25-42 ◽

Cited By ~ 11

Author(s):

Anthony Simonet ◽

Gilles Fedak ◽

Matei Ripeanu

Keyword(s):

Life Cycle ◽

Programming Model ◽

Heterogeneous Systems ◽

Data Life Cycle ◽

Active Data

Simulation of a Floating Offshore Wind Turbine With an Integrated Response Mitigation Technology

ASME 2018 1st International Offshore Wind Technical Conference ◽

10.1115/iowtc2018-1087 ◽

2018 ◽

Author(s):

Christopher K. Allen ◽

Andrew J. Goupee ◽

Jeffrey Lindner ◽

Robert Berry

Keyword(s):

Offshore Wind ◽

Offshore Wind Turbine ◽

The Novel ◽

Mass Reduction ◽

Turbine Performance ◽

Additional Equipment ◽

Significant Performance ◽

Integrated Response ◽

The University ◽

Performance Gains

This work investigates the implementation of a novel, NASA-developed Fluid Harmonic Absorber (FHA) technology to mitigate platform motions and structural loads that can lead to lighter platforms, increased turbine performance, and ultimately, a lower LCOE. The novel damping strategy takes advantage of existing water ballast in the VolturnUS semi-submersible platform to achieve significant performance gains with minimal additional equipment and complexity. NREL’s FOWT software FAST is modified to include the primary features of the FHA technology. A study of the University of Maine-developed VolturnUS semi-submersible FOWT augmented with FHA technology is undertaken to quantify global performance of the system. When compared to the baseline technology, numerical simulations of a redesigned platform utilizing the FHA dampers indicate a reduction of 15.8% in hull structural material. Finally, the improvements in LCOE resulting from this mass reduction are assessed to demonstrate the advantages of NASA’s FHA technology for FOWT applications.

Comparative Analysis of Customer Sentiments on Competing Brands using Hybrid Model Approach

2018 3rd International Conference on Inventive Computation Technologies (ICICT) ◽

10.1109/icict43934.2018.9034283 ◽

2018 ◽

Author(s):

N Srivats Athindran ◽

S. Manikandaraj ◽

R. Kamaleshwar

Keyword(s):

Comparative Analysis ◽

Hybrid Model ◽

Model Approach

A Generic Data-Driven Recommendation System for Large-Scale Regular and Ride-Hailing Taxi Services

Electronics ◽

10.3390/electronics9040648 ◽

2020 ◽

Vol 9 (4) ◽

pp. 648 ◽

Cited By ~ 3

Author(s):

Xiangpeng Wan ◽

Hakim Ghazzai ◽

Yehia Massoud

Keyword(s):

Waiting Time ◽

Large Scale ◽

Recommendation System ◽

Performance Metrics ◽

Routing Algorithm ◽

Area Of Interest ◽

The Future ◽

Taxi Service ◽

Significant Performance ◽

Performance Gains

Modern taxi services are usually classified into two major categories: traditional taxicabs and ride-hailing services. For both services, it is required to design highly efficient recommendation systems to satisfy passengers’ quality of experience and drivers’ benefits. Customers desire to minimize their waiting time before rides, while drivers aim to speed up their customer hunting. In this paper, we propose to leverage taxi service efficiency by designing a generic and smart recommendation system that exploits the benefits of Vehicular Social Networks (VSNs). Aiming at optimizing three key performance metrics, number of pick-ups, customer waiting time, and vacant traveled distance for both taxi services, the proposed recommendation system starts by efficiently estimating the future customer demands in different clusters of the area of interest. Then, it proposes an optimal taxi-to-region matching according to the location of each taxi and the future requested demand of each region. Finally, an optimized geo-routing algorithm is developed to minimize the navigation time spent by drivers. Our simulation model is applied to the borough of Manhattan and is validated with realistic data. Selected results show that significant performance gains are achieved thanks to the additional cooperation among taxi drivers enabled by VSN, as compared to traditional cases.

Dirichlet Multinomial Mixture with Variational Manifold Regularization: Topic Modeling over Short Texts

Proceedings of the AAAI Conference on Artificial Intelligence ◽

10.1609/aaai.v33i01.33017884 ◽

2019 ◽

Vol 33 ◽

pp. 7884-7891 ◽

Cited By ~ 2

Author(s):

Ximing Li ◽

Jiaojiao Zhang ◽

Jihong Ouyang

Keyword(s):

Topic Model ◽

Topic Models ◽

Manifold Regularization ◽

Neighborhood Structure ◽

Short Text ◽

Semantic Level ◽

Clustering And Classification ◽

Significant Performance ◽

Sparsity Problem ◽

Performance Gains

Conventional topic models suffer from a severe sparsity problem when facing extremely short texts such as social media posts. The family of Dirichlet multinomial mixture (DMM) can handle the sparsity problem, however, they are still very sensitive to ordinary and noisy words, resulting in inaccurate topic representations at the document level. In this paper, we alleviate this problem by preserving local neighborhood structure of short texts, enabling to spread topical signals among neighboring documents, so as to correct the inaccurate topic representations. This is achieved by using variational manifold regularization, constraining the close short texts should have similar variational topic representations. Upon this idea, we propose a novel Laplacian DMM (LapDMM) topic model. During the document graph construction, we further use the word mover’s distance with word embeddings to measure document similarities at the semantic level. To evaluate LapDMM, we compare it against the state-of-theart short text topic models on several traditional tasks. Experimental results demonstrate that our LapDMM achieves very significant performance gains over baseline models, e.g., achieving even about 0.2 higher scores on clustering and classification tasks in many cases.

Multi-Component Graph Convolutional Collaborative Filtering

Proceedings of the AAAI Conference on Artificial Intelligence ◽

10.1609/aaai.v34i04.6094 ◽

2020 ◽

Vol 34 (04) ◽

pp. 6267-6274

Author(s):

Xiao Wang ◽

Ruijia Wang ◽

Chuan Shi ◽

Guojie Song ◽

Qingyong Li

Keyword(s):

Collaborative Filtering ◽

Research Effort ◽

User Preference ◽

Cost Performance ◽

Fine Grained ◽

Multiple Components ◽

Significant Performance ◽

Performance Gains ◽

Latent Components ◽

Component Graph

The interactions of users and items in recommender system could be naturally modeled as a user-item bipartite graph. In recent years, we have witnessed an emerging research effort in exploring user-item graph for collaborative filtering methods. Nevertheless, the formation of user-item interactions typically arises from highly complex latent purchasing motivations, such as high cost performance or eye-catching appearance, which are indistinguishably represented by the edges. The existing approaches still remain the differences between various purchasing motivations unexplored, rendering the inability to capture fine-grained user preference. Therefore, in this paper we propose a novel Multi-Component graph convolutional Collaborative Filtering (MCCF) approach to distinguish the latent purchasing motivations underneath the observed explicit user-item interactions. Specifically, there are two elaborately designed modules, decomposer and combiner, inside MCCF. The former first decomposes the edges in user-item graph to identify the latent components that may cause the purchasing relationship; the latter then recombines these latent components automatically to obtain unified embeddings for prediction. Furthermore, the sparse regularizer and weighted random sample strategy are utilized to alleviate the overfitting problem and accelerate the optimization. Empirical results on three real datasets and a synthetic dataset not only show the significant performance gains of MCCF, but also well demonstrate the necessity of considering multiple components.

Modeling land cover change dynamic using a hybrid model approach in Qeshm Island, Southern Iran

Environmental Monitoring and Assessment ◽

10.1007/s10661-020-08270-w ◽

2020 ◽

Vol 192 (5) ◽

Author(s):

Amir Tajbakhsh ◽

Azadeh Karimi ◽

Anlu Zhang

Keyword(s):

Land Cover ◽

Land Cover Change ◽

Hybrid Model ◽

Qeshm Island ◽

Southern Iran ◽

Change Dynamic ◽

Model Approach