Information graph-based creation of parallel queries for databases

Abstract A scalable graphical method is presented for selecting and partitioning datasets for the training phase of a classification task. For the heuristic, a clustering algorithm is required to get its computation cost in a reasonable proportion to the task itself. This step is succeeded by construction of an information graph of the underlying classification patterns using approximate nearest neighbor methods. The presented method consists of two approaches, one for reducing a given training set, and another for partitioning the selected/reduced set. The heuristic targets large datasets, since the primary goal is a significant reduction in training computation run-time without compromising prediction accuracy. Test results show that both approaches significantly speed-up the training task when compared against that of state-of-the-art shrinking heuristics available in LIBSVM. Furthermore, the approaches closely follow or even outperform in prediction accuracy. A network design is also presented for a partitioning based distributed training formulation. Added speed-up in training run-time is observed when compared to that of serial implementation of the approaches.

Download Full-text

A proof of Beigel's cardinality conjecture

Journal of Symbolic Logic ◽

10.2307/2275299 ◽

1992 ◽

Vol 57 (2) ◽

pp. 677-681 ◽

Cited By ~ 36

Author(s):

Martin Kummer

Keyword(s):

Complexity Theory ◽

Structural Complexity ◽

Halting Problem ◽

Parallel Queries

In 1986, Beigel [Be87] (see also [Od89, III.5.9]) proved the nonspeedup theorem: if A, B ⊆ ω, and as a function of 2n variables can be computed by an algorithm which makes at most n queries to B, then A is recursive (informally, 2n parallel queries to a nonrecursive oracle A cannot be answered by making n sequential (or “adaptive”) queries to an arbitrary oracle B). Here, 2n cannot be replaced by 2n − 1. In subsequent papers of Beigel, Gasarch, Gill, Hay, and Owings the theory of “bounded query classes” has been further developed (see, for example, [BGGOta], [BGH89], and [Ow89]). The topic has also been studied in the context of structural complexity theory (see, for example, [AG88], [Be90], and [JY90]).If A ⊆ ω and n ≥ 1, let . Beigel [Be87] stated the powerful “cardinality conjecture” (CC): if A, B ⊆ ω, and can be computed by an algorithm which makes at most n queries to B, then A is recursive. Owings [Ow89] verified CC for n = 1, and, for n 1, he proved that A is recursive in the halting problem. We prove that CC is true for all n.

Download Full-text

Organization of Information Search in the Global Network Based on the Information-graph Data Model

10.1109/summa53307.2021.9632017 ◽

2021 ◽

Author(s):

Igor Zemtsov ◽

Olga Ivanova ◽

Sergey Danilkin ◽

Oksana Petrova

Keyword(s):

Information Search ◽

Data Model ◽

Global Network ◽

Graph Data ◽

Information Graph

Download Full-text

Differentiation of Healthy Individuals from Those with Autism Spectrum Disorders using Information Graph of Complementary Opposites

10.32592/ajnpp.2021.8.3.100 ◽

2020 ◽

Keyword(s):

Children With Autism ◽

Autism Spectrum ◽

Brain Signals ◽

Three Stages ◽

The Mean ◽

Spectrum Disorders ◽

Pediatric Psychiatry ◽

The Brain ◽

Optimal Feature ◽

Information Graph

This study aimed to examine the brain signals of children with Autism Spectrum Disorder (ASD) and use a method according to the concept of complementary opposites to obtain the prominent features or a pattern of EEG signal that represents the biological characteristic of such children. In this study, 20 children with the mean±SD age of 8±5 years were divided into two groups of normal control (NC) and ASD. The diagnosis and approval of individuals in both groups were conducted by two experts in the field of pediatric psychiatry and neurology. The recording protocol was designed with the most accuracy; therefore, the brain signals were recorded with the least noise in the awake state of the individuals in both groups. Moreover, the recording was conducted in three stages from two channels (C3-C4) of EEG ( referred to as the central part of the brain) which were symmetrical in function. In this study, the Mandala method was adopted based on the concept of complementary opposites to investigate the features extracted from Mandala pattern topology and obtain new features and pseudo-patterns for the screening and early diagnosis of ASD. The optimal feature here was based on different stages of processing and statistical analysis of Pattern Detection Capability (PDC). The PDC is a biomarker derived from the Mandala pattern for differentiating the NC from ASD groups.

Download Full-text

The Application of Graph Theory and Adjacency Lists to Create Parallel Queries to Relational Databases

Lecture Notes in Computer Science - Internet of Things, Smart Spaces, and Next Generation Networks and Systems ◽

10.1007/978-3-030-01168-0_13 ◽

2018 ◽

pp. 138-149

Author(s):

Yulia Shichkina ◽

Mikhail Kupriyanov ◽

Vladislav Shevsky

Keyword(s):

Graph Theory ◽

Relational Databases ◽

Parallel Queries

Download Full-text

Parallel Queries of Cluster-Based k Nearest Neighbor in MapReduce

Advances in Systems Analysis, Software Engineering, and High Performance Computing - Managing Big Data in Cloud Computing Environments ◽

10.4018/978-1-4666-9834-5.ch007 ◽

2016 ◽

pp. 163-182

Author(s):

Wei Yan

Keyword(s):

Spatial Data ◽

Spatial Databases ◽

Nearest Neighbor ◽

Programming Model ◽

K Nearest Neighbor ◽

K Nearest Neighbors ◽

Data Intensive ◽

Parallel Queries ◽

Massive Spatial Data ◽

Nearest Neighbor Queries

Parallel queries of k Nearest Neighbor for massive spatial data are an important issue. The k nearest neighbor queries (kNN queries), designed to find k nearest neighbors from a dataset S for every point in another dataset R, is a useful tool widely adopted by many applications including knowledge discovery, data mining, and spatial databases. In cloud computing environments, MapReduce programming model is a well-accepted framework for data-intensive application over clusters of computers. This chapter proposes a parallel method of kNN queries based on clusters in MapReduce programming model. Firstly, this chapter proposes a partitioning method of spatial data using Voronoi diagram. Then, this chapter clusters the data point after partition using k-means method. Furthermore, this chapter proposes an efficient algorithm for processing kNN queries based on k-means clusters using MapReduce programming model. Finally, extensive experiments evaluate the efficiency of the proposed approach.

Download Full-text

Mining Heterogeneous Information Graph for Health Status Classification

2018 5th International Conference on Behavioral, Economic, and Socio-Cultural Computing (BESC) ◽

10.1109/besc.2018.8697292 ◽

2018 ◽

Cited By ~ 3

Author(s):

Thuan Pham ◽

Xiaohui Tao ◽

Ji Zhanag ◽

Jianming Yong ◽

Wenping Zhang ◽

...

Keyword(s):

Health Status ◽

Heterogeneous Information ◽

Status Classification ◽

Information Graph

Download Full-text

Music recommendation via heterogeneous information graph embedding

2017 International Joint Conference on Neural Networks (IJCNN) ◽

10.1109/ijcnn.2017.7965907 ◽

2017 ◽

Cited By ~ 7

Author(s):

Dongjing Wang ◽

Guandong Xu ◽

Shuiguang Deng

Keyword(s):

Graph Embedding ◽

Music Recommendation ◽

Heterogeneous Information ◽

Information Graph

Download Full-text

Multi-dimensional resource scheduling for parallel queries

ACM SIGMOD Record ◽

10.1145/235968.233352 ◽

1996 ◽

Vol 25 (2) ◽

pp. 365-376 ◽

Cited By ~ 16

Author(s):

Minos N. Garofalakis ◽

Yannis E. Ioannidis

Keyword(s):

Resource Scheduling ◽

Parallel Queries

Download Full-text

THEORETICAL CONSIDERATIONS FOR LAND INFORMATION SYSTEMS

The Canadian Surveyor ◽

10.1139/tcs-1987-0004 ◽

1987 ◽

Vol 41 (1) ◽

pp. 51-64

Author(s):

J.A.R. Blais

Keyword(s):

Information System ◽

Information Systems ◽

System Analysis ◽

Natural Topology ◽

Design And Development ◽

Information Theoretic ◽

Land Information System ◽

Theoretical Considerations ◽

Information Graph ◽

Land Information Systems

In general, land information includes all information that is related to the land and its resources. Among the necessary considerations in the design and development of a land information system, the topological aspects are fundamental as they refer to the interconnectivity of the information. Graph and information theoretic considerations, based on the natural topology of the information, are also required for system analysis, optimization and other purposes. Some practical aspects of these considerations are briefly discussed with suggestions for further studies and investigations.

Download Full-text