Mining Spatio-Temporal Graph Patterns

Temporal and Spatio-Temporal Data Mining ◽

10.4018/978-1-59904-387-6.ch011 ◽

2008 ◽

pp. 227-261

Author(s):

Wynne Hsu ◽

Mong Li Lee ◽

Junmei Wang

Keyword(s):

Data Mining ◽

Graph Mining ◽

Main Memory ◽

Graph Database ◽

Graph Databases ◽

Complete Set ◽

Temporal Graph ◽

Spatio Temporal

Data mining in graph databases has received much attention. We have witnessed many algorithms proposed for mining frequent graphs. Inokuchi, Washio, and Nishimura (2002) and Karpis and Kumar (1998) introduce the Apriori-like algorithms, AGM and FSG, to mine the complete set of frequent graphs. However, both algorithms are not scalable as they require multiple scans of databases and tend to generate many candidates during the mining process. Subsequently, Yan and Han (2002) and Nijssen and Kok (2004) propose depth-first graph mining approaches called gSpan and Gaston, respectively. These approaches are essentially memory-based and their efficiencies decrease dramatically if the graph database is too large to fit into the main memory.

Download Full-text

Concept and assumptions about the temporal graph database

MATEC Web of Conferences ◽

10.1051/matecconf/201821004017 ◽

2018 ◽

Vol 210 ◽

pp. 04017

Author(s):

Jarosław Koszela ◽

Paulina Szczepańczyk-Wysocka

Keyword(s):

Temporal Databases ◽

Graph Database ◽

Graph Databases ◽

Temporal Data ◽

Incremental Method ◽

Temporal Graph

The article outlines existing solutions in the area of graphs and temporal databases. It provides explanation for why the temporal graph database was created. Furthermore, the article also describes the concept and assumptions about the temporal graph database, including a proposal of two methods for representing temporal data in graph databases. Full write method assumes creating a new database object for each state of being. While incremental method writes only such features and relationships that were subject to change. Regardless of the data write method used, the data may be returned in a historically unordered or ordered manner. The article outlines assumptions for both methods of representing data.

Download Full-text

Subgraph Mining

Encyclopedia of Data Warehousing and Mining, Second Edition ◽

10.4018/978-1-60566-010-3.ch285 ◽

2011 ◽

pp. 1865-1870

Author(s):

Ingrid Fischer

Keyword(s):

Data Mining ◽

Data Structure ◽

Graph Mining ◽

Tertiary Structure ◽

Inductive Logic ◽

Graph Databases ◽

Application Area ◽

Web Searches ◽

Subgraph Mining ◽

Molecular Databases

The amount of available data is increasing very fast. With this data, the desire for data mining is also growing. More and larger databases have to be searched to find interesting (and frequent) elements and connections between them. Most often the data of interest is very complex. It is common to model complex data with the help of graphs consisting of nodes and edges that are often labeled to store additional information. Having a graph database, the main goal is to find connections and similarities between its graphs. Based on these connections and similarities, the graphs can be categorized, clustered or changed according to the application area. Regularly occurring patterns in the form of subgraphs —called fragments in this context—that appear at least in a certain percentage of graphs, are a common method to analyze graph databases. The actual occurrence of a fragment in a database graph is called embedding. Finding the fragments and their embeddings is the goal of subgraph mining described in detail in this chapter. The first published graph mining algorithm, called Subdue, appeared in the mid-1990s and is still used in different application areas and was extended in several ways. (Cook & Holder, 2000). Subdue is based on a heuristic search and does not find all possible fragments and embeddings. It took a few more years before more and faster approaches appeared. In (Helma, Kramer, & de Raedt, 2002) graph databases are mined for simple paths, for a lot of other applications only trees are of interest (Rückert & Kramer, 2004). Also Inductive Logic Programming (Finn et al., 1998) was applied in this area. At the beginning of the new millennium finally more and more and every time faster approaches for general mining of graph databases were developed that were able to find all possible fragments. (Borgelt & Berthold, 2002; Yan & Han, 2002; Kuramochi & Karypis, 2001; Nijssen & Kok, 2004). Several different application areas for graph mining are researched. The most common area is mining molecular databases where the molecules are displayed by their two-dimensional structure. When analyzing molecules it is interesting to find patterns that might explain why a certain set of molecules is useful as a drug against certain diseases (Borgelt & Berthold, 2002). Similar problems occur for protein databases. Here graph data mining can be used to find structural patterns in the primary, secondary and tertiary structure of protein categories (Cook & Holder, 2000). Another application area are web searches (Cook, Manocha, & Holder, 2003). Existing search engines use linear feature matches. Using graphs as underlying data structure, nodes represent pages, documents or document keywords and edges represent links between them. Posing a query as a graph means a smaller graph has to be embedded in the larger one. The graph modeling the data structure can be mined to find similar clusters. Quite new is the application of subgraph mining in optimizing code for embedded devices. With the help of so-called procedural abstraction, the size of pre-compiled binaries can be reduced which is often crucial because of the limited storage capacities of embedded systems. There, subgraph mining helps identifying common structures in the program’s control flow graph which can then be combined (“abstracted”) into a single procedure (Dreweke et.al., 2007).

Download Full-text

Advantages of using graph databases to explore chromatin conformation capture experiments

BMC Bioinformatics ◽

10.1186/s12859-020-03937-0 ◽

2021 ◽

Vol 22 (S2) ◽

Author(s):

Daniele D’Agostino ◽

Pietro Liò ◽

Marco Aldinucci ◽

Ivan Merelli

Keyword(s):

Web Application ◽

High Throughput Sequencing ◽

Cell Types ◽

Graph Database ◽

Graph Databases ◽

Sources Of Information ◽

Chromosome Conformation ◽

Wide Scale ◽

User Friendly ◽

Different Cell Types

Abstract Background High-throughput sequencing Chromosome Conformation Capture (Hi-C) allows the study of DNA interactions and 3D chromosome folding at the genome-wide scale. Usually, these data are represented as matrices describing the binary contacts among the different chromosome regions. On the other hand, a graph-based representation can be advantageous to describe the complex topology achieved by the DNA in the nucleus of eukaryotic cells. Methods Here we discuss the use of a graph database for storing and analysing data achieved by performing Hi-C experiments. The main issue is the size of the produced data and, working with a graph-based representation, the consequent necessity of adequately managing a large number of edges (contacts) connecting nodes (genes), which represents the sources of information. For this, currently available graph visualisation tools and libraries fall short with Hi-C data. The use of graph databases, instead, supports both the analysis and the visualisation of the spatial pattern present in Hi-C data, in particular for comparing different experiments or for re-mapping omics data in a space-aware context efficiently. In particular, the possibility of describing graphs through statistical indicators and, even more, the capability of correlating them through statistical distributions allows highlighting similarities and differences among different Hi-C experiments, in different cell conditions or different cell types. Results These concepts have been implemented in NeoHiC, an open-source and user-friendly web application for the progressive visualisation and analysis of Hi-C networks based on the use of the Neo4j graph database (version 3.5). Conclusion With the accumulation of more experiments, the tool will provide invaluable support to compare neighbours of genes across experiments and conditions, helping in highlighting changes in functional domains and identifying new co-organised genomic compartments.

Download Full-text

Applying graph database technology for analyzing perturbed co-expression networks in cancer

Database ◽

10.1093/database/baaa110 ◽

2020 ◽

Vol 2020 ◽

Author(s):

Claire M Simpson ◽

Florian Gnad

Keyword(s):

Relational Databases ◽

Molecular Mechanisms ◽

Biological Data ◽

Database Management System ◽

Graph Database ◽

Graph Databases ◽

Graph Representations ◽

Rnaseq Data ◽

Database Technology ◽

Speed Accuracy

Abstract Graph representations provide an elegant solution to capture and analyze complex molecular mechanisms in the cell. Co-expression networks are undirected graph representations of transcriptional co-behavior indicating (co-)regulations, functional modules or even physical interactions between the corresponding gene products. The growing avalanche of available RNA sequencing (RNAseq) data fuels the construction of such networks, which are usually stored in relational databases like most other biological data. Inferring linkage by recursive multiple-join statements, however, is computationally expensive and complex to design in relational databases. In contrast, graph databases store and represent complex interconnected data as nodes, edges and properties, making it fast and intuitive to query and analyze relationships. While graph-based database technologies are on their way from a fringe domain to going mainstream, there are only a few studies reporting their application to biological data. We used the graph database management system Neo4j to store and analyze co-expression networks derived from RNAseq data from The Cancer Genome Atlas. Comparing co-expression in tumors versus healthy tissues in six cancer types revealed significant perturbation tracing back to erroneous or rewired gene regulation. Applying centrality, community detection and pathfinding graph algorithms uncovered the destruction or creation of central nodes, modules and relationships in co-expression networks of tumors. Given the speed, accuracy and straightforwardness of managing these densely connected networks, we conclude that graph databases are ready for entering the arena of biological data.

Download Full-text

The use of cultural algorithms with evolutionary programming to control the data mining of large-scale spatio-temporal databases

1997 IEEE International Conference on Systems, Man, and Cybernetics. Computational Cybernetics and Simulation ◽

10.1109/icsmc.1997.637338 ◽

2002 ◽

Cited By ~ 1

Author(s):

R. Reynolds ◽

H. Al-Shehri

Keyword(s):

Data Mining ◽

Large Scale ◽

Evolutionary Programming ◽

Temporal Databases ◽

Cultural Algorithms ◽

Spatio Temporal

Download Full-text

Comprehensive performance analysis of Spatio-Temporal Data Mining approach on multi-temporal coastal remote sensing datasets

2011 IEEE International Geoscience and Remote Sensing Symposium ◽

10.1109/igarss.2011.6049592 ◽

2011 ◽

Author(s):

Balakrishna Gokaraju ◽

Surya S. Durbha ◽

Roger L. King ◽

Nicolas H. Younan

Keyword(s):

Remote Sensing ◽

Data Mining ◽

Performance Analysis ◽

Temporal Data Mining ◽

Temporal Data ◽

Data Mining Approach ◽

Comprehensive Performance ◽

Multi Temporal ◽

Spatio Temporal

Download Full-text

Symmetric Sub-graph Spatio-Temporal Graph Convolution and its application in Complex Activity Recognition

ICASSP 2021 - 2021 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) ◽

10.1109/icassp39728.2021.9413833 ◽

2021 ◽

Author(s):

Pratyusha Das ◽

Antonio Ortega

Keyword(s):

Activity Recognition ◽

Complex Activity ◽

Temporal Graph ◽

Spatio Temporal

Download Full-text

Based on spatio-temporal graph convolution networks with residual connection for intelligence behavior recognition

International Journal of Electrical Engineering Education ◽

10.1177/0020720921996600 ◽

2021 ◽

pp. 002072092199660

Author(s):

Yinong Zhang ◽

Shanshan Guan ◽

Cheng Xu ◽

Hongzhe Liu

Keyword(s):

Human Behavior ◽

Behavior Recognition ◽

Convolutional Network ◽

Skeleton Model ◽

Basic Technology ◽

Temporal Model ◽

Temporal Graph ◽

Spatio Temporal ◽

Human Behavior Recognition ◽

Behavior Characteristics

In the era of intelligent education, human behavior recognition based on computer vision is an important branch of pattern recognition. Human behavior recognition is a basic technology in the fields of intelligent monitoring and human-computer interaction in education. The dynamic changes of human skeleton provide important information for the recognition of educational behavior. Traditional methods usually use manual information to label or traverse rules only, resulting in limited representation capabilities and poor generalization performance of the model. In this paper, a kind of dynamic skeleton model with residual is adopted—a spatio-temporal graph convolutional network based on residual connections, which not only overcomes the limitations of previous methods, but also can learn the spatio-temporal model from the skeleton data. In the big bone NTU-RGB + D dataset, the network model not only improved the representation ability of human behavior characteristics, but also improved the generalization ability, and achieved better recognition effect than the existing model. In addition, this paper also compares the results of behavior recognition on subsets of different joint points, and finds that spatial structure division have better effects.

Download Full-text