Tree Pattern Matching

Author(s):  
K. Zhang ◽  
D. Shasha

Most of this book is about stringology, the study of strings. So why this chapter on trees? Why not graphs or geometry or something else? First, trees generalize strings in a very direct sense: a string is simply a tree with a single leaf. This has the unsurprising consequence that many of our algorithms specialize to strings and the happy consequence that some of those algorithms are as efficient as the best string algorithms. From the point of view of “treeology”, there is the additional pragmatic advantage of this relationship between trees and strings: some techniques from strings carry over to trees, e.g., suffix trees, and others show promise though we don’t know of work that exploits it. So, treeology provides a good example area for applications of stringologic techniques. Second, some of our friends in stringology may wonder whether there is some easy reduction that can take any tree edit problem, map it to strings, solve it in the string domain and then map it back. We don’t believe there is, because, as you will see, tree editing seems inherently to have more data dependence than string editing. (Specifically, the dynamic programming approach to string editing is always a local operation depending on the left, upper, and upper left neighbor of a cell. In tree editing, the upper left neighbor is usually irrelevant — instead the relevant cell depends on the tree topology.) That is a belief not a theorem, so we would like to state right at the outset the key open problem of treeology: can all tree edit problems on ordered trees (trees where the order among the siblings matters) be reduced efficiently to string edit problems and back again?. The rest of this chapter proceeds on the assumption that this question has a negative response. In particular, we discuss the best known algorithms for tree editing and several variations having to do with subtree removal, variable length don’t cares, and alignment. We discuss both sequential and parallel algorithms.

Entropy ◽  
2020 ◽  
Vol 23 (1) ◽  
pp. 14
Author(s):  
Krzysztof Żabiński ◽  
Beata Zielosko

In the paper, an approach for decision rules construction is proposed. It is studied from the point of view of the supervised machine learning task, i.e., classification, and from the point of view of knowledge representation. Generated rules provide comparable classification results to the dynamic programming approach for optimization of decision rules relative to length or support. However, the proposed algorithm is based on transformation of decision table into entity–attribute–value (EAV) format. Additionally, standard deviation function for computation of averages’ values of attributes in particular decision classes was introduced. It allows to select from the whole set of attributes only these which provide the highest degree of information about the decision. Construction of decision rules is performed based on idea of partitioning of a decision table into corresponding subtables. In opposite to dynamic programming approach, not all attributes need to be taken into account but only these with the highest values of standard deviation per decision classes. Consequently, the proposed solution is more time efficient because of lower computational complexity. In the framework of experimental results, support and length of decision rules were computed and compared with the values of optimal rules. The classification error for data sets from UCI Machine Learning Repository was also obtained and compared with the ones for dynamic programming approach. Performed experiments show that constructed rules are not far from the optimal ones and classification results are comparable to these obtained in the framework of the dynamic programming extension.


Although for many years the study of cytology has tended to concentrate attention more and more on the protoplast as the fundamental unit of the plant, there can be no doubt that the membrane surrounding this unit plays a part of considerable importance in its life processes. The deposition of such a membrane, by a process which is as yet quite obscure, is obviously closely connected with protoplasmic activity, and a detailed investigation of its structure is bound to lead to a better understanding of this connexion. At the same time, the shape and size of a cell are clearly due in some degree to the action of forces external and internal on the membrane, so that a study of the structure of the plant cell wall should therefore also yield information of considerable importance in the solution of botanical problems concerned with cell elongation and growth. Comparatively recent investigations, carried out chiefly on plant fibres, have shown that the most important component of cell walls, from a structural point of view, is the polysaccharide cellulose. This substance is known to occur in varying proportions in the walls of almost all plant tissue and its structure has been worked out, chiefly by X-ray and chemical methods, with some degree of certainty. Although much remains to be discovered of the organization of cellulose in the wall, certain details are now quite clear. Celluloses obtained from many and varied plant sources have all proved to have essentially the same structure. They exist only in the form of chains of β -glucose residues, at least 500 A long (Hengstenberg and Mark 1928), bound together laterally by secondary valences to form a three-dimensional lattice. The conception of a definite micelle, in the sense of Nägeli, is no longer widely held, although the lattice is not uniformly regular throughout the wall. The chains of cellulose are more probably bound together into ill-defined bundles separated by regions in which they are not so perfectly oriented. This conception of the existence of cellulose in long molecular chains has arisen from the examination of the secondary walls of plants, but as yet no direct experimental determinations have been possible of its structure in primary walls where it is known to occur ( e. g . in Vicia faba , see TupperCarey and Priestley 1922). Recent work (Preston 1934) on the tracheids of the conifer, however, show that it is possible to carry over the idea of the long-chain structure of cellulose even to these delicate primary walls.


Author(s):  
Peppino Fazio ◽  
Mauro Tropea ◽  
Miroslav Voznak ◽  
Andrea Lupia

The employment of an appropriate Bandwidth Management Scheme (BMS) is needed in wireless networking, given that the main desire of end-users is to take advantage of satisfactory services, in terms of Quality of Service (QoS), especially when a particular charge is paid to meet the requirement. In this paper the authors are interested in investigating how the continuity of services can be guaranteed in QoS networks, when users move from a cell to another one, under an infrastructure cellular coverage. The only way to face this issue is represented by the employment of in-advance bandwidth reservations, although it leads the system to waste bandwidth resources, since they are not used until the mobile host enters the coverage cell where the passive request has been made. A new scheme for predicting user movements is proposed, taking the advantage of the dynamic programming approach, that is able to reduce the number of possible roads to be considered and thereby increasing/decreasing the accuracy/redundancy of the proposed model. Several simulation runs have been carried out in order to assess the effectiveness of the proposed idea.


Sign in / Sign up

Export Citation Format

Share Document