dynamic tree
Recently Published Documents


TOTAL DOCUMENTS

117
(FIVE YEARS 19)

H-INDEX

12
(FIVE YEARS 3)

2022 ◽  
Vol 16 (4) ◽  
pp. 1-33
Author(s):  
Danlu Liu ◽  
Yu Li ◽  
William Baskett ◽  
Dan Lin ◽  
Chi-Ren Shyu

Risk patterns are crucial in biomedical research and have served as an important factor in precision health and disease prevention. Despite recent development in parallel and high-performance computing, existing risk pattern mining methods still struggle with problems caused by large-scale datasets, such as redundant candidate generation, inability to discover long significant patterns, and prolonged post pattern filtering. In this article, we propose a novel dynamic tree structure, Risk Hierarchical Pattern Tree (RHPTree), and a top-down search method, RHPSearch, which are capable of efficiently analyzing a large volume of data and overcoming the limitations of previous works. The dynamic nature of the RHPTree avoids costly tree reconstruction for the iterative search process and dataset updates. We also introduce two specialized search methods, the extended target search (RHPSearch-TS) and the parallel search approach (RHPSearch-SD), to further speed up the retrieval of certain items of interest. Experiments on both UCI machine learning datasets and sampled datasets of the Simons Foundation Autism Research Initiative (SFARI)—Simon’s Simplex Collection (SSC) datasets demonstrate that our method is not only faster but also more effective in identifying comprehensive long risk patterns than existing works. Moreover, the proposed new tree structure is generic and applicable to other pattern mining problems.


2021 ◽  
Vol 7 (3) ◽  
pp. 1-33
Author(s):  
Joachim Gudmundsson ◽  
Michael Horton ◽  
John Pfeifer ◽  
Martin P. Seybold

We present a scalable approach for range and k nearest neighbor queries under computationally expensive metrics, like the continuous Fréchet distance on trajectory data. Based on clustering for metric indexes, we obtain a dynamic tree structure whose size is linear in the number of trajectories, regardless of the trajectory’s individual sizes or the spatial dimension, which allows one to exploit low “intrinsic dimensionality” of datasets for effective search space pruning. Since the distance computation is expensive, generic metric indexing methods are rendered impractical. We present strategies that (i) improve on known upper and lower bound computations, (ii) build cluster trees without any or very few distance calls, and (iii) search using bounds for metric pruning, interval orderings for reduction, and randomized pivoting for reporting the final results. We analyze the efficiency and effectiveness of our methods with extensive experiments on diverse synthetic and real-world datasets. The results show improvement over state-of-the-art methods for exact queries, and even further speedups are achieved for queries that may return approximate results. Surprisingly, the majority of exact nearest-neighbor queries on real datasets are answered without any distance computations.


PeerJ ◽  
2021 ◽  
Vol 9 ◽  
pp. e11120
Author(s):  
Gilda Varliero ◽  
Jared Wray ◽  
Cédric Malandain ◽  
Gary Barker

Many environmental and biomedical biomonitoring and detection studies aim to explore the presence of specific organisms or gene functionalities in microbiome samples. In such cases, when the study hypotheses can be answered with the exploration of a small number of genes, a targeted PCR-approach is appropriate. However, due to the complexity of environmental microbial communities, the design of specific primers is challenging and can lead to non-specific results. We designed PhyloPrimer, the first user-friendly platform to semi-automate the design of taxon-specific oligos (i.e., PCR primers) for a gene of interest. The main strength of PhyloPrimer is the ability to retrieve and align GenBank gene sequences matching the user’s input, and to explore their relationships through an online dynamic tree. PhyloPrimer then designs oligos specific to the gene sequences selected from the tree and uses the tree non-selected sequences to look for and maximize oligo differences between targeted and non-targeted sequences, therefore increasing oligo taxon-specificity (positive/negative consensus approach). Designed oligos are then checked for the presence of secondary structure with the nearest-neighbor (NN) calculation and the presence of off-target matches with in silico PCR tests, also processing oligos with degenerate bases. Whilst the main function of PhyloPrimer is the design of taxon-specific oligos (down to the species level), the software can also be used for designing oligos to target a gene without any taxonomic specificity, for designing oligos from preselected sequences and for checking predesigned oligos. We validated the pipeline on four commercially available microbial mock communities using PhyloPrimer to design genus- and species-specific primers for the detection of Streptococcus species in the mock communities. The software performed well on these mock microbial communities and can be found at https://www.cerealsdb.uk.net/cerealgenomics/phyloprimer.


Forests ◽  
2021 ◽  
Vol 12 (3) ◽  
pp. 379
Author(s):  
Zi Yang ◽  
Ka Wai Hui ◽  
Sawaid Abbas ◽  
Rui Zhu ◽  
Coco Yin Tung Kwok ◽  
...  

Urban forest ecosystems are being developed to provide various environmental services (e.g., the preservation of urban trees) to urban inhabitants. However, some trees are deteriorated asymptomatically without exhibiting an early sign of tree displacement, which results in a higher vulnerability under dynamic wind loads, especially during typhoon seasons, in the subtropical and tropical regions. As such, it is important to understand the tilt and sway behaviors of trees to cope up with the probability of tree failure and to improve the efficacy of tree management. Tree behaviors under wind loads have been broadly reviewed in the past literature, yet thorough discussions on the measurement methods for tree displacement and its analysis of broadleaf specimens are lacking. To understand the behavioral pattern of both broadleaf and conifer species, this paper presents a detailed review of sway behavior analysis from the perspectives of the aerial parts of the individual tree, including tree stem, canopy, and trunk, alongside a highlighted focus on the root–plate movement amid the soil-root system. The analytical approaches associated with the time-space domain and the time-frequency domain are being introduced. In addition to the review of dynamic tree behaviors, an integrated tree monitoring framework based on geographic information systems (GIS) to detect and visualize the extent of tree displacement using smart sensing technology (SST) is introduced. The monitoring system aims to establish an early warning indicator system for monitoring the displacement angles of trees over the territory of Hong Kong’s urban landscape. This pilot study highlights the importance of the monitoring system at an operational scale to be applicable in the urban areas showcasing the practical use of the Internet of Things (IoT) with an in-depth understanding of the wind-load effect toward the urban trees in the tropical and subtropical cities.


2021 ◽  
Vol 182 ◽  
pp. 106007
Author(s):  
Christopher Alexander Maximilian Busch ◽  
Karl A. Stol ◽  
Wannes van der Mark

Genes ◽  
2021 ◽  
Vol 12 (1) ◽  
pp. 87
Author(s):  
Jie Hou ◽  
Xiufen Ye ◽  
Chuanlong Li ◽  
Yixing Wang

Among biological networks, co-expression networks have been widely studied. One of the most commonly used pipelines for the construction of co-expression networks is weighted gene co-expression network analysis (WGCNA), which can identify highly co-expressed clusters of genes (modules). WGCNA identifies gene modules using hierarchical clustering. The major drawback of hierarchical clustering is that once two objects are clustered together, it cannot be reversed; thus, re-adjustment of the unbefitting decision is impossible. In this paper, we calculate the similarity matrix with the distance correlation for WGCNA to construct a gene co-expression network, and present a new approach called the k-module algorithm to improve the WGCNA clustering results. This method can assign all genes to the module with the highest mean connectivity with these genes. This algorithm re-adjusts the results of hierarchical clustering while retaining the advantages of the dynamic tree cut method. The validity of the algorithm is verified using six datasets from microarray and RNA-seq data. The k-module algorithm has fewer iterations, which leads to lower complexity. We verify that the gene modules obtained by the k-module algorithm have high enrichment scores and strong stability. Our method improves upon hierarchical clustering, and can be applied to general clustering algorithms based on the similarity matrix, not limited to gene co-expression network analysis.


2021 ◽  
pp. 1-12
Author(s):  
Huda Althumali ◽  
Mohamed Othman ◽  
Nor Kamariah Noordin ◽  
Zurina Mohd Hanapi

Sign in / Sign up

Export Citation Format

Share Document