scholarly journals HiCEnterprise: identifying long range chromosomal contacts in Hi-C data

PeerJ ◽  
2021 ◽  
Vol 9 ◽  
pp. e10558
Author(s):  
Hanna Kranas ◽  
Irina Tuszynska ◽  
Bartek Wilczynski

Motivation Computational analysis of chromosomal contact data is currently gaining popularity with the rapid advance in experimental techniques providing access to a growing body of data. An important problem in this area is the identification of long range contacts between distinct chromatin regions. Such loops were shown to exist at different scales, either mediating relatively short range interactions between enhancers and promoters or providing interactions between much larger, distant chromosome domains. A proper statistical analysis as well as availability to a wide research community are crucial in a tool for this task. Results We present HiCEnterprise, a first freely available software tool for identification of long range chromatin contacts not only between small regions, but also between chromosomal domains. It implements four different statistical tests for identification of significant contacts for user defined regions or domains as well as necessary functions for input, output and visualization of chromosome contacts. Availability The software and the corresponding documentation are available at: github.com/regulomics/HiCEnterprise. Supplementary information Supplemental data are available in the online version of the article and at the website regulomics.mimuw.edu.pl/wp/hicenterprise.

2019 ◽  
Author(s):  
Hanna Kranas ◽  
Irina Tuszyńska ◽  
Bartek Wilczynski

Computational analysis of chromosomal capture data is currently gaining popularity with the rapid advance in experimental techniques providing access to growing body of data. An important problem in this area is the identification of long-range contacts be- tween distinct chromatin regions. Such loops were shown to exist at different scales, either mediating interactions between enhancers and promoters or providing much longer interactions between functionally interacting distant chromosome domains. A proper statistical analysis is crucial for accurate identification of such interactions from experi- mental data. We present HiCEnterprise, a software tool for identification of long-range chromatin contacts. It implements three different sta- tistical tests for identification of significant contacts at different scales as well as necessary functions for input, output and visualization of chromosome contact matrices.


2019 ◽  
Author(s):  
Hanna Kranas ◽  
Irina Tuszyńska ◽  
Bartek Wilczynski

Computational analysis of chromosomal capture data is currently gaining popularity with the rapid advance in experimental techniques providing access to growing body of data. An important problem in this area is the identification of long-range contacts be- tween distinct chromatin regions. Such loops were shown to exist at different scales, either mediating interactions between enhancers and promoters or providing much longer interactions between functionally interacting distant chromosome domains. A proper statistical analysis is crucial for accurate identification of such interactions from experi- mental data. We present HiCEnterprise, a software tool for identification of long-range chromatin contacts. It implements three different sta- tistical tests for identification of significant contacts at different scales as well as necessary functions for input, output and visualization of chromosome contact matrices.


2019 ◽  
Vol 35 (16) ◽  
pp. 2724-2729 ◽  
Author(s):  
L Carron ◽  
J B Morlot ◽  
V Matthys ◽  
A Lesne ◽  
J Mozziconacci

Abstract Motivation Genome-wide chromosomal contact maps are widely used to uncover the 3D organization of genomes. They rely on collecting millions of contacting pairs of genomic loci. Contacts at short range are usually well measured in experiments, while there is a lot of missing information about long-range contacts. Results We propose to use the sparse information contained in raw contact maps to infer high-confidence contact counts between all pairs of loci. Our algorithmic procedure, Boost-HiC, enables the detection of Hi-C patterns such as chromosomal compartments at a resolution that would be otherwise only attainable by sequencing a hundred times deeper the experimental Hi-C library. Boost-HiC can also be used to compare contact maps at an improved resolution. Availability and implementation Boost-HiC is available at https://github.com/LeopoldC/Boost-HiC. Supplementary information Supplementary data are available at Bioinformatics online.


2010 ◽  
Vol 43 (2) ◽  
pp. 367-369 ◽  
Author(s):  
Md. Aftabuddin ◽  
Sudip Kundu

AMINONETis a Java-based software tool to construct different protein contact networks (unweighted and weighted; long range, short range and any range; hydrophobic, hydrophilic, charged or all-amino-acid networks). The networks thus constructed can be visualized. The software will also help in the calculation of the values of the different topological parameters of the constructed networks. The user can either provide a PDB ID or upload a structure file in PDB format as input. If necessary, the user can also do the same for a large number of proteins, uploading a batch file as input (details described in the document available online).


2020 ◽  
Vol 36 (9) ◽  
pp. 2690-2696
Author(s):  
Jarkko Toivonen ◽  
Pratyush K Das ◽  
Jussi Taipale ◽  
Esko Ukkonen

Abstract Motivation Position-specific probability matrices (PPMs, also called position-specific weight matrices) have been the dominating model for transcription factor (TF)-binding motifs in DNA. There is, however, increasing recent evidence of better performance of higher order models such as Markov models of order one, also called adjacent dinucleotide matrices (ADMs). ADMs can model dependencies between adjacent nucleotides, unlike PPMs. A modeling technique and software tool that would estimate such models simultaneously both for monomers and their dimers have been missing. Results We present an ADM-based mixture model for monomeric and dimeric TF-binding motifs and an expectation maximization algorithm MODER2 for learning such models from training data and seeds. The model is a mixture that includes monomers and dimers, built from the monomers, with a description of the dimeric structure (spacing, orientation). The technique is modular, meaning that the co-operative effect of dimerization is made explicit by evaluating the difference between expected and observed models. The model is validated using HT-SELEX and generated datasets, and by comparing to some earlier PPM and ADM techniques. The ADM models explain data slightly better than PPM models for 314 tested TFs (or their DNA-binding domains) from four families (bHLH, bZIP, ETS and Homeodomain), the ADM mixture models by MODER2 being the best on average. Availability and implementation Software implementation is available from https://github.com/jttoivon/moder2. Supplementary information Supplementary data are available at Bioinformatics online.


Nature ◽  
2021 ◽  
Author(s):  
Siyu Chen ◽  
Linda Lee ◽  
Tasmin Naila ◽  
Susan Fishbain ◽  
Annie Wang ◽  
...  

2021 ◽  
Vol 5 (1) ◽  
Author(s):  
Roman Sherrod ◽  
Eric C. O’Quinn ◽  
Igor M. Gussev ◽  
Cale Overstreet ◽  
Joerg Neuefeind ◽  
...  

AbstractThe structural response of Dy2TiO5 oxide under swift heavy ion irradiation (2.2 GeV Au ions) was studied over a range of structural length scales utilizing neutron total scattering experiments. Refinement of diffraction data confirms that the long-range orthorhombic structure is susceptible to ion beam-induced amorphization with limited crystalline fraction remaining after irradiation to 8 × 1012 ions/cm2. In contrast, the local atomic arrangement, examined through pair distribution function analysis, shows only subtle changes after irradiation and is still described best by the original orthorhombic structural model. A comparison to Dy2Ti2O7 pyrochlore oxide under the same irradiation conditions reveals a different behavior: while the dysprosium titanate pyrochlore is more radiation resistant over the long-range with smaller degree of amorphization as compared to Dy2TiO5, the former involves more local atomic rearrangements, best described by a pyrochlore-to-weberite-type transformation. These results highlight the importance of short-range and medium-range order analysis for a comprehensive description of radiation behavior.


Author(s):  
Markus Ekvall ◽  
Michael Höhle ◽  
Lukas Käll

Abstract Motivation Permutation tests offer a straightforward framework to assess the significance of differences in sample statistics. A significant advantage of permutation tests are the relatively few assumptions about the distribution of the test statistic are needed, as they rely on the assumption of exchangeability of the group labels. They have great value, as they allow a sensitivity analysis to determine the extent to which the assumed broad sample distribution of the test statistic applies. However, in this situation, permutation tests are rarely applied because the running time of naïve implementations is too slow and grows exponentially with the sample size. Nevertheless, continued development in the 1980s introduced dynamic programming algorithms that compute exact permutation tests in polynomial time. Albeit this significant running time reduction, the exact test has not yet become one of the predominant statistical tests for medium sample size. Here, we propose a computational parallelization of one such dynamic programming-based permutation test, the Green algorithm, which makes the permutation test more attractive. Results Parallelization of the Green algorithm was found possible by non-trivial rearrangement of the structure of the algorithm. A speed-up—by orders of magnitude—is achievable by executing the parallelized algorithm on a GPU. We demonstrate that the execution time essentially becomes a non-issue for sample sizes, even as high as hundreds of samples. This improvement makes our method an attractive alternative to, e.g. the widely used asymptotic Mann-Whitney U-test. Availabilityand implementation In Python 3 code from the GitHub repository https://github.com/statisticalbiotechnology/parallelPermutationTest under an Apache 2.0 license. Supplementary information Supplementary data are available at Bioinformatics online.


1977 ◽  
Vol 38 (C7) ◽  
pp. C7-202-C7-206 ◽  
Author(s):  
R. MORET ◽  
M. HUBER ◽  
R. COMÈS

2016 ◽  
Vol 6 (1) ◽  
Author(s):  
R. S. Markiewicz ◽  
J. Lorenzana ◽  
G. Seibold ◽  
A. Bansil
Keyword(s):  

Sign in / Sign up

Export Citation Format

Share Document