scholarly journals Redshift inference from the combination of galaxy colours and clustering in a hierarchical Bayesian model – Application to realistic N-body simulations

2020 ◽  
Vol 498 (2) ◽  
pp. 2614-2631 ◽  
Author(s):  
Alex Alarcon ◽  
Carles Sánchez ◽  
Gary M Bernstein ◽  
Enrique Gaztañaga

ABSTRACT Photometric galaxy surveys constitute a powerful cosmological probe but rely on the accurate characterization of their redshift distributions using only broad-band imaging, and can be very sensitive to incomplete or biased priors used for redshift calibration. A hierarchical Bayesian model has recently been developed to estimate those from the robust combination of prior information, photometry of single galaxies, and the information contained in the galaxy clustering against a well-characterized tracer population. In this work, we extend the method so that it can be applied to real data, developing some necessary new extensions to it, especially in the treatment of galaxy clustering information, and we test it on realistic simulations. After marginalizing over the mapping between the clustering estimator and the actual density distribution of the sample galaxies, and using prior information from a small patch of the survey, we find the incorporation of clustering information with photo-z’s tightens the redshift posteriors and overcomes biases in the prior that mimic those happening in spectroscopic samples. The method presented here uses all the information at hand to reduce prior biases and incompleteness. Even in cases where we artificially bias the spectroscopic sample to induce a shift in mean redshift of $\Delta \bar{z} \approx 0.05,$ the final biases in the posterior are $\Delta \bar{z} \lesssim 0.003.$ This robustness to flaws in the redshift prior or training samples would constitute a milestone for the control of redshift systematic uncertainties in future weak lensing analyses.

2018 ◽  
Vol 35 (5) ◽  
pp. 787-797 ◽  
Author(s):  
Yuanyuan Bian ◽  
Chong He ◽  
Jie Hou ◽  
Jianlin Cheng ◽  
Jing Qiu

Abstract Motivation Several methods have been proposed for the paired RNA-seq analysis. However, many of them do not consider the heterogeneity in treatment effect among pairs that can naturally arise in real data. In addition, it has been reported in literature that the false discovery rate (FDR) control of some popular methods has been problematic. In this paper, we present a full hierarchical Bayesian model for the paired RNA-seq count data that accounts for variation of treatment effects among pairs and controls the FDR through the posterior expected FDR. Results Our simulation studies show that most competing methods can have highly inflated FDR for small to moderate sample sizes while PairedFB is able to control FDR close to the nominal levels. Furthermore, PairedFB has overall better performance in ranking true differentially expressed genes (DEGs) on the top than others, especially when the sample size gets bigger or when the heterogeneity level of treatment effects is high. In addition, PairedFB can be applied to identify the biologically significant DEGs with controlled FDR. The real data analysis also indicates PairedFB tends to find more biologically relevant genes even when the sample size is small. PairedFB is also shown to be robust with respect to the model misspecification in terms of its relative performance compared to others. Availability and implementation Software to implement this method (PairedFB) can be downloaded at: https://sites.google.com/a/udel.edu/qiujing/publication. Supplementary information Supplementary data are available at Bioinformatics online.


2016 ◽  
Vol 73 (7) ◽  
pp. 1725-1738 ◽  
Author(s):  
Yan Jiao ◽  
Rob O'Reilly ◽  
Eric Smith ◽  
Don Orth ◽  

Abstract In many marine fisheries assessments, population abundance indices from surveys collected by different states and agencies do not always agree with each other. This phenomenon is often due to the spatial synchrony/asynchrony. Those indices that are asynchronous may result in discrepancies in the assessment of temporal trends. In addition, commonly employed stock assessment models, such as the statistical catch-at-age (SCA) models, do not account for spatial synchrony/asynchrony associated with spatial autocorrelation, dispersal, and environmental noise. This limits the value of statistical inference on key parameters associated with population dynamics and management reference points. To address this problem, a set of geospatial analyses of relative abundance indices is proposed to model the indices from different surveys using spatial hierarchical Bayesian models. This approach allows better integration of different surveys with spatial synchrony and asynchrony. We used Atlantic weakfish (Cynoscion regalis) as an example for which there are state-wide surveys and expansive coastal surveys. We further compared the performance of the proposed spatially structured hierarchical Bayesian SCA models with a commonly used Bayesian SCA model that assumes relative abundance indices are spatially independent. Three spatial models developed to mimic different potential spatial patterns were compared. The random effect spatially structured hierarchical Bayesian model was found to be better than the commonly used SCA model and the other two spatial models. A simulation study was conducted to evaluate the uncertainty resulting from model selection and the robustness of the recommended model. The spatially structured hierarchical Bayesian model was shown to be able to integrate different survey indices with/without spatial synchrony. It is suggested as a useful tool when there are surveys with different spatial characteristics that need to be combined in a fisheries stock assessment.


2017 ◽  
Vol 33 (19) ◽  
pp. 3018-3027
Author(s):  
Hao Peng ◽  
Yifan Yang ◽  
Shandian Zhe ◽  
Jian Wang ◽  
Michael Gribskov ◽  
...  

Sign in / Sign up

Export Citation Format

Share Document