scholarly journals BEERE: a web server for biomedical entity expansion, ranking and explorations

2019 ◽  
Vol 47 (W1) ◽  
pp. W578-W586
Author(s):  
Zongliang Yue ◽  
Christopher D Willey ◽  
Anita B Hjelmeland ◽  
Jake Y Chen

Abstract BEERE (Biomedical Entity Expansion, Ranking and Explorations) is a new web-based data analysis tool to help biomedical researchers characterize any input list of genes/proteins, biomedical terms or their combinations, i.e. ‘biomedical entities’, in the context of existing literature. Specifically, BEERE first aims to help users examine the credibility of known entity-to-entity associative or semantic relationships supported by database or literature references from the user input of a gene/term list. Then, it will help users uncover the relative importance of each entity—a gene or a term—within the user input by computing the ranking scores of all entities. At last, it will help users hypothesize new gene functions or genotype–phenotype associations by an interactive visual interface of constructed global entity relationship network. The output from BEERE includes: a list of the original entities matched with known relationships in databases; any expanded entities that may be generated from the analysis; the ranks and ranking scores reported with statistical significance for each entity; and an interactive graphical display of the gene or term network within data provenance annotations that link to external data sources. The web server is free and open to all users with no login requirement and can be accessed at http://discovery.informatics.uab.edu/beere/.

Author(s):  
Mr. Debasis Dash ◽  
Mr. Shatyaprakasha Satapathy ◽  
Dr. Chittaranjan Panda

The field programmable gate array technology can design high performance system at low cost for wavelet analysis. Wavelet transform has gained the reputation of being a very effective signal analysis tool for much practical application. Implementation of transform needs the meeting of real-time processing for most application. The objectives of this paper are to compare the Haar and Daubeches technology and to calculate the bit error rate (BER) between the input audio signal and reconstructed output signal. It is seen that the BER using Daubechies wavelet technology is less than Haar wavelet. The design procedure is explained using the stat of art electronic design. Automation tools for system design on FPGA, simulation, synthesis and implementation on the FPGA technology has been carried out. The power hovmoller, cross wavelet spectra and coherence are described. A Practical step-up-step guide to wavelet analysis is given with examples taken from time series. The guide includes a comparison to the windowed Fourier transform. New statistical significance test for wavelet power spectra are developed by deriving theoretical wavelet spectra for white and red noise. Empirical formula is given for the effect of smoothing on significance levels and filtering. The notion of orthogonal no separable trivet wavelet packets, which is the generation of orthogonal university wavelet packets is introduced. A de-noising method based on wavelet packet shrinkage is developed. The principle of wavelet packet shrinkage for de-noising and the section of thresholds and threshold function are analyzed.


2020 ◽  
Vol 36 (12) ◽  
pp. 3902-3904
Author(s):  
Timothy O’Connor ◽  
Charles E Grant ◽  
Mikael Bodén ◽  
Timothy L Bailey

Abstract Motivation Identifying the genes regulated by a given transcription factor (TF) (its ‘target genes’) is a key step in developing a comprehensive understanding of gene regulation. Previously, we developed a method (CisMapper) for predicting the target genes of a TF based solely on the correlation between a histone modification at the TF’s binding site and the expression of the gene across a set of tissues or cell lines. That approach is limited to organisms for which extensive histone and expression data are available, and does not explicitly incorporate the genomic distance between the TF and the gene. Results We present the T-Gene algorithm, which overcomes these limitations. It can be used to predict which genes are most likely to be regulated by a TF, and which of the TF’s binding sites are most likely involved in regulating particular genes. T-Gene calculates a novel score that combines distance and histone/expression correlation, and we show that this score accurately predicts when a regulatory element bound by a TF is in contact with a gene’s promoter, achieving median precision above 60%. T-Gene is easy to use via its web server or as a command-line tool, and can also make accurate predictions (median precision above 40%) based on distance alone when extensive histone/expression data is not available for the organism. T-Gene provides an estimate of the statistical significance of each of its predictions. Availability and implementation The T-Gene web server, source code, histone/expression data and genome annotation files are provided at http://meme-suite.org. Supplementary information Supplementary data are available at Bioinformatics online.


2011 ◽  
Vol 26 (S2) ◽  
pp. 947-947
Author(s):  
S. Otero ◽  
R. Mehrotra

IntroductionThe UK NICE technology guidance “Structural Neuroimaging in First-Episode Psychosis” concludes that CT/MRI is not routinely recommended as an initial investigation for first-episode psychosis.ObjectivesTo evaluate the use of CT/MRI in a group of Early Intervention Service (EIS) patients with a first-episode psychosis aged 18–35 years at presentation.AimsTo develop practice guidelines for use of neuroimaging in first-episode psychosis.MethodsAll 107 patients registered with the EIS in Hounslow, London, UK, were eligible for inclusion in this review. Data was collected from the medical records and the Picture Archiving and Communications System. Data was analysed using a microsoft excel data analysis tool. Additionally, comparisons were made between the group of patients with normal scans and that with abnormal scans. Statistical significance was determined using the chi-squared method with a significance of P < 0.05.Results17 patients had documented neuroimaging results. 4 scans were abnormal. There was no significant difference between the group with normal and abnormal scans in terms of gender, abnormalities of physical/neurological health, blood tests and whether the patient had any additional medical conditions. Abnormal scan results did not influence treatment or outcome for any patient.ConclusionsThe abnormal scans were not correlated to clinical indices of history, examination and laboratory tests. Abnormal scans appear to have a low yield in terms of clinical effectiveness. The findings support selective use of neuroimaging in this cohort of patients. The indications for it usage would appear to rely on clinical judgement as well clinical findings.


2019 ◽  
Author(s):  
Timothy O’Connor ◽  
Charles E. Grant ◽  
Mikael Bodén ◽  
Timothy L. Bailey

AbstractMotivationIdentifying the genes regulated by a given transcription factor (its “target genes”) is a key step in developing a comprehensive understanding of gene regulation. Previously we developed a method for predicting the target genes of a transcription factor (TF) based solely on the correlation between a histone modification at the TF’s binding site and the expression of the gene across a set of tissues. That approach is limited to organisms for which extensive histone and expression data is available, and does not explicitly incorporate the genomic distance between the TF and the gene.ResultsWe present the T-Gene algorithm, which overcomes these limitations. T-Gene can be used to predict which genes are most likely to be regulated by a TF, and which of the TF’s binding sites are most likely involved in regulating particular genes. T-Gene calculates a novel score that combines distance and histone/expression correlation, and we show that this score accurately predicts when a regulatory element bound by a TF is in contact with a gene’s promoter, achieving median positive predictive value (PPV) above 50%. T-Gene is easy to use via its web server or as a command-line tool, and can also make accurate predictions (median PPV above 40%) based on distance alone when extensive histone/expression data is not available for the organism. T-Gene provides an estimate of the statistical significance of each of its predictions.AvailabilityThe T-Gene web server, source code, histone/expression data and genome annotation files are provided at http://[email protected]


F1000Research ◽  
2020 ◽  
Vol 9 ◽  
pp. 832
Author(s):  
Finn Kuusisto ◽  
Daniel Ng ◽  
John Steill ◽  
Ian Ross ◽  
Miron Livny ◽  
...  

Many important scientific discoveries require lengthy experimental processes of trial and error and could benefit from intelligent prioritization based on deep domain understanding. While exponential growth in the scientific literature makes it difficult to keep current in even a single domain, that same rapid growth in literature also presents an opportunity for automated extraction of knowledge via text mining. We have developed a web application implementation of the KinderMiner algorithm for proposing ranked associations between a list of target terms and a key phrase. Any key phrase and target term list can be used for biomedical inquiry. We built the web application around a text index derived from PubMed. It is the first publicly available implementation of the algorithm, is fast and easy to use, and includes an interactive analysis tool. The KinderMiner web application is a public resource offering scientists a cohesive summary of what is currently known about a particular topic within the literature, and helping them to prioritize experiments around that topic. It performs comparably or better to similar state-of-the-art text mining tools, is more flexible, and can be applied to any biomedical topic of interest. It is also continually improving with quarterly updates to the underlying text index and through response to suggestions from the community. The web application is available at https://www.kinderminer.org.


2018 ◽  
Vol 14 (04) ◽  
pp. 181-182
Author(s):  
Manosh Kumar Biswas ◽  
◽  
Sathishkumar Natarajan ◽  
Dhiman Biswas ◽  
Ujjal Kumar Nath ◽  
...  

Blood ◽  
2011 ◽  
Vol 118 (21) ◽  
pp. 1817-1817
Author(s):  
Christopher P Wardell ◽  
Brian A Walker ◽  
David Johnson ◽  
Iwanka Kozarewa ◽  
Kerry Fenwick ◽  
...  

Abstract Abstract 1817 The two most frequent etiological translocations in multiple myeloma (MM) are t(4;14), which deregulates FGFR3 and MMSET and has a poor outcome and t(11;14) which directly deregulates cyclin D1 and has an indolent course. The t(11;14) is present at 10–15% in both MGUS and MM but the t(4;14) is in only 3–4% of MGUS compared to 11% in MM. Consequently it is thought that patients with a t(4;14) have a less stable disease which progresses more quickly to myeloma than other subtypes. In order to address the hypothesis that cases with the t(4;14) are more prone to acquire mutations and so progress, we have compared the number and mutational spectrum of cases with these two variants. DNA was extracted from CD138-selected bone marrow cells from 10 t(4;14) and 12 t(11;14) cases of newly presenting MM. 50 ng of genomic DNA was used to capture the exome using the SureSelect Human All Exon 50Mb target enrichment set (Agilent). We have previously validated this approach and shown it to have parity with approaches using larger starting amounts of DNA. Libraries were prepared from tumor and non-tumor DNA from the same patient and sequenced using 76 bp paired end reads on a GAIIx (Illumina). Samples were sequenced to a median depth of 61x, with 99% >1x and 85% >20x exomic coverage. Following base calling and quality control metrics the raw fastq reads were aligned to the reference human genome. The Genome Analysis Tool Kit was used to call indels and single nucleotide variants (SNVs), with BreakDancer used to detect structural variants. These variant calls were recalibrated and soft filters applied to remove potential false-positives using dbSNP, HapMap and the thousand genomes project as truth sets. Variants that occurred in both the normal and tumor samples were filtered out and the tumor-specific variants were annotated using Annovar. As well as the identification of commonly affected genes, functional annotation enrichment analysis was used to identify commonly affected pathways. The group of 22 cases sequenced at the exome level showed a mutation spectrum that comprised 32,000 SNVs and 1,800 indels per patient, with 1,600 SNVs and 500 indels in the tumor sample only. Structural and copy number variants inferred from this data were also identified and validated previous results using other technologies. We identified 250 SNVs and indels, per patient, that were not in dbSNP and constitute tumor-acquired mutations. We were able to validate some of these mutations that we had previously analyzed using other platforms (98% concordance). The Ti/Tv ratio of mutations was not consistent with any specific exposure or mechanism. The distribution of indels was biased toward insertions rather than deletions, with both predominantly being multiples of three to produce in-frame mutants. In total sequencing data from 60 exomes is available and pathway analysis of the SNVs newly identified confirmed the deregulation of pathways previously identified as being mutated in myeloma, in addition we also identify novel deregulated genes and pathways. We note a consistent increase in the number of variants in the t(4;14). Each tumor had on average 60 non-synonymous SNVs per sample with a range of 29 to 101, some patients being clear outliers. There was a bias to an excess number of mutations within the t(4;14) group which did not reach statistical significance. Importantly, the overlap between the SNVs in individual patients was limited with few consistently mutated genes across the sample set as a whole. In contrast, pathway analysis of the genes mutated in these two different entities shows marked similarities, with more frequent involvement of genes mediating cell adhesion in the t(4;14)s. Although the t(4;14) group had a greater number of mutations, a larger number of genes were affected in the t(11;14) group with the number of mutated genes in two or more samples being 111 versus 237, respectively. This observation implies a more consistent group of genes are deregulated in the t(4;14) group, suggesting that they are under greater selective pressure than in the t(11;14) group. In this work we show a higher mutation frequency but with more limited numbers of genes affected in the t(4;14) group compared to the t(11;14) group. Overall, the data are consistent within the two etiologically distinct groups of MM having a similar spectrum of mutations driving disease progression, with a focus on pathway deregulation rather than any single gene. Disclosures: No relevant conflicts of interest to declare.


F1000Research ◽  
2021 ◽  
Vol 9 ◽  
pp. 832
Author(s):  
Finn Kuusisto ◽  
Daniel Ng ◽  
John Steill ◽  
Ian Ross ◽  
Miron Livny ◽  
...  

Many important scientific discoveries require lengthy experimental processes of trial and error and could benefit from intelligent prioritization based on deep domain understanding. While exponential growth in the scientific literature makes it difficult to keep current in even a single domain, that same rapid growth in literature also presents an opportunity for automated extraction of knowledge via text mining. We have developed a web application implementation of the KinderMiner algorithm for proposing ranked associations between a list of target terms and a key phrase. Any key phrase and target term list can be used for biomedical inquiry. We built the web application around a text index derived from PubMed. It is the first publicly available implementation of the algorithm, is fast and easy to use, and includes an interactive analysis tool. The KinderMiner web application is a public resource offering scientists a cohesive summary of what is currently known about a particular topic within the literature, and helping them to prioritize experiments around that topic. It performs comparably or better to similar state-of-the-art text mining tools, is more flexible, and can be applied to any biomedical topic of interest. It is also continually improving with quarterly updates to the underlying text index and through response to suggestions from the community. The web application is available at https://www.kinderminer.org.


2015 ◽  
Vol 308 (8) ◽  
pp. G652-G663 ◽  
Author(s):  
Sreerup Banerjee ◽  
Sudeepa Dixit ◽  
Mark Fox ◽  
Anupam Pal

Magnetic resonance imaging (MRI) has advantages for the assessment of gastrointestinal structures and functions; however, processing MRI data is time consuming and this has limited uptake to a few specialist centers. This study introduces a semiautomatic image processing system for rapid analysis of gastrointestinal MRI. For assessment of simpler regions of interest (ROI) such as the stomach, the system generates virtual images along arbitrary planes that intersect the ROI edges in the original images. This generates seed points that are joined automatically to form contours on each adjacent two-dimensional image and reconstructed in three dimensions (3D). An alternative thresholding approach is available for rapid assessment of complex structures like the small intestine. For assessment of dynamic gastrointestinal function, such as gastric accommodation and emptying, the initial 3D reconstruction is used as reference to process adjacent image stacks automatically. This generates four-dimensional (4D) reconstructions of dynamic volume change over time. Compared with manual processing, this semiautomatic system reduced the user input required to analyze a MRI gastric emptying study (estimated 100 vs. 10,000 mouse clicks). This analysis was not subject to variation in volume measurements seen between three human observers. In conclusion, the image processing platform presented processed large volumes of MRI data, such as that produced by gastric accommodation and emptying studies, with minimal user input. 3D and 4D reconstructions of the stomach and, potentially, other gastrointestinal organs are produced faster and more accurately than manual methods. This system will facilitate the application of MRI in gastrointestinal research and clinical practice.


Sign in / Sign up

Export Citation Format

Share Document