scholarly journals A Semi-Automated Workflow for FAIR Maturity Indicators in the Life Sciences

Nanomaterials ◽  
2020 ◽  
Vol 10 (10) ◽  
pp. 2068 ◽  
Author(s):  
Ammar Ammar ◽  
Serena Bonaretti ◽  
Laurent Winckers ◽  
Joris Quik ◽  
Martine Bakker ◽  
...  

Data sharing and reuse are crucial to enhance scientific progress and maximize return of investments in science. Although attitudes are increasingly favorable, data reuse remains difficult due to lack of infrastructures, standards, and policies. The FAIR (findable, accessible, interoperable, reusable) principles aim to provide recommendations to increase data reuse. Because of the broad interpretation of the FAIR principles, maturity indicators are necessary to determine the FAIRness of a dataset. In this work, we propose a reproducible computational workflow to assess data FAIRness in the life sciences. Our implementation follows principles and guidelines recommended by the maturity indicator authoring group and integrates concepts from the literature. In addition, we propose a FAIR balloon plot to summarize and compare dataset FAIRness. We evaluated the feasibility of our method on three real use cases where researchers looked for six datasets to answer their scientific questions. We retrieved information from repositories (ArrayExpress, Gene Expression Omnibus, eNanoMapper, caNanoLab, NanoCommons and ChEMBL), a registry of repositories, and a searchable resource (Google Dataset Search) via application program interfaces (API) wherever possible. With our analysis, we found that the six datasets met the majority of the criteria defined by the maturity indicators, and we showed areas where improvements can easily be reached. We suggest that use of standard schema for metadata and the presence of specific attributes in registries of repositories could increase FAIRness of datasets.

2019 ◽  
Author(s):  
Serena Bonaretti ◽  
Egon Willighagen

AbstractData sharing and reuse are crucial to enhance scientific progress and maximize return of investments in science. Although attitudes are increasingly favorable, data reuse remains difficult for lack of infrastructures, standards, and policies. The FAIR (findable, accessible, interoperable, reusable) principles aim to provide recommendations to increase data reuse. Because of the broad interpretation of the FAIR principles, maturity indicators are necessary to determine FAIRness of a dataset. In this work, we propose a reproducible computational workflow to assess data FAIRness in the life sciences. Our implementation follows principles and guidelines recommended by the maturity indicator authoring group and integrates concepts from the literature. In addition, we propose a FAIR balloon plot to summarize and compare dataset FAIRness. We evaluated our method on two real use cases where researchers looked for datasets to answer their scientific questions. We retrieved information from repositories (ArrayExpress and Gene Expression Omnibus), a registry of repositories (re3data.org), and a searchable resource (Google Dataset Search) via application program interface (API) wherever possible. With our analysis, we found that the two datasets met the majority of the criteria defined by the maturity indicators, and we showed areas where improvements can easily be reached. We suggest that use of standard schema for metadata and presence of specific attributes in registries of repositories could increase FAIRness of datasets.


2019 ◽  
Vol 39 (06) ◽  
pp. 290-299
Author(s):  
Naushad Ali PM ◽  
Sidra Saeed

This study investigates perception of research scholars towards research data management and sharing. A survey was conducted among research scholars from Faculty of Life Sciences and Social Sciences, Aligarh Muslim University (AMU). In total, 352 participants filled out the questionnaire. The study shows that research scholars ofFaculty of Social Sciences are more willing to share their research data as compared to Research Scholars of Life Sciences. Contributing to scientific progress and increasing research citations and visibility were the key factors that motivated researchers to share data. However, confidentiality and data misuse were the main concerns among those who were unwilling to share. Finally, some recommendations to improve the of data management and sharing practices are presented.


2020 ◽  
Vol 26 (29) ◽  
pp. 3619-3630
Author(s):  
Saumya Choudhary ◽  
Dibyabhaba Pradhan ◽  
Noor S. Khan ◽  
Harpreet Singh ◽  
George Thomas ◽  
...  

Background: Psoriasis is a chronic immune mediated skin disorder with global prevalence of 0.2- 11.4%. Despite rare mortality, the severity of the disease could be understood by the accompanying comorbidities, that has even led to psychological problems among several patients. The cause and the disease mechanism still remain elusive. Objective: To identify potential therapeutic targets and affecting pathways for better insight of the disease pathogenesis. Method: The gene expression profile GSE13355 and GSE14905 were retrieved from NCBI, Gene Expression Omnibus database. The GEO profiles were integrated and the DEGs of lesional and non-lesional psoriasis skin were identified using the affy package in R software. The Kyoto Encyclopaedia of Genes and Genomes pathways of the DEGs were analyzed using clusterProfiler. Cytoscape, V3.7.1 was utilized to construct protein interaction network and analyze the interactome map of candidate proteins encoded in DEGs. Functionally relevant clusters were detected through Cytohubba and MCODE. Results: A total of 1013 genes were differentially expressed in lesional skin of which 557 were upregulated and 456 were downregulated. Seven dysregulated genes were extracted in non-lesional skin. The disease gene network of these DEGs revealed 75 newly identified differentially expressed gene that might have a role in development and progression of the disease. GO analysis revealed keratinocyte differentiation and positive regulation of cytokine production to be the most enriched biological process and molecular function. Cytokines -cytokine receptor was the most enriched pathways. Among 1013 identified DEGs in lesional group, 36 DEGs were found to have altered genetic signature including IL1B and STAT3 which are also reported as hub genes. CCNB1, CCNA2, CDK1, IL1B, CXCL8, MKI 67, ESR1, UBE2C, STAT1 and STAT3 were top 10 hub gene. Conclusion: The hub genes, genomic altered DEGs and other newly identified differentially dysregulated genes would improve our understanding of psoriasis pathogenesis, moreover, the hub genes could be explored as potential therapeutic targets for psoriasis.


2021 ◽  
Vol 21 (1) ◽  
Author(s):  
Ying Xie ◽  
Xiaofeng Hang ◽  
Wensheng Xu ◽  
Jing Gu ◽  
Yuanjing Zhang ◽  
...  

Abstract Background Most of the biological functions of circular RNAs (circRNAs) and the potential underlying mechanisms in hepatocellular carcinoma (HCC) have not yet been discovered. Methods In this study, using circRNA expression data from HCC tumor tissues and adjacent tissues from the Gene Expression Omnibus database, we identified out differentially expressed circRNAs and verified them by qRT-PCT. Functional experiments were performed to evaluate the effects of circFAM13B in HCC in vitro and in vivo. Results We found that circFAM13B was the most significantly differentially expressed circRNA in HCC tissue. Subsequently, in vitro and in vivo studies also demonstrated that circFAM13B promoted the proliferation of HCC. Further studies revealed that circFAM13B, a sponge of miR-212, is involved in the regulation of E2F5 gene expression by competitively binding to miR-212, inhibits the activation of the P53 signalling pathway, and promotes the proliferation of HCC cells. Conclusions Our findings revealed the mechanism underlying the regulatory role played by circFAM13B, miR-212 and E2F5 in HCC. This study provides a new theoretical basis and novel target for the clinical prevention and treatment of HCC.


Cancers ◽  
2021 ◽  
Vol 13 (1) ◽  
pp. 158
Author(s):  
Valentina Condelli ◽  
Giovanni Calice ◽  
Alessandra Cassano ◽  
Michele Basso ◽  
Maria Grazia Rodriquenz ◽  
...  

Epigenetics is involved in tumor progression and drug resistance in human colorectal carcinoma (CRC). This study addressed the hypothesis that the DNA methylation profiling may predict the clinical behavior of metastatic CRCs (mCRCs). The global methylation profile of two human mCRC subgroups with significantly different outcome was analyzed and compared with gene expression and methylation data from The Cancer Genome Atlas COlon ADenocarcinoma (TCGA COAD) and the NCBI GENE expression Omnibus repository (GEO) GSE48684 mCRCs datasets to identify a prognostic signature of functionally methylated genes. A novel epigenetic signature of eight hypermethylated genes was characterized that was able to identify mCRCs with poor prognosis, which had a CpG-island methylator phenotype (CIMP)-high and microsatellite instability (MSI)-like phenotype. Interestingly, methylation events were enriched in genes located on the q-arm of chromosomes 13 and 20, two chromosomal regions with gain/loss alterations associated with adenoma-to-carcinoma progression. Finally, the expression of the eight-genes signature and MSI-enriching genes was confirmed in oxaliplatin- and irinotecan-resistant CRC cell lines. These data reveal that the hypermethylation of specific genes may provide prognostic information that is able to identify a subgroup of mCRCs with poor prognosis.


2020 ◽  
Vol 7 (Supplement_1) ◽  
pp. S516-S517
Author(s):  
Kulachanya Suwanwongse ◽  
Nehad Shabarek

Abstract Background Human immunodeficiency virus (HIV) disease progression are different among genders, in which women usually progress to acquired immunodeficiency syndrome (AIDS) faster than men. The mechanisms resulting in the gender biases of HIV progression are unclear. We conducted a bioinformatics analysis of differentially expressed genes (DEGs) in women and men with HIV disease to understand the sex-based differences in HIV pathogenesis. Methods We obtained microarray data from the Gene Expression Omnibus (GEO) database using our pre-defined search strategy and analyzed data using the GEO2R platform. The t-test was done to compare DEGs between females and males with HIV diseases. The Database for Annotation, Visualization, and Integrated Discovery (DAVID) was implemented to systematically extract biological features and processes of retrieving DEGs via gene ontology (GO) analysis. A Systemic search was performed to evaluate each DEG function and its possible association with HIV. Results One gene expression profiling data were retrieved: GSE 140713, composed of 40 males and 10 females with HIV1 infected samples. A GEO2R analysis yielded 19 DEGs (Table 1). The GO analysis result was demonstrated in Tables 2 and 3. Following a systemic search, we found two DEGs, which have previous studies reported an association with HIV: DDX3X (20 studies) and PDS5 (1 study). We proposed DDX3X (t 5.3, p 0.0037) is responsible for gender inequalities of HIV progression because of: 1. DDX3X is needed in the HIV1 life cycle. 2. Several studies confirmed a positive correlation between DDX3X expression and HIV1 replication. 3. Our study found an up-regulated DDX3X expression in women corresponded to the fact that women progress to AIDS faster than men. 4. Our GO analysis showed female up-regulated genes were enriched in positive regulation of the gene expression pathway, which can be explained by DDX3X and its underlying mechanism. Table 1: DEGs in women and men with HIV1 disease Table 2: GO functional enrichment pathway analyses of overall retrieving DEGs Table 3: GO functional enrichment pathway analyses of down- and up-regulated clusters of DEGs Conclusion Aberrant DDX3X expression may contribute to sex-based differences in HIV disease. Drugs modifying DDX3X gene expression will be beneficial in the treatment of HIV especially resolving the HIV drug resistance problem because current anti-HIV drugs target viral components posed the risk of viral mutation. Disclosures All Authors: No reported disclosures


BMC Cancer ◽  
2021 ◽  
Vol 21 (1) ◽  
Author(s):  
Xiaodong Yang ◽  
Yuexin Zheng ◽  
Zhihai Han ◽  
Xiliang Zhang

Abstract Background As a marker of differentiation, Killer cell lectin like receptor G1 (KLRG1) plays an inhibitory role in human NK cells and T cells. However, its clinical role remains inexplicit. This work intended to investigate the predictive ability of KLRG1 on the efficacy of immune-checkpoint inhibitor in the treatment of lung adenocarcinoma (LUAD), as well as contribute to the possible molecular mechanisms of KLRG1 on LUAD development. Methods Using data from the Gene Expression Omnibus, the Cancer Genome Atlas and the Genotype-Tissue Expression, we compared the expression of KLRG1 and its related genes Bruton tyrosine kinase (BTK), C-C motif chemokine receptor 2 (CCR2), Scm polycomb group protein like 4 (SCML4) in LUAD and normal lung tissues. We also established stable LUAD cell lines with KLRG1 gene knockdown and investigated the effect of KLRG1 knockdown on tumor cell proliferation. We further studied the prognostic value of the four factors in terms of overall survival (OS) in LUAD. Using data from the Gene Expression Omnibus, we further investigated the expression of KLRG1 in the patients with different responses after immunotherapy. Results The expression of KLRG1, BTK, CCR2 and SCML4 was significantly downregulated in LUAD tissues compared to normal controls. Knockdown of KLRG1 promoted the proliferation of A549 and H1299 tumor cells. And low expression of these four factors was associated with unfavorable overall survival in patients with LUAD. Furthermore, low expression of KLRG1 also correlated with poor responses to immunotherapy in LUAD patients. Conclusion Based on these findings, we inferred that KLRG1 had significant correlation with immunotherapy response. Meanwhile, KLRG1, BTK, CCR2 and SCML4 might serve as valuable prognostic biomarkers in LUAD.


2021 ◽  
Vol 11 (1) ◽  
Author(s):  
Michael Kenn ◽  
Dan Cacsire Castillo-Tong ◽  
Christian F. Singer ◽  
Rudolf Karch ◽  
Michael Cibena ◽  
...  

AbstractCorrectly estimating the hormone receptor status for estrogen (ER) and progesterone (PGR) is crucial for precision therapy of breast cancer. It is known that conventional diagnostics (immunohistochemistry, IHC) yields a significant rate of wrongly diagnosed receptor status. Here we demonstrate how Dempster Shafer decision Theory (DST) enhances diagnostic precision by adding information from gene expression. We downloaded data of 3753 breast cancer patients from Gene Expression Omnibus. Information from IHC and gene expression was fused according to DST, and the clinical criterion for receptor positivity was re-modelled along DST. Receptor status predicted according to DST was compared with conventional assessment via IHC and gene-expression, and deviations were flagged as questionable. The survival of questionable cases turned out significantly worse (Kaplan Meier p < 1%) than for patients with receptor status confirmed by DST, indicating a substantial enhancement of diagnostic precision via DST. This study is not only relevant for precision medicine but also paves the way for introducing decision theory into OMICS data science.


NAR Cancer ◽  
2021 ◽  
Vol 3 (1) ◽  
Author(s):  
Zachary V Thomas ◽  
Zhenjia Wang ◽  
Chongzhi Zang

Abstract Dysregulation of gene expression plays an important role in cancer development. Identifying transcriptional regulators, including transcription factors and chromatin regulators, that drive the oncogenic gene expression program is a critical task in cancer research. Genomic profiles of active transcriptional regulators from primary cancer samples are limited in the public domain. Here we present BART Cancer (bartcancer.org), an interactive web resource database to display the putative transcriptional regulators that are responsible for differentially regulated genes in 15 different cancer types in The Cancer Genome Atlas (TCGA). BART Cancer integrates over 10000 gene expression profiling RNA-seq datasets from TCGA with over 7000 ChIP-seq datasets from the Cistrome Data Browser database and the Gene Expression Omnibus (GEO). BART Cancer uses Binding Analysis for Regulation of Transcription (BART) for predicting the transcriptional regulators from the differentially expressed genes in cancer samples compared to normal samples. BART Cancer also displays the activities of over 900 transcriptional regulators across cancer types, by integrating computational prediction results from BART and the Cistrome Cancer database. Focusing on transcriptional regulator activities in human cancers, BART Cancer can provide unique insights into epigenetics and transcriptional regulation in cancer, and is a useful data resource for genomics and cancer research communities.


Sign in / Sign up

Export Citation Format

Share Document