scholarly journals Multiomics data analysis using tensor decomposition based unsupervised feature extraction --Comparison with DIABLO--

2019 ◽  
Author(s):  
Y-h. Taguchi

AbstractMultiomics data analysis is the central issue of genomics science. In spite of that, there are not well defined methods that can integrate multomics data sets, which are formatted as matrices with different sizes. In this paper, I propose the usage of tensor decomposition based unsupervised feature extraction as a data mining tool for multiomics data set. It can successfully integrate miRNA expression, mRNA expression and proteome, which were used as a demonstration example of DIABLO that is the recently proposed advanced method for the integrated analysis of multiomics data set.

2019 ◽  
Author(s):  
Ka-Lok Ng ◽  
Y-h Taguchi

AbstractCancer is a highly complex disease caused by multiple genetic factors. MicroRNA (miRNA) and mRNA expression profiles are useful for identifying prognostic biomarkers for cancer. Kidney renal clear cell carcinoma (KIRC), which accounts for more than 70% of all renal malignant tumour cases, was selected for our analysis.Traditional methods of identifying cancer prognostic markers may not be accurate. Tensor decomposition (TD) is a useful method uncovering the underlying low-dimensional structures in the tensor. The TD-based unsupervised feature extraction method was applied to analyse mRNA and miRNA expression profiles. Biological annotations of the prognostic miRNAs and mRNAs were examined utilizing the pathway and oncogenic signature databases DIANA-miRPath and MSigDB.TD identified the miRNA signatures and the associated genes. These genes were found to be involved in cancer-related pathways, and 23 genes were significantly correlated with the survival of KIRC patients. We demonstrated that the results are robust and not highly dependent upon the databases we selected. Compared with traditional supervised methods tested, TD achieves much better performance in selecting prognostic miRNAs and mRNAs.These results suggest that integrated analysis using the TD-based unsupervised feature extraction technique is an effective strategy for identifying prognostic signatures in cancer studies.


2017 ◽  
Author(s):  
Y-h. Taguchi

AbstractIdentifying drug target genes in gene expression profiles is not straightforward. Because a drug targets not mRNAs but proteins, mRNA expression of drug target genes is not always altered. In addition, the interaction between a drug and protein can be context dependent; this means that simple drug incubation experiments on cell lines do not always reflect the real situation during active disease. In this paper, I apply tensor decomposition-based unsupervised feature extraction to the integrated analysis of gene expression between heart failure and the DrugMatrix dataset where comprehensive data on gene expression during various drug treatments of rats were reported. I found that this strategy, in a fully unsupervised manner, enables us to identify a combined set of genes and compounds, for which various associations with heart failure were reported.


2020 ◽  
Author(s):  
Y-h. Taguchi ◽  
Turki Turki

In this work, we extended the recently developed tensor decomposition (TD) based unsupervised feature extraction (FE) to a kernel-based method through a mathematical formulation. Subsequently, the kernel TD (KTD) based unsupervised FE was applied to two synthetic examples as well as real data sets, and the findings were compared with those obtained previously using the TD-based unsupervised FE approaches. The KTD-based unsupervised FE outperformed or performed comparably with the TD-based unsupervised FE in large p small n situations, which are situations involving a limited number of samples with many variables (observations). Nevertheless, the KTD-based unsupervised FE outperformed the TD-based unsupervised FE in non large p small n situations. In general, although the use of the kernel trick can help the TD-based unsupervised FE gain more variations, a wider range of problems may also be encountered. Considering the outperformance or comparable performance of the KTD-based unsupervised FE compared to the TD-based unsupervised FE when applied to large p small n problems, it is expected that the KTD-based unsupervised FE can be applied in the genomic science domain, which involves many large p small n problems, and, in which, the TD-based unsupervised FE approach has been effectively applied.


Author(s):  
Y-h. Taguchi ◽  
Turki Turki

ABSTRACTGene expression profiles of tissues treated with drugs have recently been used to infer clinical outcomes. Although this method is often successful from the application point of view, gene expression altered by drugs is rarely analyzed in detail, because of the extremely large number of genes involved. Here, we applied tensor decomposition (TD)-based unsupervised feature extraction (FE) to the gene expression profiles of 24 mouse tissues treated with 15 drugs. TD-based unsupervised FE enabled identification of the common effects of 15 drugs including an interesting universal feature: these drugs affect genes in a gene-group-wide manner and were dependent on three tissue types (neuronal, muscular, and gastroenterological). For each tissue group, TD-based unsupervised FE enabled identification of a few tens to a few hundreds of genes affected by the drug treatment. These genes are distinctly expressed between drug treatments and controls as well as between tissues in individual tissue groups and other tissues. We also validated the assignment of genes to individual tissue groups using multiple enrichment analyses. We conclude that TD-based unsupervised FE is a promising method for integrated analysis of gene expression profiles from multiple tissues treated with multiple drugs in a completely unsupervised manner.


2018 ◽  
Author(s):  
Y-h. Taguchi ◽  
Ka-Lok Ng

AbstractIntegrated analysis of epigenetic profiles is important but difficult. Tensor decomposition–based unsupervised feature extraction was applied here to data on microRNA (miRNA) expression and promoter methylation of genes in ovarian cancer. It selected seven miRNAs and 241 genes by expression levels and promoter methylation degrees, respectively, such that they showed differences between eight normal ovarian tissue samples and 569 tumor samples. The expression levels of the seven miRNAs and the degrees of promoter methylation of the 241 genes also correlated significantly. Conventional Student’s t test–based feature selection failed to identify miRNAs and genes that have the above properties. On the other hand, biological evaluation of the seven identified miRNAs and 241 identified genes suggests that they are strongly related to cancer as expected.


Sign in / Sign up

Export Citation Format

Share Document