Identifying Significant Features in Cancer Methylation Data Using Gene Pathway Segmentation

Cancer Informatics ◽

10.4137/cin.s39859 ◽

2016 ◽

Vol 15 ◽

pp. CIN.S39859

Author(s):

Zena M. Hira ◽

Duncan F. Gillies

Keyword(s):

Machine Learning ◽

Hypothesis Testing ◽

Cancer Progression ◽

Biological Knowledge ◽

Cancer Data ◽

Gene Pathway ◽

Machine Learning Methods ◽

Gene Probes ◽

Microarray Datasets ◽

Methylation Profiling

In order to provide the most effective therapy for cancer, it is important to be able to diagnose whether a patient's cancer will respond to a proposed treatment. Methylation profiling could contain information from which such predictions could be made. Currently, hypothesis testing is used to determine whether possible biomarkers for cancer progression produce statistically significant results. However, this approach requires the identification of individual genes, or sets of genes, as candidate hypotheses, and with the increasing size of modern microarrays, this task is becoming progressively harder. Exhaustive testing of small sets of genes is computationally infeasible, and so hypothesis generation depends either on the use of established biological knowledge or on heuristic methods. As an alternative machine learning, methods can be used to identify groups of genes that are acting together within sets of cancer data and associate their behaviors with cancer progression. These methods have the advantage of being multivariate and unbiased but unfortunately also rapidly become computationally infeasible as the number of gene probes and datasets increases. To address this problem, we have investigated a way of utilizing prior knowledge to segment microarray datasets in such a way that machine learning can be used to identify candidate sets of genes for hypothesis testing. A methylation dataset is divided into subsets, where each subset contains only the probes that relate to a known gene pathway. Each of these pathway subsets is used independently for classification. The classification method is AdaBoost with decision trees as weak classifiers. Since each pathway subset contains a relatively small number of gene probes, it is possible to train and test its classification accuracy quickly and determine whether it has valuable diagnostic information. Finally, genes from successful pathway subsets can be combined to create a classifier of high accuracy.

Analysis of Cancer Data Set with Statistical and Unsupervised Machine Learning Methods

Smart Intelligent Computing and Applications - Smart Innovation, Systems and Technologies ◽

10.1007/978-981-13-1921-1_27 ◽

2018 ◽

pp. 267-276

Author(s):

T. Panduranga Vital ◽

K. Dileep Kumar ◽

H. V. Bhagya Sri ◽

M. Murali Krishna

Keyword(s):

Machine Learning ◽

Learning Methods ◽

Data Set ◽

Unsupervised Machine Learning ◽

Cancer Data ◽

Machine Learning Methods

Microarray breast cancer data classification using machine learning methods

2018 Electric Electronics, Computer Science, Biomedical Engineerings' Meeting (EBBT) ◽

10.1109/ebbt.2018.8391468 ◽

2018 ◽

Cited By ~ 7

Author(s):

Siyabend Turgut ◽

Mustafa Dagtekin ◽

Tolga Ensari

Keyword(s):

Breast Cancer ◽

Machine Learning ◽

Data Classification ◽

Breast Cancer Data ◽

Learning Methods ◽

Cancer Data ◽

Machine Learning Methods

Advanced Interpretable Machine Learning Methods for Clinical NGS Big Data of Complex Hereditary Diseases

10.3389/978-2-88966-274-6 ◽

2020 ◽

Keyword(s):

Machine Learning ◽

Big Data ◽

Hereditary Diseases ◽

Learning Methods ◽

Machine Learning Methods ◽

Interpretable Machine Learning

Application of machine learning methods for automatic interpretation of open hole logging data

Neftyanoe khozyaystvo - Oil Industry ◽

10.24887/0028-2448-2020-11-44-47 ◽

2020 ◽

pp. 44-47

Author(s):

M.A. Basyrov ◽

◽

A.V. Akinshin ◽

I.R. Makhmutov ◽

Yu.D. Kantemirov ◽

...

Keyword(s):

Machine Learning ◽

Learning Methods ◽

Machine Learning Methods ◽

Automatic Interpretation ◽

Open Hole

TESTING PREDICTION ACCURACY OF HDU ADMISSION FOLLOWING HIGH GRADE SEROUS ADVANCED OVARIAN CANCER CYTOREDUCTIVE SURGERY USING MACHINE LEARNING METHODS.

10.26226/morressier.5fa3ee5d55b1fd4cc4dd93d7 ◽

2020 ◽

Author(s):

Alexandros Laios ◽

Angelika Kaufmann ◽

Mohamed Otify ◽

Diederick De Jong ◽

Tim Broadhead ◽

...

Keyword(s):

Machine Learning ◽

Ovarian Cancer ◽

Cytoreductive Surgery ◽

Prediction Accuracy ◽

Advanced Ovarian Cancer ◽

High Grade ◽

Learning Methods ◽

Machine Learning Methods

Emperical Evaluation of Machine Learning algorithms for Breast Cancer Data Classification

International Journal of Computer Sciences and Engineering ◽

10.26438/ijcse/v6i10.346351 ◽

2018 ◽

Vol 6 (10) ◽

pp. 346-351

Author(s):

S. Kumaravel ◽

S. Ophilia Domanica Vithya

Keyword(s):

Breast Cancer ◽

Machine Learning ◽

Learning Algorithms ◽

Data Classification ◽

Machine Learning Algorithms ◽

Breast Cancer Data ◽

Cancer Data

Evolution of Machine Learning Methods for Memography Classification

International Journal of Computer Sciences and Engineering ◽

10.26438/ijcse/v6i3.499502 ◽

2018 ◽

Vol 6 (3) ◽

pp. 499-502

Author(s):

R. Swathi ◽

◽

R. Seshadri ◽

Keyword(s):

Machine Learning ◽

Learning Methods ◽

Machine Learning Methods

FORECASTING THREATS AND CHOOSING THE OPTIMAL STRATEGY FOR ENSURING ECONOMIC SECURITY USING MACHINE LEARNING METHODS

Scientific Review Series 1 Economics and Law ◽

10.26653/2076-4650-2019-6-09 ◽

2019 ◽

pp. 115-123

Author(s):

Evgeniy A. Voronin ◽

◽

Igor V. Yushin ◽

Keyword(s):

Machine Learning ◽

Optimal Strategy ◽

Economic Security ◽

Learning Methods ◽

Machine Learning Methods

Utilizing Blockchain Technology in Social Media Bot Identification

10.36227/techrxiv.12049374 ◽

2020 ◽

Author(s):

Shreya Reddy ◽

Lisa Ewen ◽

Pankti Patel ◽

Prerak Patel ◽

Ankit Kundal ◽

...

Keyword(s):

Machine Learning ◽

Social Media ◽

Gold Standard ◽

The Internet ◽

Learning Models ◽

Current Time ◽

Machine Learning Methods ◽

Blockchain Technology ◽

Modern Age ◽

Machine Learning Models

<p>As bots become more prevalent and smarter in the modern age of the internet, it becomes ever more important that they be identified and removed. Recent research has dictated that machine learning methods are accurate and the gold standard of bot identification on social media. Unfortunately, machine learning models do not come without their negative aspects such as lengthy training times, difficult feature selection, and overwhelming pre-processing tasks. To overcome these difficulties, we are proposing a blockchain framework for bot identification. At the current time, it is unknown how this method will perform, but it serves to prove the existence of an overwhelming gap of research under this area.<i></i></p>

A Generalized Approach to Soil Strength Prediction With Machine Learning Methods

10.21236/ada464726 ◽

2006 ◽

Author(s):

Peter M. Semen

Keyword(s):

Machine Learning ◽

Soil Strength ◽

Strength Prediction ◽

Learning Methods ◽

Machine Learning Methods