Supervised Machine Learning Approach For The Prediction of Breast Cancer

Mol2vec: Unsupervised Machine Learning Approach with Chemical Intuition

10.26434/chemrxiv.5513581.v1 ◽

2017 ◽

Author(s):

Sabrina Jaeger ◽

Simone Fulle ◽

Samo Turk

Keyword(s):

Machine Learning ◽

Language Processing ◽

Supervised Machine Learning ◽

Learning Approach ◽

Learning Approaches ◽

Unsupervised Machine Learning ◽

Feature Representations ◽

Machine Learning Approach ◽

The Individual ◽

Vector Representations

Inspired by natural language processing techniques we here introduce Mol2vec which is an unsupervised machine learning approach to learn vector representations of molecular substructures. Similarly, to the Word2vec models where vectors of closely related words are in close proximity in the vector space, Mol2vec learns vector representations of molecular substructures that are pointing in similar directions for chemically related substructures. Compounds can finally be encoded as vectors by summing up vectors of the individual substructures and, for instance, feed into supervised machine learning approaches to predict compound properties. The underlying substructure vector embeddings are obtained by training an unsupervised machine learning approach on a so-called corpus of compounds that consists of all available chemical matter. The resulting Mol2vec model is pre-trained once, yields dense vector representations and overcomes drawbacks of common compound feature representations such as sparseness and bit collisions. The prediction capabilities are demonstrated on several compound property and bioactivity data sets and compared with results obtained for Morgan fingerprints as reference compound representation. Mol2vec can be easily combined with ProtVec, which employs the same Word2vec concept on protein sequences, resulting in a proteochemometric approach that is alignment independent and can be thus also easily used for proteins with low sequence similarities.

Download Full-text

Machine Learning Approach towards Mammographic Breast Density Measurement for Breast Cancer Risk Prediction: An Overview

SSRN Electronic Journal ◽

10.2139/ssrn.3599187 ◽

2020 ◽

Author(s):

Shivaji Pawar ◽

Suhas Sapate ◽

Kamal Sharma

Keyword(s):

Breast Cancer ◽

Machine Learning ◽

Breast Cancer Risk ◽

Cancer Risk ◽

Risk Prediction ◽

Density Measurement ◽

Mammographic Breast Density ◽

Learning Approach ◽

Machine Learning Approach ◽

Breast Density Measurement

Download Full-text

Coreference resolution of Korean anaphoric zero objects: Towards a supervised machine learning approach

International Journal of Computer Science and Information Technology for Education ◽

10.21742/ijcsite.2016.1.01 ◽

2016 ◽

Vol 1 (1) ◽

pp. 1-6

Author(s):

Euhee Kim ◽

◽

Myung-Kwan Park ◽

Keyword(s):

Machine Learning ◽

Supervised Machine Learning ◽

Learning Approach ◽

Coreference Resolution ◽

Machine Learning Approach

Download Full-text

Applying a Machine Learning Approach to Predict Acute Toxicities During Radiation for Breast Cancer Patients

International Journal of Radiation Oncology*Biology*Physics ◽

10.1016/j.ijrobp.2018.06.167 ◽

2018 ◽

Vol 102 (3) ◽

pp. S59

Author(s):

J. Reddy ◽

W.D. Lindsay ◽

C.G. Berlind ◽

C.A. Ahern ◽

B.D. Smith

Keyword(s):

Breast Cancer ◽

Machine Learning ◽

Cancer Patients ◽

Learning Approach ◽

Breast Cancer Patients ◽

Machine Learning Approach

Download Full-text

A machine learning approach to predict healthcare cost of breast cancer patients

Scientific Reports ◽

10.1038/s41598-021-91580-x ◽

2021 ◽

Vol 11 (1) ◽

Author(s):

Pratyusha Rakshit ◽

Onintze Zaballa ◽

Aritz Pérez ◽

Elisa Gómez-Inhiesto ◽

Maria T. Acaiturri-Ayesta ◽

...

Keyword(s):

Breast Cancer ◽

Machine Learning ◽

Cancer Patients ◽

Healthcare Cost ◽

Percentage Error ◽

Learning Approach ◽

Early Prediction ◽

Breast Cancer Patients ◽

Machine Learning Approach ◽

Clinical Records

AbstractThis paper presents a novel machine learning approach to perform an early prediction of the healthcare cost of breast cancer patients. The learning phase of our prediction method considers the following two steps: (1) in the first step, the patients are clustered taking into account the sequences of actions undergoing similar clinical activities and ensuring similar healthcare costs, and (2) a Markov chain is then learned for each group to describe the action-sequences of the patients in the cluster. A two step procedure is undertaken in the prediction phase: (1) first, the healthcare cost of a new patient’s treatment is estimated based on the average healthcare cost of its k-nearest neighbors in each group, and (2) finally, an aggregate measure of the healthcare cost estimated by each group is used as the final predicted cost. Experiments undertaken reveal a mean absolute percentage error as small as 6%, even when half of the clinical records of a patient is available, substantiating the early prediction capability of the proposed method. Comparative analysis substantiates the superiority of the proposed algorithm over the state-of-the-art techniques.

Download Full-text