scholarly journals Machine learning enables detection of early-stage colorectal cancer by whole-genome sequencing of plasma cell-free DNA

2018 ◽  
Author(s):  
Nathan Wan ◽  
David Weinberg ◽  
Tzu-Yu Liu ◽  
Katherine Niehaus ◽  
Daniel Delubac ◽  
...  

AbstractBackgroundBlood-based methods using cell-free DNA (cfDNA) are under development as an alternative to existing screening tests. However, early-stage detection of cancer using tumor-derived cfDNA has proven challenging because of the small proportion of cfDNA derived from tumor tissue in early-stage disease. A machine learning approach to discover signatures in cfDNA, potentially reflective of both tumor and non-tumor contributions, may represent a promising direction for the early detection of cancer.MethodsWhole-genome sequencing was performed on cfDNA extracted from plasma samples (N=546 colorectal cancer and 271 non-cancer controls). Reads aligning to protein-coding gene bodies were extracted, and read counts were normalized. cfDNA tumor fraction was estimated using IchorCNA. Machine learning models were trained using k-fold cross-validation and confounder-based cross-validation to assess generalization performance.ResultsIn a colorectal cancer cohort heavily weighted towards early-stage cancer (80% stage I/II), we achieved a mean AUC of 0.92 (95% CI 0.91-0.93) with a mean sensitivity of 85% (95% CI 83-86%) at 85% specificity. Sensitivity generally increased with tumor stage and increasing tumor fraction. Stratification by age, sequencing batch, and institution demonstrated the impact of these confounders and provided a more accurate assessment of generalization performance.ConclusionsA machine learning approach using cfDNA achieved high sensitivity and specificity in a large, predominantly early-stage, colorectal cancer cohort. The possibility of systematic technical and institution-specific biases warrants similar confounder analyses in other studies. Prospective validation of this machine learning method and evaluation of a multi-analyte approach are underway.

BMC Cancer ◽  
2019 ◽  
Vol 19 (1) ◽  
Author(s):  
Nathan Wan ◽  
David Weinberg ◽  
Tzu-Yu Liu ◽  
Katherine Niehaus ◽  
Eric A. Ariazi ◽  
...  

2019 ◽  
Vol 156 (6) ◽  
pp. S-600-S-601 ◽  
Author(s):  
Nathan Wan ◽  
David Weinberg ◽  
Tzu-yu Liu ◽  
Katherine Niehaus ◽  
Daniel Delubac ◽  
...  

2021 ◽  
Vol 13 (1) ◽  
Author(s):  
Agata Stodolna ◽  
Miao He ◽  
Mahesh Vasipalli ◽  
Zoya Kingsbury ◽  
Jennifer Becq ◽  
...  

Abstract Background Clinical-grade whole-genome sequencing (cWGS) has the potential to become the standard of care within the clinic because of its breadth of coverage and lack of bias towards certain regions of the genome. Colorectal cancer presents a difficult treatment paradigm, with over 40% of patients presenting at diagnosis with metastatic disease. We hypothesised that cWGS coupled with 3′ transcriptome analysis would give new insights into colorectal cancer. Methods Patients underwent PCR-free whole-genome sequencing and alignment and variant calling using a standardised pipeline to output SNVs, indels, SVs and CNAs. Additional insights into the mutational signatures and tumour biology were gained by the use of 3′ RNA-seq. Results Fifty-four patients were studied in total. Driver analysis identified the Wnt pathway gene APC as the only consistently mutated driver in colorectal cancer. Alterations in the PI3K/mTOR pathways were seen as previously observed in CRC. Multiple private CNAs, SVs and gene fusions were unique to individual tumours. Approximately 30% of patients had a tumour mutational burden of > 10 mutations/Mb of DNA, suggesting suitability for immunotherapy. Conclusions Clinical whole-genome sequencing offers a potential avenue for the identification of private genomic variation that may confer sensitivity to targeted agents and offer patients new options for targeted therapies.


Author(s):  
Sandhya N. dhage, Dr. Vijay Kumar Garg

Qualitative and quantitative agricultural production leads to economic benefits which can be achieved by periodic monitoring of crop, detection and prevention of crop diseases and insects. Quality of crop production is reduced by pest infection and crop diseases. Existing measures involves manual detection of cotton diseases by farmers and experts which requires  regular monitoring and detection manifest at middle to later stage of infection which causes many disadvantages such as becoming  too late for diseases to be cured.  Lack of early detection of diseases causes the diseases to be spread in nearby crops in the field and also spraying of pesticides is done on entire field for minimizing the infection of disease. The main goal of proposed research topic is to find the solution to the agriculture problem which involves detecting disease in cotton plant at early stage and classify the disease based on symptoms. Early detection of disease at an early stage prevent it from spreading to another area and preventive measures can be taken by farmers by spraying pesticides to control its growth which helps to increase the cotton yield production. Automatic identification of the different diseases affecting cotton crop will give many benefits to the farmers so that time, money will be saved and also gives healthy life to the crop. The contribution of this paper is to present the machine learning approach used for cotton crop disease diagnosis and classification.


2017 ◽  
Vol 23 (20) ◽  
pp. 6305-6314 ◽  
Author(s):  
Nadine Van Roy ◽  
Malaïka Van Der Linden ◽  
Björn Menten ◽  
Annelies Dheedene ◽  
Charlotte Vandeputte ◽  
...  

2019 ◽  
Author(s):  
Han Liang ◽  
Fuqiang Li ◽  
Sitan Qiao ◽  
Xinlan Zhou ◽  
Guoyun Xie ◽  
...  

AbstractSomatic mosaicism is widespread among tissues and could indicate distinct tissue origins of circulating cell-free DNA (cfDNA), DNA fragments released by lytic cells into the blood. By investigating the alignment patterns of whole genome sequencing reads with the genomic DNA of different tissues, we found that the read distributions formed type-specific patterns in some regions as a result of somatic mosaicism. We then utilized this information to construct a tissue-of-origin mapping model and evaluated its predictive performance on whole genome sequencing data from tissue and cfDNA samples. In total, 1,545 tissue samples associated with 13 cancer types were included, and identification of the tissue of origin achieved a specificity of 82% and a sensitivity of 80%. Furthermore, a total of 30 cfDNA samples from lung cancer and liver cancer patients and healthy controls were analyzed to predict their tissues of origin with a specificity of 87% and a sensitivity of 87%. Our results show that read distribution patterns from whole genome sequencing could be used to identify cfDNA tissues of origin with high accuracy, suggesting the potential application of our model to early cancer detection and diagnosis.


Sign in / Sign up

Export Citation Format

Share Document