A review on machine learning for neutrino experiments

Neutrino experiments study the least understood of the Standard Model particles by observing their direct interactions with matter or searching for ultra-rare signals. The study of neutrinos typically requires overcoming large backgrounds, elusive signals, and small statistics. The introduction of state-of-the-art machine learning tools to solve analysis tasks has made major impacts to these challenges in neutrino experiments across the board. Machine learning algorithms have become an integral tool of neutrino physics, and their development is of great importance to the capabilities of next generation experiments. An understanding of the roadblocks, both human and computational, and the challenges that still exist in the application of these techniques is critical to their proper and beneficial utilization for physics applications. This review presents the current status of machine learning applications for neutrino physics in terms of the challenges and opportunities that are at the intersection between these two fields.

Download Full-text

Machine Learning Applications in Breast Cancer Diagnosis

Handbook of Research on Machine Learning Innovations and Trends - Advances in Computational Intelligence and Robotics ◽

10.4018/978-1-5225-2229-4.ch020 ◽

2017 ◽

pp. 465-490

Author(s):

Syed Jamal Safdar Gardezi ◽

Mohamed Meselhy Eltoukhy ◽

Ibrahima Faye

Keyword(s):

Breast Cancer ◽

Machine Learning ◽

Feature Extraction ◽

Machine Learning Algorithms ◽

Learning Tools ◽

Extraction Techniques ◽

Cad Systems ◽

Cad System ◽

Machine Learning Applications ◽

Detection And Diagnosis

Breast cancer is one of the leading causes of death in women worldwide. Early detection is the key to reduce the mortality rates. Mammography screening has proven to be one of the effective tools for diagnosis of breast cancer. Computer aided diagnosis (CAD) system is a fast, reliable, and cost-effective tool in assisting the radiologists/physicians for diagnosis of breast cancer. CAD systems play an increasingly important role in the clinics by providing a second opinion. Clinical trials have shown that CAD systems have improved the accuracy of breast cancer detection. A typical CAD system involves three major steps i.e. segmentation of suspected lesions, feature extraction and classification of these regions into normal or abnormal class and further into benign or malignant stages. The diagnostics ability of any CAD system is dependent on accurate segmentation, feature extraction techniques and most importantly classification tools that have ability to discriminate the normal tissues from the abnormal tissues. In this chapter we discuss the application of machine learning algorithms e.g. ANN, binary tree, SVM, etc. together with segmentation and feature extraction techniques in a CAD system development. Various methods used in the detection and diagnosis of breast lesions in mammography are reviewed. A brief introduction of machine learning tools, used in diagnosis and their classification performance on various segmentation and feature extraction techniques is presented.

Download Full-text

Smooth input preparation for quantum and quantum-inspired machine learning

Quantum Machine Intelligence ◽

10.1007/s42484-021-00045-x ◽

2021 ◽

Vol 3 (1) ◽

Author(s):

Zhikuan Zhao ◽

Jack K. Fitzsimons ◽

Patrick Rebentrost ◽

Vedran Dunjko ◽

Joseph F. Fitzsimons

Keyword(s):

Machine Learning ◽

Quantum Algorithms ◽

Machine Learning Algorithms ◽

Low Rank ◽

Smoothed Analysis ◽

Machine Learning Applications ◽

Data Points ◽

Input Model ◽

The Cost ◽

Input Perturbation

AbstractMachine learning has recently emerged as a fruitful area for finding potential quantum computational advantage. Many of the quantum-enhanced machine learning algorithms critically hinge upon the ability to efficiently produce states proportional to high-dimensional data points stored in a quantum accessible memory. Even given query access to exponentially many entries stored in a database, the construction of which is considered a one-off overhead, it has been argued that the cost of preparing such amplitude-encoded states may offset any exponential quantum advantage. Here we prove using smoothed analysis that if the data analysis algorithm is robust against small entry-wise input perturbation, state preparation can always be achieved with constant queries. This criterion is typically satisfied in realistic machine learning applications, where input data is subjective to moderate noise. Our results are equally applicable to the recent seminal progress in quantum-inspired algorithms, where specially constructed databases suffice for polylogarithmic classical algorithm in low-rank cases. The consequence of our finding is that for the purpose of practical machine learning, polylogarithmic processing time is possible under a general and flexible input model with quantum algorithms or quantum-inspired classical algorithms in the low-rank cases.

Download Full-text

Accelerating organic solar cell material's discovery: high-throughput screening and big data

Energy & Environmental Science ◽

10.1039/d1ee00559f ◽

2021 ◽

Author(s):

Xabier Rodríguez-Martínez ◽

Enrique Pascual-San-José ◽

Mariano Campoy-Quiles

Keyword(s):

Machine Learning ◽

Big Data ◽

High Throughput ◽

Organic Solar Cells ◽

High Throughput Screening ◽

Organic Solar Cell ◽

State Of The Art ◽

Review Article ◽

Machine Learning Algorithms ◽

Device Optimization

This review article presents the state-of-the-art in high-throughput computational and experimental screening routines with application in organic solar cells, including materials discovery, device optimization and machine-learning algorithms.

Download Full-text

Clinician checklist for assessing suitability of machine learning applications in healthcare

BMJ Health & Care Informatics ◽

10.1136/bmjhci-2020-100251 ◽

2021 ◽

Vol 28 (1) ◽

pp. e100251

Author(s):

Ian Scott ◽

Stacey Carter ◽

Enrico Coiera

Keyword(s):

Machine Learning ◽

Large Scale ◽

Clinical Decision Making ◽

Improve Patient Care ◽

Clinical Decision ◽

Routine Care ◽

Machine Learning Algorithms ◽

Clinical Settings ◽

Machine Learning Applications ◽

Key Issues

Machine learning algorithms are being used to screen and diagnose disease, prognosticate and predict therapeutic responses. Hundreds of new algorithms are being developed, but whether they improve clinical decision making and patient outcomes remains uncertain. If clinicians are to use algorithms, they need to be reassured that key issues relating to their validity, utility, feasibility, safety and ethical use have been addressed. We propose a checklist of 10 questions that clinicians can ask of those advocating for the use of a particular algorithm, but which do not expect clinicians, as non-experts, to demonstrate mastery over what can be highly complex statistical and computational concepts. The questions are: (1) What is the purpose and context of the algorithm? (2) How good were the data used to train the algorithm? (3) Were there sufficient data to train the algorithm? (4) How well does the algorithm perform? (5) Is the algorithm transferable to new clinical settings? (6) Are the outputs of the algorithm clinically intelligible? (7) How will this algorithm fit into and complement current workflows? (8) Has use of the algorithm been shown to improve patient care and outcomes? (9) Could the algorithm cause patient harm? and (10) Does use of the algorithm raise ethical, legal or social concerns? We provide examples where an algorithm may raise concerns and apply the checklist to a recent review of diagnostic imaging applications. This checklist aims to assist clinicians in assessing algorithm readiness for routine care and identify situations where further refinement and evaluation is required prior to large-scale use.

Download Full-text

A new generic method to improve machine learning applications in official statistics

Statistical Journal of the IAOS ◽

10.3233/sji-210885 ◽

2021 ◽

pp. 1-16

Author(s):

Kevin Kloos

Keyword(s):

Machine Learning ◽

Mean Squared Error ◽

Statistical Properties ◽

Machine Learning Algorithms ◽

Official Statistics ◽

Academic Literature ◽

Misclassification Bias ◽

Squared Error ◽

Machine Learning Applications ◽

Applications Of Machine Learning

The use of machine learning algorithms at national statistical institutes has increased significantly over the past few years. Applications range from new imputation schemes to new statistical output based entirely on machine learning. The results are promising, but recent studies have shown that the use of machine learning in official statistics always introduces a bias, known as misclassification bias. Misclassification bias does not occur in traditional applications of machine learning and therefore it has received little attention in the academic literature. In earlier work, we have collected existing methods that are able to correct misclassification bias. We have compared their statistical properties, including bias, variance and mean squared error. In this paper, we present a new generic method to correct misclassification bias for time series and we derive its statistical properties. Moreover, we show numerically that it has a lower mean squared error than the existing alternatives in a wide variety of settings. We believe that our new method may improve machine learning applications in official statistics and we aspire that our work will stimulate further methodological research in this area.

Download Full-text

State of the Art: Machine Learning Applications in Glioma Imaging

American Journal of Roentgenology ◽

10.2214/ajr.18.20218 ◽

2019 ◽

Vol 212 (1) ◽

pp. 26-37 ◽

Cited By ~ 23

Author(s):

Eyal Lotan ◽

Rajan Jain ◽

Narges Razavian ◽

Girish M. Fatterpekar ◽

Yvonne W. Lui

Keyword(s):

Machine Learning ◽

State Of The Art ◽

Machine Learning Applications

Download Full-text

Knowledge discovery in Chinese herbal medicine: a machine learning perspective

MATEC Web of Conferences ◽

10.1051/matecconf/202133606024 ◽

2021 ◽

Vol 336 ◽

pp. 06024

Author(s):

Nan Liang ◽

Qing Liang ◽

Fenglei Ji

Keyword(s):

Machine Learning ◽

Herbal Medicine ◽

Knowledge Discovery ◽

Chinese Herbal Medicine ◽

Learning Algorithms ◽

Machine Learning Algorithms ◽

Current Status ◽

Network Pharmacology ◽

Valuable Insight ◽

Chinese Herbal

Traditional Chinese Medicine (TCM) has attracted more and more attention due to its remarkable effects on treating diseases, and Chinese herbal medicine (CHM) is an important partition of TCM, rich in natural active ingredients. Researchers are trying multiple analytical methods to dig out more valuable information about CHM and reveal the principle of TCM. Machine learning is playing an important role in the studies. Knowledge discovery of CHM using machine learning mainly includes quality control of CHM, network pharmacology in CHM, and medical prescriptions composed by CHM, aiming to understand TCM better, provide more efficiency methods in the production of CHM and find novel treatment of disease not curable nowadays. In this paper, we summarized the basic idea of frequently used classification and clustering machine learning algorithms, introduced pre-processing algorithms commonly used to simplify and accelerate machine learning procedure, presented current status of machine learning algorithms’ applications in knowledge discovery of CHM, discussed challenges and future trends of machine learning’s application in CHM. It is believed that the paper provides a valuable insight for the starters trying to apply machine learning in the study of CHM and catch up the recent status of related researches.

Download Full-text

Machine Learning for Landslides Prevention: A Survey

10.36227/techrxiv.12546098 ◽

2020 ◽

Author(s):

Zhengjing Ma ◽

Gang Mei

Keyword(s):

Machine Learning ◽

State Of The Art ◽

Human Life ◽

Machine Learning Algorithms ◽

Warning Systems ◽

Negative Effects ◽

Relevant Research ◽

Related Information ◽

Urgent Task ◽

Comprehensive Survey

Landslides are one of the most critical categories of natural disasters worldwide and induce severely destructive outcomes to human life and the overall economic system. To reduce its negative effects, landslides prevention has become an urgent task, which includes investigating landslide-related information and predicting potential landslides. Machine learning is a state-of-the-art analytics tool that has been widely used in landslides prevention. This paper presents a comprehensive survey of relevant research on machine learning applied in landslides prevention, mainly focusing on (1) landslides detection based on images, (2) landslides susceptibility assessment, and (3) the development of landslide warning systems. Moreover, this paper discusses the current challenges and potential opportunities in the application of machine learning algorithms for landslides prevention.

Download Full-text

Personalized Transcriptomic Analyses Identify Unique Signatures That Correlate with Genomic Subtypes in Acute Myeloid Leukemia (AML) Using Explainable Artificial Intelligence

Blood ◽

10.1182/blood-2020-139522 ◽

2020 ◽

Vol 136 (Supplement 1) ◽

pp. 33-34 ◽

Cited By ~ 1

Author(s):

Yazan Rouphail ◽

Nathan Radakovich ◽

Jacob Shreve ◽

Sudipto Mukherjee ◽

Babal K. Jha ◽

...

Keyword(s):

Machine Learning ◽

Board Of Directors ◽

Research Funding ◽

State Of The Art ◽

Data Monitoring Committee ◽

Biological Pathways ◽

Machine Learning Algorithms ◽

Advisory Committees ◽

Altered Expression ◽

Individual Level

Background Multi-omic analysis can identify unique signatures that correlate with cancer subtypes. While clinically meaningful molecular subtypes of AML have been defined based on the status of single genes such as NPM1 and FLT3, such categories remain heterogeneous and further work is needed to characterize their genetic and transcriptomic diversity on a truly individualized basis. Further, patients (pts) with NPM1+/FLT3-ITD- AML have a better overall survival compared to patients with NPM1-/FLT3-ITD+, suggesting that these pts could have different transcriptomic signature that impact phenotype, pathophysiology, and outcomes. Many current transcriptome analytic techniques use clustering analysis to aggregate samples and look at relationships on a cohort-wide basis to build transcriptomic signatures that correlate with phenotype or outcome. Such approaches can undermine the heterogeneity of the gene expression in pts with the same signatures. In this study, we took advantage of state of the art machine learning algorithms to identify unique transcriptomic signatures that correlate with AML genomic phenotype. Methods Genomic (whole exome sequencing and targeted deep sequencing) and transcriptomic data from 451 AML pts included in the Beat AML study (publicly available data) were used to build transcriptomic signatures that are specific for AML patients with NPM1+/FLT3-ITD+ compared to NPM1+/FLT3-ITD, and NPM1-/FLT3-ITD-. We chose these AML phenotypes as they have been described extensively and they correlate with clinical outcomes. Results A total of 242 patients (54%) had NPM1-/FLT3-, 35 (8%) were NPM1+/FLT3-, and 47 (10%) were NPM1+/FLT3+. Our algorithm identified 20 genes that are highly specific for NPM1/FLT3ITD phenotype: HOXB-AS3, SCRN1, LMX1B, PCBD1, DNAJC15, HOXA3, NPTXq, RP11-1055B8, ABDH128, HOXB8, SOCS2, HOXB3, HOXB9, MIR503HG, FAM221B, NRP1, NDUFAF3, MEG3, CCDC136, and HIST1H2BC. Interestingly, several of those genes were overexpressed or underexpressed in specific phenotypes. For example, SCRN1, LMX1B, RP11-1055B8, ABDH128, HOXB8, MIR503HG, NRP1 are only overexpressed or underexpressed in patients with NPM1-/FLT3-, while PCBD1, NDUFAF3, FAM221B are overexpressed or underexpressed in pts with NPM1+/FLT3+. These genes affect several important pathways that regulate cell differentiation, proliferation, mitochondrial oxidative phosphorylation, histone modification and lipid metabolism. All these genes had previously been reported as having altered expression in genomic studies of AML, confirming our approach's ability to identify biologically meaningful relationships. Further, our algorithm can provide a personalized explanation of overexpressed and underexpressed genes specific for a given patient, thus identifying targetable pathways for each pt. Figure 1 below shows three pts with the same genotype (NPM1+/FLT3-ITD+) but demonstrate different transcriptomic patterns of overexpression or underexpression that affect different biological pathways. Conclusions We describe the use of a state of the art explainable machine learning approach to define transcriptomic signatures that are specific for individual pts. In addition to correctly distinguishing AML subtype based on specific transcriptomic signatures, our model was able to accurately identify upregulated and downregulated genes that affecte several important biological pathways in AML and can summarize these pathways at an individual level. Such an approach can be used to provide personalized treatment options that can target the activated pathways at an individual level. Disclosures Mukherjee: Partnership for Health Analytic Research, LLC (PHAR, LLC): Honoraria; Novartis: Consultancy, Membership on an entity's Board of Directors or advisory committees, Research Funding; EUSA Pharma: Consultancy; Celgene/Acceleron: Membership on an entity's Board of Directors or advisory committees; Bristol Myers Squib: Honoraria; Aplastic Anemia and MDS International Foundation: Honoraria; Celgene: Consultancy, Honoraria, Research Funding. Maciejewski:Alexion, BMS: Speakers Bureau; Novartis, Roche: Consultancy, Honoraria. Sekeres:BMS: Consultancy; Takeda/Millenium: Consultancy; Pfizer: Consultancy. Nazha:Jazz: Research Funding; Incyte: Speakers Bureau; Novartis: Speakers Bureau; MEI: Other: Data monitoring Committee.

Download Full-text

Improved Antibiotic Detection in Raw Milk Using Machine Learning Tools over the Absorption Spectra of a Problem-Specific Nanobiosensor

Sensors ◽

10.3390/s20164552 ◽

2020 ◽

Vol 20 (16) ◽

pp. 4552

Author(s):

Pablo Gutiérrez ◽

Sebastián E. Godoy ◽

Sergio Torres ◽

Patricio Oyarzún ◽

Ignacio Sanhueza ◽

...

Keyword(s):

Machine Learning ◽

Milk Sample ◽

Learning Algorithm ◽

State Of The Art ◽

Raw Milk ◽

Detection Algorithm ◽

Learning Tools ◽

Antibiotic Detection ◽

Biosensor System ◽

Spectral Detection

In this article we present the development of a biosensor system that integrates nanotechnology, optomechanics and a spectral detection algorithm for sensitive quantification of antibiotic residues in raw milk of cow. Firstly, nanobiosensors were designed and synthesized by chemically bonding gold nanoparticles (AuNPs) with aptamer bioreceptors highly selective for four widely used antibiotics in the field of veterinary medicine, namely, Kanamycin, Ampicillin, Oxytetracycline and Sulfadimethoxine. When molecules of the antibiotics are present in the milk sample, the interaction with the aptamers induces random AuNP aggregation. This phenomenon modifies the initial absorption spectrum of the milk sample without antibiotics, producing spectral features that indicate both the presence of antibiotics and, to some extent, its concentration. Secondly, we designed and constructed an electro-opto-mechanic device that performs automatic high-resolution spectral data acquisition in a wavelength range of 400 to 800 nm. Thirdly, the acquired spectra were processed by a machine-learning algorithm that is embedded into the acquisition hardware to determine the presence and concentration ranges of the antibiotics. Our approach outperformed state-of-the-art standardized techniques (based on the 520/620 nm ratio) for antibiotic detection, both in speed and in sensitivity.

Download Full-text