machine learning method Latest Research Papers

A physics-informed machine learning method for predicting grain structure characteristics in directed energy deposition

Computational Materials Science ◽

10.1016/j.commatsci.2021.110958 ◽

2022 ◽

Vol 202 ◽

pp. 110958

Author(s):

Dmitriy Kats ◽

Zhidong Wang ◽

Zhengtao Gan ◽

Wing Kam Liu ◽

Gregory J. Wagner ◽

...

Keyword(s):

Machine Learning ◽

Energy Deposition ◽

Grain Structure ◽

Machine Learning Method ◽

Learning Method ◽

Structure Characteristics ◽

Directed Energy Deposition ◽

Directed Energy

Improvements of response surface modeling with self-adaptive machine learning method for PM2.5 and O3 predictions

Journal of Environmental Management ◽

10.1016/j.jenvman.2021.114210 ◽

2022 ◽

Vol 303 ◽

pp. 114210

Author(s):

Jinying Li ◽

Youzhi Dai ◽

Yun Zhu ◽

Xiangbo Tang ◽

Shuxiao Wang ◽

...

Keyword(s):

Machine Learning ◽

Response Surface ◽

Surface Modeling ◽

Response Surface Modeling ◽

Machine Learning Method ◽

Learning Method ◽

Self Adaptive

PENERAPAN GRADIENT BOOSTING DENGAN HYPEROPT UNTUK MEMPREDIKSI KEBERHASILAN TELEMARKETING BANK

Jurnal Gaussian ◽

10.14710/j.gauss.v10i4.31335 ◽

2022 ◽

Vol 10 (4) ◽

pp. 617-623

Author(s):

Silvia Elsa Suryana ◽

Budi Warsito ◽

Suparti Suparti

Keyword(s):

Machine Learning ◽

Bayesian Optimization ◽

Gradient Boosting ◽

Machine Learning Method ◽

Classification Result ◽

Independent Variables ◽

Binary Dependent Variable ◽

Boosting Method ◽

Optimal Classification

Telemarketing is another form of marketing which is conducted via telephone. Bank can use telemarketing to offer its products such as term deposit. One of the most important strategy to the success of telemarketing is opting the potential customer to create effective telemarketing. Predicting the success of telemarketing can use machine learning. Gradient boosting is machine learning method with advanced decision tree. Gardient boosting involves many classification trees which are continually upgraded from previous tree. The optimal classification result cannot be separated from the role of the optimal hyperparameter. Hyperopt is Python library that can be used to tune hyperparameter effectively because it uses Bayesian optimization. Hyperopt uses hyperparameter prior distribution to find optimal hyperparameter. Data in this study including 20 independent variables and binary dependent variable which has ‘yes’ and ‘no’ classes. The study showed that gradient boosting reached classification accuracy up to 90,39%, precision 94,91%, and AUC 0,939. These values describe gradient boosting method is able to predict both classes ‘yes’ and ‘no’ relatively accurate.

Explainable t-SNE for single-cell RNA-seq data analysis

10.1101/2022.01.12.476084 ◽

2022 ◽

Author(s):

Henry Han ◽

Tianyu Zhang ◽

Mary Lauren Benton ◽

Chun Li ◽

Juan Wang ◽

...

Keyword(s):

Gene Expression ◽

Machine Learning ◽

Data Analysis ◽

Dimension Reduction ◽

Single Cell ◽

Method Development ◽

Robustness Analysis ◽

High Dimensional ◽

Machine Learning Method ◽

Learning Method

Single-cell RNA (scRNA-seq) sequencing technologies trigger the study of individual cell gene expression and reveal the diversity within cell populations. To measure cell-to-cell similarity based on their transcription and gene expression, many dimension reduction methods are employed to retrieve the corresponding low-dimensional embeddings of input scRNA-seq data to conduct clustering. However, the methods lack explainability and may not perform well with scRNA-seq data because they are often migrated from other fields and not customized for high-dimensional sparse scRNA-seq data. In this study, we propose an explainable t-SNE: cell-driven t-SNE (c-TSNE) that fuses the cell differences reflected from biologically meaningful distance metrics for input scRNA-seq data. Our study shows that the proposed method not only enhances the interpretation of the original t-SNE visualization for scRNA-seq data but also demonstrates favorable single cell segregation performance on benchmark datasets compared to the state-of-the-art peers. The robustness analysis shows that the proposed cell-driven t-SNE demonstrates robustness to dropout and noise in dimension reduction and clustering. It provides a novel and practical way to investigate the interpretability of t-SNE in scRNA-seq data analysis. Unlike the general assumption that the explainanbility of a machine learning method needs to compromise with the learning efficiency, the proposed explainable t-SNE improves both clustering efficiency and explainanbility in scRNA-seq analysis. More importantly, our work suggests that widely used t-SNE can be easily misused in the existing scRNA-seq analysis, because its default Euclidean distance can bring biases or meaningless results in cell difference evaluation for high-dimensional sparse scRNA-seq data. To the best of our knowledge, it is the first explainable t-SNE proposed in scRNA-seq analysis and will inspire other explainable machine learning method development in the field.

Imputation by feature importance (IBFI): A methodology to envelop machine learning method for imputing missing patterns in time series data

PLoS ONE ◽

10.1371/journal.pone.0262131 ◽

2022 ◽

Vol 17 (1) ◽

pp. e0262131

Author(s):

Adil Aslam Mir ◽

Kimberlee Jane Kearfott ◽

Fatih Vehbi Çelebi ◽

Muhammad Rafique

Keyword(s):

Machine Learning ◽

Time Series Data ◽

Mean Squared Error ◽

Learning Algorithm ◽

Series Data ◽

Machine Learning Method ◽

Learning Method ◽

Imputation Methods ◽

Squared Error ◽

Feature Importance

A new methodology, imputation by feature importance (IBFI), is studied that can be applied to any machine learning method to efficiently fill in any missing or irregularly sampled data. It applies to data missing completely at random (MCAR), missing not at random (MNAR), and missing at random (MAR). IBFI utilizes the feature importance and iteratively imputes missing values using any base learning algorithm. For this work, IBFI is tested on soil radon gas concentration (SRGC) data. XGBoost is used as the learning algorithm and missing data are simulated using R for different missingness scenarios. IBFI is based on the physically meaningful assumption that SRGC depends upon environmental parameters such as temperature and relative humidity. This assumption leads to a model obtained from the complete multivariate series where the controls are available by taking the attribute of interest as a response variable. IBFI is tested against other frequently used imputation methods, namely mean, median, mode, predictive mean matching (PMM), and hot-deck procedures. The performance of the different imputation methods was assessed using root mean squared error (RMSE), mean squared log error (MSLE), mean absolute percentage error (MAPE), percent bias (PB), and mean squared error (MSE) statistics. The imputation process requires more attention when multiple variables are missing in different samples, resulting in challenges to machine learning methods because some controls are missing. IBFI appears to have an advantage in such circumstances. For testing IBFI, Radon Time Series Data (RTS) has been used and data was collected from 1st March 2017 to the 11th of May 2018, including 4 seismic activities that have taken place during the data collection time.

MT-MAG: Accurate and interpretable machine learning based taxonomic assignment of metagenome-assembled genomes, with a partial classification option

10.1101/2022.01.12.475159 ◽

2022 ◽

Author(s):

Wanxin Li ◽

Lila Kari ◽

Yaoliang Yu ◽

Laura A Hug

Keyword(s):

Machine Learning ◽

Species Level ◽

Metagenomic Data ◽

Machine Learning Method ◽

Numerical Classification ◽

Training Set ◽

Taxonomic Assignment ◽

Taxonomic Rank ◽

Partial Classification

We propose MT-MAG, a novel machine learning-based taxonomic assignment tool for hierarchically-structured local classification of metagenome-assembled genomes (MAGs). MT-MAG is capable of classifying large and diverse real metagenomic datasets, having analyzed for this study a total of 240 Gbp of data in the training set, and 7 Gbp of data in the test set. MT-MAG is, to the best of our knowledge, the first machine learning method for taxonomic assignment of metagenomic data that offers a "partial classification" option. MT-MAG outputs complete or a partial classification paths, and interpretable numerical classification confidences of its classifications, at all taxonomic ranks. MT-MAG is able to completely classify 48% more sequences than DeepMicrobes to the Species level (the only comparable taxonomic rank for DeepMicrobes), and it outperforms DeepMicrobes by an average of 33% in weighted accuracy, and by 89% in constrained accuracy.

Optimising Energy Management in Hybrid Microgrids

Mathematics ◽

10.3390/math10020214 ◽

2022 ◽

Vol 10 (2) ◽

pp. 214

Author(s):

Javier Bilbao ◽

Eugenio Bravo ◽

Olatz García ◽

Carolina Rebollar ◽

Concepción Varela

Keyword(s):

Machine Learning ◽

Energy Storage ◽

Energy Balance ◽

Energy Management ◽

Electricity Market ◽

Ensemble Methods ◽

Machine Learning Method ◽

Learning Method ◽

Electricity System ◽

Different Time Scales

This article deals with the optimization of the operation of hybrid microgrids. Both the problem of controlling the management of load sharing between the different generators and energy storage and possible solutions for the integration of the microgrid into the electricity market will be discussed. Solar and wind energy as well as hybrid storage with hydrogen, as renewable sources, will be considered, which allows management of the energy balance on different time scales. The Machine Learning method of Decision Trees, combined with ensemble methods, will also be introduced to study the optimization of microgrids. The conclusions obtained indicate that the development of suitable controllers can facilitate a competitive participation of renewable energies and the integration of microgrids in the electricity system.

An ensemble machine learning method for crash responsibility assignment in quasi-induced exposure theory

Journal of Transportation Safety & Security ◽

10.1080/19439962.2022.2026543 ◽

2022 ◽

pp. 1-19

Author(s):

Guopeng Zhang ◽

Ying Cai ◽

Xinguo Jiang ◽

Yingfei Fan ◽

Yue Zhou ◽

...

Keyword(s):

Machine Learning ◽

Machine Learning Method ◽

Learning Method ◽

Ensemble Machine Learning

Discovering Rational Heuristics for Risky Choice

10.31234/osf.io/mg7dn ◽

2022 ◽

Author(s):

Paul Krueger ◽

Frederick Callaway ◽

Sayan Gul ◽

Tom Griffiths ◽

Falk Lieder

Keyword(s):

Risky Choice ◽

Behavioral Experiment ◽

Cognitive Resources ◽

Machine Learning Method ◽

Learning Method ◽

Rational Decision ◽

Decision Strategies ◽

Rational Decision Making ◽

Wide Range ◽

Order Of Magnitude

For computationally limited agents such as humans, perfectly rational decision-making is almost always out of reach. Instead, people may rely on computationally frugal heuristics that usually yield good outcomes. Although previous research has identified many such heuristics, discovering good heuristics and predicting when they will be used remains challenging. Here, we present a machine learning method that identifies the best heuristics to use in any given situation. To demonstrate the generalizability and accuracy of our method, we compare the strategies it discovers against those used by people across a wide range of multi-alternative risky choice environments in a behavioral experiment that is an order of magnitude larger than any previous experiments of its type. Our method rediscovered known heuristics, identifying them as rational strategies for specific environments, and discovered novel heuristics that had been previously overlooked. Our results show that people adapt their decision strategies to the structure of the environment and generally make good use of their limited cognitive resources, although they tend to collect too little information and their strategy choices do not always fully exploit the structure of the environment.

A data-driven analytical model for wind turbine wakes using machine learning method

Energy Conversion and Management ◽

10.1016/j.enconman.2021.115130 ◽

2022 ◽

Vol 252 ◽

pp. 115130

Author(s):

Guo Nai-Zhi ◽

Zhang Ming-Ming ◽

Li Bo

Keyword(s):

Machine Learning ◽

Wind Turbine ◽

Analytical Model ◽

Data Driven ◽

Machine Learning Method ◽

Learning Method

machine learning method
Recently Published Documents

TOTAL DOCUMENTS

H-INDEX

A physics-informed machine learning method for predicting grain structure characteristics in directed energy deposition

Improvements of response surface modeling with self-adaptive machine learning method for PM2.5 and O3 predictions

PENERAPAN GRADIENT BOOSTING DENGAN HYPEROPT UNTUK MEMPREDIKSI KEBERHASILAN TELEMARKETING BANK

Explainable t-SNE for single-cell RNA-seq data analysis

Imputation by feature importance (IBFI): A methodology to envelop machine learning method for imputing missing patterns in time series data

MT-MAG: Accurate and interpretable machine learning based taxonomic assignment of metagenome-assembled genomes, with a partial classification option

Optimising Energy Management in Hybrid Microgrids

An ensemble machine learning method for crash responsibility assignment in quasi-induced exposure theory

Discovering Rational Heuristics for Risky Choice

A data-driven analytical model for wind turbine wakes using machine learning method

Export Citation Format

machine learning methodRecently Published Documents

TOTAL DOCUMENTS

H-INDEX

A physics-informed machine learning method for predicting grain structure characteristics in directed energy deposition

Improvements of response surface modeling with self-adaptive machine learning method for PM2.5 and O3 predictions

PENERAPAN GRADIENT BOOSTING DENGAN HYPEROPT UNTUK MEMPREDIKSI KEBERHASILAN TELEMARKETING BANK

Explainable t-SNE for single-cell RNA-seq data analysis

Imputation by feature importance (IBFI): A methodology to envelop machine learning method for imputing missing patterns in time series data

MT-MAG: Accurate and interpretable machine learning based taxonomic assignment of metagenome-assembled genomes, with a partial classification option

Optimising Energy Management in Hybrid Microgrids

An ensemble machine learning method for crash responsibility assignment in quasi-induced exposure theory

Discovering Rational Heuristics for Risky Choice

A data-driven analytical model for wind turbine wakes using machine learning method

machine learning method
Recently Published Documents