scholarly journals Machine Learning for Statistical Modeling

2021 ◽  
Vol 26 (3) ◽  
pp. 1-17
Author(s):  
Urmimala Roy ◽  
Tanmoy Pramanik ◽  
Subhendu Roy ◽  
Avhishek Chatterjee ◽  
Leonard F. Register ◽  
...  

We propose a methodology to perform process variation-aware device and circuit design using fully physics-based simulations within limited computational resources, without developing a compact model. Machine learning (ML), specifically a support vector regression (SVR) model, has been used. The SVR model has been trained using a dataset of devices simulated a priori, and the accuracy of prediction by the trained SVR model has been demonstrated. To produce a switching time distribution from the trained ML model, we only had to generate the dataset to train and validate the model, which needed ∼500 hours of computation. On the other hand, if 10 6 samples were to be simulated using the same computation resources to generate a switching time distribution from micromagnetic simulations, it would have taken ∼250 days. Spin-transfer-torque random access memory (STTRAM) has been used to demonstrate the method. However, different physical systems may be considered, different ML models can be used for different physical systems and/or different device parameter sets, and similar ends could be achieved by training the ML model using measured device data.

2021 ◽  
Vol 11 (1) ◽  
Author(s):  
Hyeon-Kyu Park ◽  
Jae-Hyeok Lee ◽  
Jehyun Lee ◽  
Sang-Koog Kim

AbstractThe macroscopic properties of permanent magnets and the resultant performance required for real implementations are determined by the magnets’ microscopic features. However, earlier micromagnetic simulations and experimental studies required relatively a lot of work to gain any complete and comprehensive understanding of the relationships between magnets’ macroscopic properties and their microstructures. Here, by means of supervised learning, we predict reliable values of coercivity (μ0Hc) and maximum magnetic energy product (BHmax) of granular NdFeB magnets according to their microstructural attributes (e.g. inter-grain decoupling, average grain size, and misalignment of easy axes) based on numerical datasets obtained from micromagnetic simulations. We conducted several tests of a variety of supervised machine learning (ML) models including kernel ridge regression (KRR), support vector regression (SVR), and artificial neural network (ANN) regression. The hyper-parameters of these models were optimized by a very fast simulated annealing (VFSA) algorithm with an adaptive cooling schedule. In our datasets of randomly generated 1,000 polycrystalline NdFeB cuboids with different microstructural attributes, all of the models yielded similar results in predicting both μ0Hc and BHmax. Furthermore, some outliers, which deteriorated the normality of residuals in the prediction of BHmax, were detected and further analyzed. Based on all of our results, we can conclude that our ML approach combined with micromagnetic simulations provides a robust framework for optimal design of microstructures for high-performance NdFeB magnets.


2021 ◽  
Vol 21 (1) ◽  
pp. 37-49
Author(s):  
Yu-Pei Liang ◽  
Shuo-Han Chen ◽  
Yuan-Hao Chang ◽  
Yun-Fei Liu ◽  
Hsin-Wen Wei ◽  
...  

Owing to the energy-constraint nature of cyber-physical systems (CPS), energy efficiency has become a primary design consideration for CPS. On CPS, owing to the high leakage power issue of SRAM, the major portion of its energy consumption comes from static random-access memory (SRAM)-based processors. Recently, with the emerging and rapidly evolving nonvolatile Spin-Transfer Torque RAM (STT-RAM), STT-RAM is expected to replace SRAM within processors for enhancing the energy efficiency with its near-zero leakage power features. The advances in Magnetic Tunneling Junction (MTJ) technology also realize the multi-level cell (MLC) STT-RAM to pack more cells with the same die area for achieving the memory density. However, the write disturbance issue of MLC STT-RAM prevents STT-RAM from properly resolving the energy efficiency of CPS. Although studies have been proposed to alleviate this issue, previous strategies could induce additional management overhead due to the use of counters or lead to frequent swap operations. Such an observation motivates us to propose an effective and simple strategy to combine direct and split cache mapping designs to enhance the energy efficiency of MLC STT-RAM. A series of experiments have been conducted on an open-source emulator with encouraging results.


Author(s):  
Vatsal Gupta and Saurabh Gautam

Image recognition is one of the core disciplines in Computer Vision. It is one of the most widely researched topics of the last few decades. Many advances in image recognition in the past decade, has made it one of the most efficient and powerful disciplines of all, having its applications in every sector including Finance, Healthcare, Security services, Agriculture and many more. Feature extraction is an integral part of image recognition. It helps in training the model more efficiently and with a higher accuracy, by getting rid of any unwanted or unnecessary features, thus reducing the dimensionality of the input image. This also helps in reducing the computational resources required by the algorithm to train, thus making it affordable for people with low end setups. Here we compare the accuracies of different machine learning classification algorithms, and their training times, with and without using feature Extraction. For the purpose of extracting features, a convolutional neural network was used. The model was trained and tested on the data of 12 classes containing a total of 2,175 images. For comparisons, we chose the Logistic regression, K-Nearest Neighbors Classifier, Random forest Classifier, and Support Vector Machine Classifier.


2020 ◽  
Vol 6 (1) ◽  
Author(s):  
Kena Zhang ◽  
Jianjun Wang ◽  
Yuhui Huang ◽  
Long-Qing Chen ◽  
P. Ganesh ◽  
...  

AbstractMetal oxide-based Resistive Random-Access Memory (RRAM) exhibits multiple resistance states, arising from the activation/deactivation of a conductive filament (CF) inside a switching layer. Understanding CF formation kinetics is critical to achieving optimal functionality of RRAM. Here a phase-field model is developed, based on materials properties determined by ab initio calculations, to investigate the role of electrical bias, heat transport and defect-induced Vegard strain in the resistive switching behavior, using MO2−x systems such as HfO2−x as a prototypical model system. It successfully captures the CF formation and resultant bipolar resistive switching characteristics. High-throughput simulations are performed for RRAMs with different material parameters to establish a dataset, based on which a compressed-sensing machine learning is conducted to derive interpretable analytical models for device performance (current on/off ratio and switching time) metrics in terms of key material parameters (electrical and thermal conductivities, Vegard strain coefficients). These analytical models reveal that optimal performance (i.e., high current on/off ratio and low switching time) can be achieved in materials with a low Lorenz number, a fundamental material constant. This work provides a fundamental understanding to the resistive switching in RRAM and demonstrates a computational data-driven methodology of materials selection for improved RRAM performance, which can also be applied to other electro-thermo-mechanical systems.


2021 ◽  
Author(s):  
Konrad Thorner ◽  
Aaron M. Zorn ◽  
Praneet Chaturvedi

AbstractAnnotation of single cells has become an important step in the single cell analysis framework. With advances in sequencing technology thousands to millions of cells can be processed to understand the intricacies of the biological system in question. Annotation through manual curation of markers based on a priori knowledge is cumbersome given this exponential growth. There are currently ~200 computational tools available to help researchers automatically annotate single cells using supervised/unsupervised machine learning, cell type markers, or tissue-based markers from bulk RNA-seq. But with the expansion of publicly available data there is also a need for a tool which can help integrate multiple references into a unified atlas and understand how annotations between datasets compare. Here we present ELeFHAnt: Ensemble learning for harmonization and annotation of single cells. ELeFHAnt is an easy-to-use R package that employs support vector machine and random forest algorithms together to perform three main functions: 1) CelltypeAnnotation 2) LabelHarmonization 3) DeduceRelationship. CelltypeAnnotation is a function to annotate cells in a query Seurat object using a reference Seurat object with annotated cell types. LabelHarmonization can be utilized to integrate multiple cell atlases (references) into a unified cellular atlas with harmonized cell types. Finally, DeduceRelationship is a function that compares cell types between two scRNA-seq datasets. ELeFHAnt can be accessed from GitHub at https://github.com/praneet1988/ELeFHAnt.


2019 ◽  
Vol 9 (3) ◽  
pp. 228 ◽  
Author(s):  
AMM Sharif Ullah

This article addresses some fundamental issues of concept mapping relevant to discipline-based education. The focus is on manufacturing knowledge representation from the viewpoints of both human and machine learning. The concept of new-generation manufacturing (Industry 4.0, smart manufacturing, and connected factory) necessitates learning factory (human learning) and human-cyber-physical systems (machine learning). Both learning factory and human-cyber-physical systems require semantic web-embedded dynamic knowledge bases, which are subjected to syntax (machine-to-machine communication), semantics (the meaning of the contents), and pragmatics (the preferences of individuals involved). This article argues that knowledge-aware concept mapping is a solution to create and analyze the semantic web-embedded dynamic knowledge bases for both human and machine learning. Accordingly, this article defines five types of knowledge, namely, analytic a priori knowledge, synthetic a priori knowledge, synthetic a posteriori knowledge, meaningful knowledge, and skeptic knowledge. These types of knowledge help find some rules and guidelines to create and analyze concept maps for the purposes human and machine learning. The presence of these types of knowledge is elucidated using a real-life manufacturing knowledge representation case. Their implications in learning manufacturing knowledge are also described. The outcomes of this article help install knowledge-aware concept maps for discipline-based education.


Author(s):  
F. WEICHERT ◽  
M. GASPAR ◽  
M. WAGNER

The present paper describes a novel approach to performing feature extraction and classification in possibly layered circular structures, as seen in two-dimensional cutting planes of three-dimensional tube-shaped objects. The algorithm can therefore be used to analyze histological specimens of blood vessels as well as intravascular ultrasound (IVUS) datasets. The approach uses a radial signal-based extraction of textural features in combination with methods of machine learning to integrate a priori domain knowledge. The algorithm in principle solves a two-dimensional classification problem that is reduced to parallel viable time series analysis. A multiscale approach hereby determines a feature vector for each analysis using either a Wavelet-transform (WT) or a S-transform (ST). The classification is done by methods of machine learning — here support vector machines. A modified marching squares algorithm extracts the polygonal segments for the two-dimensional classification. The accuracy is above 80% even in datasets with a considerable quantity of artifacts, while the mean accuracy is above 90%. The benefit of the approach therefore mainly lies in its robustness, efficient calculation, and the integration of domain knowledge.


Blood ◽  
2019 ◽  
Vol 134 (Supplement_1) ◽  
pp. 4136-4136 ◽  
Author(s):  
Raoul Santiago ◽  
Johanna Ortiz Jimenez ◽  
Reza Forghani ◽  
Nikesh Muthukrishnan ◽  
Olivier Del Corpo ◽  
...  

Introduction Approximately 15% of diffuse large B-cell lymphomas (DLBCL) do not respond to R-CHOP (rituximab, cyclophosphamide, doxorubicin, vincristine and prednisone) or equivalent regimen. These primary refractory cases (prDLBCL) have a particularly poor survival. There are currently no reliable biomarkers to a priori identify prDLBCL patients and include them in clinical trials, while avoiding needless toxicity from predictably ineffective therapy. In this study, we evaluated the potential for radiomic analysis with machine learning for predicting prDLBCL. Method This study included adult patients with prDLBCL from a single institution from 2009 to 2018, who had first-line treatment with an R-CHOP like regimen, had never received systemic treatment for indolent lymphoma, and who had a CT scan at the time of diagnosis. Refractory (R) patients were defined by progression of disease (PD) after completion of at least one cycle, or failure to achieve a complete response (CR) after at least 4 cycles, as per Lugano criteria (Cheson, JCO 2014). Non-refractory (NR) patients were matched 1:1 on sex and R-IPI for the comparison group. Enlarged lymph nodes (≥1.5 cm in greatest diameter) were eligible for evaluation. The 6 largest nodes were selected at each node site (abdomen, chest, axilla and neck) and for each node category (refractory node (RN), partial response (PR) and CR, as per Lugano criteria). 3D Slicer software was used for the delineation of the region of interest (ROI) either for subsequent 2D analysis (largest axial section) or 3D analysis (total node volume). Each node was manually contoured by two independent readers and also was reviewed by an experienced senior oncologic radiologist. A total of 788 and 1218 features were extracted from 2D and 3D regions of interest, respectively, using Pyradiomics open source software. Two independent machine learning approaches, Random Forests (RF) and Support Vector Machine (SVM), were tested for constructing the prediction models. 70% of cases were randomly assigned to the training set and 30% to the independent testing set. In the node model (NM) each independent node's response to treatment was predicted. In the patient model (PM), groups of nodes per site (abdomen, chest, axilla and neck) were used to predict the overall patient response. Results A total of 26 refractory patients were identified with a total 149 nodes (RN=55, PR=20, CR=74) and matched to 26 NR patients for comparison, with a total of 105 CR nodes. Seventeen nodes with significant artifact were excluded from the analysis (7 from NR patients and 10 from R patients). RF had consistently superior performance compared to SVM and was used for constructing the final prediction models. Furthermore, 2D radiomic analysis had superior performance compared to 3D radiomic analysis. In the independent testing (prediction) set, the mean accuracy between the 2 readers for this model for distinguishing a R from NR patient was 80% (mean sensitivity and specificity, 73% and 88%, respectively). This model was able to predict a R patient (positive predictive value (PPV)) in 100% and 71% of the case, respectively for readers 1 and 2. The area under the ROC curve (AUC) was 0.96 and 0.81 for reader 1 and 2, respectively (Figure 1A). For performance of the radiomic model for distinguishing individual refractory from responsive nodes, the independent testing set had a mean accuracy of 75% (mean sensitivity, specificity, PPV, and NPV of 80%, 69%, 78%, and 71% respectively). The AUC per reader were 0.82 and 0.85 (Figure 1B). Conclusion We demonstrate that the use of CT radiomic analysis with machine learning for identifying a priori primary refractory DLBCL patients is feasible. These models provide a relatively high prediction accuracy, which currently cannot be done in the clinical setting based on standard, largely qualitative, imaging characteristics. The main limitations of our study include small patient numbers in this pilot study and exclusion of extranodal sites. The next step for this project would be to evaluate this approach in a larger cohort that includes a second independent institution. CT-based radiomics is promising and should be further explored to achieve this unmet need for predicting prDLBCL prior to therapy initiation. Disclosures Forghani: GE Healthcare: Consultancy, Honoraria, Research Funding; 4Intel Inc: Equity Ownership, Membership on an entity's Board of Directors or advisory committees, Other: Founder. Reinhold:FRQS: Other: FRQS Grant. Assouline:Pfizer: Consultancy, Honoraria, Speakers Bureau; Janssen: Consultancy, Honoraria, Speakers Bureau; Abbvie: Consultancy, Honoraria; F. Hoffmann-La Roche Ltd: Consultancy, Honoraria.


Sign in / Sign up

Export Citation Format

Share Document