Intelligent Image Classification for Grading Egyptian Cotton Lint

A Classification Approach for Predicting COVID-19 Patient Survival Outcome with Machine Learning Techniques

10.1101/2020.08.02.20129767 ◽

2020 ◽

Author(s):

Abdulhameed Ado Osi ◽

Hussaini Garba Dikko ◽

Mannir Abdu ◽

Auwalu Ibrahim ◽

Lawan Adamu Isma'il ◽

...

Keyword(s):

Machine Learning ◽

Survival Outcome ◽

Supervised Machine Learning ◽

Machine Learning Techniques ◽

Support Vector ◽

P Value ◽

Kappa Index ◽

Quality Health Care ◽

Linear Discriminant ◽

Learning Techniques

COVID-19 is an infectious disease discovered after the outbreak began in Wuhan, China, in December 2019. COVID-19 is still becoming an increasing global threat to public health. The virus has been escalated to many countries across the globe. This paper analyzed and compared the performance of three different supervised machine learning techniques; Linear Discriminant Analysis (LDA), Random Forest (RF), and Support Vector Machine (SVM) on COVID-19 dataset. The best level of accuracy between these three algorithms was determined by comparison of some metrics for assessing predictive performance such as accuracy, sensitivity, specificity, F-score, Kappa index, and ROC. From the analysis results, RF was found to be the best algorithm with 100% prediction accuracy in comparison with LDA and SVM with 95.2% and 90.9% respectively. Our analysis shows that out of these three classification models RF predicts COVID-19 patient's survival outcome with the highest accuracy. Chi-square test reveals that all the seven features except sex were significantly correlated with the COVID-19 patient's outcome (P-value < 0.005). Therefore, RF was recommended for COVID-19 patient outcome prediction that will help in early identification of possible sensitive cases for quick provision of quality health care, support and supervision.

Download Full-text

Using Machine Learning to Grade the Mango’s Quality Based on External Features Captured by Vision System

Applied Sciences ◽

10.3390/app10175775 ◽

2020 ◽

Vol 10 (17) ◽

pp. 5775

Author(s):

Nguyen Truong Minh Long ◽

Nguyen Truong Thinh

Keyword(s):

Machine Learning ◽

Vision System ◽

Human Perception ◽

Poor Quality ◽

Machine Learning Algorithms ◽

Internal Quality ◽

Support Vector ◽

K Nearest Neighbors ◽

Linear Discriminant ◽

Length Width

Nowadays, mangoes and other fruits are classified according to human perception of low productivity, which is a poor quality of classification. Therefore, in this study, we suggest a novel evaluation of internal quality focused on external features of mango as well as its weight. The results show that evaluation is more effective than using only one of the external features or weight combining an expensive nondestructive (NDT) measurement. Grading of fruits is implemented by four models of machine learning as Random Forest (RF), Linear Discriminant Analysis (LDA), Support Vector Machine (SVM), and K-Nearest Neighbors (KNN). Models have inputs such as length, width, defect, weight, and outputs being mango classifications such as grade G1, G2, and G3. The unstructured data of 4983 of captured images combining with load-cell signals are transferred to structured data to generate a completed dataset including density. The data normalization and elimination of outliers (DNEO) are used to create a better dataset which prepared for machine learning algorithms. Moreover, an unbiased performance estimate for the training process carried out by the nested cross-validation (NCV) method. In the experiment, the methods of machine learning have high accurate over 87.9%, especially the model of RF gets 98.1% accuracy.

Download Full-text

Machine Learning Assisted Signal Analysis in Acoustic Microscopy for Non-Destructive Defect Identification

ISTFA 2019: Conference Proceedings from the 45th International Symposium for Testing and Failure Analysis ◽

10.31399/asm.cp.istfa2019p0035 ◽

2019 ◽

Author(s):

Michael Kögel ◽

Sebastian Brand ◽

Frank Altmann

Keyword(s):

Machine Learning ◽

Failure Analysis ◽

Human Error ◽

Flip Chip ◽

Acoustic Microscopy ◽

Classification Model ◽

Supervised Machine Learning ◽

Machine Learning Techniques ◽

Scanning Acoustic Microscopy ◽

Chip Technology

Abstract Signal processing and data interpretation in scanning acoustic microscopy is often challenging and based on the subjective decisions of the operator, making the defect classification results prone to human error. The aim of this work was to combine unsupervised and supervised machine learning techniques for feature extraction and image segmentation that allows automated classification and predictive failure analysis on scanning acoustic microscopy (SAM) data. In the first part, conspicuous signal components of the time-domain echo signals and their weighting matrices are extracted using independent component analysis. The applicability was shown by the assisted separation of signal patterns to intact and defective bumps from a dataset of a CPU-device manufactured in flip-chip technology. The high success-rate was verified by physical cross-sectioning and high-resolution imaging. In the second part, the before mentioned signal separation was employed to generate a labeled dataset for training and finetuning of a classification model based on a one-dimensional convolutional neural network. The learning model was sensitive to critical features of the given task without human intervention for classification between intact bumps, defective bumps and background. This approach was evaluated on two individual test samples that contained multiple defects in the solder bumps and has been verified by physical inspection. The verification of the classification model reached an accuracy of more than 97% and was successfully applied to an unknown sample which demonstrates the high potential of machine learning concepts for further developments in assisted failure analysis.

Download Full-text

Classification of Sentiment of Reviews using Supervised Machine Learning Techniques

International Journal of Rough Sets and Data Analysis ◽

10.4018/ijrsda.2017010104 ◽

2017 ◽

Vol 4 (1) ◽

pp. 56-74 ◽

Cited By ~ 14

Author(s):

Abinash Tripathy ◽

Santanu Kumar Rath

Keyword(s):

Machine Learning ◽

Sentiment Analysis ◽

Machine Learning Algorithms ◽

Supervised Machine Learning ◽

Machine Learning Techniques ◽

Support Vector ◽

Performance Parameters ◽

Linear Discriminant ◽

Learning Techniques

Sentiment analysis helps to determine hidden intention of the concerned author of any topic and provides an evaluation report on the polarity of any document. The polarity may be positive, negative or neutral. It is observed that very often the data associated with the sentiment analysis consist of the feedback given by various specialists on any topic or product. Thus, the review may be categorized properly into any sort of class based on the polarity, in order to have a good knowledge about the product. This article proposes an approach to classify the review dataset made on basis of sentiment analysis into different polarity groups. Four machine learning algorithms viz., Naive Bayes (NB), Support Vector Machine (SVM), Random Forest, and Linear Discriminant Analysis (LDA) have been considered in this paper for classification process. The obtained result on values of accuracy of the algorithms are critically examined by using different performance parameters, applied on two different datasets.

Download Full-text

Genes and Mechanisms Associated With Experimentally Induced Bovine Respiratory Disease Identified With Supervised Machine Learning Methodology on Integrated Transcriptomic Datasets

10.21203/rs.3.rs-789747/v1 ◽

2021 ◽

Author(s):

Matthew Scott ◽

Amelia Woolums ◽

Cyprianna Swiderski ◽

Andy Perkins ◽

Bindu Nanduri

Keyword(s):

Machine Learning ◽

Respiratory Disease ◽

Bovine Respiratory Disease ◽

Atp Synthesis ◽

Supervised Machine Learning ◽

Support Vector ◽

Type I ◽

Linear Discriminant ◽

Alternative Pathways ◽

Validation Parameters

Abstract Bovine respiratory disease (BRD) is a multifactorial disease involving complex host immune interactions shaped by pathogenic agents and environmental factors. Advancements in RNA sequencing and associated analytical methods are improving our understanding of host response related to BRD pathophysiology. Supervised machine learning (ML) approaches present one such method for analyzing new and previously published transcriptome data to identify novel genes and mechanisms. Our objective was to apply ML models to lung and immunological tissue datasets acquired from previous clinical BRD experiments to identify genes that classify disease with high accuracy. Raw mRNA sequencing reads from 151 bovine datasets (n=123 BRD, n=28 control) were downloaded from NCBI-GEO. Quality filtered reads were assembled in a HISAT2/Stringtie2 pipeline. Raw gene counts for ML analysis were normalized, transformed, and analyzed with MLSeq, utilizing six ML models. Cross-validation parameters (5-fold, repeated 10 times) were applied in a 70:30 training/testing ratio. Downstream analysis of genes identified by the top sparse classifiers for each etiological association was performed within WebGestalt and Reactome (FDR < 0.05). Support vector machines was routinely the top non-sparse classifier for predicting etiological disease versus sham control. Nearest shrunken centroid and Poisson linear discriminant analysis with power transformation could reliably classify IBR and BRSV with 100% accuracy. Genes identified in IBR and BRSV, but not BVDV, were related to type I interferon production and IL-8 secretion, specifically in lymphoid tissue and not lung. Genes identified in Mannheimia haemolytica infections were involved in activating classical and alternative pathways of complement. Novel findings, including expression of genes related to reduced mitochondrial oxygenation and ATP synthesis in consolidated lung tissue, were discovered. Genes identified in each analysis represent distinct genomic events relevant to understanding and predicting clinical BRD. The few genes shared across analyses may be reliably associated with clinical BRD. Our analysis demonstrates the utility of ML with published datasets for discovering functional information to support prediction and understanding BRD acquisition.

Download Full-text

Texture-Based Metallurgical Phase Identification in Structural Steels: A Supervised Machine Learning Approach

Metals ◽

10.3390/met9050546 ◽

2019 ◽

Vol 9 (5) ◽

pp. 546 ◽

Cited By ~ 8

Author(s):

Dayakar L. Naik ◽

Hizb Ullah Sajid ◽

Ravi Kiran

Keyword(s):

Machine Learning ◽

Visual Information ◽

Supervised Machine Learning ◽

Automatic Identification ◽

Phase Identification ◽

Textural Features ◽

K Nearest Neighbor ◽

Pixel Intensity ◽

Image Domain ◽

Linear Discriminant

Automatic identification of metallurgical phases based on thresholding methods in microstructural images may not be possible when the pixel intensities associated with the metallurgical phases overlap and, hence, are indistinguishable. To circumvent this problem, additional visual information about the metallurgical phases, referred to as textural features, are considered in this study. Mathematically, textural features are the second order statistics of an image domain and can be distinct for each metallurgical phase. Textural features are evaluated from the gray level co-occurrence matrix (GLCM) of each metallurgical phase (ferrite, pearlite, and martensite) present in heat-treated ASTM A36 steels in this study. The dataset of textural features and pixel intensities generated for the metallurgical phases is used to train supervised machine learning classifiers, which are subsequently employed to predict the metallurgical phases in the microstructure. Naïve Bayes (NB), k-nearest neighbor (K-NN), linear discriminant analysis (LDA), and decision tree (DT) classifiers are the four classifiers employed in this study. The performances of all four classifiers were assessed prior to their deployment, and the classification accuracy was found to be >97%. The proposed technique has two unique advantages: (1) unlike pixel intensity-based methods, the proposed method does not misclassify the grain boundaries as a metallurgical phase, and (2) the proposed method does not require the end-user to input the number of phases present in the microstructure.

Download Full-text

Machine learning to automate rapid soil health assessment using infrared spectroscopy

10.5194/ismc2021-83 ◽

2021 ◽

Author(s):

Leonardo Deiss ◽

Shameema Oottikkal ◽

Karen Tomko ◽

Wanyu Huang ◽

Steve Culman ◽

...

Keyword(s):

Machine Learning ◽

Infrared Spectroscopy ◽

Soil Properties ◽

Data Science ◽

Intelligent System ◽

Soil Health ◽

Health Indicators ◽

Supervised Machine Learning ◽

Machine Learning Techniques ◽

Support Vector

<p>Soil infrared spectroscopy has great potential for estimating soil properties, but reference soil measurements are typically required in combination with multivariate statistical models to estimate soil properties. User-friendly predictive tools based on open-source statistical environment remain one of the main limitations to enable technology diffusion to non-specialist users. Our aim is to build capacity for an automated machine learning routine for rapid and robust prediction of soil health indicators using lab acquired soil infrared spectra. This intelligent system runs on R statistical environment and includes (1) a diverse soil spectral library comprising main physiographic regions from the USA Midwest region under diverse land uses and various sampling depths, (2) a classification process to detect potential outliers in newly acquired spectra using supervised machine learning techniques, and (3) a multi-model optimized prediction process based on linear and non-linear statistical procedures (partial least squares, support vector machines, and neural network). This prediction system works at the intersection of soil and data science and high-performance computing to enable efficient parallel processing of spectral data on multi-core coprocessors. Using artificial intelligence to automate soil infrared spectroscopy is a fundamental demand that will make this technique an effective routine in soil laboratories to estimate soil health.</p>

Download Full-text

Classification of Sentiment of Reviews using Supervised Machine Learning Techniques

Cognitive Analytics ◽

10.4018/978-1-7998-2460-2.ch009 ◽

2020 ◽

pp. 143-163

Author(s):

Abinash Tripathy ◽

Santanu Kumar Rath

Keyword(s):

Machine Learning ◽

Sentiment Analysis ◽

Machine Learning Algorithms ◽

Supervised Machine Learning ◽

Machine Learning Techniques ◽

Support Vector ◽

Evaluation Report ◽

Linear Discriminant ◽

Learning Techniques

Sentiment analysis helps to determine hidden intention of the concerned author of any topic and provides an evaluation report on the polarity of any document. The polarity may be positive, negative or neutral. It is observed that very often the data associated with the sentiment analysis consist of the feedback given by various specialists on any topic or product. Thus, the review may be categorized properly into any sort of class based on the polarity, in order to have a good knowledge about the product. This article proposes an approach to classify the review dataset made on basis of sentiment analysis into different polarity groups. Four machine learning algorithms viz., Naive Bayes (NB), Support Vector Machine (SVM), Random Forest, and Linear Discriminant Analysis (LDA) have been considered in this paper for classification process. The obtained result on values of accuracy of the algorithms are critically examined by using different performance parameters, applied on two different datasets.

Download Full-text

Genes and regulatory mechanisms associated with experimentally-induced bovine respiratory disease identified using supervised machine learning methodology

Scientific Reports ◽

10.1038/s41598-021-02343-7 ◽

2021 ◽

Vol 11 (1) ◽

Author(s):

Matthew A. Scott ◽

Amelia R. Woolums ◽

Cyprianna E. Swiderski ◽

Andy D. Perkins ◽

Bindu Nanduri

Keyword(s):

Machine Learning ◽

Respiratory Disease ◽

Lung Tissue ◽

Bovine Respiratory Disease ◽

Atp Synthesis ◽

Parameter Tuning ◽

Supervised Machine Learning ◽

Type I ◽

Linear Discriminant ◽

Validation Parameters

AbstractBovine respiratory disease (BRD) is a multifactorial disease involving complex host immune interactions shaped by pathogenic agents and environmental factors. Advancements in RNA sequencing and associated analytical methods are improving our understanding of host response related to BRD pathophysiology. Supervised machine learning (ML) approaches present one such method for analyzing new and previously published transcriptome data to identify novel disease-associated genes and mechanisms. Our objective was to apply ML models to lung and immunological tissue datasets acquired from previous clinical BRD experiments to identify genes that classify disease with high accuracy. Raw mRNA sequencing reads from 151 bovine datasets (n = 123 BRD, n = 28 control) were downloaded from NCBI-GEO. Quality filtered reads were assembled in a HISAT2/Stringtie2 pipeline. Raw gene counts for ML analysis were normalized, transformed, and analyzed with MLSeq, utilizing six ML models. Cross-validation parameters (fivefold, repeated 10 times) were applied to 70% of the compiled datasets for ML model training and parameter tuning; optimized ML models were tested with the remaining 30%. Downstream analysis of significant genes identified by the top ML models, based on classification accuracy for each etiological association, was performed within WebGestalt and Reactome (FDR ≤ 0.05). Nearest shrunken centroid and Poisson linear discriminant analysis with power transformation models identified 154 and 195 significant genes for IBR and BRSV, respectively; from these genes, the two ML models discriminated IBR and BRSV with 100% accuracy compared to sham controls. Significant genes classified by the top ML models in IBR (154) and BRSV (195), but not BVDV (74), were related to type I interferon production and IL-8 secretion, specifically in lymphoid tissue and not homogenized lung tissue. Genes identified in Mannheimia haemolytica infections (97) were involved in activating classical and alternative pathways of complement. Novel findings, including expression of genes related to reduced mitochondrial oxygenation and ATP synthesis in consolidated lung tissue, were discovered. Genes identified in each analysis represent distinct genomic events relevant to understanding and predicting clinical BRD. Our analysis demonstrates the utility of ML with published datasets for discovering functional information to support the prediction and understanding of clinical BRD.

Download Full-text

Intelligent system of English composition scoring model based on improved machine learning algorithm

Journal of Intelligent & Fuzzy Systems ◽

10.3233/jifs-189235 ◽

2020 ◽

pp. 1-11

Author(s):

Jie Liu ◽

Lin Lin ◽

Xiufang Liang

Keyword(s):

Machine Learning ◽

Evaluation System ◽

Intelligent System ◽

Learning Algorithm ◽

Learning Algorithms ◽

Machine Learning Algorithms ◽

Assessment System ◽

English Composition ◽

Region Extraction ◽

Constraint Model

The online English teaching system has certain requirements for the intelligent scoring system, and the most difficult stage of intelligent scoring in the English test is to score the English composition through the intelligent model. In order to improve the intelligence of English composition scoring, based on machine learning algorithms, this study combines intelligent image recognition technology to improve machine learning algorithms, and proposes an improved MSER-based character candidate region extraction algorithm and a convolutional neural network-based pseudo-character region filtering algorithm. In addition, in order to verify whether the algorithm model proposed in this paper meets the requirements of the group text, that is, to verify the feasibility of the algorithm, the performance of the model proposed in this study is analyzed through design experiments. Moreover, the basic conditions for composition scoring are input into the model as a constraint model. The research results show that the algorithm proposed in this paper has a certain practical effect, and it can be applied to the English assessment system and the online assessment system of the homework evaluation system algorithm system.

Download Full-text