METHOD OF KEY VECTORS EXTRACTION USING R-CLOUD CLASSIFIERS

A novel method for reducing a training data set in the context of nonparametric classification is proposed. The new method is based on the method of R-clouds. The advantages of the R-cloud classification method introduced recently are being investigated. The separating boundary of the R-cloud classifier is represented using Rvachev functions. The method of key vectors extraction uses the value of the R-cloud function to quantify the disturbance of the separating boundary, which is caused by removal of one data vector from the design dataset. The R-cloud method was found instructive and practical in a number of engineering problems related to pattern classification.

Download Full-text

Data Preprocessing Method and Fault Diagnosis Based on Evaluation Function of Information Contribution Degree

Journal of Control Science and Engineering ◽

10.1155/2018/6565737 ◽

2018 ◽

Vol 2018 ◽

pp. 1-10

Author(s):

Siyu Ji ◽

Chenglin Wen

Keyword(s):

Fault Diagnosis ◽

Combined Cycle ◽

Training Data ◽

New Method ◽

Model Parameters ◽

Evaluation Function ◽

Data Set ◽

Standard Data ◽

True Value ◽

The Mean

Neural network is a data-driven algorithm; the process established by the network model requires a large amount of training data, resulting in a significant amount of time spent in parameter training of the model. However, the system modal update occurs from time to time. Prediction using the original model parameters will cause the output of the model to deviate greatly from the true value. Traditional methods such as gradient descent and least squares methods are all centralized, making it difficult to adaptively update model parameters according to system changes. Firstly, in order to adaptively update the network parameters, this paper introduces the evaluation function and gives a new method to evaluate the parameters of the function. The new method without changing other parameters of the model updates some parameters in the model in real time to ensure the accuracy of the model. Then, based on the evaluation function, the Mean Impact Value (MIV) algorithm is used to calculate the weight of the feature, and the weighted data is brought into the established fault diagnosis model for fault diagnosis. Finally, the validity of this algorithm is verified by the example of UCI-Combined Cycle Power Plant (UCI-ccpp) simulation of standard data set.

Download Full-text

Application of hydrologic forecast model

Water Science & Technology ◽

10.2166/wst.2012.161 ◽

2012 ◽

Vol 66 (2) ◽

pp. 239-246

Author(s):

Xu Hua ◽

Xue Hengxin ◽

Chen Zhiguo

Keyword(s):

Fuzzy System ◽

Fuzzy Inference ◽

Learning Algorithm ◽

System Modeling ◽

Forecast Model ◽

Training Data ◽

New Method ◽

Data Set ◽

Gradient Learning ◽

Takagi Sugeno

In order to overcome the shortcoming of the solution may be trapped into the local minimization in the traditional TSK (Takagi-Sugeno-Kang) fuzzy inference training, this paper attempts to consider the TSK fuzzy system modeling approach based on the visual system principle and the Weber law. This approach not only utilizes the strong capability of identifying objects of human eyes, but also considers the distribution structure of the training data set in parameter regulation. In order to overcome the shortcoming of it adopting the gradient learning algorithm with slow convergence rate, a novel visual TSK fuzzy system model based on evolutional learning is proposed by introducing the particle swarm optimization algorithm. The main advantage of this method lies in its very good optimization, very strong noise immunity and very good interpretability. The new method is applied to long-term hydrological forecasting examples. The simulation results show that the method is feasibile and effective, the new method not only inherits the advantages of traditional visual TSK fuzzy models but also has the better global convergence and accuracy than the traditional model.

Download Full-text

A Practical Robust and Efficient RBF Metamodel Method for Typical Engineering Problems

Volume 1: 34th Design Automation Conference, Parts A and B ◽

10.1115/detc2008-49994 ◽

2008 ◽

Cited By ~ 1

Author(s):

Xingjie Fang ◽

Liping Wang ◽

Don Beeson ◽

Gene Wiggs

Keyword(s):

Principal Component ◽

Large Data ◽

Large Data Sets ◽

Training Data ◽

Data Sets ◽

Dimensional Model ◽

Data Set ◽

Engineering Problems ◽

Processing Techniques ◽

Generalization Accuracy

Radial Basis Function (RBF) metamodels have recently attracted increased interest due to their significant advantages over other types of non-parametric metamodels. However, because of the interpolation nature of the RBF mathematics, the accuracy of the model may dramatically deteriorate if the training data set used contains duplicate information, noise or outliers. Also constructing the metamodel may be time consuming whenever the training data sets are large or a high dimensional model is required. In this paper, we propose a robust and efficient RBF metamodeling approach based on data pre-processing techniques that alleviate the accuracy and efficiency issues commonly encountered when RBF models are used in typical real engineering situations. These techniques include 1) the removal of duplicate training data information, 2) the generation of smaller uniformly distributed subsets of training data from large data sets and 3) the quantification and identification of outliers by principal component analysis (PCA) and Hotelling statistics. Simulation results are used to validate the generalization accuracy and efficiency of the proposed approach.

Download Full-text

ALS Point Cloud Classification With Small Training Data Set Based on Transfer Learning

IEEE Geoscience and Remote Sensing Letters ◽

10.1109/lgrs.2019.2947608 ◽

2020 ◽

Vol 17 (8) ◽

pp. 1406-1410 ◽

Cited By ~ 1

Author(s):

Chuan Zhao ◽

Haitao Guo ◽

Jun Lu ◽

Donghang Yu ◽

Daoji Li ◽

...

Keyword(s):

Transfer Learning ◽

Point Cloud ◽

Training Data ◽

Data Set ◽

Cloud Classification ◽

Point Cloud Classification

Download Full-text

USING A VARIABLE WEIGHTING k-MEANS METHOD TO BUILD A DECISION CLUSTER CLASSIFICATION MODEL

International Journal of Pattern Recognition and Artificial Intelligence ◽

10.1142/s0218001412500036 ◽

2012 ◽

Vol 26 (02) ◽

pp. 1250003 ◽

Cited By ~ 1

Author(s):

YAN LI ◽

EDWARD HUNG ◽

KORRIS CHUNG ◽

JOSHUA HUANG

Keyword(s):

High Dimensional Data ◽

Real Data ◽

Training Data ◽

Classification Model ◽

Classification Method ◽

High Dimensional ◽

New Classification ◽

Data Set ◽

Variable Weighting ◽

Anderson Darling

In this paper, a new classification method (ADCC) for high-dimensional data is proposed. In this method, a decision cluster classification (DCC) model consists of a set of disjoint decision clusters, each labeled with a dominant class that determines the class of new objects falling in the cluster. A cluster tree is first generated from a training data set by recursively calling a variable weighting k-means algorithm. Then, the DCC model is extracted from the tree. Various tests including Anderson–Darling test are used to determine the stopping condition of the tree growing. A series of experiments on both synthetic and real data sets have been conducted. Their results show that the new classification method (ADCC) performed better in accuracy and scalability than existing methods like k-NN, decision tree and SVM. ADCC is particularly suitable for large, high-dimensional data with many classes.

Download Full-text

From pixels to patches: a cloud classification method based on a bag of micro-structures

Atmospheric Measurement Techniques ◽

10.5194/amt-9-753-2016 ◽

2016 ◽

Vol 9 (2) ◽

pp. 753-764 ◽

Cited By ~ 14

Author(s):

Qingyong Li ◽

Zhen Zhang ◽

Weitao Lu ◽

Jun Yang ◽

Ying Ma ◽

...

Keyword(s):

Large Data ◽

Classification Method ◽

Support Vector ◽

Svm Classifier ◽

Data Set ◽

Clear Sky ◽

Cloud Classification ◽

Image Patches ◽

Micro Structures ◽

Weighted Histogram

Abstract. Automatic cloud classification has attracted more and more attention with the increasing development of whole sky imagers, but it is still in progress for ground-based cloud observation. This paper proposes a new cloud classification method, named bag of micro-structures (BoMS). This method treats an all-sky image as a collection of micro-structures mapped from image patches, rather than a collection of pixels. It represents the image with a weighted histogram of micro-structures. Based on this representation, BoMS recognizes the cloud class of the image by a support vector machine (SVM) classifier. Five classes of sky condition are identified: cirriform, cumuliform, stratiform, clear sky, and mixed cloudiness. BoMS is evaluated on a large data set, which contains 5000 all-sky images captured by a total-sky cloud imager located in Tibet (29.25° N, 88.88° E). BoMS achieves an accuracy of 90.9 % for 10-fold cross-validation, and it outperforms state-of-the-art methods with an increase of 19 %. Furthermore, influence of key parameters in BoMS is investigated to verify their robustness.

Download Full-text

A New Approach for the Analysis of Mixture Toxicity Data

Water Science & Technology ◽

10.2166/wst.1992.0733 ◽

1992 ◽

Vol 26 (9-11) ◽

pp. 2345-2348 ◽

Cited By ~ 2

Author(s):

C. N. Haas

Keyword(s):

Quantitative Analysis ◽

Mixture Toxicity ◽

New Method ◽

Metal Exposure ◽

Positive Interactions ◽

Toxicity Data ◽

Data Set ◽

New Approach ◽

Negative Interactions

A new method for the quantitative analysis of multiple toxicity data is described and illustrated using a data set on metal exposure to copepods. Positive interactions are observed for Ni-Pb and Pb-Cr, with weak negative interactions observed for Ni-Cr.

Download Full-text

Cloning Safe Driving Behavior for Self-Driving Cars using Convolutional Neural Networks

Recent Patents on Computer Science ◽

10.2174/2213275911666181106160002 ◽

2019 ◽

Vol 12 (2) ◽

pp. 120-127 ◽

Cited By ~ 5

Author(s):

Wael Farag

Keyword(s):

Gradient Descent ◽

Autonomous Driving ◽

Driving Behavior ◽

Training Data ◽

Stochastic Gradient Descent ◽

Data Set ◽

Safe Driving ◽

Processing Pipeline ◽

Self Driving Cars ◽

And Training

Background: In this paper, a Convolutional Neural Network (CNN) to learn safe driving behavior and smooth steering manoeuvring, is proposed as an empowerment of autonomous driving technologies. The training data is collected from a front-facing camera and the steering commands issued by an experienced driver driving in traffic as well as urban roads. Methods: This data is then used to train the proposed CNN to facilitate what it is called “Behavioral Cloning”. The proposed Behavior Cloning CNN is named as “BCNet”, and its deep seventeen-layer architecture has been selected after extensive trials. The BCNet got trained using Adam’s optimization algorithm as a variant of the Stochastic Gradient Descent (SGD) technique. Results: The paper goes through the development and training process in details and shows the image processing pipeline harnessed in the development. Conclusion: The proposed approach proved successful in cloning the driving behavior embedded in the training data set after extensive simulations.

Download Full-text

Comparative Analysis of Machine Learning Techniques Using Predictive Modeling

Recent Advances in Computer Science and Communications ◽

10.2174/2666255813999200904164539 ◽

2020 ◽

Vol 13 ◽

Author(s):

Ritu Khandelwal ◽

Hemlata Goyal ◽

Rajveer Singh Shekhawat

Keyword(s):

Machine Learning ◽

Comparative Analysis ◽

Data Science ◽

Training Data ◽

Machine Learning Techniques ◽

Future Trends ◽

Data Set ◽

Learning Stage ◽

Learning Techniques ◽

Different Types

Introduction: Machine learning is an intelligent technology that works as a bridge between businesses and data science. With the involvement of data science, the business goal focuses on findings to get valuable insights on available data. The large part of Indian Cinema is Bollywood which is a multi-million dollar industry. This paper attempts to predict whether the upcoming Bollywood Movie would be Blockbuster, Superhit, Hit, Average or Flop. For this Machine Learning techniques (classification and prediction) will be applied. To make classifier or prediction model first step is the learning stage in which we need to give the training data set to train the model by applying some technique or algorithm and after that different rules are generated which helps to make a model and predict future trends in different types of organizations. Methods: All the techniques related to classification and Prediction such as Support Vector Machine(SVM), Random Forest, Decision Tree, Naïve Bayes, Logistic Regression, Adaboost, and KNN will be applied and try to find out efficient and effective results. All these functionalities can be applied with GUI Based workflows available with various categories such as data, Visualize, Model, and Evaluate. Result: To make classifier or prediction model first step is learning stage in which we need to give the training data set to train the model by applying some technique or algorithm and after that different rules are generated which helps to make a model and predict future trends in different types of organizations Conclusion: This paper focuses on Comparative Analysis that would be performed based on different parameters such as Accuracy, Confusion Matrix to identify the best possible model for predicting the movie Success. By using Advertisement Propaganda, they can plan for the best time to release the movie according to the predicted success rate to gain higher benefits. Discussion: Data Mining is the process of discovering different patterns from large data sets and from that various relationships are also discovered to solve various problems that come in business and helps to predict the forthcoming trends. This Prediction can help Production Houses for Advertisement Propaganda and also they can plan their costs and by assuring these factors they can make the movie more profitable.

Download Full-text

Building Damage Detection from Post-Event Aerial Imagery Using Single Shot Multibox Detector

Applied Sciences ◽

10.3390/app9061128 ◽

2019 ◽

Vol 9 (6) ◽

pp. 1128 ◽

Cited By ~ 12

Author(s):

Yundong Li ◽

Wei Hu ◽

Han Dong ◽

Xueyan Zhang

Keyword(s):

Machine Learning ◽

Data Augmentation ◽

Hurricane Sandy ◽

Training Data ◽

Aerial Images ◽

Detection Methods ◽

Single Shot ◽

Data Set ◽

Augmentation Strategies ◽

Post Disaster

Using aerial cameras, satellite remote sensing or unmanned aerial vehicles (UAV) equipped with cameras can facilitate search and rescue tasks after disasters. The traditional manual interpretation of huge aerial images is inefficient and could be replaced by machine learning-based methods combined with image processing techniques. Given the development of machine learning, researchers find that convolutional neural networks can effectively extract features from images. Some target detection methods based on deep learning, such as the single-shot multibox detector (SSD) algorithm, can achieve better results than traditional methods. However, the impressive performance of machine learning-based methods results from the numerous labeled samples. Given the complexity of post-disaster scenarios, obtaining many samples in the aftermath of disasters is difficult. To address this issue, a damaged building assessment method using SSD with pretraining and data augmentation is proposed in the current study and highlights the following aspects. (1) Objects can be detected and classified into undamaged buildings, damaged buildings, and ruins. (2) A convolution auto-encoder (CAE) that consists of VGG16 is constructed and trained using unlabeled post-disaster images. As a transfer learning strategy, the weights of the SSD model are initialized using the weights of the CAE counterpart. (3) Data augmentation strategies, such as image mirroring, rotation, Gaussian blur, and Gaussian noise processing, are utilized to augment the training data set. As a case study, aerial images of Hurricane Sandy in 2012 were maximized to validate the proposed method’s effectiveness. Experiments show that the pretraining strategy can improve of 10% in terms of overall accuracy compared with the SSD trained from scratch. These experiments also demonstrate that using data augmentation strategies can improve mAP and mF1 by 72% and 20%, respectively. Finally, the experiment is further verified by another dataset of Hurricane Irma, and it is concluded that the paper method is feasible.

Download Full-text