Climate Regionalization of Asphalt Pavement Based on the K-Means Clustering Algorithm

The climate regionalization of asphalt pavement plays an active role in ensuring the good performance and service life of asphalt pavement. In order to better adapt to the climate characteristics of a region, this study developed a multi-index method of climate regionalization of asphalt pavement. First, meteorological data from the research region were statistically analyzed and the major climate variables were identified. Then, a principal component analysis (PCA) was used to eliminate any correlation between the major climate variables. Three principal components were extracted by the PCA as cluster factors, namely, the temperature factor, precipitation factor, and radiation factor. The research region was divided into the following four asphalt pavement climate zones via the K-means clustering algorithm. Those zones are affected by the climate comprehensively: an inland zone with high temperatures, little rainfall, and radiation, a coastal zone with high temperatures, and a rainy mountainous zone. The results of the climate regionalization were compared with the results of on-site investigations. The pavement degradation in each climatic zone was related to the climate characteristics of the region. Probabilistic neural network (PNN) and support vector machine (SVM) climate regionalization predictive models were established with MATLAB. The clustering factors were used as the input data to identify the climate zones, and the identification accuracy rate was determined to be over 90%. The climate regionalization of pavement can provide reference and guidance for the selection of reasonable technical measures, parameters, and building materials in highway projects with similar climatic conditions.

Download Full-text

AUTOMATED DIAGNOSIS OF BRAIN TUMOURS ASTROCYTOMAS USING PROBABILISTIC NEURAL NETWORK CLUSTERING AND SUPPORT VECTOR MACHINES

International Journal of Neural Systems ◽

10.1142/s0129065705000013 ◽

2005 ◽

Vol 15 (01n02) ◽

pp. 1-11 ◽

Cited By ~ 26

Author(s):

DIMITRIS GLOTSOS ◽

JUSSI TOHKA ◽

PANAGIOTA RAVAZOULA ◽

DIONISIS CAVOURAS ◽

GEORGE NIKIFORIDIS

Keyword(s):

Neural Network ◽

Clustering Algorithm ◽

Probabilistic Neural Network ◽

Support Vector ◽

Network Clustering ◽

High Grade ◽

Cell Nuclei ◽

Decision Tree Classification ◽

Average Accuracy ◽

Malignancy Grading

A computer-aided diagnosis system was developed for assisting brain astrocytomas malignancy grading. Microscopy images from 140 astrocytic biopsies were digitized and cell nuclei were automatically segmented using a Probabilistic Neural Network pixel-based clustering algorithm. A decision tree classification scheme was constructed to discriminate low, intermediate and high-grade tumours by analyzing nuclear features extracted from segmented nuclei with a Support Vector Machine classifier. Nuclei were segmented with an average accuracy of 86.5%. Low, intermediate, and high-grade tumours were identified with 95%, 88.3%, and 91% accuracies respectively. The proposed algorithm could be used as a second opinion tool for the histopathologists.

Download Full-text

Detection and Classification of Breast Cancer in Mammography Images Using Pattern Recognition Methods

10.30699/acadpub.mci.3.4.13 ◽

2019 ◽

Vol 3 (4) ◽

pp. 13-24 ◽

Cited By ~ 1

Author(s):

Naser Safdarian ◽

Mohammadreza Hedyezadeh

Keyword(s):

Breast Cancer ◽

Neural Network ◽

Clustering Algorithm ◽

Probabilistic Neural Network ◽

Screening Mammography ◽

Fuzzy Classification ◽

Support Vector ◽

Svm Classifier ◽

Final Decision ◽

Pattern Recognition Methods

Introduction: In this paper, a method is presented to classify the breast cancer masses according to new geometric features. Methods: After obtaining digital breast mammogram images from the digital database for screening mammography (DDSM), image preprocessing was performed. Then, by using image processing methods, an algorithm was developed for automatic extracting of masses from other normal parts of the breast image. In this study, 19 final different features of each image were extracted to generate the feature vector for classifier input. The proposed method not only determined the boundary of masses but also classified the type of masses such as benign and malignant ones. The neural network classification methods such as the radial basis function (RBF), probabilistic neural network (PNN), and multi-layer perceptron (MLP) as well as the Takagi-Sugeno-Kang (TSK) fuzzy classification, the binary statistic classifier, and the k-nearest neighbors (KNN) clustering algorithm were used for the final decision of mass class. Results: The best results of the proposed method for accuracy, sensitivity, and specificity metrics were obtained 97%±4.36, 100%±0 and 96%±5.81, respectively for support vector machine (SVM) classifier. Conclusions: By comparing the results of the proposed method with the results of the other previous methods, the efficiency of the proposed algorithm was reported.

Download Full-text

Enhanced clustering algorithm based on fuzzy C-means and support vector machine

Journal of Computer Applications ◽

10.3724/sp.j.1087.2013.00991 ◽

2013 ◽

Vol 33 (4) ◽

pp. 991-993

Author(s):

Lei HU ◽

Qinzhou NIU ◽

Yan CHEN

Keyword(s):

Support Vector Machine ◽

Clustering Algorithm ◽

Support Vector ◽

Fuzzy C Means

Download Full-text

Application of Machine Learning in Animal Disease Analysis and Prediction

Current Bioinformatics ◽

10.2174/1574893615999200728195613 ◽

2020 ◽

Vol 15 ◽

Author(s):

Shuwen Zhang ◽

Qiang Su ◽

Qin Chen

Keyword(s):

Machine Learning ◽

Unsupervised Learning ◽

Supervised Learning ◽

Clustering Algorithm ◽

Principal Component ◽

Support Vector ◽

Animal Disease ◽

Human Beings ◽

Animal Diseases ◽

Disease Analysis

Abstract: Major animal diseases pose a great threat to animal husbandry and human beings. With the deepening of globalization and the abundance of data resources, the prediction and analysis of animal diseases by using big data are becoming more and more important. The focus of machine learning is to make computers learn how to learn from data and use the learned experience to analyze and predict. Firstly, this paper introduces the animal epidemic situation and machine learning. Then it briefly introduces the application of machine learning in animal disease analysis and prediction. Machine learning is mainly divided into supervised learning and unsupervised learning. Supervised learning includes support vector machines, naive bayes, decision trees, random forests, logistic regression, artificial neural networks, deep learning, and AdaBoost. Unsupervised learning has maximum expectation algorithm, principal component analysis hierarchical clustering algorithm and maxent. Through the discussion of this paper, people have a clearer concept of machine learning and understand its application prospect in animal diseases.

Download Full-text

Similarity detection of English text and teaching evaluation based on improved TCUSS clustering algorithm

Journal of Intelligent & Fuzzy Systems ◽

10.3233/jifs-189576 ◽

2020 ◽

pp. 1-11

Author(s):

Yu Wang

Keyword(s):

Language Processing ◽

Calculation Method ◽

Clustering Algorithm ◽

Research Result ◽

Structural Features ◽

English Text ◽

Support Vector ◽

Similarity Calculation ◽

Other Information ◽

Calculation Task

The semantic similarity calculation task of English text has important influence on other fields of natural language processing and has high research value and application prospect. At present, research on the similarity calculation of short texts has achieved good results, but the research result on long text sets is still poor. This paper proposes a similarity calculation method that combines planar features with structured features and uses support vector regression models. Moreover, this paper uses PST and PDT to represent the syntax, semantics and other information of the text. In addition, through the two structural features suitable for text similarity calculation, this paper proposes a similarity calculation method combining structural features with Tree-LSTM model. Experiments show that this method provides a new idea for interest network extraction.

Download Full-text

Pinball Loss Twin Support Vector Clustering

ACM Transactions on Multimedia Computing Communications and Applications ◽

10.1145/3409264 ◽

2021 ◽

Vol 17 (2s) ◽

pp. 1-23

Author(s):

M. Tanveer ◽

Tarun Gupta ◽

Miten Shah ◽

Keyword(s):

Loss Function ◽

Clustering Algorithm ◽

Clustering Algorithms ◽

Structural Mri ◽

Twin Support Vector Machine ◽

Support Vector ◽

Support Vector Clustering ◽

Hinge Loss ◽

Pinball Loss ◽

Vector Clustering

Twin Support Vector Clustering (TWSVC) is a clustering algorithm inspired by the principles of Twin Support Vector Machine (TWSVM). TWSVC has already outperformed other traditional plane based clustering algorithms. However, TWSVC uses hinge loss, which maximizes shortest distance between clusters and hence suffers from noise-sensitivity and low re-sampling stability. In this article, we propose Pinball loss Twin Support Vector Clustering (pinTSVC) as a clustering algorithm. The proposed pinTSVC model incorporates the pinball loss function in the plane clustering formulation. Pinball loss function introduces favorable properties such as noise-insensitivity and re-sampling stability. The time complexity of the proposed pinTSVC remains equivalent to that of TWSVC. Extensive numerical experiments on noise-corrupted benchmark UCI and artificial datasets have been provided. Results of the proposed pinTSVC model are compared with TWSVC, Twin Bounded Support Vector Clustering (TBSVC) and Fuzzy c-means clustering (FCM). Detailed and exhaustive comparisons demonstrate the better performance and generalization of the proposed pinTSVC for noise-corrupted datasets. Further experiments and analysis on the performance of the above-mentioned clustering algorithms on structural MRI (sMRI) images taken from the ADNI database, face clustering, and facial expression clustering have been done to demonstrate the effectiveness and feasibility of the proposed pinTSVC model.

Download Full-text

The transferability of random forest and support vector machine for estimating daily global solar radiation using sunshine duration over different climate zones

Theoretical and Applied Climatology ◽

10.1007/s00704-021-03726-6 ◽

2021 ◽

Author(s):

Wei Wu ◽

Mao-Fen Li ◽

Xia Xu ◽

Xiao-Ping Tang ◽

Chao Yang ◽

...

Keyword(s):

Support Vector Machine ◽

Random Forest ◽

Solar Radiation ◽

Sunshine Duration ◽

Global Solar Radiation ◽

Support Vector ◽

Climate Zones

Download Full-text

Pretrained Convolutional Neural Networks Perform Well in a Challenging Test Case: Identification of Plant Bugs (Hemiptera: Miridae) Using a Small Number of Training Images

Insect Systematics and Diversity ◽

10.1093/isd/ixab004 ◽

2021 ◽

Vol 5 (2) ◽

Author(s):

Alexander Knyshov ◽

Samantha Hoang ◽

Christiane Weirauch

Keyword(s):

Neural Networks ◽

Convolutional Neural Networks ◽

Training Image ◽

Species Level ◽

Identification Accuracy ◽

Support Vector ◽

Test Case ◽

Learning Approaches ◽

Lower Accuracy ◽

Neural Network Classifiers

Abstract Automated insect identification systems have been explored for more than two decades but have only recently started to take advantage of powerful and versatile convolutional neural networks (CNNs). While typical CNN applications still require large training image datasets with hundreds of images per taxon, pretrained CNNs recently have been shown to be highly accurate, while being trained on much smaller datasets. We here evaluate the performance of CNN-based machine learning approaches in identifying three curated species-level dorsal habitus datasets for Miridae, the plant bugs. Miridae are of economic importance, but species-level identifications are challenging and typically rely on information other than dorsal habitus (e.g., host plants, locality, genitalic structures). Each dataset contained 2–6 species and 126–246 images in total, with a mean of only 32 images per species for the most difficult dataset. We find that closely related species of plant bugs can be identified with 80–90% accuracy based on their dorsal habitus alone. The pretrained CNN performed 10–20% better than a taxon expert who had access to the same dorsal habitus images. We find that feature extraction protocols (selection and combination of blocks of CNN layers) impact identification accuracy much more than the classifying mechanism (support vector machine and deep neural network classifiers). While our network has much lower accuracy on photographs of live insects (62%), overall results confirm that a pretrained CNN can be straightforwardly adapted to collection-based images for a new taxonomic group and successfully extract relevant features to classify insect species.

Download Full-text

Random Forest Model for Trip End Identification Using Cellular Phone and Points of Interest Data

Transportation Research Record Journal of the Transportation Research Board ◽

10.1177/03611981211031537 ◽

2021 ◽

pp. 036119812110315

Author(s):

Fei Yang ◽

Yanchen Wang ◽

Peter J. Jin ◽

Dingbang Li ◽

Zhenxing Yao

Keyword(s):

Random Forest ◽

Clustering Algorithm ◽

Subjective Experience ◽

Average Distance ◽

Cellular Phone ◽

Random Forest Model ◽

Support Vector ◽

Rule Based ◽

Forest Model ◽

Points Of Interest

Cellular phone data has been proven to be valuable in the analysis of residents’ travel patterns. Existing studies mostly identify the trip ends through rule-based or clustering algorithms. These methods largely depend on subjective experience and users’ communication behaviors. Moreover, limited by privacy policy, the accuracy of these methods is difficult to assess. In this paper, points of interest data is applied to supplement cellular phone data’s missing information generated by users’ behaviors. Specifically, a random forest model for trip end identification is proposed using multi-dimensional attributes. A field data acquisition test is designed and conducted with communication operators to implement synchronized cellular phone data and real trip information collection. The proposed identification approach is empirically evaluated with real trip information. Results show that the overall trip end detection precision and recall reach 95.2% and 88.7% with an average distance error of 269 m, and the time errors of the trip ends are less than 10 min. Compared with the rule-based approach, clustering algorithm, naive Bayes method, and support vector machine, the proposed method has better performance in accuracy and consistency.

Download Full-text

A novel method for spacecraft electrical fault detection based on FCM clustering and WPSVM classification with PCA feature extraction

Proceedings of the Institution of Mechanical Engineers Part G Journal of Aerospace Engineering ◽

10.1177/0954410016638874 ◽

2016 ◽

Vol 231 (1) ◽

pp. 98-108 ◽

Cited By ~ 11

Author(s):

Ke Li ◽

Yalei Wu ◽

Shimin Song ◽

Yi sun ◽

Jun Wang ◽

...

Keyword(s):

Feature Extraction ◽

Clustering Algorithm ◽

High Efficiency ◽

Computing Time ◽

Principal Component ◽

Component Analysis ◽

Support Vector ◽

Electrical Characteristics ◽

Complex Signals ◽

Fcm Clustering

The measurement of spacecraft electrical characteristics and multi-label classification issues are generally including a large amount of unlabeled test data processing, high-dimensional feature redundancy, time-consumed computation, and identification of slow rate. In this paper, a fuzzy c-means offline (FCM) clustering algorithm and the approximate weighted proximal support vector machine (WPSVM) online recognition approach have been proposed to reduce the feature size and improve the speed of classification of electrical characteristics in the spacecraft. In addition, the main component analysis for the complex signals based on the principal component feature extraction is used for the feature selection process. The data capture contribution approach by using thresholds is furthermore applied to resolve the selection problem of the principal component analysis (PCA), which effectively guarantees the validity and consistency of the data. Experimental results indicate that the proposed approach in this paper can obtain better fault diagnosis results of the spacecraft electrical characteristics’ data, improve the accuracy of identification, and shorten the computing time with high efficiency.

Download Full-text