Coke Index Analysis Based on Principal Component Analysis and Decision Tree Mining

2011 ◽  
Vol 48-49 ◽  
pp. 318-322 ◽  
Author(s):  
Hong Wei Guo ◽  
Bu Xin Su ◽  
Jian Chang ◽  
Jian Liang Zhang ◽  
Wei Chao Cao

Current analysis in the relations between blast furnace production index and coke index is still using the traditional statistical analysis method,but it involves too many coke quality evaluation indexes and there are some overlap between the indexes. According to this situation, this paper puts forward a new method based on principal component analysis and decision tree mining to analyze the relations between blast furnace production index and coke index . The materials of blast furnace production mainly include ore, coke and coal, in which the coke quality index have the biggest influence on the blast furnace production index. It has profound meaning to analyze the relation between coke index and blast furnace production index to evaluate Coke quality indicators reasonably[1] and improve the blast furnace production index. Current analysis in the relations between blast furnace production index and coke index is still using the traditional statistical analysis method[2],but it involves too many coke quality evaluation indexes and there are some overlap between the indexes. According to this situation, this paper puts forward a new method based on principal component analysis and decision-tree-based data-mining to analyze the relations between blast furnace production index and coke index. On the one hand this method can get few representative indexes from so many evaluation indexes by principal component analysis; on the other hand, decision-tree-based data-mining on the coke representative index based on the principal component analysis can get accurately quantitative relation between blast furnace production index and coke index.

2009 ◽  
Vol 147-149 ◽  
pp. 588-593 ◽  
Author(s):  
Marcin Derlatka ◽  
Jolanta Pauk

In the paper the procedure of processing biomechanical data has been proposed. It consists of selecting proper noiseless data, preprocessing data by means of model’s identification and Kernel Principal Component Analysis and next classification using decision tree. The obtained results of classification into groups (normal and two selected pathology of gait: Spina Bifida and Cerebral Palsy) were very good.


2021 ◽  
Author(s):  
Anwar Yahya Ebrahim ◽  
Hoshang Kolivand

The authentication of writers, handwritten autograph is widely realized throughout the world, the thorough check of the autograph is important before going to the outcome about the signer. The Arabic autograph has unique characteristics; it includes lines, and overlapping. It will be more difficult to realize higher achievement accuracy. This project attention the above difficulty by achieved selected best characteristics of Arabic autograph authentication, characterized by the number of attributes representing for each autograph. Where the objective is to differentiate if an obtain autograph is genuine, or a forgery. The planned method is based on Discrete Cosine Transform (DCT) to extract feature, then Spars Principal Component Analysis (SPCA) to selection significant attributes for Arabic autograph handwritten recognition to aid the authentication step. Finally, decision tree classifier was achieved for signature authentication. The suggested method DCT with SPCA achieves good outcomes for Arabic autograph dataset when we have verified on various techniques.


2021 ◽  
Vol 13 (20) ◽  
pp. 4123
Author(s):  
Hanqi Wang ◽  
Zhiling Wang ◽  
Linglong Lin ◽  
Fengyu Xu ◽  
Jie Yu ◽  
...  

Vehicle pose estimation is essential in autonomous vehicle (AV) perception technology. However, due to the different density distributions of the point cloud, it is challenging to achieve sensitive direction extraction based on 3D LiDAR by using the existing pose estimation methods. In this paper, an optimal vehicle pose estimation network based on time series and spatial tightness (TS-OVPE) is proposed. This network uses five pose estimation algorithms proposed as candidate solutions to select each obstacle vehicle's optimal pose estimation result. Among these pose estimation algorithms, we first propose the Basic Line algorithm, which uses the road direction as the prior knowledge. Secondly, we propose improving principal component analysis based on point cloud distribution to conduct rotating principal component analysis (RPCA) and diagonal principal component analysis (DPCA) algorithms. Finally, we propose two global algorithms independent of the prior direction. We provided four evaluation indexes to transform each algorithm into a unified dimension. These evaluation indexes’ results were input into the ensemble learning network to obtain the optimal pose estimation results from the five proposed algorithms. The spatial dimension evaluation indexes reflected the tightness of the bounding box and the time dimension evaluation index reflected the coherence of the direction estimation. Since the network was indirectly trained through the evaluation index, it could be directly used on untrained LiDAR and showed a good pose estimation performance. Our approach was verified on the SemanticKITTI dataset and our urban environment dataset. Compared with the two mainstream algorithms, the polygon intersection over union (P-IoU) average increased by about 5.25% and 9.67%, the average heading error decreased by about 29.49% and 44.11%, and the average speed direction error decreased by about 3.85% and 46.70%. The experiment results showed that the ensemble learning network could effectively select the optimal pose estimation from the five abovementioned algorithms, making pose estimation more accurate.


2020 ◽  
Vol 27 (4) ◽  
pp. 1-16
Author(s):  
Meenal Jain ◽  
Gagandeep Kaur

Due to the launch of new applications the behavior of internet traffic is changing. Hackers are always looking for sophisticated tools to launch attacks and damage the services. Researchers have been working on intrusion detection techniques involving machine learning algorithms for supervised and unsupervised detection of these attacks. However, with newly found attacks these techniques need to be refined. Handling data with large number of attributes adds to the problem. Therefore, dimensionality based feature reduction of the data is required. In this work three reduction techniques, namely, Principal Component Analysis (PCA), Artificial Neural Network (ANN), and Nonlinear Principal Component Analysis (NLPCA) have been studied and analyzed. Secondly, performance of four classifiers, namely, Decision Tree (DT), Support Vector Machine (SVM), K Nearest Neighbor (KNN) and Naïve Bayes (NB) has been studied for the actual and reduced datasets. In addition, novel performance measurement metrics, Classification Difference Measure (CDM), Specificity Difference Measure (SPDM), Sensitivity Difference Measure (SNDM), and F1 Difference Measure (F1DM) have been defined and used to compare the outcomes on actual and reduced datasets. Comparisons have been done using new Coburg Intrusion Detection Data Set (CIDDS-2017) dataset as well widely referred NSL-KDD dataset. Successful results were achieved for Decision Tree with 99.0 percent and 99.8 percent accuracy on CIDDS and NSLKDD datasets respectively.


Author(s):  
Putri Kurnia Handayani

Data mining merupakan salah bidang ilmu yang bermanfaat untuk pengenalan pola/knowledge yang tersimpan dalam database. Klasifikasi merupakan salah satu peran dalam bidang data mining. Termasuk ke dalam supervised learning, klasifikasi digunakan untuk memprediksi objek yang belum memiliki kelas/label. Penggunaan algoritma decision tree untuk proses mining dataset bunga iris dikarenakan kemudahan dalam representasi knowledge yang dihasilkan. Selain itu, decision tree juga termasuk ke dalam eager learner sehingga akurasi dari knowledge yang dihasilkan lebih baik. Penggunaan principal component analysis (PCA) dalam optimasi algoritma decision tree, dilakukan saat preprocessing dataset. PCA berfungsi untuk mereduksi dimensi, fitur yang saling berkorelasi akan dipertahankan. Penggunaan dataset publik bunga iris diambil dari UCI Repository. Berdasarkan hasil perhitungan, akurasi algoritma decision tree setelah dilakukan optimasi dengan PCA terhadap dataset bunga iris sebesar 95.33%.


JOM ◽  
2020 ◽  
Vol 72 (11) ◽  
pp. 3908-3916
Author(s):  
Dewen Jiang ◽  
Jianliang Zhang ◽  
Zhenyang Wang ◽  
Chenfan Feng ◽  
Kexin Jiao ◽  
...  

Sign in / Sign up

Export Citation Format

Share Document