scholarly journals Improved support vector machine using optimization techniques for an aerobic granular sludge

2020 ◽  
Vol 9 (5) ◽  
pp. 1835-1843
Author(s):  
Nur Sakinah Ahmad Yasmin ◽  
Norhaliza Abdul Wahab ◽  
Aznah Nor Anuar

Aerobic granular sludge (AGS) is one of the treatment methods often used in wastewater systems. The dynamic behavior of AGS is complex and hard to predict especially when it comes to a limited data set. Theoretically, support vector machine (SVM) is a good prediction tool in handling limited data set. In this paper, an improved SVM using optimization approaches for better predictions is proposed. Two different types of optimization are built which are particle swarm optimization (PSO) and genetic algorithm (GA). The prediction of the models using SVM-PSO, SVM-GA and SVM-Grid Search are developed and compared prior to several feature analysis for verification purposes. The experimental data under hot temperature of 50˚C obtained from sequencing batch reactor is used. From simulation results, the proposed SVM with optimizations improve the prediction of chemical oxygen demand compared to the conventional grid search method and hence provide better prediction of effluent quality using AGS wastewater treatment systems.

Author(s):  
Nur Sakinah Ahmad Yasmin ◽  
Norhaliza Abdul Wahab ◽  
Aznah Nor Anuar ◽  
Mustafa Bob

To comply with growing demand for high effluent quality of Domestic Wastewater Treatment Plant (WWTP), a simple and reliable prediction model is thus needed. The wastewater treatment technology considered in this paper is an Aerobic Granular Sludge (AGS). The AGS systems are fundamentally complex due to uncertainty and non-linearity of the system makes it hard to predict. This paper presents model predictions and optimization as a tool in predicting the performance of the AGS. The input-output data used in model prediction are (COD, TN, TP, AN, and MLSS). After feature analysis, the prediction of the models using Support Vector Machine (SVM) and Feed-Forward Neural Network (FFNN) are developed and compared. The simulation of the model uses the experimental data obtained from Sequencing Batch Reactor under hot temperature of 50˚C. The simulation results indicated that the SVM is preferable to FFNN and it can provide a useful tool in predicting the effluent quality of WWTP.


Membranes ◽  
2021 ◽  
Vol 11 (8) ◽  
pp. 554
Author(s):  
Nur Sakinah Ahmad Yasmin ◽  
Norhaliza Abdul Wahab ◽  
Fatimah Sham Ismail ◽  
Mu’azu Jibrin Musa ◽  
Mohd Hakim Ab Halim ◽  
...  

Support vector regression (SVR) models have been designed to predict the concentration of chemical oxygen demand in sequential batch reactors under high temperatures. The complex internal interaction between the sludge characteristics and their influent were used to develop the models. The prediction becomes harder when dealing with a limited dataset due to the limitation of the experimental works. A radial basis function algorithm with selected kernel parameters of cost and gamma was used to developed SVR models. The kernel parameters were selected by using a grid search method and were further optimized by using particle swarm optimization and genetic algorithm. The SVR models were then compared with an artificial neural network. The prediction results R2 were within >90% for all predicted concentration of COD. The results showed the potential of SVR for simulating the complex aerobic granulation process and providing an excellent tool to help predict the behaviour in aerobic granular reactors of wastewater treatment.


2020 ◽  
Vol 27 (4) ◽  
pp. 329-336 ◽  
Author(s):  
Lei Xu ◽  
Guangmin Liang ◽  
Baowen Chen ◽  
Xu Tan ◽  
Huaikun Xiang ◽  
...  

Background: Cell lytic enzyme is a kind of highly evolved protein, which can destroy the cell structure and kill the bacteria. Compared with antibiotics, cell lytic enzyme will not cause serious problem of drug resistance of pathogenic bacteria. Thus, the study of cell wall lytic enzymes aims at finding an efficient way for curing bacteria infectious. Compared with using antibiotics, the problem of drug resistance becomes more serious. Therefore, it is a good choice for curing bacterial infections by using cell lytic enzymes. Cell lytic enzyme includes endolysin and autolysin and the difference between them is the purpose of the break of cell wall. The identification of the type of cell lytic enzymes is meaningful for the study of cell wall enzymes. Objective: In this article, our motivation is to predict the type of cell lytic enzyme. Cell lytic enzyme is helpful for killing bacteria, so it is meaningful for study the type of cell lytic enzyme. However, it is time consuming to detect the type of cell lytic enzyme by experimental methods. Thus, an efficient computational method for the type of cell lytic enzyme prediction is proposed in our work. Method: We propose a computational method for the prediction of endolysin and autolysin. First, a data set containing 27 endolysins and 41 autolysins is built. Then the protein is represented by tripeptides composition. The features are selected with larger confidence degree. At last, the classifier is trained by the labeled vectors based on support vector machine. The learned classifier is used to predict the type of cell lytic enzyme. Results: Following the proposed method, the experimental results show that the overall accuracy can attain 97.06%, when 44 features are selected. Compared with Ding's method, our method improves the overall accuracy by nearly 4.5% ((97.06-92.9)/92.9%). The performance of our proposed method is stable, when the selected feature number is from 40 to 70. The overall accuracy of tripeptides optimal feature set is 94.12%, and the overall accuracy of Chou's amphiphilic PseAAC method is 76.2%. The experimental results also demonstrate that the overall accuracy is improved by nearly 18% when using the tripeptides optimal feature set. Conclusion: The paper proposed an efficient method for identifying endolysin and autolysin. In this paper, support vector machine is used to predict the type of cell lytic enzyme. The experimental results show that the overall accuracy of the proposed method is 94.12%, which is better than some existing methods. In conclusion, the selected 44 features can improve the overall accuracy for identification of the type of cell lytic enzyme. Support vector machine performs better than other classifiers when using the selected feature set on the benchmark data set.


Solid Earth ◽  
2016 ◽  
Vol 7 (2) ◽  
pp. 481-492 ◽  
Author(s):  
Faisal Khan ◽  
Frieder Enzmann ◽  
Michael Kersten

Abstract. Image processing of X-ray-computed polychromatic cone-beam micro-tomography (μXCT) data of geological samples mainly involves artefact reduction and phase segmentation. For the former, the main beam-hardening (BH) artefact is removed by applying a best-fit quadratic surface algorithm to a given image data set (reconstructed slice), which minimizes the BH offsets of the attenuation data points from that surface. A Matlab code for this approach is provided in the Appendix. The final BH-corrected image is extracted from the residual data or from the difference between the surface elevation values and the original grey-scale values. For the segmentation, we propose a novel least-squares support vector machine (LS-SVM, an algorithm for pixel-based multi-phase classification) approach. A receiver operating characteristic (ROC) analysis was performed on BH-corrected and uncorrected samples to show that BH correction is in fact an important prerequisite for accurate multi-phase classification. The combination of the two approaches was thus used to classify successfully three different more or less complex multi-phase rock core samples.


2013 ◽  
Vol 295-298 ◽  
pp. 644-647 ◽  
Author(s):  
Yu Kai Yao ◽  
Hong Mei Cui ◽  
Ming Wei Len ◽  
Xiao Yun Chen

SVM (Support Vector Machine) is a powerful data mining algorithm, and is mainly used to finish classification or regression tasks. In this literature, SVM is used to conduct disease prediction. We focus on integrating with stratified sample and grid search technology to improve the classification accuracy of SVM, thus, we propose an improved algorithm named SGSVM: Stratified sample and Grid search based SVM. To testify the performance of SGSVM, heart-disease data from UCI are used in our experiment, and the results show SGSVM has obvious improvement in classification accuracy, and this is very valuable especially in disease prediction.


Author(s):  
Sajid Umair ◽  
Muhammad Majid Sharif

Prediction of student performance on the basis of habits has been a very important research topic in academics. Studies show that selection of the correct data set also plays a vital role in these predictions. In this chapter, the authors took data from different schools that contains student habits and their comments, analyzed it using latent semantic analysis to get semantics, and then used support vector machine to classify the data into two classes, important for prediction and not important. Finally, they used artificial neural networks to predict the grades of students. Regression was also used to predict data coming from support vector machine, while giving only the important data for prediction.


2018 ◽  
Vol 11 (5) ◽  
pp. 2863-2878 ◽  
Author(s):  
Yu Oishi ◽  
Haruma Ishida ◽  
Takashi Y. Nakajima ◽  
Ryosuke Nakamura ◽  
Tsuneo Matsunaga

Abstract. The Greenhouse Gases Observing Satellite (GOSAT) was launched in 2009 to measure global atmospheric CO2 and CH4 concentrations. GOSAT is equipped with two sensors: the Thermal And Near infrared Sensor for carbon Observations (TANSO)-Fourier transform spectrometer (FTS) and TANSO-Cloud and Aerosol Imager (CAI). The presence of clouds in the instantaneous field of view of the FTS leads to incorrect estimates of the concentrations. Thus, the FTS data suspected to have cloud contamination must be identified by a CAI cloud discrimination algorithm and rejected. Conversely, overestimating clouds reduces the amount of FTS data that can be used to estimate greenhouse gas concentrations. This is a serious problem in tropical rainforest regions, such as the Amazon, where the amount of useable FTS data is small because of cloud cover. Preparations are continuing for the launch of the GOSAT-2 in fiscal year 2018. To improve the accuracy of the estimates of greenhouse gases concentrations, we need to refine the existing CAI cloud discrimination algorithm: Cloud and Aerosol Unbiased Decision Intellectual Algorithm (CLAUDIA1). A new cloud discrimination algorithm using a support vector machine (CLAUDIA3) was developed and presented in another paper. Although the use of visual inspection of clouds as a standard for judging is not practical for screening a full satellite data set, it has the advantage of allowing for locally optimized thresholds, while CLAUDIA1 and -3 use common global thresholds. Thus, the accuracy of visual inspection is better than that of these algorithms in most regions, with the exception of snow- and ice-covered surfaces, where there is not enough spectral contrast to identify cloud. In other words, visual inspection results can be used as truth data for accuracy evaluation of CLAUDIA1 and -3. For this reason visual inspection can be used for the truth metric for the cloud discrimination verification exercise. In this study, we compared CLAUDIA1–CAI and CLAUDIA3–CAI for various land cover types, and evaluated the accuracy of CLAUDIA3–CAI by comparing both CLAUDIA1–CAI and CLAUDIA3–CAI with visual inspection (400  ×  400 pixels) of the same CAI images in tropical rainforests. Comparative results between CLAUDIA1–CAI and CLAUDIA3–CAI for various land cover types indicated that CLAUDIA3–CAI had a tendency to identify bright surface and optically thin clouds. However, CLAUDIA3–CAI had a tendency to misjudge the edges of clouds compared with CLAUDIA1–CAI. The accuracy of CLAUDIA3–CAI was approximately 89.5 % in tropical rainforests, which is greater than that of CLAUDIA1–CAI (85.9 %) for the test cases presented here.


Sign in / Sign up

Export Citation Format

Share Document