Combining partial least squares regression and least squares support vector machine for data mining

Author(s):  
Chen Gaobo ◽  
Chen Xiufang
2017 ◽  
Author(s):  
Jan-Patrick Voß

Die vorliegende Arbeit befasst sich mit dem Bioprozessmonitoring unter Verwendung spektroskopischer Messverfahren und multivariater Datenanalyse nach den Grundsätzen von PAT – Process Analytical Technology. Mit NIR-Spektroskopie und dem Verfahren Soft Independent Modelling of Class Analogy (SIMCA) wurde eine Quali¬tätsbewertung von Hefeextrakten realisiert. Im Vordergrund stand jedoch die Quanti¬fizierung nicht direkt messbarer Größen aus NIR-, Raman- und 2D-Fluoreszenzspektren in pharmazeutischen Produktionsprozessen mit Pichia pastoris. Eine entsprechende Online-Bestimmung mit der Methode Partial Least Squares Regression (PLSR) kam weiterführend zur Regelung der Glycerolkonzentration zum Einsatz. Darüber hinaus wurde die Verwendung nichtspektraler Online-Daten zur Prozessbeobachtung erprobt. Dabei gelang mit Hilfe des nichtlinearen Verfahrens Support Vector Regression (SVR) unter anderem die Bestimmung zellspezifischer Reaktionsraten. ...


2020 ◽  
Vol 22 (5) ◽  
pp. 1283-1305
Author(s):  
Zhaoxin Yue ◽  
Ping Ai ◽  
Chuansheng Xiong ◽  
Min Hong ◽  
Yanhong Song

Abstract Data representation and prediction model design play an important role in mid- to long-term runoff prediction. However, it is challenging to extract key factors that accurately characterize the changes in the runoff of a river basin because of the complex nature of the runoff process. In addition, the low accuracy is another problem for mid- to long-term runoff prediction. With an aim to solve these problems, two improvements are proposed in this paper. First, the partial mutual information (PMI)-based approach was employed for estimating the importance of various factors. Second, a deep learning architecture was introduced by using the deep belief network (DBN) with partial least-squares regression (PLSR), together denoted as PDBN, for mid- to long-term runoff prediction, which solves the problem of parameter optimization for the DBN using PLSR. The novelty of the proposed method lies in the key factor selection and a novel forecasting method for mid- to long-term runoff. Experimental results demonstrated that the proposed method can significantly improve the effect of mid- to long-term runoff prediction. Also, compared with the results obtained by current state-of-the-art prediction methods, i.e., DBN, backpropagation neural networks, and support vector machine models, our prediction results demonstrate the performance of the proposed method.


Sign in / Sign up

Export Citation Format

Share Document