scholarly journals Comparison of dimension reduction methods on fatty acids food source study

2021 ◽  
Vol 11 (1) ◽  
Author(s):  
Yifan Chen ◽  
Yusuke Miura ◽  
Toshihiro Sakurai ◽  
Zhen Chen ◽  
Rojeet Shrestha ◽  
...  

AbstractSerum fatty acids (FAs) exist in the four lipid fractions of triglycerides (TGs), phospholipids (PLs), cholesteryl esters (CEs) and free fatty acids (FFAs). Total fatty acids (TFAs) indicate the sum of FAs in them. In this study, four statistical analysis methods, which are independent component analysis (ICA), factor analysis, common principal component analysis (CPCA) and principal component analysis (PCA), were conducted to uncover food sources of FAs among the four lipid fractions (CE, FFA, and TG + PL). Among the methods, ICA provided the most suggestive results. To distinguish the animal fat intake from endogenous fatty acids, FFA variables in ICA and factor analysis were studied. ICA provided more distinct suggestions of FA food sources (endogenous, plant oil intake, animal fat intake, and fish oil intake) than factor analysis. Moreover, ICA was discovered as a new approach to distinguish animal FAs from endogenous FAs, which will have an impact on epidemiological studies. In addition, the correlation coefficients between a published dataset of food FA compositions and the loading values obtained in the present ICA study suggested specific foods as serum FA sources. In conclusion, we found that ICA is a useful tool to uncover food sources of serum FAs.

2022 ◽  
pp. 146808742110707
Author(s):  
Aran Mohammad ◽  
Reza Rezaei ◽  
Christopher Hayduk ◽  
Thaddaeus Delebinski ◽  
Saeid Shahpouri ◽  
...  

The development of internal combustion engines is affected by the exhaust gas emissions legislation and the striving to increase performance. This demands for engine-out emission models that can be used for engine optimization for real driving emission controls. The prediction capability of physically and data-driven engine-out emission models is influenced by the system inputs, which are specified by the user and can lead to an improved accuracy with increasing number of inputs. Thereby the occurrence of irrelevant inputs becomes more probable, which have a low functional relation to the emissions and can lead to overfitting. Alternatively, data-driven methods can be used to detect irrelevant and redundant inputs. In this work, thermodynamic states are modeled based on 772 stationary measured test bench data from a commercial vehicle diesel engine. Afterward, 37 measured and modeled variables are led into a data-driven dimensionality reduction. For this purpose, approaches of supervised learning, such as lasso regression and linear support vector machine, and unsupervised learning methods like principal component analysis and factor analysis are applied to select and extract the relevant features. The selected and extracted features are used for regression by the support vector machine and the feedforward neural network to model the NOx, CO, HC, and soot emissions. This enables an evaluation of the modeling accuracy as a result of the dimensionality reduction. Using the methods in this work, the 37 variables are reduced to 25, 22, 11, and 16 inputs for NOx, CO, HC, and soot emission modeling while maintaining the accuracy. The features selected using the lasso algorithm provide more accurate learning of the regression models than the extracted features through principal component analysis and factor analysis. This results in test errors RMSETe for modeling NOx, CO, HC, and soot emissions 19.22 ppm, 6.46 ppm, 1.29 ppm, and 0.06 FSN, respectively.


Sign in / Sign up

Export Citation Format

Share Document