Dimensionality Reduction for Countermovement Jump Metrics

Author(s):  
Lachlan P. James ◽  
Haresh Suppiah ◽  
Michael R. McGuigan ◽  
David L. Carey

Purpose: Dozens of variables can be derived from the countermovement jump (CMJ). However, this does not guarantee an increase in useful information because many of the variables are highly correlated. Furthermore, practitioners should seek to find the simplest solution to performance testing and reporting challenges. The purpose of this investigation was to show how to apply dimensionality reduction to CMJ data with a view to offer practitioners solutions to aid applications in high-performance settings. Methods: The data were collected from 3 cohorts using 3 different devices. Dimensionality reduction was undertaken on the extracted variables by way of principal component analysis and maximum likelihood factor analysis. Results: Over 90% of the variance in each CMJ data set could be explained in 3 or 4 principal components. Similarly, 2 to 3 factors could successfully explain the CMJ. Conclusions: The application of dimensional reduction through principal component analysis and factor analysis allowed for the identification of key variables that strongly contributed to distinct aspects of jump performance. Practitioners and scientists can consider the information derived from these procedures in several ways to streamline the transfer of CMJ test information.

2022 ◽  
pp. 146808742110707
Author(s):  
Aran Mohammad ◽  
Reza Rezaei ◽  
Christopher Hayduk ◽  
Thaddaeus Delebinski ◽  
Saeid Shahpouri ◽  
...  

The development of internal combustion engines is affected by the exhaust gas emissions legislation and the striving to increase performance. This demands for engine-out emission models that can be used for engine optimization for real driving emission controls. The prediction capability of physically and data-driven engine-out emission models is influenced by the system inputs, which are specified by the user and can lead to an improved accuracy with increasing number of inputs. Thereby the occurrence of irrelevant inputs becomes more probable, which have a low functional relation to the emissions and can lead to overfitting. Alternatively, data-driven methods can be used to detect irrelevant and redundant inputs. In this work, thermodynamic states are modeled based on 772 stationary measured test bench data from a commercial vehicle diesel engine. Afterward, 37 measured and modeled variables are led into a data-driven dimensionality reduction. For this purpose, approaches of supervised learning, such as lasso regression and linear support vector machine, and unsupervised learning methods like principal component analysis and factor analysis are applied to select and extract the relevant features. The selected and extracted features are used for regression by the support vector machine and the feedforward neural network to model the NOx, CO, HC, and soot emissions. This enables an evaluation of the modeling accuracy as a result of the dimensionality reduction. Using the methods in this work, the 37 variables are reduced to 25, 22, 11, and 16 inputs for NOx, CO, HC, and soot emission modeling while maintaining the accuracy. The features selected using the lasso algorithm provide more accurate learning of the regression models than the extracted features through principal component analysis and factor analysis. This results in test errors RMSETe for modeling NOx, CO, HC, and soot emissions 19.22 ppm, 6.46 ppm, 1.29 ppm, and 0.06 FSN, respectively.


Author(s):  
Sam McCormack ◽  
Ben Jones ◽  
Sean Scantlebury ◽  
Neil Collins ◽  
Cameron Owen ◽  
...  

Purpose: To compare the physical qualities between academy and international youth rugby league (RL) players using principal component analysis. Methods: Six hundred fifty-four males (age = 16.7 [1.4] y; height = 178.4 [13.3] cm; body mass = 82.2 [14.5] kg) from 11 English RL academies participated in this study. Participants completed anthropometric, power (countermovement jump), strength (isometric midthigh pull; IMTP), speed (10 and 40 m speed), and aerobic endurance (prone Yo-Yo IR1) assessments. Principal component analysis was conducted on all physical quality measures. A 1-way analysis of variance with effect sizes was performed on 2 principal components (PCs) to identify differences between academy and international backs, forwards, and pivots at under 16 and 18 age groups. Results: Physical quality measures were reduced to 2 PCs explaining 69.4% of variance. The first PC (35.3%) was influenced by maximum and 10-m momentum, absolute IMTP, and body mass. Ten and forty-meter speed, body mass and fat, prone Yo-Yo, IMTP relative, maximum speed, and countermovement jump contributed to PC2 (34.1%). Significant differences (P < .05, effect size = −1.83) were identified between U18 academy and international backs within PC1. Conclusion: Running momentum, absolute IMTP, and body mass contributed to PC1, while numerous qualities influenced PC2. The physical qualities of academy and international youth RL players are similar, excluding U18 backs. Principal component analysis can reduce the dimensionality of a data set and help identify overall differences between playing levels. Findings suggest that RL practitioners should measure multiple physical qualities when assessing physical performance.


2016 ◽  
Vol 26 (2) ◽  
Author(s):  
Peter Filzmoser

In this paper we introduce a statistical method which can be used in combination with principal component analysis or factor analysis. Certain variables of a large data set which are of interest can be selected in order to calculate loadings and scores of these variables. We describe how the remaining variables of the data set can be presented in the previously extracted factor space. Furthermore, a possibility for the representation of the results is shown which is helpful for the interpretation.


Mathematics ◽  
2021 ◽  
Vol 9 (17) ◽  
pp. 2067
Author(s):  
Viliam Ďuriš ◽  
Renáta Bartková ◽  
Anna Tirpáková

The present contribution is devoted to the theory of fuzzy sets, especially Atanassov Intuitionistic Fuzzy sets (IF sets) and their use in practice. We define the correlation between IF sets and the correlation coefficient, and we bring a new perspective to solving the problem of data file reduction in case sets where the input data come from IF sets. We present specific applications of the two best-known methods, the Principal Component Analysis and Factor Analysis, used to solve the problem of reducing the size of a data file. We examine input data from IF sets from three perspectives: through membership function, non-membership function and hesitation margin. This examination better reflects the character of the input data and also better captures and preserves the information that the input data carries. In the article, we also present and solve a specific example from practice where we show the behavior of these methods on data from IF sets. The example is solved using R programming language, which is useful for statistical analysis of data and their graphical representation.


2020 ◽  
Vol 0 (0) ◽  
Author(s):  
Alexandra-Maria Tăuţan ◽  
Alessandro C. Rossi ◽  
Ruben de Francisco ◽  
Bogdan Ionescu

AbstractMethods developed for automatic sleep stage detection make use of large amounts of data in the form of polysomnographic (PSG) recordings to build predictive models. In this study, we investigate the effect of several dimensionality reduction techniques, i.e., principal component analysis (PCA), factor analysis (FA), and autoencoders (AE) on common classifiers, e.g., random forests (RF), multilayer perceptron (MLP), long-short term memory (LSTM) networks, for automated sleep stage detection. Experimental testing is carried out on the MGH Dataset provided in the “You Snooze, You Win: The PhysioNet/Computing in Cardiology Challenge 2018”. The signals used as input are the six available (EEG) electoencephalographic channels and combinations with the other PSG signals provided: ECG – electrocardiogram, EMG – electromyogram, respiration based signals – respiratory efforts and airflow. We observe that a similar or improved accuracy is obtained in most cases when using all dimensionality reduction techniques, which is a promising result as it allows to reduce the computational load while maintaining performance and in some cases also improves the accuracy of automated sleep stage detection. In our study, using autoencoders for dimensionality reduction maintains the performance of the model, while using PCA and FA the accuracy of the models is in most cases improved.


Author(s):  
Guang-Ho Cha

Principal component analysis (PCA) is an important tool in many areas including data reduction and interpretation, information retrieval, image processing, and so on. Kernel PCA has recently been proposed as a nonlinear extension of the popular PCA. The basic idea is to first map the input space into a feature space via a nonlinear map and then compute the principal components in that feature space. This paper illustrates the potential of kernel PCA for dimensionality reduction and feature extraction in multimedia retrieval. By the use of Gaussian kernels, the principal components were computed in the feature space of an image data set and they are used as new dimensions to approximate image features. Extensive experimental results show that kernel PCA performs better than linear PCA with respect to the retrieval quality as well as the retrieval precision in content-based image retrievals.Keywords: Principal component analysis, kernel principal component analysis, multimedia retrieval, dimensionality reduction, image retrieval


2017 ◽  
Vol 727 ◽  
pp. 447-449 ◽  
Author(s):  
Jun Dai ◽  
Hua Yan ◽  
Jian Jian Yang ◽  
Jun Jun Guo

To evaluate the aging behavior of high density polyethylene (HDPE) under an artificial accelerated environment, principal component analysis (PCA) was used to establish a non-dimensional expression Z from a data set of multiple degradation parameters of HDPE. In this study, HDPE samples were exposed to the accelerated thermal oxidative environment for different time intervals up to 64 days. The results showed that the combined evaluating parameter Z was characterized by three-stage changes. The combined evaluating parameter Z increased quickly in the first 16 days of exposure and then leveled off. After 40 days, it began to increase again. Among the 10 degradation parameters, branching degree, carbonyl index and hydroxyl index are strongly associated. The tensile modulus is highly correlated with the impact strength. The tensile strength, tensile modulus and impact strength are negatively correlated with the crystallinity.


Sign in / Sign up

Export Citation Format

Share Document