scholarly journals Assessment of coal pillar stability using principal component analysis and stepwise selection and elimination

2021 ◽  
Vol 69 (3) ◽  
pp. 81
Author(s):  
Brijesh Kumar ◽  
Punit Paurush ◽  
Sanjay K. Sharma ◽  
Gauri S. Prasad Singh

Prediction of pillar stability is one of the most critical tasks in underground mining industries. This pillar stability analysis requires many input parameters and some of them are difficult to be determined. Various statistical based analysis is presented in literature for assessing pillar stability successfully. In the present work, the data from three mines had been to determine the factor of safety. A total of 63 pillar cases had been collected from the mines. Principal component analysis (PCA) and Stepwise selection and elimination (SSE) models were developed by using multi variate linear regression (MLR) on 45 data sets and subsequently the proposed models were validated on 18 different data sets. The value of coefficient of determination (R2) is 0.86 and 0.84 for PCA and SSE respectively. The root mean square error for PCA and SSE are found to be 0.112 and 0.123 respectively. On validation of the proposed model developed by PCA and SSE, the PCA model provided a better validation results. Hence, PCA is recommended for modelling pillar stability.

Sensors ◽  
2021 ◽  
Vol 21 (10) ◽  
pp. 3527
Author(s):  
Melanija Vezočnik ◽  
Roman Kamnik ◽  
Matjaz B. Juric

Inertial sensor-based step length estimation has become increasingly important with the emergence of pedestrian-dead-reckoning-based (PDR-based) indoor positioning. So far, many refined step length estimation models have been proposed to overcome the inaccuracy in estimating distance walked. Both the kinematics associated with the human body during walking and actual step lengths are rarely used in their derivation. Our paper presents a new step length estimation model that utilizes acceleration magnitude. To the best of our knowledge, we are the first to employ principal component analysis (PCA) to characterize the experimental data for the derivation of the model. These data were collected from anatomical landmarks on the human body during walking using a highly accurate optical measurement system. We evaluated the performance of the proposed model for four typical smartphone positions for long-term human walking and obtained promising results: the proposed model outperformed all acceleration-based models selected for the comparison producing an overall mean absolute stride length estimation error of 6.44 cm. The proposed model was also least affected by walking speed and smartphone position among acceleration-based models and is unaffected by smartphone orientation. Therefore, the proposed model can be used in the PDR-based indoor positioning with an important advantage that no special care regarding orientation is needed in attaching the smartphone to a particular body segment. All the sensory data acquired by smartphones that we utilized for evaluation are publicly available and include more than 10 h of walking measurements.


Author(s):  
Petr Praus

In this chapter the principals and applications of principal component analysis (PCA) applied on hydrological data are presented. Four case studies showed the possibility of PCA to obtain information about wastewater treatment process, drinking water quality in a city network and to find similarities in the data sets of ground water quality results and water-related images. In the first case study, the composition of raw and cleaned wastewater was characterised and its temporal changes were displayed. In the second case study, drinking water samples were divided into clusters in consistency with their sampling localities. In the case study III, the similar samples of ground water were recognised by the calculation of cosine similarity, the Euclidean and Manhattan distances. In the case study IV, 32 water-related images were transformed into a large image matrix whose dimensionality was reduced by PCA. The images were clustered using the PCA scatter plots.


2016 ◽  
Vol 2 (4) ◽  
pp. 211
Author(s):  
Girdhari Lal Chaurasia ◽  
Mahesh Kumar Gupta ◽  
Praveen Kumar Tandon

Water is an essential resource for all the organisms, plants and animals including the human beings. It is the backbone for agricultural and industrial sectors and all the small business units. Increase in human population and economic activities have tremendously increased the demand for large-scale suppliers of fresh water for various competing end users.The quality evaluation of water is represented in terms of physical, chemical and Biological parameters. A particular problem in the case of water quality monitoring is the complexity associated with analyzing the large number of measured variables. The data sets contain rich information about the behavior of the water resources. Multivariate statistical approaches allow deriving hidden information from the data sets about the possible influences of the environment on water quality. Classification, modeling and interpretation of monitored data are the most important steps in the assessment of water quality. The application of different multivariate statistical techniques, such as cluster analysis (CA), principal component analysis (PCA) and factor analysis (FA) help to identify important components or factors accounting for most of the variances of a system. In the present study water samples were analyzed for various physicochemical analyses by different methods following the standards of APHA, BIS and WHO and were subjected to further statistical analysis viz. the cluster analysis to understand the similarity and differences among the various sampling stations.  Three clusters were found. Cluster 1 was marked with 3 sampling locations 1, 3 & 5; Cluster-2 was marked with sampling location-2 and cluster-3 was marked with sampling location-4. Principal component analysis/factor analysis is a pattern reorganization technique which is used to assess the correlation between the observations in terms of different factors which are not observable. Observations correlated either positively or negatively, are likely to be affected by the same factors while the observations which are not correlated are influenced by different factors. In our study three factors explained 99.827% of variances. F1 marked  51.619% of total variances, high positive strong loading with TSS, TS, Temp, TDS, phosphate and moderate with electrical conductivity with loading values of 0.986, 0.970, 0.792, 0.744, 0.695,  0.701, respectively. Factor 2 marked 27.236% of the total variance with moderate positive loading with total alkalinity & temp. with loading values 0.723 & 0.606 respectively. It also explained the moderate negative loading with conductivity, TDS, and chloride with loading values -0.698, -0.690, -0.582. Factor F 3 marked 20.972 % of the variances with positive loading with PH, chloride, and phosphate with strong loading of pH 0.872 and moderate positive loading with chloride and phosphate with loading values 0.721, and 0.569 respectively. 


2011 ◽  
Vol 199-200 ◽  
pp. 850-857
Author(s):  
Jian Chao Dong ◽  
Tie Jun Yang ◽  
Xin Hui Li ◽  
Zhi Jun Shuai ◽  
You Hong Xiao

Principal component analysis (PCA), serving as one of the basic blind signal processing techniques, is extensively employed in all forms of analysis for extracting relevant information from confusing data sets. The principle of PCA is explained in this paper firstly, then the simulation and experiment are carried out to a simply supported beam rig, and PCA is used in frequency domain to identify sources number of several cases. Meanwhile principal components (PCs) contribution coefficient and signal to noise ratio between neighboring PCs (neighboring SNR) are introduced to cutoff minor components quantificationally. The results show that when observation number is equal to or larger than source number and additive noise is feebleness, accurate prediction of the number of uncorrelated excitation sources in a multiple input multiple output system could be obtained by principal component analysis.


1993 ◽  
Vol 13 (1) ◽  
pp. 5-14 ◽  
Author(s):  
K. J. Friston ◽  
C. D. Frith ◽  
P. F. Liddle ◽  
R. S. J. Frackowiak

The distributed brain systems associated with performance of a verbal fluency task were identified in a nondirected correlational analysis of neurophysiological data obtained with positron tomography. This analysis used a recursive principal-component analysis developed specifically for large data sets. This analysis is interpreted in terms of functional connectivity, defined as the temporal correlation of a neurophysiological index measured in different brain areas. The results suggest that the variance in neurophysiological measurements, introduced experimentally, was accounted for by two independent principal components. The first, and considerably larger, highlighted an intentional brain system seen in previous studies of verbal fluency. The second identified a distributed brain system including the anterior cingulate and Wernicke's area that reflected monotonic time effects. We propose that this system has an attentional bias.


2017 ◽  
Vol 71 (11) ◽  
pp. 2497-2503 ◽  
Author(s):  
Saranjam Khan ◽  
Rahat Ullah ◽  
Samina Javaid ◽  
Shaheen Shahzad ◽  
Hina Ali ◽  
...  

This study demonstrates the analysis of nasopharyngeal cancer (NPC) in human blood sera using Raman spectroscopy combined with the multivariate analysis technique. Blood samples of confirmed NPC patients and healthy individuals have been used in this study. The Raman spectra from all these samples were recorded using 785 nm laser for excitation. Important Raman bands at 760, 800, 815, 834, 855, 1003, 1220–1275, and 1524 cm−1, have been observed in both normal and NPC samples. A decrease in the lipids content, phenylalanine, and β-carotene, whereas increases in amide III, tyrosine, and tryptophan have been observed in the NPC samples. The two data sets were well separated using principal component analysis (PCA) based on Raman spectral data. The spectral variations between the healthy and cancerous samples have been further highlighted by plotting loading vectors PC1 and PC2, which shows only those spectral regions where the differences are obvious.


2021 ◽  
Vol 23 (06) ◽  
pp. 1699-1715
Author(s):  
Mohamed, A. M. ◽  
◽  
Abdel Latif, S. H ◽  
Alwan, A. S. ◽  
◽  
...  

The principle component analysis is used more frequently as a variables reduction technique. And recently, an evolving group of studies makes use of machine learning regression algorithms to improve the estimation of empirical models. One of the most frequently used machines learning regression models is support vector regression with various kernel functions. However, an ensemble of support vector regression and principal component analysis is also possible. So, this paper aims to investigate the competence of support vector regression techniques after performing principal component analysis to explore the possibility of reducing data and having more accurate estimations. Some new proposals are introduced and the behavior of two different models 𝜀𝜀-SVR and 𝑣𝑣-SVR are compared through an extensive simulation study under four different kernel functions; linear, radial, polynomial, and sigmoid kernel functions, with different sample sizes, ranges from small, moderate to large. The models are compared with their counterparts in terms of coefficient of determination (𝑅𝑅2 ) and root mean squared error (RMSE). The comparative results show that applying SVR after PCA models improve the results in terms of SV numbers between 30% and 60% on average and it can be applied with real data. In addition, the linear kernel function gave the best values rather than other kernel functions and the sigmoid kernel gave the worst values. Under 𝜀𝜀-SVR the results improved which did not happen with 𝑣𝑣-SVR. It is also drawn that, RMSE values decreased with increasing sample size.


2020 ◽  
Author(s):  
Christiane Scherer ◽  
James Grover ◽  
Darby Kammeraad ◽  
Gabe Rudy ◽  
Andreas Scherer

AbstractSince the beginning of the global SARS-CoV-2 pandemic, there have been a number of efforts to understand the mutations and clusters of genetic lines of the SARS-CoV-2 virus. Until now, phylogenetic analysis methods have been used for this purpose. Here we show that Principal Component Analysis (PCA), which is widely used in population genetics, can not only help us to understand existing findings about the mutation processes of the virus, but can also provide even deeper insights into these processes while being less sensitive to sequencing gaps. Here we describe a comprehensive analysis of a 46,046 SARS-CoV-2 genome sequence dataset downloaded from the GISAID database in June of this year.SummaryPCA provides deep insights into the analysis of large data sets of SARS-CoV-2 genomes, revealing virus lineages that have thus far been unnoticed.


2020 ◽  
Vol 39 (3) ◽  
pp. 3183-3193
Author(s):  
Jieya Li ◽  
Liming Yang

The classical principal component analysis (PCA) is not sparse enough since it is based on the L2-norm that is also prone to be adversely affected by the presence of outliers and noises. In order to address the problem, a sparse robust PCA framework is proposed based on the min of zero-norm regularization and the max of Lp-norm (0 < p ≤ 2) PCA. Furthermore, we developed a continuous optimization method, DC (difference of convex functions) programming algorithm (DCA), to solve the proposed problem. The resulting algorithm (called DC-LpZSPCA) is convergent linearly. In addition, when choosing different p values, the model can keep robust and is applicable to different data types. Numerical simulations are simulated in artificial data sets and Yale face data sets. Experiment results show that the proposed method can maintain good sparsity and anti-outlier ability.


Sign in / Sign up

Export Citation Format

Share Document