good prediction accuracy
Recently Published Documents


TOTAL DOCUMENTS

112
(FIVE YEARS 69)

H-INDEX

7
(FIVE YEARS 2)

Author(s):  
Xin-yu Li ◽  
Jian-xiong You ◽  
Lu-yu Zhang ◽  
Li-xin Su ◽  
Xi-tao Yang

Background: Necroptosis is a newly recognized form of cell death. Here, we applied bioinformatics tools to identify necroptosis-related genes using a dataset from The Cancer Genome Atlas (TCGA) database, then constructed a model for prognosis of patients with prostate cancer.Methods: RNA sequence (RNA‐seq) data and clinical information for Prostate adenocarcinoma (PRAD) patients were obtained from the TCGA portal (http://tcga-data.nci.nih.gov/tcga/). We performed comprehensive bioinformatics analyses to identify hub genes as potential prognostic biomarkers in PRAD u followed by establishment and validation of a prognostic model. Next, we assessed the overall prediction performance of the model using receiver operating characteristic (ROC) curves and the area under curve (AUC) of the ROC.Results: A total of 5 necroptosis-related genes, namely ALOX15, BCL2, IFNA1, PYGL and TLR3, were used to construct a survival prognostic model. The model exhibited excellent performance in the TCGA cohort and validation group and had good prediction accuracy in screening out high-risk prostate cancer patients.Conclusion: We successfully identified necroptosis-related genes and constructed a prognostic model that can accurately predict 1- 3-and 5-years overall survival (OS) rates of PRAD patients. Our riskscore model has provided novel strategy for the prediction of PRAD patients’ prognosis.


Information ◽  
2021 ◽  
Vol 12 (12) ◽  
pp. 530
Author(s):  
Congmin Yang ◽  
Tao Zhu ◽  
Yang Zhang ◽  
Huansheng Ning ◽  
Liming Chen ◽  
...  

The particle swarm optimization (PSO) algorithm has been widely used in various optimization problems. Although PSO has been successful in many fields, solving optimization problems in big data applications often requires processing of massive amounts of data, which cannot be handled by traditional PSO on a single machine. There have been several parallel PSO based on Spark, however they are almost proposed for solving numerical optimization problems, and few for big data optimization problems. In this paper, we propose a new Spark-based parallel PSO algorithm to predict the co-authorship of academic papers, which we formulate as an optimization problem from massive academic data. Experimental results show that the proposed parallel PSO can achieve good prediction accuracy.


Energies ◽  
2021 ◽  
Vol 14 (21) ◽  
pp. 7202
Author(s):  
Jianfei Huang ◽  
Xinchun Cheng ◽  
Yuying Shen ◽  
Dewen Kong ◽  
Jixin Wang

Accurate prediction of the throttle value and state for wheel loaders can help to achieve autonomous operation, thereby reducing the cost and accident rate. However, existing methods based on a physical model cannot accurately reflect the operator’s driving habits and the interaction between wheel loaders and the environment. In this paper, a deep-learning-based prediction model is developed to predict the throttle value and state for wheel loaders by learning from driving data. Multiple long–short-term memory (LSTM) networks are used to extract the temporal features of different stages during the operation of the wheel loader. Two backward-propagation neural networks (BPNNs), which use the temporal feature extracted by LSTM as the input, are designed to output the final prediction results of throttle value and state, respectively. The proposed prediction model is trained and tested using the data from two different conditions. The end-to-end LSTM prediction model and BPNNs are used as benchmark models. The results indicate that the proposed prediction model has good prediction accuracy and adaptability. Furthermore, the relationship between the prediction performance and signal sampling frequency is also studied. The proposed prediction method that combines driving data and deep learning can make the throttle action conform to the decisions of an experienced operator, providing technical support for the autonomous operation of construction machinery.


Author(s):  
Maurizio Marra ◽  
Olivia Di Vincenzo ◽  
Iolanda Cioffi ◽  
Rosa Sammarco ◽  
Delia Morlino ◽  
...  

Abstract Background An accurate estimation of athletes’ energy needs is crucial in diet planning to improve sport performance and to maintain an appropriate body composition. This study aimed to develop and validate in elite athletes new equations for estimating resting energy expenditure (REE) based on anthropometric parameters as well as bioimpedance analysis (BIA)-derived raw variables and to validate the accuracy of selected predictive equations. Methods Adult elite athletes aged 18–40 yrs were studied. Anthropometry, indirect calorimetry and BIA were performed in all subjects. The new predictive equations were generated using different regression models. The accuracy of the new equations was assessed at the group level (bias) and at the individual level (precision accuracy), and then compared with the one of five equations used in the general population or three athletes-specific formulas. Results One-hundred and twenty-six male athletes (age 26.9 ± 9.1 yrs; weight 71.3 ± 10.9 kg; BMI 22.8 ± 2.7 kg/m2) from different sport specialties were randomly assigned to the calibration (n = 75) or validation group (n = 51). REE was directly correlated with individual characteristics, except for age, and raw BIA variables. Most of the equations from the literature were reasonably accurate at the population level (bias within ±5%). The new equations showed a mean bias −0.3% (Eq. A based on anthropometric parameters) and −0.6% (Eq. B based on BIA-derived raw variables). Precision accuracy (individual predicted-measured differences within ±5%) was ~75% in six out of eight of the selected equations and even higher for Eq. A (82.4%) and Eq. B (92.2%). Conclusion In elite athletes, BIA-derived phase angle is a significant predictor of REE. The new equations have a very good prediction accuracy at both group and individual levels. The use of phase angle as predictor of REE requires further research with respect to different sport specialties, training programs and training level.


2021 ◽  
Vol 11 (1) ◽  
Author(s):  
Seunghyun Moon ◽  
Ruimin Ma ◽  
Ross Attardo ◽  
Charles Tomonto ◽  
Mark Nordin ◽  
...  

AbstractIn this study, the effects of surface roughness and pore characteristics on fatigue lives of laser powder bed fusion (LPBF) Ti–6Al–4V parts were investigated. The 197 fatigue bars were printed using the same laser power but with varied scanning speeds. These actions led to variations in the geometries of microscale pores, and such variations were characterized using micro-computed tomography. To generate differences in surface roughness in fatigue bars, half of the samples were grit-blasted and the other half were machined. Fatigue behaviors were analyzed with respect to surface roughness and statistics of the pores. For the grit-blasted samples, the contour laser scan in the LPBF strategy led to a pore-depletion zone isolating surface and internal pores with different features. For the machined samples, where surface pores resemble internal pores, the fatigue life was highly correlated with the average pore size and projected pore area in the plane perpendicular to the stress direction. Finally, a machine learning model using a drop-out neural network (DONN) was employed to establish a link between surface and pore features to the fatigue data (logN), and good prediction accuracy was demonstrated. Besides predicting fatigue lives, the DONN can also estimate the prediction uncertainty.


2021 ◽  
Vol 8 (1) ◽  
Author(s):  
Elisa Cappetta ◽  
Giuseppe Andolfo ◽  
Anna Guadagno ◽  
Antonio Di Matteo ◽  
Amalia Barone ◽  
...  

AbstractMany studies showed that few degrees above tomato optimum growth temperature threshold can lead to serious loss in production. Therefore, the development of innovative strategies to obtain tomato cultivars with improved yield under high temperature conditions is a main goal both for basic genetic studies and breeding activities. In this paper, a F4 segregating population was phenotypically evaluated for quantitative and qualitative traits under heat stress conditions. Moreover, a genotyping by sequencing (GBS) approach has been employed for building up genomic selection (GS) models both for yield and soluble solid content (SCC). Several parameters, including training population size, composition and marker quality were tested to predict genotype performance under heat stress conditions. A good prediction accuracy for the two analyzed traits (0.729 for yield production and 0.715 for SCC) was obtained. The predicted models improved the genetic gain of selection in the next breeding cycles, suggesting that GS approach is a promising strategy to accelerate breeding for heat tolerance in tomato. Finally, the annotation of SNPs located in gene body regions combined with QTL analysis allowed the identification of five candidates putatively involved in high temperatures response, and the building up of a GS model based on calibrated panel of SNP markers.


2021 ◽  
Author(s):  
Hongquan Chen ◽  
Deepthi Sen ◽  
Akhil Datta-Gupta ◽  
Masahiro Nagao

Abstract Routine well-wise injection and production measurements contain significant information on subsurface structure and properties. Data-driven technology that interprets surface data into subsurface structure or properties can assist operators in making informed decisions by providing a better understanding of field assets. Our machine-learning framework is built on the statistical recurrent unit (SRU) model and interprets well-based injection/production data into inter-well connectivity without relying on a geologic model. We test it on synthetic and field-scale CO2 EOR projects utilizing the water-alternating-gas (WAG) process. SRU is a special type of recurrent neural network (RNN) that allows for better characterization of temporal trends, by learning various statistics of the input at different time scales. In our application, the complete states (injection rate, pressure and cumulative injection) at injectors and pressure states at producers are fed to SRU as the input and the phase rates at producers are treated as the output. Once the SRU is trained and validated, it is then used to assess the connectivity of each injector to any producer using permutation variable importance method, wherein inputs corresponding to an injector are shuffled and the increase in prediction error at a given producer is recorded as the importance (connectivity metric) of the injector to the producer. This method is tested in both synthetic and field-scale cases. The validation of the proposed data-driven inter-well connectivity assessment is performed using synthetic data from simulation models where inter-well connectivity can be easily measured using the streamline-based flux allocation. The SRU model is shown to offer excellent prediction performance on the synthetic case. Despite significant measurement noise and frequent well shut-ins imposed in the field-scale case, the SRU model offers good prediction accuracy, the overall relative error of the phase production rates at most producers ranges from 10% to 30%. It is shown that the dominant connections identified by the data-driven method and streamline method are in close agreement. This significantly improves confidence in our data-driven procedure. The novelty of this work is that it is purely data-driven method and can directly interpret routine surface measurements to intuitive subsurface knowledge. Furthermore, the streamline-based validation procedure provides physics-based backing to the results obtained from data analytics. The study results in a reliable and efficient data analytics framework that is well-suited for large field applications.


2021 ◽  
Vol 108 (Supplement_6) ◽  
Author(s):  
C W L Chia ◽  
S Bhatia ◽  
D Shastin ◽  
M Chamberland

Abstract Aim A third of epilepsy patients suffer from medically refractory seizures. In patients eligible for surgical treatment, seizure freedom rates remain variable. Machine learning (ML) utilises large datasets to detect patterns to make predictions. We systematically review studies employing ML models for prediction of outcome following resective epilepsy surgery to evaluate their efficacy, applicability and value in determining surgical candidacy. Method MEDLINE, Cochrane and EMBASE databases were searched for literature published between 2010 – 2020 according to PRISMA guidance. Non-refractory epilepsy, non-clinical outcome prediction, or non-human studies were excluded. Clinical and demographic data, ML features, discrimination and prediction accuracy metrics were extracted. Results 15 studies were included. Median cohort size was 49 (range 16 – 4211). Heterogeneous input data sources were utilised: MRI (n = 10) , electrophysiology (n = 4), PET (n = 2), clinical data (n = 2), and neuropsychological testing (n = 1). The most common ML model used was support vector machines (n = 7). All studies had good discrimination (AUC > 0.70, range: 0.79 [95% CI NR] - 0.94 [95% CI 0.92 – 0.96]), and good prediction accuracy (> 0.70, range: 0.76 [95% CI NR] – 0.95 [95% CI NR]). Limitations included small sample sizes, limited external validation and lack of comparison with clinician-predicted outcomes. Conclusions Machine Learning for outcome prediction could enhance clinical decision-making for surgical candidacy in epilepsy, and lead to improved precision medicine delivery. Outcome reporting remains inconsistent, and further work is required to externally validate such models to implement these to large-scale clinical populations.


2021 ◽  
Vol 8 ◽  
Author(s):  
Zhihao Qu ◽  
Dezhi Tang ◽  
Zhu Wang ◽  
Xiaqiao Li ◽  
Hongjian Chen ◽  
...  

Pitting corrosion seriously harms the service life of oil field gathering and transportation pipelines, which is an important subject of corrosion prevention. In this study, we collected the corrosion data of pipeline steel immersion experiment and established a pitting judgment model based on machine learning algorithm. Feature reduction methods, including feature importance calculation and pearson correlation analysis, were first adopted to find the important factors affecting pitting. Then, the best input feature set for pitting judgment was constructed by combining feature combination and feature creation. Through receiver operating characteristic (ROC) curve and area under curve (AUC) calculation, random forest algorithm was selected as the modeling algorithm. As a result, the pitting judgment model based on machine learning and high dimensional feature parameters (i.e., material factors, solution factors, environment factors) showed good prediction accuracy. This study provided an effective means for processing high-dimensional and complex corrosion data, and proved the feasibility of machine learning in solving material corrosion problems.


2021 ◽  
Vol 13 (7) ◽  
pp. 168781402110300
Author(s):  
Zhiming Zhang ◽  
Yapeng Shang ◽  
Tong Zhang

The aim of this study is to obtain the deflection curve equations of endplates with one to five clamping belts which allows investigating endplates deflection for uniform contact pressure distribution. Based on an equivalent mechanical model for a large fuel cell stack, the effects of the thicknesses of endplates, numbers, and positions of clamping belts are discussed, and the optimal thickness of endplate with different clamping belts is obtained, and moreover the optimal position of intermediate and outer clamping belts on the endplates. Finally, a three-dimensional finite element analysis (FEA) of a fuel cell stack clamping with steel belts and nonlinear contact elements is compared to what the equivalent mechanical beam model predicts. The result of this study shows that the equivalent mechanical model gives good prediction accuracy for the deflection behavior of endplates and the clamping force of the fuel cell stack, which is effective and helpful for the design of a large fuel cell stack assembly.


Sign in / Sign up

Export Citation Format

Share Document