scholarly journals iPReditor-CMG: Improving Predictive RNA Editor for Crop Mitochondrial Genomes Using Genomic Sequence Features and Optimal Support Vector Machine

Author(s):  
Sidong Qin ◽  
Yanjun Fan ◽  
Shengnan Hu ◽  
Yongqiang Wang ◽  
Ziqi Wang ◽  
...  

Cytosine (C) to uracil (U) RNA editing is one of the most important post-transcriptional processes, however exploring C-to-U editing events efficiently within the crop mitochondrial genome remains a challenge. An improving predictive RNA editor for crop mitochondrial genomes, iPReditor-CMG, was proposed, which was based on SVM, three common crop mitochondrial genomes and self-sequenced tobacco mitochondrial ATPase. After multi-combination feature extracting, high-dimension feature screening and multi-test independent predicting, the results showed that the average accuracy of intraspecific prediction was 0.85, and the highest value even up to 0.91, which outperformed the previous reference models. While the prediction accuracies were 0.78 between dicotyledons and no more than 0.56 between dicotyledons and monocotyledons, implying a possible similarity in C-to-U editing mechanisms among close relatives. The best model was finally identified with an independent test accuracy of 0.91 and an area under the curve of 0.88, and further suggested that five unreported feature sequences TGACA, ACAAC, GTAGA, CCGTT and TAACA were closely associated with the editing phenomenon. Multiple evaluation findings supported that the iPReditor-CMG could be effectively applied to predict crop mitochondrial editing sites, which may contribute to insight into their recognition mechanisms and even other post-transcriptional events in crop mitochondria.


2021 ◽  
Author(s):  
Hsueh-Chun Lin ◽  
Wei-Ting Hsiao ◽  
Yao-Chiang Kan ◽  
Chin-Chi Kuo ◽  
Sin-Kuo Chai

BACKGROUND Ubiquitous health management (UHM) services are getting used to working with intelligent medical technology. People expect to track their daily electrocardiogram (ECG) signals at home with AI-enabled computing for assessing arrhythmias, such as atrial premature atrial complex (APC), atrial fibrillation (AFib), ventricular premature complex (VPC), and ventricular tachycardia (VT), versus normal sinus rhythm (NSR). OBJECTIVE We established a uHealth management system prototype “ECG4UHM” with coupled machine learning models to recognize hybrid arrhythmia ECG patterns via online AI-computation. METHODS We coupled analytical modeling with machine-learning methods, such as multiple layer perceptron (MLP), random forest (RF), support vector machine (SVM), and naïve Bayes (NB), to recognize the hybrid patterns of four arrhythmia symptoms. The data pre-processing used Hilbert-Huang transform (HHT) with empirical mode decomposition to calculate ECG’s intrinsic mode functions (IMFs). The area centroids of the marginal Hilbert spectrum for the first three-order IMFs were suggested as the features. A strategy of simulative dataset modeling was conducted in the few-data resource analysis for comparison. The modeling was compiled to the modules “MMLCA” and “ECGHHT” for implementation. We engaged the MATLAB compiler and runtime server in the web-based ECG4UHM to build the recognition modules in the Java class library “HHTFeature” and drive the AI-computation to identify the arrhythmia symptoms, respectively. RESULTS The crucial datasets were extracted from the MIT-BIH arrhythmia open database in the modeling, which also derived the simulative datasets based on the features’ means and standard deviations. Four arrhythmia symptoms were studied, including the premature pattern (APC, VPC) and the fibril-rapid pattern (AFib, VT) against NSR. With 5-fold cross-validation, the RF model performed the best accuracy of 0.99 for the both patterns due to the crucial dataset. The test dataset evaluated the model and reached the area under the curve (AUC) of receiver operating characteristic about 0.99. The models for all hybrid patterns, excluding VPC versus AFib and VT, achieved an average accuracy of around 90%. With the prediction test, the respective AUCs of the NSR & APC versus the AFib, VPC, & VT were 0.94 and 0.93 for the RF and SVM on average. The average accuracy and the AUC of MLP, RF, and SVM models for APC-VT reached the value of 0.98 due to both crucial and simulative datasets. The feasible models for the hybrid patterns were installed in ECG4UHM to recognize four arrhythmia symptoms online for the UHM. CONCLUSIONS The self-developed system involved the AI-enabled modeling in recognizing hybrid patterns of the multiple arrhythmia symptoms in practice. The modeling can be extended to serve the UHM and home-isolated care in the future.



Author(s):  
Alexis David Pascual ◽  
Kenneth McIsaac ◽  
Gordon Osinski

Autonomous image recognition has numerous potential applications in the field of planetary science and geology. For instance, having the ability to classify images of rocks would allow geologists to have immediate feedback without having to bring back samples to the laboratory. Also, planetary rovers could classify rocks in remote places and even in other planets without needing human intervention. Shu et al. classified 9 different types of rock images using a Support Vector Machine (SVM) with the image features extracted autonomously. Through this method, the authors achieved a test accuracy of 96.71%. In this research, Convolutional Neural Networks(CNN) have been used to classify the same set of rock images. Results show that a 3-layer network obtains an average accuracy of 99.60% across 10 trials on the test set. A version of Self-taught Learning was also implemented to prove the generalizability of the features extracted by the CNN. Finally, one model has been chosen to be deployed on a mobile device to demonstrate practicality and portability. The deployed model achieves a perfect classification accuracy on the test set, while taking only 0.068 seconds to make a prediction, equivalent to about 14 frames per second.



2021 ◽  
Vol 10 (5) ◽  
pp. 992
Author(s):  
Martina Barchitta ◽  
Andrea Maugeri ◽  
Giuliana Favara ◽  
Paolo Marco Riela ◽  
Giovanni Gallo ◽  
...  

Patients in intensive care units (ICUs) were at higher risk of worsen prognosis and mortality. Here, we aimed to evaluate the ability of the Simplified Acute Physiology Score (SAPS II) to predict the risk of 7-day mortality, and to test a machine learning algorithm which combines the SAPS II with additional patients’ characteristics at ICU admission. We used data from the “Italian Nosocomial Infections Surveillance in Intensive Care Units” network. Support Vector Machines (SVM) algorithm was used to classify 3782 patients according to sex, patient’s origin, type of ICU admission, non-surgical treatment for acute coronary disease, surgical intervention, SAPS II, presence of invasive devices, trauma, impaired immunity, antibiotic therapy and onset of HAI. The accuracy of SAPS II for predicting patients who died from those who did not was 69.3%, with an Area Under the Curve (AUC) of 0.678. Using the SVM algorithm, instead, we achieved an accuracy of 83.5% and AUC of 0.896. Notably, SAPS II was the variable that weighted more on the model and its removal resulted in an AUC of 0.653 and an accuracy of 68.4%. Overall, these findings suggest the present SVM model as a useful tool to early predict patients at higher risk of death at ICU admission.



2019 ◽  
Vol 45 (10) ◽  
pp. 3193-3201 ◽  
Author(s):  
Yajuan Li ◽  
Xialing Huang ◽  
Yuwei Xia ◽  
Liling Long

Abstract Purpose To explore the value of CT-enhanced quantitative features combined with machine learning for differential diagnosis of renal chromophobe cell carcinoma (chRCC) and renal oncocytoma (RO). Methods Sixty-one cases of renal tumors (chRCC = 44; RO = 17) that were pathologically confirmed at our hospital between 2008 and 2018 were retrospectively analyzed. All patients had undergone preoperative enhanced CT scans including the corticomedullary (CMP), nephrographic (NP), and excretory phases (EP) of contrast enhancement. Volumes of interest (VOIs), including lesions on the images, were manually delineated using the RadCloud platform. A LASSO regression algorithm was used to screen the image features extracted from all VOIs. Five machine learning classifications were trained to distinguish chRCC from RO by using a fivefold cross-validation strategy. The performance of the classifier was mainly evaluated by areas under the receiver operating characteristic (ROC) curve and accuracy. Results In total, 1029 features were extracted from CMP, NP, and EP. The LASSO regression algorithm was used to screen out the four, four, and six best features, respectively, and eight features were selected when CMP and NP were combined. All five classifiers had good diagnostic performance, with area under the curve (AUC) values greater than 0.850, and support vector machine (SVM) classifier showed a diagnostic accuracy of 0.945 (AUC 0.964 ± 0.054; sensitivity 0.999; specificity 0.800), showing the best performance. Conclusions Accurate preoperative differential diagnosis of chRCC and RO can be facilitated by a combination of CT-enhanced quantitative features and machine learning.



Cancers ◽  
2021 ◽  
Vol 13 (6) ◽  
pp. 1407
Author(s):  
Matyas Bukva ◽  
Gabriella Dobra ◽  
Juan Gomez-Perez ◽  
Krisztian Koos ◽  
Maria Harmati ◽  
...  

Investigating the molecular composition of small extracellular vesicles (sEVs) for tumor diagnostic purposes is becoming increasingly popular, especially for diseases for which diagnosis is challenging, such as central nervous system (CNS) malignancies. Thorough examination of the molecular content of sEVs by Raman spectroscopy is a promising but hitherto barely explored approach for these tumor types. We attempt to reveal the potential role of serum-derived sEVs in diagnosing CNS tumors through Raman spectroscopic analyses using a relevant number of clinical samples. A total of 138 serum samples were obtained from four patient groups (glioblastoma multiforme, non-small-cell lung cancer brain metastasis, meningioma and lumbar disc herniation as control). After isolation, characterization and Raman spectroscopic assessment of sEVs, the Principal Component Analysis–Support Vector Machine (PCA–SVM) algorithm was performed on the Raman spectra for pairwise classifications. Classification accuracy (CA), sensitivity, specificity and the Area Under the Curve (AUC) value derived from Receiver Operating Characteristic (ROC) analyses were used to evaluate the performance of classification. The groups compared were distinguishable with 82.9–92.5% CA, 80–95% sensitivity and 80–90% specificity. AUC scores in the range of 0.82–0.9 suggest excellent and outstanding classification performance. Our results support that Raman spectroscopic analysis of sEV-enriched isolates from serum is a promising method that could be further developed in order to be applicable in the diagnosis of CNS tumors.



2020 ◽  
Vol 2020 ◽  
pp. 1-11
Author(s):  
Yuanyuan Xu ◽  
Genke Yang ◽  
Jiliang Luo ◽  
Jianan He

Electronic component recognition plays an important role in industrial production, electronic manufacturing, and testing. In order to address the problem of the low recognition recall and accuracy of traditional image recognition technologies (such as principal component analysis (PCA) and support vector machine (SVM)), this paper selects multiple deep learning networks for testing and optimizes the SqueezeNet network. The paper then presents an electronic component recognition algorithm based on the Faster SqueezeNet network. This structure can reduce the size of network parameters and computational complexity without deteriorating the performance of the network. The results show that the proposed algorithm performs well, where the Receiver Operating Characteristic Curve (ROC) and Area Under the Curve (AUC), capacitor and inductor, reach 1.0. When the FPR is less than or equal 10 − 6   level, the TPR is greater than or equal to 0.99; its reasoning time is about 2.67 ms, achieving the industrial application level in terms of time consumption and performance.



Author(s):  
Zepei Wu ◽  
Shuo Liu ◽  
Delong Zhao ◽  
Ling Yang ◽  
Zixin Xu ◽  
...  

AbstractCloud particles have different shapes in the atmosphere. Research on cloud particle shapes plays an important role in analyzing the growth of ice crystals and the cloud microphysics. To achieve an accurate and efficient classification algorithm on ice crystal images, this study uses image-based morphological processing and principal component analysis, to extract features of images and apply intelligent classification algorithms for the Cloud Particle Imager (CPI). Currently, there are mainly two types of ice-crystal classification methods: one is the mode parameterization scheme, and the other is the artificial intelligence model. Combined with data feature extraction, the dataset was tested on ten types of classifiers, and the highest average accuracy was 99.07%. The fastest processing speed of the real-time data processing test was 2,000 images/s. In actual application, the algorithm should consider the processing speed, because the images are in the order of millions. Therefore, a support vector machine (SVM) classifier was used in this study. The SVM-based optimization algorithm can classify ice crystals into nine classes with an average accuracy of 95%, blurred frame accuracy of 100%, with a processing speed of 2,000 images/s. This method has a relatively high accuracy and faster classification processing speed than the classic neural network model. The new method could be also applied in physical parameter analysis of cloud microphysics.



Sensors ◽  
2018 ◽  
Vol 18 (9) ◽  
pp. 2936 ◽  
Author(s):  
Xianghao Zhan ◽  
Xiaoqing Guan ◽  
Rumeng Wu ◽  
Zhan Wang ◽  
You Wang ◽  
...  

As alternative herbal medicine gains soar in popularity around the world, it is necessary to apply a fast and convenient means for classifying and evaluating herbal medicines. In this work, an electronic nose system with seven classification algorithms is used to discriminate between 12 categories of herbal medicines. The results show that these herbal medicines can be successfully classified, with support vector machine (SVM) and linear discriminant analysis (LDA) outperforming other algorithms in terms of accuracy. When principal component analysis (PCA) is used to lower the number of dimensions, the time cost for classification can be reduced while the data is visualized. Afterwards, conformal predictions based on 1NN (1-Nearest Neighbor) and 3NN (3-Nearest Neighbor) (CP-1NN and CP-3NN) are introduced. CP-1NN and CP-3NN provide additional, yet significant and reliable, information by giving the confidence and credibility associated with each prediction without sacrificing of accuracy. This research provides insight into the construction of a herbal medicine flavor library and gives methods and reference for future works.



2020 ◽  
Vol 10 (18) ◽  
pp. 6417 ◽  
Author(s):  
Emanuele Lattanzi ◽  
Giacomo Castellucci ◽  
Valerio Freschi

Most road accidents occur due to human fatigue, inattention, or drowsiness. Recently, machine learning technology has been successfully applied to identifying driving styles and recognizing unsafe behaviors starting from in-vehicle sensors signals such as vehicle and engine speed, throttle position, and engine load. In this work, we investigated the fusion of different external sensors, such as a gyroscope and a magnetometer, with in-vehicle sensors, to increase machine learning identification of unsafe driver behavior. Starting from those signals, we computed a set of features capable to accurately describe the behavior of the driver. A support vector machine and an artificial neural network were then trained and tested using several features calculated over more than 200 km of travel. The ground truth used to evaluate classification performances was obtained by means of an objective methodology based on the relationship between speed, and lateral and longitudinal acceleration of the vehicle. The classification results showed an average accuracy of about 88% using the SVM classifier and of about 90% using the neural network demonstrating the potential capability of the proposed methodology to identify unsafe driver behaviors.



Sensors ◽  
2020 ◽  
Vol 20 (9) ◽  
pp. 2630 ◽  
Author(s):  
Erika Rovini ◽  
Carlo Maremmani ◽  
Filippo Cavallo

Objective assessment of the motor evaluation test for Parkinson’s disease (PD) diagnosis is an open issue both for clinical and technical experts since it could improve current clinical practice with benefits both for patients and healthcare systems. In this work, a wearable system composed of four inertial devices (two SensHand and two SensFoot), and related processing algorithms for extracting parameters from limbs motion was tested on 40 healthy subjects and 40 PD patients. Seventy-eight and 96 kinematic parameters were measured from lower and upper limbs, respectively. Statistical and correlation analysis allowed to define four datasets that were used to train and test five supervised learning classifiers. Excellent discrimination between the two groups was obtained with all the classifiers (average accuracy ranging from 0.936 to 0.960) and all the datasets (average accuracy ranging from 0.953 to 0.966), over three conditions that included parameters derived from lower, upper or all limbs. The best performances (accuracy = 1.00) were obtained when classifying all the limbs with linear support vector machine (SVM) or gaussian SVM. Even if further studies should be done, the current results are strongly promising to improve this system as a support tool for clinicians in objectifying PD diagnosis and monitoring.



Sign in / Sign up

Export Citation Format

Share Document