svm model
Recently Published Documents





SinkrOn ◽  
2022 ◽  
Vol 7 (1) ◽  
pp. 39-45
Nur Ghaniaviyanto Ramadhan ◽  
Teguh Ikhlas Ramadhan

A movie is a spectacle that can be done at a relaxed time. Currently, there are many movies that can be watched via the internet or cinema. Movies that are watched on the internet are sometimes charged to watch so that potential viewers before watching a movie will read comments from users who have watched the movie. The website that is often used to view movie comments today is IMDB. Movie comments are many and varied on the IMDB website, we can see comments based on the star rating aspect. This causes users to have difficulty analyzing other users' comments. So, this study aims to analyze the sentiment of opinions from several comments from IMDB website users using the star rating aspect and will be classified using the support vector machine method (SVM). Sentiment analysis is a classification process to understand the opinions, interactions, and emotions of a document or text. SVM is very efficient for many applications in science and engineering, especially for classification (pattern recognition) problems. In addition to the SVM method, the TF-IDF technique is also used to change the shape of the document into several words. The results obtained by applying the SVM model are 79% accuracy, 75% precision, and 87% recall. The SVM classification is also superior to other methods, namely logistic regression.

2022 ◽  
Vol 3 (4) ◽  
pp. 283-294
M. Duraipandian ◽  
R. Vinothkanna

Customers post online product reviews based on their own experience. They may share their thoughts and comments on items on online shopping websites. The sentiment analysis comprises of opinion or idea process and process of sorting high rating reviews according to how well the product satisfies. Opinion mining is a technique for extracting useful data from large amounts of texts in order to use those to enhance or expand a company's operations. According to consumer evaluations, many of the goods aren't as good as they seem. It's common that buyers submit their thoughts on a product but then forget to rate it. The prior data preprocessing is more efficient to extract the features by CNN approach. This proposed methodology breaks down each user's rating prediction model into two parts: one based on the review text and other based on the user rating matrix with the help of CNN feature engineering. The goal of this study is to classify all reviews into ratings by SVM model. This proposed classification model provides good accuracy to predict the online reviews efficiently. For reviews without ratings, a further prediction of feelings is generated using multiple classifiers. The benefits of this proposed model are honed using helpfulness ratings from a small number of evaluations such as accuracy, F1 score, sensitivity, and precision. According to studies using the standard benchmark dataset, the accuracy of customized recommendation services, user happiness, and corporate trust may all be enhanced by including review helpfulness information in the recommender system.

Zhiyong Yang ◽  
Jing Wen ◽  
Kaide Huang

AbstractThere is a wide demand for people counting and pedestrian flow monitoring in large public places such as scenic tourist areas, shopping malls, stations, squares, and so on. Based on the feedback from the pedestrian flow monitoring system, resources can be optimally allocated to maximize social and economic benefits. Moreover, trampling accidents can be avoided because pedestrian guidance is carried out in time. In order to meet these requirements, we propose a method of pedestrian flow monitoring based on the received signal strength (RSS) of wireless sensor networks. This method mainly utilizes the shadow attenuation effect of pedestrians on radio frequency (RF) signals of effective links. In this paper, a deployment structure of RF wireless sensor network is firstly designed to monitor the pedestrians. Secondly, the features are extracted from the wavelet decomposition of RSS signal series with a short time. Lastly, the support vector machine (SVM) algorithm is trained by an experimental data set to distinguish the instantaneous number of pedestrian passing through the monitoring point. In the case of dense and sparse indoor personnel density, the accuracy of the SVM model is 88.9% and 94.5%, respectively. In the outdoor environment, the accuracy of the SVM model is 92.9%. The experimental results show that this method can realize the high precision monitoring of the flow of people in the context of real-time pedestrian flow monitoring.

2021 ◽  
Vol 12 (1) ◽  
pp. 186
Mehmet Şükrü Özçoban ◽  
Muhammed Erdem Isenkul ◽  
Selçuk Sevgen ◽  
Seren Acarer ◽  
Mertol Tüfekci

Impermeable base layers that are made of materials with low permeability, such as clay soil, are necessary to prevent leachate in landfills from harming the environment. However, over time, the permeability of the clay soil changes. Therefore, to reduce and minimize the risk, the permeability-related characteristics of the base layers must be improved. Thus, this study aims to serve this purpose by experimentally investigating the effects of nanomaterial addition (aluminum oxide, iron oxide) into kaolin samples. The obtained samples are prepared by applying standard compaction, and the permeability of the soil sample is experimentally investigated by passing leachate from the reactors, in which these samples are placed. Therefore, Flow Resistance (FR) analysis is conducted and the obtained results show that the Al additives are more successful than the Fe additive in reducing leachate permeability. Besides, the concentration values of some polluting parameters (Chemical Oxygen Demand (COD), Total Kjeldahl Nitrogen (TKN), and Total Phosphorus (TP)) at the inlet and outlet of the reactors are analyzed. Three different models (Artificial Neural Networks (ANN), Multiple Linear Regression (MLR), Support Vector Machine (SVM)) are applied to the data obtained from the experimental study. The results have shown that polluting parameters produce high FR regression similarity rates (>75%), TKN, TP, and COD features are highly correlated with the FR value (>60%) and the most successful method is found to be the SVM model.

2021 ◽  
Guosheng Zhang ◽  
Tongyu Xu ◽  
Youwen Tian ◽  
Shuai Feng ◽  
Dongxue Zhao ◽  

Abstract Background: Hyperspectral imaging is an emerging technology applied in plant disease research, including disease detection, multiple disease identification, disease severity assessment, and disease resistance evaluation. Rice leaf blast is prevalent all over the world and is a serious threat to rice yield and quality. In this paper, the standard deviation (STD) of the spectral reflectance of whole leaves was calculated and a support vector machine (SVM) model was built to classify the degree of rice leaf blast at different growth stages.Results: The classification accuracy of the full-spectrum-based SVM model at jointing stage, booting stage and heading stage was 94.44%, 81.58% and 80.48%, respectively. The corresponding macro recall values were 0.9714, 0.715 and 0.79. The average STD of the spectral reflectance of the whole leaf differed not only within samples with different disease grades, but also those with the same disease level. Conclusion: The STD of the spectral reflectance of whole leaf could be utilized to classify the rice leaf blast degree at different growth stages. The classification method was derived from physiological phenomena that were visible to the naked eye, making it more intuitive and convincing.

2021 ◽  
Vol 2021 ◽  
pp. 1-11
Bin Li ◽  
Le Kui ◽  
Jingdong Luo ◽  
Shiyong Chen

Mechanical equipment is a key component of mechanical equipment, and its working condition is directly related to the overall performance of mechanical equipment. Accurate evaluation and prediction of the performance degradation trend of mechanical equipment is of great significance to ensure the reliability and safety of the mechanical equipment system. Based on the data of typical faulty equipment, this paper analyzes the energy characteristic parameters of mechanical equipment under different types and degrees of failure in the time domain. Using amplitude spectrum analysis, Hilbert envelope demodulation and wavelet packet decomposition method, and other vibration signal processing methods, preliminary extraction of multiple statistical feature parameters are given. Secondly, in view of the irrelevant and redundant components of multiple statistical parameters, a new method for extracting fault features of mechanical equipment based on variance value and principal component analysis is proposed. This method can effectively classify the fault status of mechanical equipment. The effectiveness of the method is verified by actual equipment signals. After that, the value extracted from the vibration signal of the double-row roller equipment is used as the degradation feature. In order to reduce the influence of irregular characteristics in the vibration signal and simplify the complexity of the vibration signal, the wavelet transform and the support vector machine model are combined, according to the degradation after decomposition. The 95% confidence interval of the predicted value is also given. The SVM model is established based on data characteristics, and single-step and multistep prediction of equipment degradation trends are carried out. The prediction result shows that, according to the mapping position formula, the distribution of equipment degradation prediction points is obtained, and a 95% confidence interval based on the distribution of the prediction points is given. Finally, on the basis of completing feature extraction, this paper applies an unsupervised feature selection method. The sensitive characteristics of life prediction and the prediction results of a single SVM model and a neural network model are compared and analyzed at the same time.

2021 ◽  
Vol 8 ◽  
Liang Chen ◽  
Ya Shen ◽  
Xiao Huang ◽  
Hua Li ◽  
Jian Li ◽  

Aim: The purpose of this work was to develop and evaluate magnetic resonance imaging (MRI)-based radiomics for differentiation of orbital cavernous hemangioma (OCH) and orbital schwannoma (OSC).Methods: Fifty-eight patients (40 OCH and 18 OSC, confirmed pathohistologically) screened out from 216 consecutive patients who presented between 2015 and 2020 were divided into a training group (28 OCH and 12 OSC) and a validation group (12 OCH and 6 OSC). Radiomics features were extracted from T1-weighted imaging (T1WI) and T2-weighted imaging (T2WI). T-tests, the least absolute shrinkage and selection operator (LASSO), and principal components analysis (PCA) were used to select features for use in the classification models. A logistic regression (LR) model, support vector machine (SVM) model, decision tree (DT) model, and random forest (RF) model were constructed to differentiate OCH from OSC. The models were evaluated according to their accuracy and the area under the receiver operator characteristic (ROC) curve (AUC).Results: Six features from T1WI, five features from T2WI, and eight features from combined T1WI and T2WI were finally selected for building the classification models. The models using T2WI features showed superior performance on the validation data than those using T1WI features, especially the LR model and SVM model, which showed accuracy of 93% (85–100%) and 92%, respectively, The SVM model showed high accuracy of 93% (91–96%) on the combined feature group with an AUC of 98% (97–99%). The DT and RF models did not perform as well as the SVM model.Conclusion: Radiomics analysis using an SVM model achieved an accuracy of 93% for distinguishing OCH and OSC, which may be helpful for clinical diagnosis.

2021 ◽  
Vol ahead-of-print (ahead-of-print) ◽  
Ikhlaas Gurrib ◽  
Firuz Kamalov

Purpose Cryptocurrencies such as Bitcoin (BTC) attracted a lot of attention in recent months due to their unprecedented price fluctuations. This paper aims to propose a new method for predicting the direction of BTC price using linear discriminant analysis (LDA) together with sentiment analysis. Design/methodology/approach Concretely, the authors train an LDA-based classifier that uses the current BTC price information and BTC news announcements headlines to forecast the next-day direction of BTC prices. The authors compare the results with a Support Vector Machine (SVM) model and random guess approach. The use of BTC price information and news announcements related to crypto enables us to value the importance of these different sources and types of information. Findings Relative to the LDA results, the SVM model was more accurate in predicting BTC next day’s price movement. All models yielded better forecasts of an increase in tomorrow’s BTC price compared to forecasting a decrease in the crypto price. The inclusion of news sentiment resulted in the highest forecast accuracy of 0.585 on the test data, which is superior to a random guess. The LDA (SVM) model with asset specific (news sentiment and asset specific) input features ranked first within their respective model classifiers, suggesting both BTC news sentiment and asset specific are prized factors in predicting tomorrow’s price direction. Originality/value To the best of the authors’ knowledge, this is the first study to analyze the potential effect of crypto-related sentiment and BTC specific news on BTC’s price using LDA and sentiment analysis.

2021 ◽  
Vol 8 ◽  
Yan Gao ◽  
Xueke Bai ◽  
Jiapeng Lu ◽  
Lihua Zhang ◽  
Xiaofang Yan ◽  

Background: Heart failure with preserved ejection fraction (HFpEF) is increasingly recognized as a major global public health burden and lacks effective risk stratification. We aimed to assess a multi-biomarker model in improving risk prediction in HFpEF.Methods: We analyzed 18 biomarkers from the main pathophysiological domains of HF in 380 patients hospitalized for HFpEF from a prospective cohort. The association between these biomarkers and 2-year risk of all-cause death was assessed by Cox proportional hazards model. Support vector machine (SVM), a supervised machine learning method, was used to develop a prediction model of 2-year all-cause and cardiovascular death using a combination of 18 biomarkers and clinical indicators. The improvement of this model was evaluated by c-statistics, net reclassification improvement (NRI), and integrated discrimination improvement (IDI).Results: The median age of patients was 71-years, and 50.5% were female. Multiple biomarkers independently predicted the 2-year risk of death in Cox regression model, including N-terminal pro B-type brain-type natriuretic peptide (NT-proBNP), high-sensitivity cardiac troponin T (hs-TnT), growth differentiation factor-15 (GDF-15), tumor necrosis factor-α (TNFα), endoglin, and 3 biomarkers of extracellular matrix turnover [tissue inhibitor of metalloproteinases (TIMP)-1, matrix metalloproteinase (MMP)-2, and MMP-9) (FDR < 0.05). The SVM model effectively predicted the 2-year risk of all-cause death in patients with acute HFpEF in training set (AUC 0.834, 95% CI: 0.771–0.895) and validation set (AUC 0.798, 95% CI: 0.719–0.877). The NRI and IDI indicated that the SVM model significantly improved patient classification compared to the reference model in both sets (p < 0.05).Conclusions: Multiple circulating biomarkers coupled with an appropriate machine-learning method could effectively predict the risk of long-term mortality in patients with acute HFpEF. It is a promising strategy for improving risk stratification in HFpEF.

Sign in / Sign up

Export Citation Format

Share Document