Classification of the fragrant styles and evaluation of the aromatic quality of flue-cured tobacco leaves by machine-learning methods

2016 ◽  
Vol 14 (06) ◽  
pp. 1650033 ◽  
Author(s):  
Li Gu ◽  
Lichun Xue ◽  
Qi Song ◽  
Fengji Wang ◽  
Huaqin He ◽  
...  

During commercial transactions, the quality of flue-cured tobacco leaves must be characterized efficiently, and the evaluation system should be easily transferable across different traders. However, there are over 3000 chemical compounds in flue-cured tobacco leaves; thus, it is impossible to evaluate the quality of flue-cured tobacco leaves using all the chemical compounds. In this paper, we used Support Vector Machine (SVM) algorithm together with 22 chemical compounds selected by ReliefF-Particle Swarm Optimization (R-PSO) to classify the fragrant style of flue-cured tobacco leaves, where the Accuracy (ACC) and Matthews Correlation Coefficient (MCC) were 90.95% and 0.80, respectively. SVM algorithm combined with 19 chemical compounds selected by R-PSO achieved the best assessment performance of the aromatic quality of tobacco leaves, where the PCC and MSE were 0.594 and 0.263, respectively. Finally, we constructed two online tools to classify the fragrant style and evaluate the aromatic quality of flue-cured tobacco leaf samples. These tools can be accessed at http://bioinformatics.fafu.edu.cn/tobacco .

2021 ◽  
Author(s):  
Hanna Klimczak ◽  
Wojciech Kotłowski ◽  
Dagmara Oszkiewicz ◽  
Francesca DeMeo ◽  
Agnieszka Kryszczyńska ◽  
...  

<p>The aim of the project is the classification of asteroids according to the most commonly used asteroid taxonomy (Bus-Demeo et al. 2009) with the use of various machine learning methods like Logistic Regression, Naive Bayes, Support Vector Machines, Gradient Boosting and Multilayer Perceptrons. Different parameter sets are used for classification in order to compare the quality of prediction with limited amount of data, namely the difference in performance between using the 0.45mu to 2.45mu spectral range and multiple spectral features, as well as performing the Prinicpal Component Analysis to reduce the dimensions of the spectral data.</p> <p> </p> <p>This work has been supported by grant No. 2017/25/B/ST9/00740 from the National Science Centre, Poland.</p>


2020 ◽  
Vol 4 (1) ◽  
pp. 1-6
Author(s):  
Irzal Ahmad Sabilla ◽  
Chastine Fatichah

Vegetables are ingredients for flavoring, such as tomatoes and chilies. A Both of these ingredients are processed to accompany the people's staple food in the form of sauce and seasoning. In supermarkets, these vegetables can be found easily, but many people do not understand how to choose the type and quality of chilies and tomatoes. This study discusses the classification of types of cayenne, curly, green, red chilies, and tomatoes with good and bad conditions using machine learning and contrast enhancement techniques. The machine learning methods used are Support Vector Machine (SVM), K-Nearest Neighbor (K-NN), Linear Discriminant Analysis (LDA), and Random Forest (RF). The results of testing the best method are measured based on the value of accuracy. In addition to the accuracy of this study, it also measures the speed of computation so that the methods used are efficient.


2013 ◽  
Vol 38 (2) ◽  
pp. 374-379 ◽  
Author(s):  
Zhi-Li PAN ◽  
Meng QI ◽  
Chun-Yang WEI ◽  
Feng LI ◽  
Shi-Xiang ZHANG ◽  
...  

The article aims to develop a model for forecasting the characteristics of traffic flows in real-time based on the classification of applications using machine learning methods to ensure the quality of service. It is shown that the model can forecast the mean rate and frequency of packet arrival for the entire flow of each class separately. The prediction is based on information about the previous flows of this class and the first 15 packets of the active flow. Thus, the Random Forest Regression method reduces the prediction error by approximately 1.5 times compared to the standard mean estimate for transmitted packets issued at the switch interface.


Sensors ◽  
2019 ◽  
Vol 19 (15) ◽  
pp. 3349 ◽  
Author(s):  
Maciej Roman Nowak ◽  
Rafał Zdunek ◽  
Edward Pliński ◽  
Piotr Świątek ◽  
Małgorzata Strzelecka ◽  
...  

In this study, we presented the concept and implementation of a fully functional system for the recognition of bi-heterocyclic compounds. We have conducted research into the application of machine learning methods to correctly recognize compounds based on THz spectra, and we have described the process of selecting optimal parameters for the kernel support vector machine (KSVM) with an additional `unknown’ class. The chemical compounds used in the study contain a target molecule, used in pharmacy to combat inflammatory states formed in living organisms. Ready-made medical products with similar properties are commonly referred to as non-steroidal anti-inflammatory drugs (NSAIDs) once authorised on the pharmaceutical market. It was crucial to clearly determine whether the tested sample is a chemical compound known to researchers or is a completely new structure which should be additionally tested using other spectrometric methods. Our approach allows us to achieve 100% accuracy of the classification of the tested chemical compounds in the time of several milliseconds counted for 30 samples of the test set. It fits perfectly into the concept of rapid recognition of bi-heterocyclic compounds without the need to analyse the percentage composition of compound components, assuming that the sample is classified in a known group. The method allows us to minimize testing costs and significant reduction of the time of analysis.


2014 ◽  
Vol 548-549 ◽  
pp. 1265-1269
Author(s):  
Yun Sik Hwang ◽  
Byeong Joo Jun ◽  
Tae Seon Yoon

As the stage of bioinformatics has been upgraded, classification of certain pathogen has been improved into a new manner. The main topic of this research is genetic singularity of HCV (Hepatitis C Virus) and our objective is to assay features of the HCV's amino acid under usage of Support Vector Machine (SVM) algorithm. HCV data used in our experiment has 10 kinds of sequences and 257 kinds of data. According to data analysis, some peculiar genetic patterns of HCV’s linearity that discord pre-existing neural network and C5.0 were found.


2014 ◽  
Vol 644-650 ◽  
pp. 2587-2590
Author(s):  
Yu Qin Xu ◽  
Xiang Ling Zhan ◽  
Zheng Ren ◽  
Tong Li ◽  
Guo Hua Qiao ◽  
...  

The quantitative classification of transformer supervision can improve the quality of transformer supervision. For the contents of transformer supervision, this paper establishes an index system of transformer supervision. Example results show that the extension matter-element theory which is applied to transformer supervision quantitative evaluation is more reasonable. According to the categories of transformer supervision, this paper quantifies the uncertainty and builds the evaluation system of transformer supervision. It realizes the evaluation of transformer supervision and gets the key factors which influence the transformer supervision. This evaluation system is conducive to guide the transformer supervision to be more scientific and effective.


2019 ◽  
Vol 7 ◽  
pp. 61-69
Author(s):  
Bikash Chawal ◽  
Sanjeev Prasad Panday

Crop disease epidemics can cause severe losses and affect agricultural products and food security especially in south Asian countries and Nepal where rice is enjoyed as a staple throughout the year. To achieve automatic diagnosis of crop disease the proposed system aims to develop a prototype system for detection of the paddy disease. Image recognition of the disease would be conducted based on Image Processing techniques to enhance the quality of the image and Twin Support Vector Machine (TSVM) technique to classify the paddy disease. The methodology involves image acquisition, pre-processing, analysis and classification of the paddy disease. All the paddy sample images will be passed through the RGB calculation before it proceeds to the binary conversion. If the sample is in the range of normal paddy RGB, then it is automatically classify as normal. Then, all the segmented paddy disease sample will be converted into the binary data in data base before proceed through the TSVM for training and testing. The proposed system is targeted to achieve better recognition results.


2021 ◽  
Vol 5 (3) ◽  
pp. 905
Author(s):  
Muhammad Afrizal Amrustian ◽  
Vika Febri Muliati ◽  
Elsa Elvira Awal

Japanese is one of the most difficult languages to understand and read. Japanese writing that does not use the alphabet is the reason for the difficulty of the Japanese language to read. There are three types of Japanese, namely kanji, katakana, and hiragana. Hiragana letters are the most commonly used type of writing. In addition, hiragana has a cursive nature, so each person's writing will be different. Machine learning methods can be used to read Japanese letters by recognizing the image of the letters. The Japanese letters that are used in this study are hiragana vowels. This study focuses on conducting a comparative study of machine learning methods for the image classification of Japanese letters. The machine learning methods that were successfully compared are Naïve Bayes, Support Vector Machine, Decision Tree, Random Forest, and K-Nearest Neighbor. The results of the comparative study show that the K-Nearest Neighbor method is the best method for image classification of hiragana vowels. K-Nearest Neighbor gets an accuracy of 89.4% with a low error rate.


2019 ◽  
Author(s):  
Rüdiger Pryss ◽  
Winfried Schlee ◽  
Burkhard Hoppenstedt ◽  
Manfred Reichert ◽  
Myra Spiliopoulou ◽  
...  

BACKGROUND Tinnitus is often described as the phantom perception of a sound and is experienced by 5.1% to 42.7% of the population worldwide, at least once during their lifetime. The symptoms often reduce the patient’s quality of life. The TrackYourTinnitus (TYT) mobile health (mHealth) crowdsensing platform was developed for two operating systems (OS)—Android and iOS—to help patients demystify the daily moment-to-moment variations of their tinnitus symptoms. In all platforms developed for more than one OS, it is important to investigate whether the crowdsensed data predicts the OS that was used in order to understand the degree to which the OS is a confounder that is necessary to consider. OBJECTIVE In this study, we explored whether the mobile OS—Android and iOS—used during user assessments can be predicted by the dynamic daily-life TYT data. METHODS TYT mainly applies the paradigms ecological momentary assessment (EMA) and mobile crowdsensing to collect dynamic EMA (EMA-D) daily-life data. The dynamic daily-life TYT data that were analyzed included eight questions as part of the EMA-D questionnaire. In this study, 518 TYT users were analyzed, who each completed at least 11 EMA-D questionnaires. Out of these, 221 were iOS users and 297 were Android users. The iOS users completed, in total, 14,708 EMA-D questionnaires; the number of EMA-D questionnaires completed by the Android users was randomly reduced to the same number to properly address the research question of the study. Machine learning methods—a feedforward neural network, a decision tree, a random forest classifier, and a support vector machine—were applied to address the research question. RESULTS Machine learning was able to predict the mobile OS used with an accuracy up to 78.94% based on the provided EMA-D questionnaires on the assessment level. In this context, the daily measurements regarding how users concentrate on the actual activity were particularly suitable for the prediction of the mobile OS used. CONCLUSIONS In the work at hand, two particular aspects have been revealed. First, machine learning can contribute to EMA-D data in the medical context. Second, based on the EMA-D data of TYT, we found that the accuracy in predicting the mobile OS used has several implications. Particularly, in clinical studies using mobile devices, the OS should be assessed as a covariate, as it might be a confounder.


Sign in / Sign up

Export Citation Format

Share Document