A Study on Acer Mono Sap Integration Management System Based on Energy Harvesting Electric Device and Sap Big Data Analysis Model

Se-Hoon Jung; Jun-Yeong Kim; Jun Park; Jun-Ho Huh; Chun-Bo Sim

doi:10.3390/electronics9111979

A Study on Acer Mono Sap Integration Management System Based on Energy Harvesting Electric Device and Sap Big Data Analysis Model

Electronics ◽

10.3390/electronics9111979 ◽

2020 ◽

Vol 9 (11) ◽

pp. 1979

Author(s):

Se-Hoon Jung ◽

Jun-Yeong Kim ◽

Jun Park ◽

Jun-Ho Huh ◽

Chun-Bo Sim

Keyword(s):

Random Forest ◽

Linear Regression ◽

Information And Communication Technologies ◽

Forest Products ◽

Environmental Information ◽

Mobile App ◽

Support Vector ◽

Electric Device ◽

Sap Exudation ◽

Acer Mono

This study set out to invent an Information and Communication Technologies (ICT)-based smart Acer mono sap collection electric device to make efficient use of the labor force by reducing inefficient activities of old manual work to record sap exudation and state information. Based on the assumption that environmental information would have close connections with Acer mono sap exudation to reinforce the competitive edge of production in forest products, the study analyzed correlations between Acer mono sap exudation and environmental information and predicted Acer mono exudation. A smart collection of electric devices would gather data about Acer mono sap exudation per hour on outdoor temperature, humidity, conductivity, and wind direction and velocity, and was installed in four areas in the Republic of Korea, including Sancheong, Gwangyang, Geoje, and Inje. Collected data were used to analyze correlations between environmental information and Acer mono sap exudation using four different algorithms, including linear regression, Support Vector Machine (SVM), Artificial Neural Network (ANN), and random forest, to predict Acer mono sap exudation. Remarkable outcomes were obtained across all the algorithms except for linear regression, demonstrating close connections between environmental information and Acer mono sap exudation. The random forest model, which showed the most outstanding performance, was used to make a mobile app capable of providing predicted Acer mono sap exudation and collected environmental information.

Performance of NO, NO<sub>2</sub> low cost sensors and three calibration approaches within a real world application

Atmospheric Measurement Techniques ◽

10.5194/amt-11-3717-2018 ◽

2018 ◽

Vol 11 (6) ◽

pp. 3717-3735 ◽

Cited By ~ 17

Author(s):

Alessandro Bigi ◽

Michael Mueller ◽

Stuart K. Grange ◽

Grazia Ghermandi ◽

Christoph Hueglin

Keyword(s):

Random Forest ◽

Linear Regression ◽

Support Vector Regression ◽

Low Cost ◽

Multivariate Linear Regression ◽

Pollution Level ◽

Support Vector ◽

Non Linear ◽

Linear Algorithms ◽

Urban Sites

Abstract. Low cost sensors for measuring atmospheric pollutants are experiencing an increase in popularity worldwide among practitioners, academia and environmental agencies, and a large amount of data by these devices are being delivered to the public. Notwithstanding their behaviour, performance and reliability are not yet fully investigated and understood. In the present study we investigate the medium term performance of a set of NO and NO2 electrochemical sensors in Switzerland using three different regression algorithms within a field calibration approach. In order to mimic a realistic application of these devices, the sensors were initially co-located at a rural regulatory monitoring site for a 4-month calibration period, and subsequently deployed for 4 months at two distant regulatory urban sites in traffic and urban background conditions, where the performance of the calibration algorithms was explored. The applied algorithms were Multivariate Linear Regression, Support Vector Regression and Random Forest; these were tested, along with the sensors, in terms of generalisability, selectivity, drift, uncertainty, bias, noise and suitability for spatial mapping intra-urban pollution gradients with hourly resolution. Results from the deployment at the urban sites show a better performance of the non-linear algorithms (Support Vector Regression and Random Forest) achieving RMSE <  5 ppb, R2 between 0.74 and 0.95 and MAE between 2 and 4 ppb. The combined use of both NO and NO2 sensor output in the estimate of each pollutant showed some contribution by NO sensor to NO2 estimate and vice-versa. All algorithms exhibited a drift ranging between 5 and 10 ppb for Random Forest and 15 ppb for Multivariate Linear Regression at the end of the deployment. The lowest concentration correctly estimated, with a 25 % relative expanded uncertainty, resulted in ca. 15–20 ppb and was provided by the non-linear algorithms. As an assessment for the suitability of the tested sensors for a targeted application, the probability of resolving hourly concentration difference in cities was investigated. It was found that NO concentration differences of 5–10 ppb (8–10 for NO2) can reliably be detected (90 % confidence), depending on the air pollution level. The findings of this study, although derived from a specific sensor type and sensor model, are based on a flexible methodology and have extensive potential for exploring the performance of other low cost sensors, that are different in their target pollutant and sensing technology.

Interpolation of Instantaneous Air Temperature Using Geographical and MODIS Derived Variables with Machine Learning Techniques

10.20944/preprints201906.0008.v1 ◽

2019 ◽

Author(s):

Marcos Ruiz-Álvarez ◽

Francisco Alonso-Sarría ◽

Francisco Gomariz-Castillo

Keyword(s):

Machine Learning ◽

Support Vector Machine ◽

Random Forest ◽

Linear Regression ◽

Air Temperature ◽

Satellite Data ◽

Multivariate Linear Regression ◽

Machine Learning Algorithms ◽

Machine Learning Techniques ◽

Support Vector

Several methods have been tried to estimate air temperature using satellite imagery. In this paper, the results of two machine learning algorithms, Support Vector Machine and Random Forest, are compared with Multivariate Linear Regression, TVX and Ordinary kriging. Several geographic, remote sensing and time variables are used as predictors. The validation is carried out using four different statistics on a daily basis allowing the use of ANOVA to compare the results. The main conclusion is that Random Forest with residual kriging produces the best results (R$^2$=0.612 $\pm$ 0.019, NSE=0.578 $\pm$ 0.025, RMSE=1.068 $\pm$ 0.027, PBIAS=-0.172 $\pm$ 0.046), whereas TVX produces the least accurate results. The environmental conditions in the study area are not really suited to TVX, moreover this method only takes into account satellite data. On the other hand, regression methods (Support Vector Machine, Random Forest and Multivariate Linear Regression) use several parameters that are easily calculated from a Digital Elevation Model, adding very little difficulty to the use of satellite data alone. The most important variables in the Random Forest Model were satellite temperature, potential irradiation and cdayt, a cosine transformation of the julian day.

Forecasting primary delay recovery of high-speed railway using multiple linear regression, supporting vector machine, artificial neural network, and random forest regression

Canadian Journal of Civil Engineering ◽

10.1139/cjce-2017-0642 ◽

2019 ◽

Vol 46 (5) ◽

pp. 353-363 ◽

Cited By ~ 6

Author(s):

Chaozhe Jiang ◽

Ping Huang ◽

Javad Lessan ◽

Liping Fu ◽

Chao Wen

Keyword(s):

Random Forest ◽

Linear Regression ◽

Multiple Linear Regression ◽

Prediction Accuracy ◽

High Speed ◽

Support Vector ◽

Random Forest Regression ◽

High Speed Railway ◽

Buffer Time ◽

Artificial Neural

Accurate prediction of recoverable train delay can support the train dispatchers’ decision-making with timetable rescheduling and improving service reliability. In this paper, we present the results of an effort aimed to develop primary delay recovery (PDR) predictor model using train operation records from Wuhan-Guangzhou (W-G) high-speed railway. To this end, we first identified the main variables that contribute to delay, including dwell buffer time, running buffer time, magnitude of primary delay time, and individual sections’ influence. Different models are applied and calibrated to predict the PDR. The validation results on test datasets indicate that the random forest regression (RFR) model outperforms the other three alternative models, namely, multiple linear regression (MLR), support vector machine (SVM), and artificial neural networks (ANN) regarding prediction accuracy measure. Specifically, the evaluation results show that when the prediction tolerance is less than 1 min, the RFR model can achieve up to 80.4% of prediction accuracy, while the accuracy level is 44.4%, 78.5%, and 78.5% for MLR, SVM, and ANN models, respectively.

Interpolation of Instantaneous Air Temperature Using Geographical and MODIS Derived Variables with Machine Learning Techniques

ISPRS International Journal of Geo-Information ◽

10.3390/ijgi8090382 ◽

2019 ◽

Vol 8 (9) ◽

pp. 382 ◽

Cited By ~ 2

Author(s):

Marcos Ruiz-Álvarez ◽

Francisco Alonso-Sarria ◽

Francisco Gomariz-Castillo

Keyword(s):

Machine Learning ◽

Random Forest ◽

Linear Regression ◽

Multiple Linear Regression ◽

Air Temperature ◽

Cross Validation ◽

Daily Basis ◽

Machine Learning Algorithms ◽

Machine Learning Techniques ◽

Support Vector

Several methods have been tried to estimate air temperature using satellite imagery. In this paper, the results of two machine learning algorithms, Support Vector Machines and Random Forest, are compared with Multiple Linear Regression and Ordinary kriging. Several geographic, remote sensing and time variables are used as predictors. The validation is carried out using two different approaches, a leave-one-out cross validation in the spatial domain and a spatio-temporal k-block cross-validation, and four different statistics on a daily basis, allowing the use of ANOVA to compare the results. The main conclusion is that Random Forest produces the best results (R 2 = 0.888 ± 0.026, Root mean square error = 3.01 ± 0.325 using k-block cross-validation). Regression methods (Support Vector Machine, Random Forest and Multiple Linear Regression) are calibrated with MODIS data and several predictors easily calculated from a Digital Elevation Model. The most important variables in the Random Forest model were satellite temperature, potential irradiation and cdayt, a cosine transformation of the julian day.

Predicting Future Products Rate using Machine Learning Algorithms

International Journal of Intelligent Systems and Applications ◽

10.5815/ijisa.2020.05.04 ◽

2020 ◽

Vol 12 (5) ◽

pp. 41-51

Author(s):

Shaimaa Mahmoud ◽

◽

Mahmoud Hussein ◽

Arabi Keshk

Keyword(s):

Machine Learning ◽

Random Forest ◽

Linear Regression ◽

Mean Squared Error ◽

Learning Algorithms ◽

Machine Learning Algorithms ◽

Support Vector ◽

Random Forest Regression ◽

Data Set ◽

Squared Error

Opinion mining in social networks data is considered as one of most important research areas because a large number of users interact with different topics on it. This paper discusses the problem of predicting future products rate according to users’ comments. Researchers interacted with this problem by using machine learning algorithms (e.g. Logistic Regression, Random Forest Regression, Support Vector Regression, Simple Linear Regression, Multiple Linear Regression, Polynomial Regression and Decision Tree). However, the accuracy of these techniques still needs to be improved. In this study, we introduce an approach for predicting future products rate using LR, RFR, and SVR. Our data set consists of tweets and its rate from 1:5. The main goal of our approach is improving the prediction accuracy about existing techniques. SVR can predict future product rate with a Mean Squared Error (MSE) of 0.4122, Linear Regression model predict with a Mean Squared Error of 0.4986 and Random Forest Regression can predict with a Mean Squared Error of 0.4770. This is better than the existing approaches accuracy.

STUDY OF THE INFLUENCE OF WOOD PROPERTIES ON THE CHARCOAL PRODUCTION: APPLYING THE RANDOM FOREST ALGORITHM

Revista Árvore ◽

10.1590/1806-908820210000002 ◽

2021 ◽

Vol 45 ◽

Author(s):

Kaléo Dias Pereira ◽

Antônio Policarpo Souza Carneiro ◽

Gerson Rodrigues dos Santos ◽

Angélica de Cassia Oliveira Carneiro ◽

Hélio Garcia Leite ◽

...

Keyword(s):

Random Forest ◽

Linear Regression ◽

Multiple Linear Regression ◽

Support Vector Regression ◽

Wood Properties ◽

Coefficient Of Determination ◽

Support Vector ◽

Random Forest Algorithm ◽

Predicted Values

ABSTRACT The understanding of the relationship between the properties of wood and charcoal makes it possible to improve the production of charcoal. Therefore, the random forest algorithm was used in this study to analyze the influence of eucalyptus wood properties on the quality of charcoal as well as the accuracy of the predicted values concerning the results estimated by support vector regression and multiple linear regression. Six properties of wood and six properties of charcoal obtained from the hybrid Eucalyptus grandis x Eucalyptus urophylla and from twelve clones of Corymbia torelliana x Corymbia critriodora at the age of seven were measured. In the analysis, the measure of mean decrease in node impurity (residual sum of squares) calculated with the random forest and the copula correlation was used to evaluate the relationship between properties of wood and charcoal. The random forest was compared to the support vector regression and multiple linear regression through the coefficient of determination, linear correlation between observed and predicted values, mean absolute error and root mean squared error. The accuracy of the random forest was greater than that obtained with the support vector regression and multiple linear regression, mainly in terms of the coefficient of determination and the linear correlation between observed and predicted values. The yield and quality of the charcoal produced from clones were mainly influenced by the holocellulose content, heartwood/sapwood ratio, and basic wood density. The apparent relative density of charcoal was the variable in which the random forest algorithm reached the best level of explanation of the variability as a function of the properties of wood, while the minor error was observed for the fixed carbon content.

Influenza Prediction: Analyzing Machine Learning Algorithms

Asian Journal of Computer Science and Technology ◽

10.51983/ajcst-2020.9.1.2155 ◽

2020 ◽

Vol 9 (1) ◽

pp. 14-18

Author(s):

Sapna Yadav ◽

Pankaj Agarwal

Keyword(s):

Machine Learning ◽

Random Forest ◽

Linear Regression ◽

Random Forests ◽

Ridge Regression ◽

Learning Algorithms ◽

Machine Learning Algorithms ◽

Digital Data ◽

Support Vector ◽

Lasso Regression

Analyzing online or digital data for detecting epidemics is one of the hot areas of research and now becomes more relevant during the present outbreak of Covid-19. There are several different types of the influenza virus and moreover they keep evolving constantly in the same manner the COVID-19 virus has done. As a result, they pose a greater challenge when it comes to analyzing them, predicting when, where and at what degree of severity it will outbreak during the flu season across the world. There is need for greater surveillance to both seasonal and pandemic influenza to ensure the health and safety of the mankind. The objective of work is to apply machine learning algorithms for building predictive models that can predict where the occurrence, peak and severity of influenza in each season. For this work we have considered a freely available dataset of Ireland which is recorded for the duration of 2005 to 2016. Specifically, we have tested three ML Algorithms namely Linear Regression, Support Vector Regression and Random Forests. We found Random Forests is giving better predictive results. We also conducted experiment through weka tool and tested Zero R, Linear Regression, Lazy Kstar, Random Forest, REP Tree, Multilayer Perceptron models. We again found the Random Forest is performing better in comparison to all other models. We also evaluated other regression models including Ridge Regression, modified Ridge regression, Lasso Regression, K Neighbor Regression and evaluated the mean absolute errors. We found that modified Ridge regression is producing minimum error. The proposed work is inclined towards finding the suitability & appropriate ML algorithm for solving this problem on Flu.

Investigating the use of random forest, gradient boosting machine, support vector machine and their ensemble applied to fault detection

10.26678/abcm.cobem2017.cob17-1600 ◽

2017 ◽

Author(s):

Luis Felipe Nogoseke ◽

Gabriel Herman Bernardim Andrade ◽

Marco Boaretto ◽

Leandro Coelho

Keyword(s):

Support Vector Machine ◽

Random Forest ◽

Fault Detection ◽

Gradient Boosting ◽

Support Vector ◽

Gradient Boosting Machine

Extraction of Arecanut Planting Distribution Based on the Feature Space Optimization of PlanetScope Imagery

Agriculture ◽

10.3390/agriculture11040371 ◽

2021 ◽

Vol 11 (4) ◽

pp. 371

Author(s):

Yu Jin ◽

Jiawei Guo ◽

Huichun Ye ◽

Jinling Zhao ◽

Wenjiang Huang ◽

...

Keyword(s):

Random Forest ◽

Satellite Imagery ◽

Feature Space ◽

Kappa Coefficient ◽

Classification Model ◽

Support Vector ◽

Textural Feature ◽

Monitoring Accuracy ◽

Areca Catechu ◽

High Level

The remote sensing extraction of large areas of arecanut (Areca catechu L.) planting plays an important role in investigating the distribution of arecanut planting area and the subsequent adjustment and optimization of regional planting structures. Satellite imagery has previously been used to investigate and monitor the agricultural and forestry vegetation in Hainan. However, the monitoring accuracy is affected by the cloudy and rainy climate of this region, as well as the high level of land fragmentation. In this paper, we used PlanetScope imagery at a 3 m spatial resolution over the Hainan arecanut planting area to investigate the high-precision extraction of the arecanut planting distribution based on feature space optimization. First, spectral and textural feature variables were selected to form the initial feature space, followed by the implementation of the random forest algorithm to optimize the feature space. Arecanut planting area extraction models based on the support vector machine (SVM), BP neural network (BPNN), and random forest (RF) classification algorithms were then constructed. The overall classification accuracies of the SVM, BPNN, and RF models optimized by the RF features were determined as 74.82%, 83.67%, and 88.30%, with Kappa coefficients of 0.680, 0.795, and 0.853, respectively. The RF model with optimized features exhibited the highest overall classification accuracy and kappa coefficient. The overall accuracy of the SVM, BPNN, and RF models following feature optimization was improved by 3.90%, 7.77%, and 7.45%, respectively, compared with the corresponding unoptimized classification model. The kappa coefficient also improved. The results demonstrate the ability of PlanetScope satellite imagery to extract the planting distribution of arecanut. Furthermore, the RF is proven to effectively optimize the initial feature space, composed of spectral and textural feature variables, further improving the extraction accuracy of the arecanut planting distribution. This work can act as a theoretical and technical reference for the agricultural and forestry industries.

A Mobile App to Optimize Social Participation for Individuals with Physical Disabilities: Content Validation and Usability Testing

International Journal of Environmental Research and Public Health ◽

10.3390/ijerph18041753 ◽

2021 ◽

Vol 18 (4) ◽

pp. 1753

Author(s):

Dahlia Kairy ◽

Mir Abolfazl Mostafavi ◽

Catherine Blanchette-Dallaire ◽

Eva Belanger ◽

Andrea Corbeil ◽

...

Keyword(s):

Information And Communication Technologies ◽

Social Participation ◽

Physical Disabilities ◽

Mobile App ◽

User Centered Design ◽

Mobility Limitations ◽

User Friendliness ◽

Cross Sectional ◽

Information And Communication ◽

Information Optimization

Background: Social participation is beneficial for individuals’ health. However, people with disabilities that may lead to mobility limitations tend to experience lower levels of social participation. Information and communication technologies such as the OnRoule mobile application (app) can help promote social participation. Objectives: To obtain potential users’ perceptions on the usability and content of the OnRoule app for providing information on accessibility, as well as its potential to optimize social participation. Materials and Methods: Cross-sectional user-centered design study. Individuals with physical disabilities (n = 18) were recruited through community organizations and interviewed using a semi-structured guide. Interviews were recorded, transcribed, and analyzed using thematic analysis. Results: Three main themes were identified: (1) “user-friendliness”; (2) “balance between the amount and relevance of information”; and (3) “potential use of the app”. Discussion and Conclusion: Findings from this study indicated that the app was easy to use, had pertinent information, and enabled a positive experience of finding information. However, several areas of improvement were identified, such as the clarity of specific elements, organization and amount of information, optimization of features, and inclusiveness. Apps such as OnRoule could optimize social participation by facilitating the process of finding resources in the community and building a sense of connectedness between users.