Fault detection for air conditioning system using machine learning

Noor Asyikin Sulaiman; Md Pauzi Abdullah; Hayati Abdullah; Muhammad Noorazlan Shah Zainudin; Azdiana Md Yusop

doi:10.11591/ijai.v9.i1.pp109-116

Fault detection for air conditioning system using machine learning

IAES International Journal of Artificial Intelligence (IJ-AI) ◽

10.11591/ijai.v9.i1.pp109-116 ◽

2020 ◽

Vol 9 (1) ◽

pp. 109

Author(s):

Noor Asyikin Sulaiman ◽

Md Pauzi Abdullah ◽

Hayati Abdullah ◽

Muhammad Noorazlan Shah Zainudin ◽

Azdiana Md Yusop

Keyword(s):

Machine Learning ◽

Supervised Learning ◽

Air Conditioning ◽

Machine Learning Algorithms ◽

Coefficient Of Performance ◽

Support Vector ◽

Air Conditioning System ◽

Learning Classifier ◽

Negative Impacts ◽

The Impact

Air conditioning system is a complex system and consumes the most energy in a building. Any fault in the system operation such as cooling tower fan faulty, compressor failure, damper stuck, etc. could lead to energy wastage and reduction in the system’s coefficient of performance (COP). Due to the complexity of the air conditioning system, detecting those faults is hard as it requires exhaustive inspections. This paper consists of two parts; i) to investigate the impact of different faults related to the air conditioning system on COP and ii) to analyse the performances of machine learning algorithms to classify those faults. Three supervised learning classifier models were developed, which were deep learning, support vector machine (SVM) and multi-layer perceptron (MLP). The performances of each classifier were investigated in terms of six different classes of faults. Results showed that different faults give different negative impacts on the COP. Also, the three supervised learning classifier models able to classify all faults for more than 94%, and MLP produced the highest accuracy and precision among all.

Get full-text (via PubEx)

Sentiment Analysis of Impact of Technology on Employment from Text on Twitter

International Journal of Interactive Mobile Technologies (iJIM) ◽

10.3991/ijim.v14i07.10600 ◽

2020 ◽

Vol 14 (07) ◽

pp. 88

Author(s):

Shahzad Qaiser ◽

Nooraini Yusoff ◽

Farzana Kabir Ahmad ◽

Ramsha Ali

Keyword(s):

Machine Learning ◽

Social Media ◽

Social Issues ◽

Support Vector ◽

Ripple Effect ◽

Learning Classifier ◽

The People ◽

Impact Of Technology ◽

Negative Sentiment ◽

The Impact

Many different studies are in progress to analyze the content created by the users on social media due to its influence and social ripple effect. Various content created on social media has pieces of information and user’s sentiments about social issues. This study aims to analyze people’s sentiments about the impact of technology on employment and advancements in technologies and build a machine learning classifier to classify the sentiments. People are getting nervous, depressed and even doing suicides due to unemployment; hence, it is essential to explore this relatively new area of research. The study has two main objectives 1) to preprocess text collected from Twitter concerning the impact of technology on employment and analyze its sentiment, 2) to evaluate the performance of machine learning Naïve Bayes (NB) classifier on the text. To achieve this, a methodology is proposed that includes 1) data collection and preprocessing 2) analyze sentiment, 3) building machine learning classifier and 4) compare the performance of NB and support vector machine (SVM). NB and SVM achieved 87.18% and 82.05% accuracy respectively. The study found that 65% of the people hold negative sentiment regarding the impact of technology on employment and technological advancements; hence people must acquire new skills to minimize the effect of structural unemployment.

Get full-text (via PubEx)

Performance of three machine learning algorithms for predicting soil organic carbon in German agricultural soil

10.5194/soil-2021-107 ◽

2021 ◽

Author(s):

Ali Sakhaee ◽

Anika Gebauer ◽

Mareike Ließ ◽

Axel Don

Keyword(s):

Machine Learning ◽

Organic Carbon ◽

Soil Organic Carbon ◽

Agricultural Soil ◽

Learning Algorithms ◽

Model Performance ◽

Machine Learning Algorithms ◽

Support Vector ◽

Organic Soils ◽

The Impact

Abstract. Soil organic carbon (SOC), as the largest terrestrial carbon pool, has the potential to influence climate change and mitigation, and consequently SOC monitoring is important in the frameworks of different international treaties. There is therefore a need for high resolution SOC maps. Machine learning (ML) offers new opportunities to do this due to its capability for data mining of large datasets. The aim of this study, therefore, was to test three commonly used algorithms in digital soil mapping – random forest (RF), boosted regression trees (BRT) and support vector machine for regression (SVR) – on the first German Agricultural Soil Inventory to model agricultural topsoil SOC content. Nested cross-validation was implemented for model evaluation and parameter tuning. Moreover, grid search and differential evolution algorithm were applied to ensure that each algorithm was tuned and optimised suitably. The SOC content of the German Agricultural Soil Inventory was highly variable, ranging from 4 g kg−1 to 480 g kg−1. However, only 4 % of all soils contained more than 87 g kg−1 SOC and were considered organic or degraded organic soils. The results show that SVR provided the best performance with RMSE of 32 g kg−1 when the algorithms were trained on the full dataset. However, the average RMSE of all algorithms decreased by 34 % when mineral and organic soils were modeled separately, with the best result from SVR with RMSE of 21 g kg−1. Model performance is often limited by the size and quality of the available soil dataset for calibration and validation. Therefore, the impact of enlarging the training data was tested by including 1223 data points from the European Land Use/Land Cover Area Frame Survey for agricultural sites in Germany. The model performance was enhanced for maximum 1 % for mineral soils and 2 % for organic soils. Despite the capability of machine learning algorithms in general, and particularly SVR, in modelling SOC on a national scale, the study showed that the most important to improve the model performance was separate modelling of mineral and organic soils.

Get full-text (via PubEx)

A Machine Learning Approach to Predict the Performance of Refrigerator and Air Conditioning Using Gaussian Process Regression and Support Vector Methods

Recent Patents on Mechanical Engineering ◽

10.2174/2212797614666210125143810 ◽

2021 ◽

Vol 14 ◽

Author(s):

Harinarayan Sharma ◽

Sonam Kumari ◽

Aniket K. Dutt ◽

Pawan Kumar ◽

Mamookho E. Makhatha

Keyword(s):

Machine Learning ◽

Air Conditioning ◽

Gaussian Process Regression ◽

Coefficient Of Performance ◽

Experimental Result ◽

Support Vector ◽

Nano Particle ◽

Learning Models ◽

Machine Learning Model ◽

Machine Learning Models

Aim: Develop machine learning models for the performance of refrigerator and airconditioning system. Background: The Coefficient Of Performance (COP) of Refrigerator and Air-Conditioning (RAC) is a complex function of evaporative temperature and concentration of nano-particle in lubricants. In recent years, researchers focus on experimental study for improvement of COP. Further, few researchers applied simulation techniques such as fuzzy system, Artificial Neural Network (ANN), simulated annealing, etc. to the Vapour Compression Refrigeration (VCR) cycle. There is a scarcity of modeling research work for the performance of RAC system. Objective: The study aims to develop the machine learning predictive models for the performance of refrigerator and air-conditioning system using experimental data. Methods: The experiment was performed on VCR system to determine COP. Three different concentration of lubricants (added 0.5, 1.0 and 1.5g nano-TiO2 particle on 1 liter of Polyolester (POE) oil) were used. The experimentally calculated COP was used to train and test the machine learning models. Gaussian Process Regression (GPR) and Support Vector Regression (SVR) methods were applied to develop the models. Results: The experimental result reveals that the COP increases with increasing the concentration (of nano particles) at a given temperature. The addition of 0.5 and 1.0g TiO2 in the POE oil shows better rate of increment in the COP in comparison to addition of 1.5g TiO2 in the POE oil. Machine learning models using GPR and SVR with RBF kernel function is the most appropriate machine learning model for the nonlinear relationship between the output parameter (COP) and the input parameter (evaporative temperature and concentration of TiO2). Conclusion: The present study was conducted to investigate the machine learning approaches for performance of RAC system using experimental data sets. The experimental result shows that R134a and TiO2-POE nanolubricant work efficiently and the coefficient of performance of VCR system increases with concentration of nano-particle. The developed model performance is compared using coefficient of correlation and RSME values. After comparison, it is concluded that RBF based GPR model is the best fit machine learning model to predict the COP in the context of any other model for this data set.

Get full-text (via PubEx)

Using Machine Learning to Predict Mortality for COVID-19 Patients on Day 0 in the ICU

Frontiers in Digital Health ◽

10.3389/fdgth.2021.681608 ◽

2022 ◽

Vol 3 ◽

Author(s):

Elham Jamshidi ◽

Amirhossein Asgary ◽

Nader Tavakoli ◽

Alireza Zali ◽

Soroush Setareh ◽

...

Keyword(s):

Machine Learning ◽

Treatment Strategies ◽

Machine Learning Algorithms ◽

Gradient Boosting ◽

Support Vector ◽

Early Prediction ◽

Distribution Width ◽

Predictive Values ◽

Prediction Of Mortality ◽

The Impact

Rationale: Given the expanding number of COVID-19 cases and the potential for new waves of infection, there is an urgent need for early prediction of the severity of the disease in intensive care unit (ICU) patients to optimize treatment strategies.Objectives: Early prediction of mortality using machine learning based on typical laboratory results and clinical data registered on the day of ICU admission.Methods: We retrospectively studied 797 patients diagnosed with COVID-19 in Iran and the United Kingdom (U.K.). To find parameters with the highest predictive values, Kolmogorov-Smirnov and Pearson chi-squared tests were used. Several machine learning algorithms, including Random Forest (RF), logistic regression, gradient boosting classifier, support vector machine classifier, and artificial neural network algorithms were utilized to build classification models. The impact of each marker on the RF model predictions was studied by implementing the local interpretable model-agnostic explanation technique (LIME-SP).Results: Among 66 documented parameters, 15 factors with the highest predictive values were identified as follows: gender, age, blood urea nitrogen (BUN), creatinine, international normalized ratio (INR), albumin, mean corpuscular volume (MCV), white blood cell count, segmented neutrophil count, lymphocyte count, red cell distribution width (RDW), and mean cell hemoglobin (MCH) along with a history of neurological, cardiovascular, and respiratory disorders. Our RF model can predict patient outcomes with a sensitivity of 70% and a specificity of 75%. The performance of the models was confirmed by blindly testing the models in an external dataset.Conclusions: Using two independent patient datasets, we designed a machine-learning-based model that could predict the risk of mortality from severe COVID-19 with high accuracy. The most decisive variables in our model were increased levels of BUN, lowered albumin levels, increased creatinine, INR, and RDW, along with gender and age. Considering the importance of early triage decisions, this model can be a useful tool in COVID-19 ICU decision-making.

Get full-text (via PubEx)

A machine learning-based study of the impact of traffic flow changes on air pollution combined with meteorological conditions: the case of Oslo

10.31235/osf.io/yp4gn ◽

2021 ◽

Author(s):

Cong Cao

Keyword(s):

Machine Learning ◽

Air Pollution ◽

Support Vector Machine ◽

Traffic Flow ◽

Arrival Time ◽

Meteorological Conditions ◽

Machine Learning Algorithms ◽

Support Vector ◽

Traffic Volumes ◽

The Impact

In this paper, we explore the impact of changes in traffic flow on local air pollution under specific meteorological conditions by integrating hourly traffic flow data, air pollution data and meteorological data, using generalized linear regression models and advanced machine learning algorithms: support vector machines and decision trees. The geographical location is Oslo, the capital of Norway, and the time we selected is from February 2020 to September 2020; We also selected 24-hour data for May 11 and 16 of the same year, representing weekday and holiday traffic flow, respectively, as a subset to further explore. Finally, we selected data from July 2020 for robustness testing, and algorithm performance verification.We found that: the maximum traffic flow on holidays is significantly higher than that on weekdays, but the holidays produce less concentration of {NO}_x throughout the month; the peak arrival time of {NO}_x,\ {NO}_2and NO concentrations is later than the peak arrival time of traffic flow. Among them, {NO}_x has a very significant variation, so we choose {NO}_x concentration as an air pollution indicator to measure the effect of traffic flow variation on air pollution; we also find that {NO}_xconcentration is negatively correlated with hourly precipitation, and the variation trend is like that of minimum air temperature. We used multiple imputation methods to interpolate the missing values. The decision tree results yield that when traffic volumes are high (>81%), low temperatures generate more concentrations of {NO}_x than high temperatures (an increase of 3.1%). Higher concentrations of {NO}_x (2.4%) are also generated when traffic volumes are low (no less than 22%) but there is some precipitation ≥ 0.27%.In the evaluation of the prediction accuracy of the machine learning algorithms, the support vector machine has the best prediction performance with high R-squared and small MAE, MSE and RMSE, indicating that the support vector machine has a better explanation for air pollution caused by traffic flow, while the decision tree is the second best, and the generalized linear regression model is the worst.The selected data for July 2020 obtained results consistent with the overall dataset.

Get full-text (via PubEx)

Data based predictive models for odor perception

Scientific Reports ◽

10.1038/s41598-020-73978-1 ◽

2020 ◽

Vol 10 (1) ◽

Author(s):

Rinu Chacko ◽

Deepak Jain ◽

Manasi Patwardhan ◽

Abhishek Puri ◽

Shirish Karande ◽

...

Keyword(s):

Machine Learning ◽

Data Analytics ◽

Model Performance ◽

Structural Features ◽

Machine Learning Algorithms ◽

Gradient Boosting ◽

Support Vector ◽

Structure Property ◽

Odor Perception ◽

The Impact

Abstract Machine learning and data analytics are being increasingly used for quantitative structure property relation (QSPR) applications in the chemical domain where the traditional Edisonian approach towards knowledge-discovery have not been fruitful. The perception of odorant stimuli is one such application as olfaction is the least understood among all the other senses. In this study, we employ machine learning based algorithms and data analytics to address the efficacy of using a data-driven approach to predict the perceptual attributes of an odorant namely the odorant characters (OC) of “sweet” and “musky”. We first analyze a psychophysical dataset containing perceptual ratings of 55 subjects to reveal patterns in the ratings given by subjects. We then use the data to train several machine learning algorithms such as random forest, gradient boosting and support vector machine for prediction of the odor characters and report the structural features correlating well with the odor characters based on the optimal model. Furthermore, we analyze the impact of the data quality on the performance of the models by comparing the semantic descriptors generally associated with a given odorant to its perception by majority of the subjects. The study presents a methodology for developing models for odor perception and provides insights on the perception of odorants by untrained human subjects and the effect of the inherent bias in the perception data on the model performance. The models and methodology developed here could be used for predicting odor characters of new odorants.

Get full-text (via PubEx)

Hoax News Classification using Machine Learning Algorithms

International Journal of Engineering and Advanced Technology - Regular Issue ◽

10.35940/ijeat.b3753.129219 ◽

2019 ◽

Vol 9 (2) ◽

pp. 3938-3944

Keyword(s):

Machine Learning ◽

Social Media ◽

Learning Algorithm ◽

Detection System ◽

Machine Learning Algorithms ◽

Training Data ◽

Stochastic Gradient Descent ◽

Support Vector ◽

The Impact ◽

F Measure

Hoax news on social media has had a dramatic effect on our society in recent years. The impact of hoax news felt by many people, anxiety, financial loss, and loss of the right name. Therefore we need a detection system that can help reduce hoax news on social media. Hoax news classification is one of the stages in the construction of a hoax news detection system, and this unsupervised learning algorithm becomes a method for creating hoax news datasets, machine learning tools for data processing, and text processing for detecting data. The next will produce a classification of a hoax or not a Hoax based on the text inputted. Hoax news classification in this study uses five algorithms, namely Support Vector Machine, Naïve Bayes, Decision Tree, Logistic Regression, Stochastic Gradient Descent, and Neural Network (MLP). These five algorithms to produce the best algorithm that can use to detect hoax news, with the highest parameters, accuracy, F-measure, Precision, and recall. From the results of testing conducted on five classification algorithms produced shows that the NN-MPL algorithm has an average of 93% for the value of accuracy, F-Measure, and Precision, the highest compared to five other algorithms, but for the highest Recall value generated from the algorithm SVM which is 94%. the results of this experiment show that different effects for different classifiers, and that means that the more hoax data used as training data, the more accurate the system calculates accuracy in more detail.

Get full-text (via PubEx)

A Classifier to Detect Informational vs. Non-Informational Heart Attack Tweets

Future Internet ◽

10.3390/fi13010019 ◽

2021 ◽

Vol 13 (1) ◽

pp. 19

Author(s):

Ola Karajeh ◽

Dirar Darweesh ◽

Omar Darwish ◽

Noor Abu-El-Rub ◽

Belal Alsinglawi ◽

...

Keyword(s):

Machine Learning ◽

Social Media ◽

Heart Attack ◽

Machine Learning Algorithms ◽

Support Vector ◽

Heart Attacks ◽

Learning Classifier ◽

Important Concern ◽

Starting Point ◽

First Time

Social media sites are considered one of the most important sources of data in many fields, such as health, education, and politics. While surveys provide explicit answers to specific questions, posts in social media have the same answers implicitly occurring in the text. This research aims to develop a method for extracting implicit answers from large tweet collections, and to demonstrate this method for an important concern: the problem of heart attacks. The approach is to collect tweets containing “heart attack” and then select from those the ones with useful information. Informational tweets are those which express real heart attack issues, e.g., “Yesterday morning, my grandfather had a heart attack while he was walking around the garden.” On the other hand, there are non-informational tweets such as “Dropped my iPhone for the first time and almost had a heart attack.” The starting point was to manually classify around 7000 tweets as either informational (11%) or non-informational (89%), thus yielding a labeled dataset to use in devising a machine learning classifier that can be applied to our large collection of over 20 million tweets. Tweets were cleaned and converted to a vector representation, suitable to be fed into different machine-learning algorithms: Deep neural networks, support vector machine (SVM), J48 decision tree and naïve Bayes. Our experimentation aimed to find the best algorithm to use to build a high-quality classifier. This involved splitting the labeled dataset, with 2/3 used to train the classifier and 1/3 used for evaluation besides cross-validation methods. The deep neural network (DNN) classifier obtained the highest accuracy (95.2%). In addition, it obtained the highest F1-scores with (73.6%) and (97.4%) for informational and non-informational classes, respectively.

Get full-text (via PubEx)

Solving the Problem of Class Imbalance in the Prediction of Hotel Cancelations: A Hybridized Machine Learning Approach

Processes ◽

10.3390/pr9101713 ◽

2021 ◽

Vol 9 (10) ◽

pp. 1713

Author(s):

Mohd Adil ◽

Mohd Faizan Ansari ◽

Ahmad Alahmadi ◽

Jei-Zheng Wu ◽

Ripon K. Chakrabortty

Keyword(s):

Machine Learning ◽

Class Imbalance ◽

Machine Learning Algorithms ◽

Support Vector ◽

Class Imbalance Problem ◽

Computing Power ◽

Imbalance Problem ◽

Learning Classifier ◽

Machine Learning Approach ◽

Hotel Booking

The cancelation of bookings puts a considerable strain on management decisions in the case of the hospitability industry. Booking cancelations restrict precise predictions and are thus a critical tool for revenue management performance. However, in recent times, thanks to the availability of considerable computing power through machine learning (ML) approaches, it has become possible to create more accurate models to predict the cancelation of bookings compared to more traditional methods. Previous studies have used several ML approaches, such as support vector machine (SVM), neural network (NN), and decision tree (DT) models for predicting hotel cancelations. However, they are yet to address the class imbalance problem that exists in the prediction of hotel cancelations. In this study, we have shortened this gap by introducing an oversampling technique to address class imbalance problems, in conjunction with machine learning algorithms to better predict hotel booking cancelations. A combination of the synthetic minority oversampling technique and the edited nearest neighbors (SMOTE-ENN) algorithm is proposed to address the problem of class imbalance. Class imbalance is a general problem that occurs when classifying which class has more examples compared to others. Our research has shown that, after addressing the class imbalance problem, the performance of a machine learning classifier improves significantly.

Get full-text (via PubEx)

Touch-Based Active Cloud Authentication Using Traditional Machine Learning and LSTM on a Distributed Tensorflow Framework

International Journal of Computational Intelligence and Applications ◽

10.1142/s1469026819500226 ◽

2019 ◽

Vol 18 (04) ◽

pp. 1950022

Author(s):

Dylan J. Gunn ◽

Zhipeng Liu ◽

Rushit Dave ◽

Xiaohong Yuan ◽

Kaushik Roy

Keyword(s):

Machine Learning ◽

Deep Learning ◽

Mobile Device ◽

Short Term Memory ◽

Machine Learning Algorithms ◽

Modern World ◽

Support Vector ◽

Learning Classifier ◽

Cloud Framework ◽

Computationally Intensive

In this modern world, mobile devices have been paired with the cloud environment to scale the voluminous amount of generated data. The implementation comes at the cost of privacy as proprietary data can be stolen in transit to the cloud, or victims’ phones can be seized along with synced data from cloud. The attacker can gain access to the phone through shoulder surfing, or even spoofing attacks. Our approach is to mitigate this issue by proposing an active cloud authentication framework using touch biometric pattern. To the best of our knowledge, active cloud authentication using touch dynamics for mobile cloud computing has not been explored in the literature. This research creates a proof of concept that will lead into a simulated cloud framework for active authentication. Given the amount of data captured by the mobile device from user activity, it can be a computationally intensive process for the mobile device to handle with such limited resources. To solve this, we simulated a post-transmission process of data to the cloud so that we could implement the authentication process within the cloud. We evaluated the touch data using traditional machine learning algorithms, such as Random Forest (RF), Support Vector Machine (SVM), and also using a deep learning classifier, the Long Short-Term Memory Recurrent Neural Network (LSTM-RNN) algorithms. The novelty of this work is two-fold. First, we develop a distributed tensorflow framework for cloud authentication using touch biometric pattern. This framework helps alleviate the drawback of the computationally intensive recognition of the substantial amount of raw data from the user. Second, we apply the RF, SVM, and a deep learning classifier, the LSTM-RNN, on the touch data to evaluate the performance of the proposed authentication scheme. The proposed approach shows a promising performance with an accuracy of 99.0361% using RF on the distributed tensorflow framework.

Get full-text (via PubEx)