Application of Two Unsupervised Learning Techniques to Questionable Claims: PRIDIT and Random Forest

Microgrids constitute complex systems that integrate distributed generation (DG) and feature different operational modes. The optimal coordination of directional over-current relays (DOCRs) in microgrids is a challenging task, especially if topology changes are taken into account. This paper proposes an adaptive protection approach that takes advantage of multiple setting groups that are available in commercial DOCRs to account for network topology changes in microgrids. Because the number of possible topologies is greater than the available setting groups, unsupervised learning techniques are explored to classify network topologies into a number of clusters that is equal to the number of setting groups. Subsequently, optimal settings are calculated for every topology cluster. Every setting is saved in the DOCRs as a different setting group that would be activated when a corresponding topology takes place. Several tests are performed on a benchmark IEC (International Electrotechnical Commission) microgrid, evidencing the applicability of the proposed approach.

Download Full-text

Advances in Unsupervised Learning Techniques Applied to Biosciences and Medicine

Advances in Artificial Neural Systems ◽

10.1155/2012/219860 ◽

2012 ◽

Vol 2012 ◽

pp. 1-2

Author(s):

Anke Meyer-Baese ◽

Sylvain Lespinats ◽

Juan Manuel Gorriz Saez ◽

Olivier Bastien

Keyword(s):

Unsupervised Learning ◽

Learning Techniques

Download Full-text

A Survey on Supervised and Unsupervised Learning Techniques

Proceedings of International Conference on Artificial Intelligence, Smart Grid and Smart City Applications ◽

10.1007/978-3-030-24051-6_58 ◽

2020 ◽

pp. 627-644

Author(s):

K. Sindhu Meena ◽

S. Suriya

Keyword(s):

Unsupervised Learning ◽

Supervised And Unsupervised Learning ◽

Learning Techniques

Download Full-text

Learning from Imbalanced Educational Data Using Ensemble Machine Learning Algorithms

Webology ◽

10.14704/web/v18si01/web18053 ◽

2021 ◽

Vol 18 (Special Issue 01) ◽

pp. 183-195

Author(s):

Thingbaijam Lenin ◽

N. Chandrasekaran

Keyword(s):

Machine Learning ◽

Random Forest ◽

Missing Values ◽

Machine Learning Techniques ◽

Gradient Boosting ◽

Adaptive Boosting ◽

Stochastic Gradient Boosting ◽

Ensemble Machine Learning ◽

Learning Techniques ◽

Student’S Performance

Student’s academic performance is one of the most important parameters for evaluating the standard of any institute. It has become a paramount importance for any institute to identify the student at risk of underperforming or failing or even drop out from the course. Machine Learning techniques may be used to develop a model for predicting student’s performance as early as at the time of admission. The task however is challenging as the educational data required to explore for modelling are usually imbalanced. We explore ensemble machine learning techniques namely bagging algorithm like random forest (rf) and boosting algorithms like adaptive boosting (adaboost), stochastic gradient boosting (gbm), extreme gradient boosting (xgbTree) in an attempt to develop a model for predicting the student’s performance of a private university at Meghalaya using three categories of data namely demographic, prior academic record, personality. The collected data are found to be highly imbalanced and also consists of missing values. We employ k-nearest neighbor (knn) data imputation technique to tackle the missing values. The models are developed on the imputed data with 10 fold cross validation technique and are evaluated using precision, specificity, recall, kappa metrics. As the data are imbalanced, we avoid using accuracy as the metrics of evaluating the model and instead use balanced accuracy and F-score. We compare the ensemble technique with single classifier C4.5. The best result is provided by random forest and adaboost with F-score of 66.67%, balanced accuracy of 75%, and accuracy of 96.94%.

Download Full-text

Heart Disease Prediction using Machine Learning Techniques

International Journal of Scientific Research in Science and Technology ◽

10.32628/ijsrst2183218 ◽

2021 ◽

pp. 42-47

Author(s):

Ramesh Ponnala ◽

K. Sai Sowjanya

Keyword(s):

Machine Learning ◽

Heart Disease ◽

Random Forest ◽

Linear Model ◽

Machine Learning Techniques ◽

Disease Prediction ◽

Huge Amount ◽

Healthcare Enterprise ◽

Learning Techniques ◽

Accuracy Level

Prediction of Cardiovascular ailment is an important task inside the vicinity of clinical facts evaluation. Machine learning knowledge of has been proven to be effective in helping in making selections and predicting from the huge amount of facts produced by using the healthcare enterprise. on this paper, we advocate a unique technique that pursuits via finding good sized functions by means of applying ML strategies ensuing in improving the accuracy inside the prediction of heart ailment. The severity of the heart disease is classified primarily based on diverse methods like KNN, choice timber and so on. The prediction version is added with special combos of capabilities and several known classification techniques. We produce a stronger performance level with an accuracy level of a 100% through the prediction version for heart ailment with the Hybrid Random forest area with a linear model (HRFLM).

Download Full-text

Unsupervised Learning Techniques

Nonlinear System Identification ◽

10.1007/978-3-662-04323-3_6 ◽

2001 ◽

pp. 137-155 ◽

Cited By ~ 1

Author(s):

Oliver Nelles

Keyword(s):

Unsupervised Learning ◽

Learning Techniques

Download Full-text

Semisupervised sentiment analysis method for online text reviews

Journal of Information Science ◽

10.1177/0165551520910032 ◽

2020 ◽

pp. 016555152091003

Author(s):

Gyeong Taek Lee ◽

Chang Ouk Kim ◽

Min Song

Keyword(s):

Unsupervised Learning ◽

Sentiment Analysis ◽

Supervised Learning ◽

Model Space ◽

Training Dataset ◽

Learning Approach ◽

Learning Models ◽

Text Data ◽

Learning Techniques ◽

Sentiment Dictionary

Sentiment analysis plays an important role in understanding individual opinions expressed in websites such as social media and product review sites. The common approaches to sentiment analysis use the sentiments carried by words that express opinions and are based on either supervised or unsupervised learning techniques. The unsupervised learning approach builds a word-sentiment dictionary, but it requires lengthy time periods and high costs to build a reliable dictionary. The supervised learning approach uses machine learning models to learn the sentiment scores of words; however, training a classifier model requires large amounts of labelled text data to achieve a good performance. In this article, we propose a semisupervised approach that performs well despite having only small amounts of labelled data available for training. The proposed method builds a base sentiment dictionary from a small training dataset using a lasso-based ensemble model with minimal human effort. The scores of words not in the training dataset are estimated using an adaptive instance-based learning model. In a pretrained word2vec model space, the sentiment values of the words in the dictionary are propagated to the words that did not exist in the training dataset. Through two experiments, we demonstrate that the performance of the proposed method is comparable to that of supervised learning models trained on large datasets.

Download Full-text

Classification Based on Unsupervised Learning

Statistical Techniques for Network Security ◽

10.4018/978-1-59904-708-9.ch010 ◽

2011 ◽

pp. 348-395

Author(s):

Yu Wang

Keyword(s):

Network Security ◽

Unsupervised Learning ◽

Supervised Learning ◽

Network Traffic ◽

High Speed ◽

Ad Hoc ◽

Training Data ◽

Traffic Data ◽

Response Variable ◽

Learning Techniques

The requirement for having a labeled response variable in training data from the supervised learning technique may not be satisfied in some situations: particularly, in dynamic, short-term, and ad-hoc wireless network access environments. Being able to conduct classification without a labeled response variable is an essential challenge to modern network security and intrusion detection. In this chapter we will discuss some unsupervised learning techniques including probability, similarity, and multidimensional models that can be applied in network security. These methods also provide a different angle to analyze network traffic data. For comprehensive knowledge on unsupervised learning techniques please refer to the machine learning references listed in the previous chapter; for their applications in network security see Carmines, Edward & McIver (1981), Lane & Brodley (1997), Herrero, Corchado, Gastaldo, Leoncini, Picasso & Zunino (2007), and Dhanalakshmi & Babu (2008). Unlike in supervised learning, where for each vector 1 2 ( , , , ) n X x x x = ? we have a corresponding observed response, Y, in unsupervised learning we only have X, and Y is not available either because we could not observe it or its frequency is too low to be fit ted with a supervised learning approach. Unsupervised learning has great meanings in practice because in many circumstances, available network traffic data may not include any anomalous events or known anomalous events (e.g., traffics collected from a newly constructed network system). While high-speed mobile wireless and ad-hoc network systems have become popular, the importance and need to develop new unsupervised learning methods that allow the modeling of network traffic data to use anomaly-free training data have significantly increased.

Download Full-text

Application of Two Unsupervised Learning Techniques to Questionable Claims: PRIDIT and Random Forest

Framework for Tasks Suggestion on Web Search Based on Unsupervised Learning Techniques

Anomaly Detection on Shuttle data using Unsupervised Learning Techniques

Optimal Coordination of Over-Current Relays in Microgrids Using Unsupervised Learning Techniques

Advances in Unsupervised Learning Techniques Applied to Biosciences and Medicine

A Survey on Supervised and Unsupervised Learning Techniques

Learning from Imbalanced Educational Data Using Ensemble Machine Learning Algorithms

Heart Disease Prediction using Machine Learning Techniques

Unsupervised Learning Techniques

Semisupervised sentiment analysis method for online text reviews

Classification Based on Unsupervised Learning

Export Citation Format