Predicting bend-induced heterogeneity in sediment microbial communities by integrating bacteria-based index of biotic integrity and supervised learning algorithms

There are several malware detection techniques available that are based on a signature-based approach. This approach can detect known malware very effectively but sometimes may fail to detect unknown or zero-day attacks. In this article, the authors have proposed a malware detection model that uses operation codes of malicious and benign executables as the feature. The proposed model uses opcode extract and count (OPEC) algorithm to prepare the opcode feature vector for the experiment. Most relevant features are selected using extra tree classifier feature selection technique and then passed through several supervised learning algorithms like support vector machine, naive bayes, decision tree, random forest, logistic regression, and k-nearest neighbour to build classification models for malware detection. The proposed model has achieved a detection accuracy of 98.7%, which makes this model better than many of the similar works discussed in the literature.

Download Full-text

An Auto-Adjustable Semi-Supervised Self-Training Algorithm

Algorithms ◽

10.3390/a11090139 ◽

2018 ◽

Vol 11 (9) ◽

pp. 139 ◽

Cited By ~ 5

Author(s):

Ioannis Livieris ◽

Andreas Kanavos ◽

Vassilis Tampakas ◽

Panagiotis Pintelas

Keyword(s):

Supervised Learning ◽

Predictive Models ◽

Learning Algorithm ◽

Learning Algorithms ◽

Classification Problem ◽

Classification Methods ◽

Training Algorithm ◽

Traditional Classification ◽

Supervised Learning Algorithms ◽

Significant Research

Semi-supervised learning algorithms have become a topic of significant research as an alternative to traditional classification methods which exhibit remarkable performance over labeled data but lack the ability to be applied on large amounts of unlabeled data. In this work, we propose a new semi-supervised learning algorithm that dynamically selects the most promising learner for a classification problem from a pool of classifiers based on a self-training philosophy. Our experimental results illustrate that the proposed algorithm outperforms its component semi-supervised learning algorithms in terms of accuracy, leading to more efficient, stable and robust predictive models.

Download Full-text

Evaluation of Supervised Learning Algorithms Based on Speech Features as Predictors to the Diagnosis of Mild to Moderate Intellectual Disability

3D Research ◽

10.1007/s13319-018-0207-6 ◽

2018 ◽

Vol 9 (4) ◽

Author(s):

Gaurav Aggarwal ◽

Latika Singh

Keyword(s):

Intellectual Disability ◽

Supervised Learning ◽

Learning Algorithms ◽

Moderate Intellectual Disability ◽

Supervised Learning Algorithms ◽

Speech Features

Download Full-text

Bootstrap Domain-Specific Sentiment Classifiers from Unlabeled Corpora

Transactions of the Association for Computational Linguistics ◽

10.1162/tacl_a_00020 ◽

2018 ◽

Vol 6 ◽

pp. 269-285 ◽

Cited By ~ 3

Author(s):

Andrius Mudinas ◽

Dell Zhang ◽

Mark Levene

Keyword(s):

Supervised Learning ◽

Learning Algorithms ◽

General Purpose ◽

Machine Learning Algorithms ◽

Sentiment Classification ◽

Two Phase ◽

Transductive Learning ◽

Domain Specific ◽

Sentiment Lexicon ◽

Supervised Learning Algorithms

There is often the need to perform sentiment classification in a particular domain where no labeled document is available. Although we could make use of a general-purpose off-the-shelf sentiment classifier or a pre-built one for a different domain, the effectiveness would be inferior. In this paper, we explore the possibility of building domain-specific sentiment classifiers with unlabeled documents only. Our investigation indicates that in the word embeddings learned from the unlabeled corpus of a given domain, the distributed word representations (vectors) for opposite sentiments form distinct clusters, though those clusters are not transferable across domains. Exploiting such a clustering structure, we are able to utilize machine learning algorithms to induce a quality domain-specific sentiment lexicon from just a few typical sentiment words (“seeds”). An important finding is that simple linear model based supervised learning algorithms (such as linear SVM) can actually work better than more sophisticated semi-supervised/transductive learning algorithms which represent the state-of-the-art technique for sentiment lexicon induction. The induced lexicon could be applied directly in a lexicon-based method for sentiment classification, but a higher performance could be achieved through a two-phase bootstrapping method which uses the induced lexicon to assign positive/negative sentiment scores to unlabeled documents first, a nd t hen u ses those documents found to have clear sentiment signals as pseudo-labeled examples to train a document sentiment classifier v ia supervised learning algorithms (such as LSTM). On several benchmark datasets for document sentiment classification, our end-to-end pipelined approach which is overall unsupervised (except for a tiny set of seed words) outperforms existing unsupervised approaches and achieves an accuracy comparable to that of fully supervised approaches.

Download Full-text

Predicting bend-induced heterogeneity in sediment microbial communities by integrating bacteria-based index of biotic integrity and supervised learning algorithms

Analysis of Tree Based Supervised Learning Algorithms on Medical Data

Investigating the performance of the supervised learning algorithms for estimating NPPs parameters in combination with the different feature selection techniques

Supervised learning algorithms in the classification of plant populations with different degrees of kinship

An Experimental Comparison of Semi-supervised Learning Algorithms for Multispectral Image Classification

Ordering and finding the best of K > 2 supervised learning algorithms

On incrementally using a small portion of strong unlabeled data for semi-supervised learning algorithms

An Opcode-Based Malware Detection Model Using Supervised Learning Algorithms

An Auto-Adjustable Semi-Supervised Self-Training Algorithm

Evaluation of Supervised Learning Algorithms Based on Speech Features as Predictors to the Diagnosis of Mild to Moderate Intellectual Disability

Bootstrap Domain-Specific Sentiment Classifiers from Unlabeled Corpora

Export Citation Format