Botnet Forensic Analysis Using Machine Learning

Security and Communication Networks ◽

10.1155/2020/9302318 ◽

2020 ◽

Vol 2020 ◽

pp. 1-9

Author(s):

Anchit Bijalwan

Keyword(s):

Machine Learning ◽

Ensemble Classifier ◽

Forensic Analysis ◽

Modus Operandi ◽

Botnet Detection ◽

Quality Of Results ◽

Proposed Model ◽

Improve Accuracy ◽

Rapid Pace

Botnet forensic analysis helps in understanding the nature of attacks and the modus operandi used by the attackers. Botnet attacks are difficult to trace because of their rapid pace, epidemic nature, and smaller size. Machine learning works as a panacea for botnet attack related issues. It not only facilitates detection but also helps in prevention from bot attack. The proposed inquisition model endeavors improved quality of results by comprehensive botnet detection and forensic analysis. This scenario has been applied in eight different combinations of ensemble classifier technique to detect botnet evidence. The study is also compared to the ensemble-based classifiers with the single classifier using different parameters. The results exhibit that the proposed model can improve accuracy over a single classifier.

Download Full-text

Logistic Regression for Employability Prediction

International Journal of Innovative Technology and Exploring Engineering - Special Issue ◽

10.35940/ijitee.c8170.019320 ◽

2020 ◽

Vol 9 (3) ◽

pp. 2471-2478

Keyword(s):

Machine Learning ◽

Logistic Regression ◽

Deep Learning ◽

Weather Forecasting ◽

Previous Knowledge ◽

Recruitment Process ◽

Proposed Model ◽

Unknown Event ◽

The Right

Prediction is a conjecture about something which may happen. Prediction need not be based upon the previous knowledge or experience on the unknown event of interest in the future. But it is a necessity for mankind to foresee and make the right decisions to live better. Every person does predictions but the quality of the predictions differs and that differentiates successful persons and unsuccessful persons. In order to automate the prediction process and to make quality predictions available to every person, machines are trained to make predictions and such field comes under machine learning and later on deep learning algorithms. Various fields such as health care, weather forecasting, natural calamities, and crime prediction are some of the applications of prediction. The researchers have applied the field of prediction to see whether a model can predict the employability of a candidate in a recruitment process. Organizations use human expertise to identify a skilled candidate for employment based on various factors and now these organizations are trying to migrate to automated systems by harnessing the benefits of the exponential growth in the area of machine learning and deep learning. This investigation presents the development of a model to predict the employability by using Logistic Regression. A set of candidates was tested in the proposed model and results are discussed in this paper.

Download Full-text

Ens-PPI: A Novel Ensemble Classifier for Predicting the Interactions of Proteins Using Autocovariance Transformation from PSSM

BioMed Research International ◽

10.1155/2016/4563524 ◽

2016 ◽

Vol 2016 ◽

pp. 1-8 ◽

Cited By ~ 13

Author(s):

Zhen-Guo Gao ◽

Lei Wang ◽

Shi-Xiong Xia ◽

Zhu-Hong You ◽

Xin Yan ◽

...

Keyword(s):

Machine Learning ◽

Protein Interactions ◽

Biological Activities ◽

Ensemble Classifier ◽

Prediction Performance ◽

Protein Protein Interactions ◽

Rotation Forest ◽

Proposed Model ◽

Protein Amino Acids ◽

Scoring Matrix

Protein-Protein Interactions (PPIs) play vital roles in most biological activities. Although the development of high-throughput biological technologies has generated considerable PPI data for various organisms, many problems are still far from being solved. A number of computational methods based on machine learning have been developed to facilitate the identification of novel PPIs. In this study, a novel predictor was designed using the Rotation Forest (RF) algorithm combined with Autocovariance (AC) features extracted from the Position-Specific Scoring Matrix (PSSM). More specifically, the PSSMs are generated using the information of protein amino acids sequence. Then, an effective sequence-based features representation, Autocovariance, is employed to extract features from PSSMs. Finally, the RF model is used as a classifier to distinguish between the interacting and noninteracting protein pairs. The proposed method achieves promising prediction performance when performed on the PPIs ofYeast,H.pylori, andindependent datasets. The good results show that the proposed model is suitable for PPIs prediction and could also provide a useful supplementary tool for solving other bioinformatics problems.

Download Full-text

An Enhanced Sentiment Analysis Framework Based on Pre-Trained Word Embedding

International Journal of Computational Intelligence and Applications ◽

10.1142/s1469026820500315 ◽

2020 ◽

Vol 19 (04) ◽

pp. 2050031 ◽

Cited By ~ 1

Author(s):

Ensaf Hussein Mohamed ◽

Mohammed ElSaid Moussa ◽

Mohamed Hassan Haggag

Keyword(s):

Machine Learning ◽

Sentiment Analysis ◽

Ensemble Classifier ◽

Word Embedding ◽

Machine Learning Techniques ◽

Bag Of Words ◽

Pos Tagging ◽

Learning Techniques ◽

Proposed Model ◽

Machine Learning Approach

Sentiment analysis (SA) is a technique that lets people in different fields such as business, economy, research, government, and politics to know about people’s opinions, which greatly affects the process of decision-making. SA techniques are classified into: lexicon-based techniques, machine learning techniques, and a hybrid between both approaches. Each approach has its limitations and drawbacks, the machine learning approach depends on manual feature extraction, lexicon-based approach relies on sentiment lexicons that are usually unscalable, unreliable, and manually annotated by human experts. Nowadays, word-embedding techniques have been commonly used in SA classification. Currently, Word2Vec and GloVe are some of the most accurate and usable word embedding techniques, which can transform words into meaningful semantic vectors. However, these techniques ignore sentiment information of texts and require a huge corpus of texts for training and generating accurate vectors, which are used as inputs of deep learning models. In this paper, we propose an enhanced ensemble classifier framework. Our framework is based on our previously published lexicon-based method, bag-of-words, and pre-trained word embedding, first the sentence is preprocessed by removing stop-words, POS tagging, stemming and lemmatization, shortening exaggerated word. Second, the processed sentence is passed to three modules, our previous lexicon-based method (Sum Votes), bag-of-words module and semantic module (Word2Vec and Glove) and produced feature vectors. Finally, the previous features vectors are fed into 11 different classifiers. The proposed framework is tested and evaluated over four datasets with five different lexicons, the experiment results show that our proposed model outperforms the previous lexicon based and the machine learning methods individually.

Download Full-text

To Enhance the Quality of HCRS using Fuzzy-Genetic Approach

10.21203/rs.3.rs-1215751/v1 ◽

2022 ◽

Author(s):

Latha Banda ◽

Karan Singh ◽

Vikash Arya ◽

Devendra Gautam ◽

Ali Ahmadian

Keyword(s):

Machine Learning ◽

Genetic Algorithm ◽

Health Care ◽

Medical Data ◽

Machine Learning Algorithms ◽

Genetic Approach ◽

Data Set ◽

Proposed Model ◽

Fuzzy Genetic

Abstract Social media is recent generation of Recommender Systems (RS). Health Care Recommender System (HCRS) term used to analyse the medical data and then predict the disease of a patient with the help of various techniques used in RS. To ensure the quality and trustworthiness of medical data, machine learning algorithms are applied. Even though, there is a much gap between health care diagnosis and IT solutions. To evade this gap, the hybrid Fuzzy-genetic approach is used in HCRS. In this, Genetic algorithm is used for similarity computations with the help of mutation and crossover operators. Later fuzzy rules are generated for the data set with the additional personalized information of a user. Considering these approaches, the proposed model enhances the quality of recommendation in HCRS.

Download Full-text

A deep learning-based quality assessment model of collaboratively edited documents: A case study of Wikipedia

Journal of Information Science ◽

10.1177/0165551519877646 ◽

2019 ◽

pp. 016555151987764

Author(s):

Ping Wang ◽

Xiaodan Li ◽

Renli Wu

Keyword(s):

Machine Learning ◽

Deep Learning ◽

Complete Information ◽

Classification Performance ◽

Machine Learning Algorithms ◽

Assessment Model ◽

Learning Models ◽

Proposed Model

Wikipedia is becoming increasingly critical in helping people obtain information and knowledge. Its leading advantage is that users can not only access information but also modify it. However, this presents a challenging issue: how can we measure the quality of a Wikipedia article? The existing approaches assess Wikipedia quality by statistical models or traditional machine learning algorithms. However, their performance is not satisfactory. Moreover, most existing models fail to extract complete information from articles, which degrades the model’s performance. In this article, we first survey related works and summarise a comprehensive feature framework. Then, state-of-the-art deep learning models are introduced and applied to assess Wikipedia quality. Finally, a comparison among deep learning models and traditional machine learning models is conducted to validate the effectiveness of the proposed model. The models are compared extensively in terms of their training and classification performance. Moreover, the importance of each feature and the importance of different feature sets are analysed separately.

Download Full-text

Fast and Accurate Estimation of Quality of Results in High-Level Synthesis with Machine Learning

2018 IEEE 26th Annual International Symposium on Field-Programmable Custom Computing Machines (FCCM) ◽

10.1109/fccm.2018.00029 ◽

2018 ◽

Cited By ~ 15

Author(s):

Steve Dai ◽

Yuan Zhou ◽

Hang Zhang ◽

Ecenur Ustun ◽

Evangeline F.Y. Young ◽

...

Keyword(s):

Machine Learning ◽

High Level Synthesis ◽

Accurate Estimation ◽

Quality Of Results ◽

High Level

Download Full-text

Resistance Monitor and Control for Vacuum Evaporation of Metals and Carbon

Proceedings, annual meeting, Electron Microscopy Society of America ◽

10.1017/s0424820100092827 ◽

1976 ◽

Vol 34 ◽

pp. 572-573

Author(s):

Russell L. Steere ◽

Eric F. Erbe ◽

J. Michael Moseley

Keyword(s):

Electronic Device ◽

Point Sources ◽

High Contrast ◽

Variable Resistor ◽

Quality Of Results ◽

Low Contrast ◽

Evaporation Source ◽

And Control ◽

At Will

We have designed and built an electronic device which compares the resistance of a defined area of vacuum evaporated material with a variable resistor. When the two resistances are matched, the device automatically disconnects the primary side of the substrate transformer and stops further evaporation.This approach to controlled evaporation in conjunction with the modified guns and evaporation source permits reliably reproducible multiple Pt shadow films from a single Pt wrapped carbon point source. The reproducibility from consecutive C point sources is also reliable. Furthermore, the device we have developed permits us to select a predetermined resistance so that low contrast high-resolution shadows, heavy high contrast shadows, or any grade in between can be selected at will. The reproducibility and quality of results are demonstrated in Figures 1-4 which represent evaporations at various settings of the variable resistor.

Download Full-text

A Literature Review Study of Software Defect Prediction using Machine Learning Techniques

International Journal of Emerging Research in Management and Technology ◽

10.23956/ijermt.v6i6.286 ◽

2018 ◽

Vol 6 (6) ◽

pp. 300 ◽

Cited By ~ 3

Author(s):

Feidu Akmel ◽

Ermiyas Birihanu ◽

Bahir Siraj

Keyword(s):

Machine Learning ◽

Software Metrics ◽

Quality Standard ◽

Machine Learning Techniques ◽

Software Systems ◽

Health Care Insurance ◽

Software Defect ◽

Learning Techniques ◽

Software Product

Software systems are any software product or applications that support business domains such as Manufacturing,Aviation, Health care, insurance and so on.Software quality is a means of measuring how software is designed and how well the software conforms to that design. Some of the variables that we are looking for software quality are Correctness, Product quality, Scalability, Completeness and Absence of bugs, However the quality standard that was used from one organization is different from other for this reason it is better to apply the software metrics to measure the quality of software. Attributes that we gathered from source code through software metrics can be an input for software defect predictor. Software defect are an error that are introduced by software developer and stakeholders. Finally, in this study we discovered the application of machine learning on software defect that we gathered from the previous research works.

Download Full-text

Data science in economics: comprehensive review of advanced machine learning and deep learning methods

10.31232/osf.io/4pxq2 ◽

2020 ◽

Author(s):

Saeed Nosratabadi ◽

Amir Mosavi ◽

Puhong Duan ◽

Pedram Ghamisi ◽

Ferdinand Filip ◽

...

Keyword(s):

Machine Learning ◽

Deep Learning ◽

Data Science ◽

State Of The Art ◽

Science Methods ◽

Learning Models ◽

Diverse Range ◽

Hybrid Machine ◽

Economics Research

This paper provides a state-of-the-art investigation of advances in data science in emerging economic applications. The analysis was performed on novel data science methods in four individual classes of deep learning models, hybrid deep learning models, hybrid machine learning, and ensemble models. Application domains include a wide and diverse range of economics research from the stock market, marketing, and e-commerce to corporate banking and cryptocurrency. Prisma method, a systematic literature review methodology, was used to ensure the quality of the survey. The findings reveal that the trends follow the advancement of hybrid models, which, based on the accuracy metric, outperform other learning algorithms. It is further expected that the trends will converge toward the advancements of sophisticated hybrid deep learning models.

Download Full-text

Model and Method for Contributor’s Quality Assessment in Community Image Tagging Systems

Information and Control Systems ◽

10.31799/1684-8853-2018-4-45-51 ◽

2018 ◽

pp. 45-51

Author(s):

A. V. Ponomarev

Keyword(s):

Large Scale ◽

Wide Spectrum ◽

Preference Relation ◽

Pairwise Comparison ◽

Ground Truth ◽

Comparison Method ◽

Characteristic Matrix ◽

Image Tagging ◽

Proposed Model

Introduction: Large-scale human-computer systems involving people of various skills and motivation into the information processing process are currently used in a wide spectrum of applications. An acute problem in such systems is assessing the expected quality of each contributor; for example, in order to penalize incompetent or inaccurate ones and to promote diligent ones.Purpose: To develop a method of assessing the expected contributor’s quality in community tagging systems. This method should only use generally unreliable and incomplete information provided by contributors (with ground truth tags unknown).Results:A mathematical model is proposed for community image tagging (including the model of a contributor), along with a method of assessing the expected contributor’s quality. The method is based on comparing tag sets provided by different contributors for the same images, being a modification of pairwise comparison method with preference relation replaced by a special domination characteristic. Expected contributors’ quality is evaluated as a positive eigenvector of a pairwise domination characteristic matrix. Community tagging simulation has confirmed that the proposed method allows you to adequately estimate the expected quality of community tagging system contributors (provided that the contributors' behavior fits the proposed model).Practical relevance: The obtained results can be used in the development of systems based on coordinated efforts of community (primarily, community tagging systems).

Download Full-text