scholarly journals Construction of Hybrid Model for English News Headline Sarcasm Detection by Word Embedding Technique

Author(s):  
S. Ayyasamy

People often use sarcasm to taunt, anger, or amuse one another. Scathing undertones can't be missed, even when using a simple sentiment analysis tool. Sarcasm may be detected using a variety of machine learning techniques, including rule-based approaches, statistical approaches, and classifiers. Since English is a widely used language on the internet, most of these terms were created to help people recognize sarcasm in written material. Convolutional Neural Networks (CNNs) are used to extract features, and Naive Bayes (NBs) are trained and evaluated on those features using a probability function. This suggested approach gives a more accurate forecast of sarcasm detection based on probability prediction. This hybrid machine learning technique is evaluated according to the stretching component in frequency inverse domain, the cluster of the words and word vectors with embedding. Based on the findings, the proposed model surpasses many advanced algorithms for sarcasm detection, including accuracy, recall, and F1 scores. It is possible to identify sarcasm in a multi-domain dataset using the suggested model, which is accurate and resilient.

2018 ◽  
Vol 34 (3) ◽  
pp. 569-581 ◽  
Author(s):  
Sujata Rani ◽  
Parteek Kumar

Abstract In this article, an innovative approach to perform the sentiment analysis (SA) has been presented. The proposed system handles the issues of Romanized or abbreviated text and spelling variations in the text to perform the sentiment analysis. The training data set of 3,000 movie reviews and tweets has been manually labeled by native speakers of Hindi in three classes, i.e. positive, negative, and neutral. The system uses WEKA (Waikato Environment for Knowledge Analysis) tool to convert these string data into numerical matrices and applies three machine learning techniques, i.e. Naive Bayes (NB), J48, and support vector machine (SVM). The proposed system has been tested on 100 movie reviews and tweets, and it has been observed that SVM has performed best in comparison to other classifiers, and it has an accuracy of 68% for movie reviews and 82% in case of tweets. The results of the proposed system are very promising and can be used in emerging applications like SA of product reviews and social media analysis. Additionally, the proposed system can be used in other cultural/social benefits like predicting/fighting human riots.


2017 ◽  
Author(s):  
Vinicius Da S. Segalin ◽  
Carina F. Dorneles ◽  
Mario A. R. Dantas

AA well-known challenge with long running time queries in database environments is how much time a query will take to execute. This prediction is relevant for several reasons. For instance, by knowing that a query will take longer to execute than desired, one resource reservation mechanism can be performed, which means reserving more resources in order to execute this query in a shorter time in a future request. In this research work, it is presented a proposal in which the use of an advance reservation mechanism in a cloud database environment, considering machine learning techniques, provides resource recommendation. The proposed model is presented, in addition to some experiments that evaluate benefits and the efficiency of this enhanced proposal.


2021 ◽  
Vol 10 (5) ◽  
pp. e13110514732
Author(s):  
Paulo César Ossani ◽  
Diogo Francisco Rossoni ◽  
Marcelo Ângelo Cirillo ◽  
Flávio Meira Borém

Specialty coffees have a big importance in the economic scenario, and its sensory quality is appreciated by the productive sector and by the market. Researches have been constantly carried out in the search for better blends in order to add value and differentiate prices according to the product quality. To accomplish that, new methodologies must be explored, taking into consideration factors that might differentiate the particularities of each consumer and/or product. Thus, this article suggests the use of the machine learning technique in the construction of supervised classification and identification models. In a sensory evaluation test for consumer acceptance using four classes of specialty coffees, applied to four groups of trained and untrained consumers, features such as flavor, body, sweetness and general grade were evaluated. The use of machine learning is viable because it allows the classification and identification of specialty coffees produced in different altitudes and different processing methods.


In order to take notes of the speech delivered by the VIPs in the short time short hand language is employed. Mainly there are two shorthand languages namely Pitman and Teeline. An automatic shorthand language recognition system is essential in order to make use of the handheld devices for speedy conversion to the original text. The paper addresses and compares the recognition of the Teeline alphabets using the Machine learning (SVM and KNN) and deep learning (CNN) techniques. The dataset has been prepared using the digital pen and the same is processed and stored using the android application. The prepared dataset is fed to the proposed system and accuracy of recognition is compared. Deep learning technique gave higher accuracy compared to machine learning techniques. MATLAB 2018b platform is used for implementation of the experimental setup.


Now days when someone decide to book a hotel, previous online reviews of the hotels play a major role in determining the best hotel within the budget of the customer. Previous Online reviews are the most important motivation for the information that are used to analyse public opinion. Because of the high impact of the reviews on business, hotel owners are always highly concerned and focused about the customer feedback and past online reviews. But all reviews are not true and trustworthy, sometime few people may intentionally generate the fake reviews to make some hotel famous of to defame. Therefore it is essential to develop and propose the techniques for analysis of reviews. With the help of various machine learning techniques viz. Supervised machine learning technique, Text mining, Unsupervised machine learning technique, Semi-supervised learning, Reinforcement learning etc we may detect the fake reviews. This paper gives some notions of using machine learning techniques in analysis of past online reviews of hotels, Based on the observation it also suggest the optimal machine learning technique for a particular situation


2020 ◽  
Vol 2 (2) ◽  
pp. 106-119
Author(s):  
Subasish Das ◽  
Minh Le ◽  
Boya Dai

Abstract Crash occurrence is a complex phenomenon, and crashes associated with pedestrians and bicyclists are even more complex. Furthermore, pedestrian- and bicyclist-involved crashes are typically not reported in detail in state or national crash databases. To address this issue, developers created the Pedestrian and Bicycle Crash Analysis Tool (PBCAT). However, it is labour-intensive to manually identify the types of pedestrian and bicycle crash from crash-narrative reports and to classify different crash attributes from the textual content of police reports. Therefore, there is a need for a supporting tool that can assist practitioners in using PBCAT more efficiently and accurately. The objective of this study is to develop a framework for applying machine-learning models to classify crash types from unstructured textual content. In this study, the research team collected pedestrian crash-typing data from two locations in Texas. The XGBoost model was found to be the best classifier. The high prediction power of the XGBoost classifiers indicates that this machine-learning technique was able to classify pedestrian crash types with the highest accuracy rate (up to 77% for training data and 72% for test data). The findings demonstrate that advanced machine-learning models can extract underlying patterns and trends of crash mechanisms. This provides the basis for applying machine-learning techniques in addressing the crash typing issues associated with non-motorist crashes.


2017 ◽  
Vol 10 (13) ◽  
pp. 489 ◽  
Author(s):  
Saheb Ghosh ◽  
Sathis Kumar B ◽  
Kathir Deivanai

Deep learning methods are a great machine learning technique which is mostly used in artificial neural networks for pattern recognition. This project is to identify the Whales from under water Bioacoustics network using an efficient algorithm and data model, so that location of the whales can be send to the Ships travelling in the same region in order to avoid collision with the whale or disturbing their natural habitat as much as possible. This paper shows application of unsupervised machine learning techniques with help of deep belief network and manual feature extraction model for better results.


2021 ◽  
Vol 27 (2) ◽  
Author(s):  
K.A. Oladapo ◽  
F.Y. Ayankoya ◽  
F.A. Adekunle ◽  
S.A. Idowu

The periodical occurrence of emergency situations represents an important issue for mankind. Over the years, the world at large has experienced multiple misadventures both natural and man-made. A recent report showed that flood have affected more individuals than any other category of disaster in the 21st century with the highest percentage of 43% of all disaster events in 2019 and Africa been the second vulnerable continent after Asia. Handling flood risk with the intention of safety and comfort of the citizens as well as saving their environment is one of the major responsibilities of the leadership in each country especially in flood prone areas. Machine learning predictive analytic applications can improve the risk management. So, it is highly important to devise a scientific method for flood risk reduction since it cannot be eradicated. The paper proposes a pluvial flood detection and prediction system based on machine learning techniques. The proposed model will employ a fuzzy rule-based classification to appraise the performance of the machine learning algorithm on pluvial flood conditioning variables.


MENDEL ◽  
2019 ◽  
Vol 25 (2) ◽  
pp. 1-10 ◽  
Author(s):  
Ivan Zelinka ◽  
Eslam Amer

Current commercial antivirus detection engines still rely on signature-based methods. However, with the huge increase in the number of new malware, current detection methods become not suitable. In this paper, we introduce a malware detection model based on ensemble learning. The model is trained using the minimum number of signification features that are extracted from the file header. Evaluations show that the ensemble models slightly outperform individual classification models. Experimental evaluations show that our model can predict unseen malware with an accuracy rate of 0.998 and with a false positive rate of 0.002. The paper also includes a comparison between the performance of the proposed model and with different machine learning techniques. We are emphasizing the use of machine learning based approaches to replace conventional signature-based methods.


Sign in / Sign up

Export Citation Format

Share Document