class labelling
Recently Published Documents


TOTAL DOCUMENTS

13
(FIVE YEARS 2)

H-INDEX

3
(FIVE YEARS 1)

Author(s):  
Faraz Ahmad ◽  
S. A. M. Rizvi

<p>Twitter is one of the most influential social media platforms, facilitates the spreading of information in the form of text, images, and videos. However, the credibility of posted content is still trailed by an interrogation mark. Introduction: In this paper, a model has been developed for finding the user’s credibility based on the tweets which they had posted on Twitter social networks. The model consists of machine learning algorithms that assist not only in categorizing the tweets into credibility classes but also helps in finding user’s credibility ratings on the social media platform. Methods and results: The dataset and associated features of 100,000 tweets were extracted and pre-processed. Furthermore, the credibility class labelling of tweets was performed using four different human annotators. The meaning cloud and natural language understanding platforms were used for calculating the polarity, sentiment, and emotions score. The K-Means algorithm was applied for finding the clusters of tweets based on features set, whereas, random forest, support vector machine, naïve Bayes, K-nearest-neighbours (KNN), J48 decision tree, and multilayer perceptron were used for classifying the tweets into credibility classes. A significant level of accuracy, precision, and recall was provided by all the classifiers for all the given credibility classes.</p>


2015 ◽  
Vol 56 (69) ◽  
pp. 285-294 ◽  
Author(s):  
M.-A.N. Moen ◽  
S.N. Anfinsen ◽  
A.P. Doulgeris ◽  
A.H.H. Renner ◽  
S. Gerland

AbstractThis paper investigates automatic segmentation and classification of C-band, polarimetric synthetic aperture radar (SAR) satellite images of Arctic sea ice under freezing conditions prior to melt. The objective is to investigate the robustness of the results obtained under slightly varying environmental conditions and different viewing geometries. Initially, three geographically overlapping SAR images from consecutive days are incidence-angle corrected and segmented into unknown classes. The segmentation is performed by an unsupervised mixture-of-Gaussian segmentation algorithm utilizing six features extracted from the polarimetric data. After segmentation, the segments are contextually smoothed. One segmented image is manually labelled based on in situ data and expert knowledge. Using this scene as reference, we consider two strategies for class labelling of the other scenes. The first manually labels the classes based on visual inspection of the reference; the second utilizes various statistical distance measures to automatically assign each unknown class to the statistically nearest reference class. These two scenes are also classified pixel-wise by a supervised classification algorithm based on the reference data. Poor classification results are obtained when the incidence angle is very different from the reference scene. Similar viewing geometries reveal good classification and labelling results, the latter regardless of the distance measure used.


2013 ◽  
Vol 8 (2) ◽  
Author(s):  
Kathryn Widhiyanti ◽  
Agus Harjoko

The research conduct a Part of Speech Tagging (POS-tagging) for text in Indonesian language, supporting another process in digitising natural language e.g. Indonesian language text parsing. POS-tagging is an automated process of labelling word classes for certain word in sentences (Jurafsky and Martin, 2000). The escalated issue is how to acquire an accurate word class labelling in sentence domain. The author would like to propose a method which combine Hidden Markov Model and Rule Based method. The expected outcome in this research is a better accurary in word class labelling, resulted by only using Hidden Markov Model. The labelling results –from Hidden Markov Model– are  refined by validating with certain rule, composed by the used corpus automatically. From the conducted research through some POST document, using Hidden Markov Model, produced 100% as the highest accurary for identical text within corpus. For different text within the referenced corpus, used words subjected in corpus, produced 92,2% for the highest accurary.


2003 ◽  
Vol &NA; (1371/1372) ◽  
pp. 21
Author(s):  
&NA;
Keyword(s):  

Sign in / Sign up

Export Citation Format

Share Document