Research on Feature Extraction Method of Social Network Text

Researchers have collected Twitter data to study a wide range of topics, one of which is a natural disaster. A social network sensor was developed in existing research to filter natural disaster information from direct eyewitnesses, none eyewitnesses, and non-natural disaster information. It can be used as a tool for early warning or monitoring when natural disasters occur. The main component of the social network sensor is the text tweet classification. Similar to text classification research in general, the challenge is the feature extraction method to convert Twitter text into structured data. The strategy commonly used is vector space representation. However, it has the potential to produce high dimension data. This research focuses on the feature extraction method to resolve high dimension data issues. We propose a hybrid approach of word2vec-based and lexicon-based feature extraction to produce new features. The Experiment result shows that the proposed method has fewer features and improves classification performance with an average AUC value of 0.84, and the number of features is 150. The value is obtained by using only the word2vec-based method. In the end, this research shows that lexicon-based did not influence the improvement in the performance of social network sensor predictions in natural disasters. HIGHLIGHTS Implementation of text classification is generally only used to perform sentiment analysis, it is still rare to use it to perform text classification for use in determining direct eyewitnesses in cases of natural disasters One of the common problems in text mining research is the extracted features from the vector space representation method generate high dimension data A hybrid approach of word2vec-based and lexicon-based feature extraction experiment was conducted in order to find a method that can generate new features with low dimensions and also improve the classification performance GRAPHICAL ABSTRACT

Download Full-text

Robust Texture Feature Extraction Method for Geometrical and Illumination Distortions

IEEJ Transactions on Electronics Information and Systems ◽

10.1541/ieejeiss.129.985 ◽

2009 ◽

Vol 129 (5) ◽

pp. 985-992 ◽

Cited By ~ 1

Author(s):

Norisuke Takao ◽

Zhuo Liu ◽

Shigeo Wada

Keyword(s):

Feature Extraction ◽

Extraction Method ◽

Texture Feature ◽

Texture Feature Extraction ◽

Feature Extraction Method

Download Full-text

IMPLEMENTATION OF HIGH PERFORMANCE FEATURE EXTRACTION METHOD USING ORIENTED FAST AND ROTATED BRIEF ALGORITHM

International Journal of Research in Engineering and Technology ◽

10.15623/ijret.2015.0402052 ◽

2015 ◽

Vol 04 (02) ◽

pp. 394-397 ◽

Cited By ~ 3

Author(s):

Prashant Aglave .

Keyword(s):

Feature Extraction ◽

Extraction Method ◽

High Performance ◽

Feature Extraction Method

Download Full-text

A Novel Feature Extraction Method for Identification of Healthy and Diseased Maize and Paddy Leaves Using ECOC Classifier

International Journal of Computer Sciences and Engineering ◽

10.26438/ijcse/v6i9.137141 ◽

2018 ◽

Vol 6 (9) ◽

pp. 137-141

Author(s):

T. Harisha Naik ◽

M. Suresha ◽

Shreekanth K. N.

Keyword(s):

Feature Extraction ◽

Extraction Method ◽

Feature Extraction Method

Download Full-text

(2D)2UFFCA: Two-directional Two-dimensional Unsupervised Feature Extraction Method with Fuzzy Clustering Ability

ACTA AUTOMATICA SINICA ◽

10.3724/sp.j.1004.2012.00549 ◽

2012 ◽

Vol 38 (4) ◽

pp. 549-562 ◽

Cited By ~ 1

Author(s):

Jun GAO ◽

Chang-Yin SUN ◽

Shi-Tong WANG

Keyword(s):

Feature Extraction ◽

Fuzzy Clustering ◽

Extraction Method ◽

Two Dimensional ◽

Feature Extraction Method ◽

Unsupervised Feature Extraction

Download Full-text

A Feature Extraction Method of Computer Viruses Based on Artificial Immune and Code Relevance

Chinese Journal of Computers ◽

10.3724/sp.j.1016.2011.00204 ◽

2011 ◽

Vol 34 (2) ◽

pp. 204-215 ◽

Cited By ~ 4

Author(s):

Wei WANG ◽

Peng-Tao ZHANG ◽

Ying TAN ◽

Xin-Gui HE

Keyword(s):

Feature Extraction ◽

Extraction Method ◽

Artificial Immune ◽

Computer Viruses ◽

Feature Extraction Method

Download Full-text

A Novel Prediction of Quaternary Structural Type of Proteins with Gene Ontology

Protein and Peptide Letters ◽

10.2174/0929866526666191014144618 ◽

2020 ◽

Vol 27 (4) ◽

pp. 313-320 ◽

Cited By ~ 1

Author(s):

Xuan Xiao ◽

Wei-Jie Chen ◽

Wang-Ren Qiu

Keyword(s):

Gene Ontology ◽

Feature Extraction ◽

Extraction Method ◽

Quaternary Structure ◽

Structural Type ◽

Sequence Information ◽

Prediction System ◽

Data Set ◽

Feature Extraction Method ◽

Prediction Rate

Background: The information of quaternary structure attributes of proteins is very important because it is closely related to the biological functions of proteins. With the rapid development of new generation sequencing technology, we are facing a challenge: how to automatically identify the four-level attributes of new polypeptide chains according to their sequence information (i.e., whether they are formed as just as a monomer, or as a hetero-oligomer, or a homo-oligomer). Objective: In this article, our goal is to find a new way to represent protein sequences, thereby improving the prediction rate of protein quaternary structure. Methods: In this article, we developed a prediction system for protein quaternary structural type in which a protein sequence was expressed by combining the Pfam functional-domain and gene ontology. turn protein features into digital sequences, and complete the prediction of quaternary structure through specific machine learning algorithms and verification algorithm. Results: Our data set contains 5495 protein samples. Through the method provided in this paper, we classify proteins into monomer, or as a hetero-oligomer, or a homo-oligomer, and the prediction rate is 74.38%, which is 3.24% higher than that of previous studies. Through this new feature extraction method, we can further classify the four-level structure of proteins, and the results are also correspondingly improved. Conclusion: After the applying the new prediction system, compared with the previous results, we have successfully improved the prediction rate. We have reason to believe that the feature extraction method in this paper has better practicability and can be used as a reference for other protein classification problems.

Download Full-text