DYPLODOC: Dynamic Plots for Document Classification

Narrative generation and analysis are still on the fringe of modern natural language processing yet are crucial in a variety of applications. This paper proposes a feature extraction method for plot dynamics. We present a dataset that consists of the plot descriptions for thirteen thousand TV shows alongside meta-information on their genres and dynamic plots extracted from them. We validate the proposed tool for plot dynamics extraction and discuss possible applications of this method to the tasks of narrative analysis and generation.

Download Full-text

Natural Language Processing as Feature Extraction Method for Building Better Predictive Models

Artificial Intelligence ◽

10.4018/978-1-5225-1759-7.ch078 ◽

2017 ◽

pp. 1913-1937

Author(s):

Goran Klepac ◽

Marko Velić

Keyword(s):

Feature Extraction ◽

Natural Language Processing ◽

Natural Language ◽

Language Processing ◽

Predictive Models ◽

Extraction Method ◽

Feature Extraction Method

Download Full-text

An Improved Feature Extraction Approach for Web Anomaly Detection Based on Semantic Structure

Security and Communication Networks ◽

10.1155/2021/6661124 ◽

2021 ◽

Vol 2021 ◽

pp. 1-11

Author(s):

Zishuai Cheng ◽

Baojiang Cui ◽

Tao Qi ◽

Wenchuan Yang ◽

Junsong Fu

Keyword(s):

Machine Learning ◽

Feature Extraction ◽

Anomaly Detection ◽

Language Processing ◽

Web Application ◽

Extraction Method ◽

Domain Knowledge ◽

Semantic Structure ◽

Feature Extraction Method ◽

Web Attacks

Anomaly-based Web application firewalls (WAFs) are vital for providing early reactions to novel Web attacks. In recent years, various machine learning, deep learning, and transfer learning-based anomaly detection approaches have been developed to protect against Web attacks. Most of them directly treat the request URL as a general string that consists of letters and roughly use natural language processing (NLP) methods (i.e., Word2Vec and Doc2Vec) or domain knowledge to extract features. In this paper, we proposed an improved feature extraction approach which leveraged the advantage of the semantic structure of URLs. Semantic structure is an inherent interpretative property of the URL that identifies the function and vulnerability of each part in the URL. The evaluations on CSIC-2020 show that our feature extraction method has better performance than conventional feature extraction routine by more than average dramatic 5% improvement in accuracy, recall, and F1-score.

Download Full-text

Robust Texture Feature Extraction Method for Geometrical and Illumination Distortions

IEEJ Transactions on Electronics Information and Systems ◽

10.1541/ieejeiss.129.985 ◽

2009 ◽

Vol 129 (5) ◽

pp. 985-992 ◽

Cited By ~ 1

Author(s):

Norisuke Takao ◽

Zhuo Liu ◽

Shigeo Wada

Keyword(s):

Feature Extraction ◽

Extraction Method ◽

Texture Feature ◽

Texture Feature Extraction ◽

Feature Extraction Method

Download Full-text

IMPLEMENTATION OF HIGH PERFORMANCE FEATURE EXTRACTION METHOD USING ORIENTED FAST AND ROTATED BRIEF ALGORITHM

International Journal of Research in Engineering and Technology ◽

10.15623/ijret.2015.0402052 ◽

2015 ◽

Vol 04 (02) ◽

pp. 394-397 ◽

Cited By ~ 3

Author(s):

Prashant Aglave .

Keyword(s):

Feature Extraction ◽

Extraction Method ◽

High Performance ◽

Feature Extraction Method

Download Full-text

Fast Neural Network Engine for Natural Science Language Processing: A Drug-Search Case.

10.26434/chemrxiv.12800348 ◽

2020 ◽

Author(s):

Vadim V. Korolev ◽

Artem Mitrofanov ◽

Kirill Karpov ◽

Valery Tkachenko

Keyword(s):

Neural Network ◽

Natural Language Processing ◽

Natural Language ◽

Language Processing ◽

Natural Science ◽

Therapeutic Agent ◽

Semantic Relations ◽

Chemical Data ◽

Processing Methods ◽

Modern Natural

The main advantage of modern natural language processing methods is a possibility to turn an amorphous human-readable task into a strict mathematic form. That allows to extract chemical data and insights from articles and to find new semantic relations. We propose a universal engine for processing chemical and biological texts. We successfully tested it on various use-cases and applied to a case of searching a therapeutic agent for a COVID-19 disease by analyzing PubMed archive.

Download Full-text

A Novel Feature Extraction Method for Identification of Healthy and Diseased Maize and Paddy Leaves Using ECOC Classifier

International Journal of Computer Sciences and Engineering ◽

10.26438/ijcse/v6i9.137141 ◽

2018 ◽

Vol 6 (9) ◽

pp. 137-141

Author(s):

T. Harisha Naik ◽

M. Suresha ◽

Shreekanth K. N.

Keyword(s):

Feature Extraction ◽

Extraction Method ◽

Feature Extraction Method

Download Full-text

(2D)2UFFCA: Two-directional Two-dimensional Unsupervised Feature Extraction Method with Fuzzy Clustering Ability

ACTA AUTOMATICA SINICA ◽

10.3724/sp.j.1004.2012.00549 ◽

2012 ◽

Vol 38 (4) ◽

pp. 549-562 ◽

Cited By ~ 1

Author(s):

Jun GAO ◽

Chang-Yin SUN ◽

Shi-Tong WANG

Keyword(s):

Feature Extraction ◽

Fuzzy Clustering ◽

Extraction Method ◽

Two Dimensional ◽

Feature Extraction Method ◽

Unsupervised Feature Extraction

Download Full-text

A Feature Extraction Method of Computer Viruses Based on Artificial Immune and Code Relevance

Chinese Journal of Computers ◽

10.3724/sp.j.1016.2011.00204 ◽

2011 ◽

Vol 34 (2) ◽

pp. 204-215 ◽

Cited By ~ 4

Author(s):

Wei WANG ◽

Peng-Tao ZHANG ◽

Ying TAN ◽

Xin-Gui HE

Keyword(s):

Feature Extraction ◽

Extraction Method ◽

Artificial Immune ◽

Computer Viruses ◽

Feature Extraction Method

Download Full-text

A Novel Prediction of Quaternary Structural Type of Proteins with Gene Ontology

Protein and Peptide Letters ◽

10.2174/0929866526666191014144618 ◽

2020 ◽

Vol 27 (4) ◽

pp. 313-320 ◽

Cited By ~ 1

Author(s):

Xuan Xiao ◽

Wei-Jie Chen ◽

Wang-Ren Qiu

Keyword(s):

Gene Ontology ◽

Feature Extraction ◽

Extraction Method ◽

Quaternary Structure ◽

Structural Type ◽

Sequence Information ◽

Prediction System ◽

Data Set ◽

Feature Extraction Method ◽

Prediction Rate

Background: The information of quaternary structure attributes of proteins is very important because it is closely related to the biological functions of proteins. With the rapid development of new generation sequencing technology, we are facing a challenge: how to automatically identify the four-level attributes of new polypeptide chains according to their sequence information (i.e., whether they are formed as just as a monomer, or as a hetero-oligomer, or a homo-oligomer). Objective: In this article, our goal is to find a new way to represent protein sequences, thereby improving the prediction rate of protein quaternary structure. Methods: In this article, we developed a prediction system for protein quaternary structural type in which a protein sequence was expressed by combining the Pfam functional-domain and gene ontology. turn protein features into digital sequences, and complete the prediction of quaternary structure through specific machine learning algorithms and verification algorithm. Results: Our data set contains 5495 protein samples. Through the method provided in this paper, we classify proteins into monomer, or as a hetero-oligomer, or a homo-oligomer, and the prediction rate is 74.38%, which is 3.24% higher than that of previous studies. Through this new feature extraction method, we can further classify the four-level structure of proteins, and the results are also correspondingly improved. Conclusion: After the applying the new prediction system, compared with the previous results, we have successfully improved the prediction rate. We have reason to believe that the feature extraction method in this paper has better practicability and can be used as a reference for other protein classification problems.

Download Full-text