Does One Size Fit All? Exploring the Contribution of Text features, Text content, and Grade of Use on Comprehension

A method for solving the problem of classifying short-text messages in the form of sentences of customers uttered in talking via the telephone line of organizations is considered. To solve this problem, a classifier was developed, which is based on using a combination of two methods: a description of the subject area in the form of a hierarchy of entities and plausible reasoning based on the case-based reasoning approach, which is actively used in artificial intelligence systems. In solving various problems of artificial intelligence-based analysis of data, these methods have shown a high degree of efficiency, scalability, and independence from data structure. As part of using the case-based reasoning approach in the classifier, it is proposed to modify the TF-IDF (Term Frequency - Inverse Document Frequency) measure of assessing the text content taking into account known information about the distribution of documents by topics. The proposed modification makes it possible to improve the classification quality in comparison with classical measures, since it takes into account the information about the distribution of words not only in a separate document or topic, but in the entire database of cases. Experimental results are presented that confirm the effectiveness of the proposed metric and the developed classifier as applied to classification of customer sentences and providing them with the necessary information depending on the classification result. The developed text classification service prototype is used as part of the voice interaction module with the user in the objective of robotizing the telephone call routing system and making a shift from interaction between the user and system by means of buttons to their interaction through voice.

Download Full-text

Historical Trends in the Development of the English Academic Medical Written Text: Content Organization

Dagestan State Pedagogical University Journal Social and Humanitarian Sciences ◽

10.31161/1995-0667-2018-12-1-92-97 ◽

2018 ◽

Vol 12 (2) ◽

pp. 92-97

Author(s):

Larisa V. Yagenich ◽

Keyword(s):

Historical Trends ◽

Written Text ◽

Content Organization ◽

Academic Medical ◽

Text Content

Download Full-text

Systemic Lupus Erythematosus before and after COVID-19 Lockdown: How the Perception of Disease Changes through the Lenses of Narrative Medicine

Healthcare ◽

10.3390/healthcare9060726 ◽

2021 ◽

Vol 9 (6) ◽

pp. 726

Author(s):

Fulvia Ceccarelli ◽

Venusia Covelli ◽

Giulio Olivieri ◽

Francesco Natalucci ◽

Fabrizio Conti

Keyword(s):

Systemic Lupus Erythematosus ◽

Lupus Erythematosus ◽

Narrative Medicine ◽

Good Control ◽

Final Analysis ◽

Point Of View ◽

Systemic Lupus ◽

Before And After ◽

Text Content ◽

Insight Into

Background: The COVID-19 pandemic contributes to the burden of living with different diseases, including Systemic Lupus Erythematosus (SLE). We described, from a narrative point of view, the experiences and perspectives of Italian SLE adults during the COVID-19 emergency, by distinguishing the illness experience before and after the lockdown. Methods: Fifteen patients were invited to participate. Illness narratives were collected between 22 and 29 March 2020 using a written modality to capture patients’ perspectives before and after the COVID-19 lockdown. We performed a two-fold analysis of collected data by distinguishing three narrative types and a qualitative analysis of content to identify the relevant themes and sub-themes reported. Results: Eight narratives included in the final analysis (mean length 436.9 words) have been written by eight females (mean age 43.3 ± 9.9 years, mean disease duration 13.1 ± 7.4 years). Six patients provided a quest narrative, one a chaos and the remaining one a restitution narrative. By text content analysis, we identified specific themes, temporally distinct before and after the lockdown. Before COVID-19, all the patients referred to a good control of disease, however the unexpected arrival of the COVID-19 emergency broke a balance, and patients perceived the loss of health status control, with anxiety and stress. Conclusions: We provided unique insight into the experiences of people with SLE at the time of COVID-19, underlining the perspective of patients in relation to the pandemic.

Download Full-text

Text Content Based Layout Analysis

2020 17th International Conference on Frontiers in Handwriting Recognition (ICFHR) ◽

10.1109/icfhr2020.2020.00055 ◽

2020 ◽

Author(s):

Jose Ramon Prieto ◽

Vicente Bosch ◽

Enrique Vidal ◽

Dominique Stutzmann ◽

Sebastien Hamel

Keyword(s):

Layout Analysis ◽

Text Content

Download Full-text

A Densely Connected GRU Neural Network Based on Coattention Mechanism for Chinese Rice-Related Question Similarity Matching

Agronomy ◽

10.3390/agronomy11071307 ◽

2021 ◽

Vol 11 (7) ◽

pp. 1307

Author(s):

Haoriqin Wang ◽

Huaji Zhu ◽

Huarui Wu ◽

Xiaomin Wang ◽

Xiao Han ◽

...

Keyword(s):

A Comparative Analysis of Arabic Text Steganography

Applied Sciences ◽

10.3390/app11156851 ◽

2021 ◽

Vol 11 (15) ◽

pp. 6851

Author(s):

Reema Thabit ◽

Nur Izura Udzir ◽

Sharifah Md Yasin ◽

Aziah Asmawi ◽

Nuur Alifah Roslan ◽

...

Keyword(s):

Evaluation Criteria ◽

Arabic Language ◽

Text Messages ◽

Secret Message ◽

Sensitive Information ◽

Arabic Text ◽

Ideal Object ◽

Internet Users ◽

Text Steganography ◽

Text Content

Protecting sensitive information transmitted via public channels is a significant issue faced by governments, militaries, organizations, and individuals. Steganography protects the secret information by concealing it in a transferred object such as video, audio, image, text, network, or DNA. As text uses low bandwidth, it is commonly used by Internet users in their daily activities, resulting a vast amount of text messages sent daily as social media posts and documents. Accordingly, text is the ideal object to be used in steganography, since hiding a secret message in a text makes it difficult for the attacker to detect the hidden message among the massive text content on the Internet. Language’s characteristics are utilized in text steganography. Despite the richness of the Arabic language in linguistic characteristics, only a few studies have been conducted in Arabic text steganography. To draw further attention to Arabic text steganography prospects, this paper reviews the classifications of these methods from its inception. For analysis, this paper presents a comprehensive study based on the key evaluation criteria (i.e., capacity, invisibility, robustness, and security). It opens new areas for further research based on the trends in this field.

Download Full-text

Extractive Summarization Based on Dynamic Memory Network

Symmetry ◽

10.3390/sym13040600 ◽

2021 ◽

Vol 13 (4) ◽

pp. 600

Author(s):

Ping Li ◽

Jiong Yu

Keyword(s):

Dynamic Memory ◽

Extractive Summarization ◽

Model Based ◽

Network Method ◽

Comparable Performance ◽

Benchmark Datasets ◽

Memory Network ◽

Text Features

We present an extractive summarization model based on the Bert and dynamic memory network. The model based on Bert uses the transformer to extract text features and uses the pre-trained model to construct the sentence embeddings. The model based on Bert labels the sentences automatically without using any hand-crafted features and the datasets are symmetry labeled. We also present a dynamic memory network method for extractive summarization. Experiments are conducted on several summarization benchmark datasets. Our model shows comparable performance compared with other extractive summarization methods.

Download Full-text

Enhancement of email spam detection using improved deep learning algorithms for cyber security

Journal of Computer Security ◽

10.3233/jcs-200111 ◽

2021 ◽

pp. 1-34

Author(s):

Kadam Vikas Samarthrao ◽

Vandana M. Rohokale

Keyword(s):

Feature Selection ◽

Deep Learning ◽

Visual Features ◽

Spam Detection ◽

Learning Approaches ◽

Learning Technique ◽

Text Features ◽

Optimal Feature Selection ◽

Optimal Feature ◽

Email Spam

Email has sustained to be an essential part of our lives and as a means for better communication on the internet. The challenge pertains to the spam emails residing a large amount of space and bandwidth. The defect of state-of-the-art spam filtering methods like misclassification of genuine emails as spam (false positives) is the rising challenge to the internet world. Depending on the classification techniques, literature provides various algorithms for the classification of email spam. This paper tactics to develop a novel spam detection model for improved cybersecurity. The proposed model involves several phases like dataset acquisition, feature extraction, optimal feature selection, and detection. Initially, the benchmark dataset of email is collected that involves both text and image datasets. Next, the feature extraction is performed using two sets of features like text features and visual features. In the text features, Term Frequency-Inverse Document Frequency (TF-IDF) is extracted. For the visual features, color correlogram and Gray-Level Co-occurrence Matrix (GLCM) are determined. Since the length of the extracted feature vector seems to the long, the optimal feature selection process is done. The optimal feature selection is performed by a new meta-heuristic algorithm called Fitness Oriented Levy Improvement-based Dragonfly Algorithm (FLI-DA). Once the optimal features are selected, the detection is performed by the hybrid learning technique that is composed of two deep learning approaches named Recurrent Neural Network (RNN) and Convolutional Neural Network (CNN). For improving the performance of existing deep learning approaches, the number of hidden neurons of RNN and CNN is optimized by the same FLI-DA. Finally, the optimized hybrid learning technique having CNN and RNN classifies the data into spam and ham. The experimental outcomes show the ability of the proposed method to perform the spam email classification based on improved deep learning.

Download Full-text

Predicting Implicit User Preferences with Multimodal Feature Fusion for Similar User Recommendation in Social Media

Applied Sciences ◽

10.3390/app11031064 ◽

2021 ◽

Vol 11 (3) ◽

pp. 1064

Author(s):

Jenq-Haur Wang ◽

Yen-Tsang Wu ◽

Long Wang

Keyword(s):

Social Media ◽

Feature Fusion ◽

Relevant Information ◽

Image Features ◽

User Preferences ◽

User Preference ◽

Late Fusion ◽

Multimodal Features ◽

Fusion Methods ◽

Text Features

In social networks, users can easily share information and express their opinions. Given the huge amount of data posted by many users, it is difficult to search for relevant information. In addition to individual posts, it would be useful if we can recommend groups of people with similar interests. Past studies on user preference learning focused on single-modal features such as review contents or demographic information of users. However, such information is usually not easy to obtain in most social media without explicit user feedback. In this paper, we propose a multimodal feature fusion approach to implicit user preference prediction which combines text and image features from user posts for recommending similar users in social media. First, we use the convolutional neural network (CNN) and TextCNN models to extract image and text features, respectively. Then, these features are combined using early and late fusion methods as a representation of user preferences. Lastly, a list of users with the most similar preferences are recommended. The experimental results on real-world Instagram data show that the best performance can be achieved when we apply late fusion of individual classification results for images and texts, with the best average top-k accuracy of 0.491. This validates the effectiveness of utilizing deep learning methods for fusing multimodal features to represent social user preferences. Further investigation is needed to verify the performance in different types of social media.

Download Full-text

Does One Size Fit All? Exploring the Contribution of Text features, Text content, and Grade of Use on Comprehension

A Method of Readability Assessment for Web Documents Using Text Features and HTML Structures

Solving the Message Classification Problem in Voice Interaction Systems

Historical Trends in the Development of the English Academic Medical Written Text: Content Organization

Systemic Lupus Erythematosus before and after COVID-19 Lockdown: How the Perception of Disease Changes through the Lenses of Narrative Medicine

Text Content Based Layout Analysis

A Densely Connected GRU Neural Network Based on Coattention Mechanism for Chinese Rice-Related Question Similarity Matching

A Comparative Analysis of Arabic Text Steganography

Extractive Summarization Based on Dynamic Memory Network

Enhancement of email spam detection using improved deep learning algorithms for cyber security

Predicting Implicit User Preferences with Multimodal Feature Fusion for Similar User Recommendation in Social Media

Export Citation Format