On Graph Mining With Deep Learning: Introducing Model R for Link Weight Prediction

Abstract Deep learning has been successful in various domains including image recognition, speech recognition and natural language processing. However, the research on its application in graph mining is still in an early stage. Here we present Model R, a neural network model created to provide a deep learning approach to the link weight prediction problem. This model uses a node embedding technique that extracts node embeddings (knowledge of nodes) from the known links’ weights (relations between nodes) and uses this knowledge to predict the unknown links’ weights. We demonstrate the power of Model R through experiments and compare it with the stochastic block model and its derivatives. Model R shows that deep learning can be successfully applied to link weight prediction and it outperforms stochastic block model and its derivatives by up to 73% in terms of prediction accuracy. We analyze the node embeddings to confirm that closeness in embedding space correlates with stronger relationships as measured by the link weight. We anticipate this new approach will provide effective solutions to more graph mining tasks.

Download Full-text

Text Data Augmentation for Deep Learning

10.21203/rs.3.rs-650804/v1 ◽

2021 ◽

Author(s):

Connor Shorten ◽

Taghi M. Khoshgoftaar ◽

Borko Furht

Keyword(s):

Deep Learning ◽

Language Processing ◽

Data Augmentation ◽

Early Stage ◽

Practical Implementation ◽

Text Data ◽

Training Strategy ◽

Local Decision ◽

Decision Boundaries

Abstract Natural Language Processing (NLP) is one of the most captivating applications of Deep Learning. In this survey, we consider how the Data Augmentation training strategy can aid in its development. We begin with the major motifs of Data Augmentation summarized into strengthening local decision boundaries, brute force training, causality and counterfactual examples, and the distinction between meaning and form. We follow these motifs with a concrete list of augmentation frameworks that have been developed for text data. Deep Learning generally struggles with the measurement of generalization and characterization of overfitting. We highlight studies that cover how augmentations can construct test sets for generalization. NLP is at an early stage in applying Data Augmentation compared to Computer Vision. We highlight the key differences and promising ideas that have yet to be tested in NLP. For the sake of practical implementation, we describe tools that facilitate Data Augmentation such as the use of consistency regularization, controllers, and offline and online augmentation pipelines, to preview a few. Finally, we discuss interesting topics around Data Augmentation in NLP such as task-specific augmentations, the use of prior knowledge in self-supervised learning versus Data Augmentation, intersections with transfer and multi-task learning, and ideas for AI-GAs (AI-Generating Algorithms). We hope this paper inspires further research interest in Text Data Augmentation.

Download Full-text

FastPacket: Towards Pre-trained Packets Embedding based on FastText for next-generation NIDS

10.21203/rs.3.rs-555961/v1 ◽

2021 ◽

Author(s):

Khloud Al Jallad

Keyword(s):

Big Data ◽

Deep Learning ◽

Language Processing ◽

Intrusion Detection Systems ◽

Dynamic Feature ◽

Statistical Features ◽

Text Data ◽

New Approach ◽

New Era ◽

Detection Systems

Abstract New Attacks are increasingly used by attackers every day but many of them are not detected by Intrusion Detection Systems as most IDS ignore raw packet information and only care about some basic statistical information extracted from PCAP files. Using networking programs to extract fixed statistical features from packets is good, but may not enough to detect nowadays challenges. We think that it is time to utilize big data and deep learning for automatic dynamic feature extraction from packets. It is time to get inspired by deep learning pre-trained models in computer vision and natural language processing, so security deep learning solutions will have its pre-trained models on big datasets to be used in future researches. In this paper, we proposed a new approach for embedding packets based on character-level embeddings, inspired by FastText success on text data. We called this approach FastPacket. Results are measured on subsets of CIC-IDS-2017 dataset, but we expect promising results on big data pre-trained models. We suggest building pre-trained FastPacket on MAWI big dataset and make it available to community, similar to FastText. To be able to outperform currently used NIDS, to start a new era of packet-level NIDS that can better detect complex attacks

Download Full-text

Research on application of GPS-based wireless communication system in highway landslide

EURASIP Journal on Wireless Communications and Networking ◽

10.1186/s13638-021-02038-7 ◽

2021 ◽

Vol 2021 (1) ◽

Author(s):

Zhiwen Xiong

Keyword(s):

Machine Learning ◽

Deep Learning ◽

Wireless Communication ◽

Language Processing ◽

Learning Algorithm ◽

Detection System ◽

Early Stage ◽

Small Error ◽

Deformation Monitoring ◽

Analysis Method

AbstractMachine learning is a branch of the field of artificial intelligence. Deep learning is a complex machine learning algorithm that has unique advantages in image recognition, speech recognition, natural language processing, and industrial process control. Deep learning has It is widely used in the field of wireless communication. Prediction of geological disasters (such as landslides) is currently a difficult problem. Because landslides are difficult to detect in the early stage, this paper proposes a GPS-based wireless communication continuous detection system and applies it to landslide deformation monitoring to achieve early treatment and prevention. This article introduces the GPS multi-antenna detection system based on deep learning wireless communication, and introduces the time series analysis method and its application. The test results show that the GPS multi-antenna detection system of the wireless communication network has great advantages in response time, with high accuracy and small error. The horizontal accuracy is controlled at 0–2 mm and the vertical accuracy is about 1 mm. The analysis method is simple and efficient, and can obtain good results for short-term deformation prediction.

Download Full-text

Deep learning approach to link weight prediction

2017 International Joint Conference on Neural Networks (IJCNN) ◽

10.1109/ijcnn.2017.7966076 ◽

2017 ◽

Cited By ~ 1

Author(s):

Yuchen Hou ◽

Lawrence B. Holder

Keyword(s):

Deep Learning ◽

Learning Approach ◽

Link Weight ◽

Weight Prediction

Download Full-text

Text Data Augmentation for Deep Learning

Journal Of Big Data ◽

10.1186/s40537-021-00492-0 ◽

2021 ◽

Vol 8 (1) ◽

Author(s):

Connor Shorten ◽

Taghi M. Khoshgoftaar ◽

Borko Furht

Keyword(s):

Deep Learning ◽

Language Processing ◽

Data Augmentation ◽

Early Stage ◽

Practical Implementation ◽

Text Data ◽

Training Strategy ◽

Local Decision ◽

Decision Boundaries

AbstractNatural Language Processing (NLP) is one of the most captivating applications of Deep Learning. In this survey, we consider how the Data Augmentation training strategy can aid in its development. We begin with the major motifs of Data Augmentation summarized into strengthening local decision boundaries, brute force training, causality and counterfactual examples, and the distinction between meaning and form. We follow these motifs with a concrete list of augmentation frameworks that have been developed for text data. Deep Learning generally struggles with the measurement of generalization and characterization of overfitting. We highlight studies that cover how augmentations can construct test sets for generalization. NLP is at an early stage in applying Data Augmentation compared to Computer Vision. We highlight the key differences and promising ideas that have yet to be tested in NLP. For the sake of practical implementation, we describe tools that facilitate Data Augmentation such as the use of consistency regularization, controllers, and offline and online augmentation pipelines, to preview a few. Finally, we discuss interesting topics around Data Augmentation in NLP such as task-specific augmentations, the use of prior knowledge in self-supervised learning versus Data Augmentation, intersections with transfer and multi-task learning, and ideas for AI-GAs (AI-Generating Algorithms). We hope this paper inspires further research interest in Text Data Augmentation.

Download Full-text

Prediction of ambulatory outcome in patients with corona radiata infarction using deep learning

Scientific Reports ◽

10.1038/s41598-021-87176-0 ◽

2021 ◽

Vol 11 (1) ◽

Author(s):

Jeoung Kun Kim ◽

Yoo Jin Choo ◽

Hyunkwang Shin ◽

Gyu Sang Choi ◽

Min Cheol Chang

Keyword(s):

Deep Learning ◽

Language Processing ◽

Early Stage ◽

Model Performance ◽

Area Under The Curve ◽

Brain Magnetic Resonance Imaging ◽

Validation Dataset ◽

Corona Radiata ◽

Magnetic Resonance Imaging Mri ◽

Validation Set

AbstractDeep learning (DL) is an advanced machine learning approach used in diverse areas such as bioinformatics, image analysis, and natural language processing. Here, using brain magnetic resonance imaging (MRI) data obtained at early stages of infarcts, we attempted to develop a convolutional neural network (CNN) to predict the ambulatory outcome of corona radiata infarction at six months after onset. We retrospectively recruited 221 patients with corona radiata infarcts. A favorable outcome of ambulatory function was defined as a functional ambulation category (FAC) score of ≥ 4 (able to walk without a guardian’s assistance), and a poor outcome of ambulatory function was defined as an FAC score of < 4. We used a CNN algorithm. Of the included subjects, 69.7% (n = 154) were assigned randomly to the training set and the remaining 30.3% (n = 67) were assigned to the validation set to measure the model performance. The area under the curve was 0.751 (95% CI 0.649–0.852) for the prediction of ambulatory function with the validation dataset using the CNN model. We demonstrated that a CNN model trained using brain MRIs captured at an early stage after corona radiata infarction could be helpful in predicting long-term ambulatory outcomes.

Download Full-text

Deep Learning Based High-Resolution Remote Sensing Image classification

International Journal of Advanced Research in Computer Science and Software Engineering ◽

10.23956/ijarcsse.v7i10.384 ◽

2017 ◽

Vol 7 (10) ◽

pp. 22

Author(s):

Sumit Kaur

Keyword(s):

Machine Learning ◽

Remote Sensing ◽

Deep Learning ◽

Image Classification ◽

Language Processing ◽

Object Perception ◽

Remote Sensing Image ◽

Research Area ◽

Remote Sensing Image Classification ◽

Unsupervised Algorithms

Abstract- Deep learning is an emerging research area in machine learning and pattern recognition field which has been presented with the goal of drawing Machine Learning nearer to one of its unique objectives, Artificial Intelligence. It tries to mimic the human brain, which is capable of processing and learning from the complex input data and solving different kinds of complicated tasks well. Deep learning (DL) basically based on a set of supervised and unsupervised algorithms that attempt to model higher level abstractions in data and make it self-learning for hierarchical representation for classification. In the recent years, it has attracted much attention due to its state-of-the-art performance in diverse areas like object perception, speech recognition, computer vision, collaborative filtering and natural language processing. This paper will present a survey on different deep learning techniques for remote sensing image classification.

Download Full-text

Fast Algorithm on Stochastic Block Model for Exploring General Communities

Journal of Software ◽

10.3724/sp.j.1001.2013.04474 ◽

2014 ◽

Vol 24 (11) ◽

pp. 2699-2709 ◽

Cited By ~ 1

Author(s):

Bian-Fang CHAI ◽

Jian YU ◽

Cai-Yan JIA ◽

Jing-Hong WANG

Keyword(s):

Fast Algorithm ◽

Block Model ◽

Stochastic Block Model

Download Full-text

Fast Multi-Focus Fusion Based on Deep Learning for Early-Stage Embryo Image Enhancement

Sensors ◽

10.3390/s21030863 ◽

2021 ◽

Vol 21 (3) ◽

pp. 863

Author(s):

Vidas Raudonis ◽

Agne Paulauskaite-Taraseviciene ◽

Kristina Sutiene

Keyword(s):

Deep Learning ◽

Image Fusion ◽

Early Stage ◽

Image Data ◽

Cell Detection ◽

Processing Times ◽

Fused Image ◽

Stage Embryo ◽

Early Stage Embryo

Background: Cell detection and counting is of essential importance in evaluating the quality of early-stage embryo. Full automation of this process remains a challenging task due to different cell size, shape, the presence of incomplete cell boundaries, partially or fully overlapping cells. Moreover, the algorithm to be developed should process a large number of image data of different quality in a reasonable amount of time. Methods: Multi-focus image fusion approach based on deep learning U-Net architecture is proposed in the paper, which allows reducing the amount of data up to 7 times without losing spectral information required for embryo enhancement in the microscopic image. Results: The experiment includes the visual and quantitative analysis by estimating the image similarity metrics and processing times, which is compared to the results achieved by two wellknown techniques—Inverse Laplacian Pyramid Transform and Enhanced Correlation Coefficient Maximization. Conclusion: Comparatively, the image fusion time is substantially improved for different image resolutions, whilst ensuring the high quality of the fused image.

Download Full-text

Community detection in the sparse hypergraph stochastic block model

Random Structures and Algorithms ◽

10.1002/rsa.21006 ◽

2021 ◽

Author(s):

Soumik Pal ◽

Yizhe Zhu

Keyword(s):

Community Detection ◽

Block Model ◽

Stochastic Block Model

Download Full-text