data augmentation Latest Research Papers

The main objective of Personalized Tour Recommendation (PTR) is to generate a sequence of point-of-interest (POIs) for a particular tourist, according to the user-specific constraints such as duration time, start and end points, the number of attractions planned to visit, and so on. Previous PTR solutions are based on either heuristics for solving the orienteering problem to maximize a global reward with a specified budget or approaches attempting to learn user visiting preferences and transition patterns with the stochastic process or recurrent neural networks. However, existing learning methodologies rely on historical trips to train the model and use the next visited POI as the supervised signal, which may not fully capture the coherence of preferences and thus recommend similar trips to different users, primarily due to the data sparsity problem and long-tailed distribution of POI popularity. This work presents a novel tour recommendation model by distilling knowledge and supervision signals from the trips in a self-supervised manner. We propose Contrastive Trajectory Learning for Tour Recommendation (CTLTR), which utilizes the intrinsic POI dependencies and traveling intent to discover extra knowledge and augments the sparse data via pre-training auxiliary self-supervised objectives. CTLTR provides a principled way to characterize the inherent data correlations while tackling the implicit feedback and weak supervision problems by learning robust representations applicable for tour planning. We introduce a hierarchical recurrent encoder-decoder to identify tourists’ intentions and use the contrastive loss to discover subsequence semantics and their sequential patterns through maximizing the mutual information. Additionally, we observe that a data augmentation step as the preliminary of contrastive learning can solve the overfitting issue resulting from data sparsity. We conduct extensive experiments on a range of real-world datasets and demonstrate that our model can significantly improve the recommendation performance over the state-of-the-art baselines in terms of both recommendation accuracy and visiting orders.

Download Full-text

DSSAE: Deep Stacked Sparse Autoencoder Analytical Model for COVID-19 Diagnosis by Fractional Fourier Entropy

ACM Transactions on Management Information Systems ◽

10.1145/3451357 ◽

2022 ◽

Vol 13 (1) ◽

pp. 1-20

Author(s):

Shui-Hua Wang ◽

Xin Zhang ◽

Yu-Dong Zhang

Keyword(s):

Data Augmentation ◽

State Of The Art ◽

Community Acquired Pneumonia ◽

Chest Ct ◽

Two Dimensional ◽

The World ◽

Healthy Control ◽

Sparse Autoencoder ◽

Stacked Sparse Autoencoder ◽

Fractional Fourier Entropy

( Aim ) COVID-19 has caused more than 2.28 million deaths till 4/Feb/2021 while it is still spreading across the world. This study proposed a novel artificial intelligence model to diagnose COVID-19 based on chest CT images. ( Methods ) First, the two-dimensional fractional Fourier entropy was used to extract features. Second, a custom deep stacked sparse autoencoder (DSSAE) model was created to serve as the classifier. Third, an improved multiple-way data augmentation was proposed to resist overfitting. ( Results ) Our DSSAE model obtains a micro-averaged F1 score of 92.32% in handling a four-class problem (COVID-19, community-acquired pneumonia, secondary pulmonary tuberculosis, and healthy control). ( Conclusion ) Our method outperforms 10 state-of-the-art approaches.

Download Full-text

Expansion dataset COVID-19 chest X-ray using data augmentation and histogram equalization

International Journal of Electrical and Computer Engineering (IJECE) ◽

10.11591/ijece.v12i2.pp1904-1909 ◽

2022 ◽

Vol 12 (2) ◽

pp. 1904

Author(s):

Farah Flayeh Alkhalid ◽

Abdulhakeem Qusay Albayati ◽

Ahmed Ali Alhammad

Keyword(s):

Data Augmentation ◽

State Of The Art ◽

Virus Disease ◽

Histogram Equalization ◽

Vital Role ◽

X Ray ◽

Current State ◽

Proposed Model ◽

Chest X Ray ◽

Using Data

The main important factor that plays vital role in success the deep learning is the deep training by many and many images, if neural networks are getting bigger and bigger but the training datasets are not, then it sounds like going to hit an accuracy wall. Briefly, this paper investigates the current state of the art of approaches used for a data augmentation for expansion the corona virus disease 2019 (COVID-19) chest X-ray images using different data augmentation methods (transformation and enhancement) the dataset expansion helps to rise numbers of images from 138 to 5520, the increasing rate is 3,900%, this proposed model can be used to expand any type of image dataset, in addition, the dataset have used with convolutional neural network (CNN) model to make classification if detected infection with COVID-19 in X-ray, the results have gotten high training accuracy=99%

Download Full-text

A state-classification approach for light-weight robotic drilling using model-based data augmentation and multi-level deep learning

Mechanical Systems and Signal Processing ◽

10.1016/j.ymssp.2021.108480 ◽

2022 ◽

Vol 167 ◽

pp. 108480

Author(s):

Hao Lu ◽

Xingwei Zhao ◽

Bo Tao ◽

Han Ding

Keyword(s):

Deep Learning ◽

Data Augmentation ◽

Light Weight ◽

Classification Approach ◽

Model Based ◽

Robotic Drilling ◽

State Classification ◽

Multi Level

Download Full-text

Low-Resource Language Discrimination toward Chinese Dialects with Transfer Learning and Data Augmentation

ACM Transactions on Asian and Low-Resource Language Information Processing ◽

10.1145/3473499 ◽

2022 ◽

Vol 21 (2) ◽

pp. 1-21

Author(s):

Fan Xu ◽

Yangjie Dan ◽

Keyu Yan ◽

Yong Ma ◽

Mingwen Wang

Keyword(s):

Transfer Learning ◽

Language Processing ◽

Data Augmentation ◽

Semantic Representation ◽

Semantic Features ◽

Language Discrimination ◽

Low Resource ◽

Chinese Dialects ◽

Fine Tune ◽

Target Side

Chinese dialects discrimination is a challenging natural language processing task due to scarce annotation resource. In this article, we develop a novel Chinese dialects discrimination framework with transfer learning and data augmentation (CDDTLDA) in order to overcome the shortage of resources. To be more specific, we first use a relatively larger Chinese dialects corpus to train a source-side automatic speech recognition (ASR) model. Then, we adopt a simple but effective data augmentation method (i.e., speed, pitch, and noise disturbance) to augment the target-side low-resource Chinese dialects, and fine-tune another target ASR model based on the previous source-side ASR model. Meanwhile, the potential common semantic features between source-side and target-side ASR models can be captured by using self-attention mechanism. Finally, we extract the hidden semantic representation in the target ASR model to conduct Chinese dialects discrimination. Our extensive experimental results demonstrate that our model significantly outperforms state-of-the-art methods on two benchmark Chinese dialects corpora.

Download Full-text

A Deep Learning Approach for Basal Cell Carcinomas and Bowen’s Disease Recognition in Dermatopathology Image

Journal of Biomaterials and Tissue Engineering ◽

10.1166/jbt.2022.2982 ◽

2022 ◽

Vol 12 (5) ◽

pp. 879-887

Author(s):

Jiantao Zhang ◽

Xiaobo Zhang ◽

Dong Qu ◽

Yan Xue ◽

Xinling Bi ◽

...

Keyword(s):

Deep Learning ◽

Cell Carcinoma ◽

Basal Cell ◽

Data Augmentation ◽

Recognition Accuracy ◽

Clinical Samples ◽

Bowen’S Disease ◽

Deep Convolutional Neural Networks ◽

Bowen's Disease ◽

Basal Cell Carcinomas

Basal cell carcinomas and Bowen’s disease (squamous cell carcinoma in situ) are the most common cutaneous tumors. The early diagnoses of these diseases are very important due to their better prognosis. But it is a heavy workload for the pathologists to recognize a large number of pathological images and diagnose these diseases. So, there is an urgent need to develop an automatic method for detecting and classifying the skin cancers. This paper presents a recognition system of dermatopathology images based on the deep convolutional neural networks (CNN). The dermatopathology images are collected from the hospital. The deep learning model is trained using different image datasets. It can be found that the recognition accuracy of the system can be improved by using data augmentation even if the number of the clinical samples are not increased. But the recognition accuracy of the system is the highest when the number of the original histological image is increased. The experimental results that the system can correctly recognize 88.5% of patients with basal cell carcinoma and 86.5% of patients with Bowen’s disease.

Download Full-text

Diagnosis of small-sample measured electromagnetic transients in power system using DRN-LSTM and data augmentation

International Journal of Electrical Power & Energy Systems ◽

10.1016/j.ijepes.2021.107820 ◽

2022 ◽

Vol 137 ◽

pp. 107820

Author(s):

Wenxia Sima ◽

Han Zhang ◽

Ming Yang ◽

Xiuhua Li

Keyword(s):

Power System ◽

Data Augmentation ◽

Small Sample ◽

Electromagnetic Transients

Download Full-text

ANT-UNet: Accurate and Noise-Tolerant Segmentation for Pathology Image Processing

ACM Journal on Emerging Technologies in Computing Systems ◽

10.1145/3451213 ◽

2022 ◽

Vol 18 (2) ◽

pp. 1-17

Author(s):

Yufei Chen ◽

Tingtao Li ◽

Qinming Zhang ◽

Wei Mao ◽

Nan Guan ◽

...

Keyword(s):

Image Processing ◽

Network Architecture ◽

Data Augmentation ◽

Complex Nature ◽

Effective Option ◽

Segmentation Approach ◽

Speed Up ◽

Detection And Diagnosis ◽

Noise Tolerant ◽

Pathology Image

Pathology image segmentation is an essential step in early detection and diagnosis for various diseases. Due to its complex nature, precise segmentation is not a trivial task. Recently, deep learning has been proved as an effective option for pathology image processing. However, its efficiency is highly restricted by inconsistent annotation quality. In this article, we propose an accurate and noise-tolerant segmentation approach to overcome the aforementioned issues. This approach consists of two main parts: a preprocessing module for data augmentation and a new neural network architecture, ANT-UNet. Experimental results demonstrate that, even on a noisy dataset, the proposed approach can achieve more accurate segmentation with 6% to 35% accuracy improvement versus other commonly used segmentation methods. In addition, the proposed architecture is hardware friendly, which can reduce the amount of parameters to one-tenth of the original and achieve 1.7× speed-up.

Download Full-text

GaitSense: Towards Ubiquitous Gait-Based Human Identification with Wi-Fi

ACM Transactions on Sensor Networks ◽

10.1145/3466638 ◽

2022 ◽

Vol 18 (1) ◽

pp. 1-24

Author(s):

Yi Zhang ◽

Yue Zheng ◽

Guidong Zhang ◽

Kun Qian ◽

Chen Qian ◽

...

Keyword(s):

Data Augmentation ◽

Gait Recognition ◽

Wearable Sensors ◽

Human Identification ◽

Training Data ◽

Identification Accuracy ◽

Identification System ◽

Gait Patterns ◽

Training Samples ◽

Augmentation Techniques

Gait, the walking manner of a person, has been perceived as a physical and behavioral trait for human identification. Compared with cameras and wearable sensors, Wi-Fi-based gait recognition is more attractive because Wi-Fi infrastructure is almost available everywhere and is able to sense passively without the requirement of on-body devices. However, existing Wi-Fi sensing approaches impose strong assumptions of fixed user walking trajectories, sufficient training data, and identification of already known users. In this article, we present GaitSense , a Wi-Fi-based human identification system, to overcome the above unrealistic assumptions. To deal with various walking trajectories and speeds, GaitSense first extracts target specific features that best characterize gait patterns and applies novel normalization algorithms to eliminate gait irrelevant perturbation in signals. On this basis, GaitSense reduces the training efforts in new deployment scenarios by transfer learning and data augmentation techniques. GaitSense also enables a distinct feature of illegal user identification by anomaly detection, making the system readily available for real-world deployment. Our implementation and evaluation with commodity Wi-Fi devices demonstrate a consistent identification accuracy across various deployment scenarios with little training samples, pushing the limit of gait recognition with Wi-Fi signals.

Download Full-text

Edge-assisted Collaborative Image Recognition for Mobile Augmented Reality

ACM Transactions on Sensor Networks ◽

10.1145/3469033 ◽

2022 ◽

Vol 18 (1) ◽

pp. 1-31

Author(s):

Guohao Lan ◽

Zida Liu ◽

Yunfan Zhang ◽

Tim Scargill ◽

Jovan Stojkovic ◽

...

Keyword(s):

Augmented Reality ◽

Image Recognition ◽

Large Scale ◽

Data Augmentation ◽

Recognition Accuracy ◽

Image Distortion ◽

Mobile Augmented Reality ◽

The Real ◽

In The Wild ◽

Mobile Ar

Mobile Augmented Reality (AR), which overlays digital content on the real-world scenes surrounding a user, is bringing immersive interactive experiences where the real and virtual worlds are tightly coupled. To enable seamless and precise AR experiences, an image recognition system that can accurately recognize the object in the camera view with low system latency is required. However, due to the pervasiveness and severity of image distortions, an effective and robust image recognition solution for “in the wild” mobile AR is still elusive. In this article, we present CollabAR, an edge-assisted system that provides distortion-tolerant image recognition for mobile AR with imperceptible system latency . CollabAR incorporates both distortion-tolerant and collaborative image recognition modules in its design. The former enables distortion-adaptive image recognition to improve the robustness against image distortions, while the latter exploits the spatial-temporal correlation among mobile AR users to improve recognition accuracy. Moreover, as it is difficult to collect a large-scale image distortion dataset, we propose a Cycle-Consistent Generative Adversarial Network-based data augmentation method to synthesize realistic image distortion. Our evaluation demonstrates that CollabAR achieves over 85% recognition accuracy for “in the wild” images with severe distortions, while reducing the end-to-end system latency to as low as 18.2 ms.

Download Full-text

data augmentation
Recently Published Documents

TOTAL DOCUMENTS

H-INDEX

Contrastive Trajectory Learning for Tour Recommendation

DSSAE: Deep Stacked Sparse Autoencoder Analytical Model for COVID-19 Diagnosis by Fractional Fourier Entropy

Expansion dataset COVID-19 chest X-ray using data augmentation and histogram equalization

A state-classification approach for light-weight robotic drilling using model-based data augmentation and multi-level deep learning

Low-Resource Language Discrimination toward Chinese Dialects with Transfer Learning and Data Augmentation

A Deep Learning Approach for Basal Cell Carcinomas and Bowen’s Disease Recognition in Dermatopathology Image

Diagnosis of small-sample measured electromagnetic transients in power system using DRN-LSTM and data augmentation

ANT-UNet: Accurate and Noise-Tolerant Segmentation for Pathology Image Processing

GaitSense: Towards Ubiquitous Gait-Based Human Identification with Wi-Fi

Edge-assisted Collaborative Image Recognition for Mobile Augmented Reality

Export Citation Format

data augmentationRecently Published Documents

TOTAL DOCUMENTS

H-INDEX

Contrastive Trajectory Learning for Tour Recommendation

DSSAE: Deep Stacked Sparse Autoencoder Analytical Model for COVID-19 Diagnosis by Fractional Fourier Entropy

Expansion dataset COVID-19 chest X-ray using data augmentation and histogram equalization

A state-classification approach for light-weight robotic drilling using model-based data augmentation and multi-level deep learning

Low-Resource Language Discrimination toward Chinese Dialects with Transfer Learning and Data Augmentation

A Deep Learning Approach for Basal Cell Carcinomas and Bowen’s Disease Recognition in Dermatopathology Image

Diagnosis of small-sample measured electromagnetic transients in power system using DRN-LSTM and data augmentation

ANT-UNet: Accurate and Noise-Tolerant Segmentation for Pathology Image Processing

GaitSense: Towards Ubiquitous Gait-Based Human Identification with Wi-Fi

Edge-assisted Collaborative Image Recognition for Mobile Augmented Reality

data augmentation
Recently Published Documents